IRC log of ixml on 2023-05-09
Timestamps are in UTC.
- 13:59:50 [RRSAgent]
- RRSAgent has joined #ixml
- 13:59:55 [RRSAgent]
- logging to https://www.w3.org/2023/05/09-ixml-irc
- 14:01:02 [norm]
- topic: iXML meeting
- 14:01:03 [cmsmcq]
- cmsmcq has joined #ixml
- 14:01:08 [norm]
- scribe: norm
- 14:01:16 [norm]
- present: Steven, John, Bethan, Michael, Norm
- 14:01:37 [norm]
- date: 9 May 2023
- 14:02:27 [norm]
- meeting: iXML CG
- 14:03:00 [norm]
- topic: Review of action items
- 14:03:23 [norm]
- ACTION 2023-01-10-b continues
- 14:03:27 [norm]
- ACTION 2023-01-10-d continues
- 14:03:31 [norm]
- ACTION 2023-01-10-f continues
- 14:03:36 [norm]
- ACTION 2023-01-10-h continues
- 14:03:57 [norm]
- ACTION 2023-02-07-a continues; Steven had trouble logging in
- 14:04:23 [norm]
- ACTION 2023-02-07-b continues; merge with 2023-01-10-d
- 14:04:37 [norm]
- ACTION 2023-04-11-a continues
- 14:04:47 [norm]
- ACTION 2023-04-11-b continues
- 14:04:54 [norm]
- ACTION 2023-04-11-c continues
- 14:05:46 [norm]
- ACTION 2023-04-11-d completed https://lists.w3.org/Archives/Public/public-ixml/2023May/0005.html
- 14:05:58 [norm]
- topic: Status reports
- 14:06:36 [norm]
- John: I've added the ability to switch off indentation becuase it interferes with test results
- 14:07:18 [norm]
- John: The problem isn't the test suite, it's the XML tree to display. The spec uses the term "elide" but doesn't clearly define it. Are you permitted to completely remove whitespace?
- 14:07:48 [norm]
- Steven: Yes, one of my examples at the web conference has two spaces but they disappear.
- 14:08:14 [norm]
- John: Right, the serializer uses indentation in the indentation method and my reading is that under those circumstances, it's possible for purely whitespace nodes to be removed.
- 14:08:32 [norm]
- John: The processor certainly produces two spaces.
- 14:08:48 [norm]
- Bethan: I'd understand "elide" to mean leave out.
- 14:08:56 [norm]
- John: I think of it as to shorten.
- 14:09:06 [norm]
- Some discussion of the use of "..." in elided text.
- 14:09:30 [norm]
- Norm: I published an update that fixed some bugs.
- 14:09:40 [norm]
- Steven: I haven't made any changes in the last month as I recall.
- 14:09:54 [norm]
- Some discussion of the web conference.
- 14:10:07 [norm]
- Steven: At the last moment, they moved the tutorial but they didn't tell anyone.
- 14:10:44 [norm]
- Much enthusiasm from a small tutorial audience.
- 14:10:55 [norm]
- Steven: I also gave a talk on the dev day and that had about 20-25 people.
- 14:11:02 [norm]
- topic: Publication plans
- 14:11:18 [norm]
- No discussion this month.
- 14:11:23 [norm]
- topic: Issues #174 and #175 BOM
- 14:12:24 [norm]
- Steven: I think we agree to ignore the BOM at the beginning but not to ignore them in the middle (in the case of concatentated files)
- 14:13:19 [norm]
- Micheal: I'm puzzled by one thing. I think of operations like recognizing the BOM, setting the endianness, etc. as something that gets handled by a low-level system routine.
- 14:13:29 [norm]
- ... that provides the program with a sequence of characters.
- 14:13:48 [norm]
- ... If you're iXML processor is seeing a BOM, you have an OS issue. Am I just spoiled?
- 14:13:59 [norm]
- Steven: I do agree, especially since UTF-8 doesn't need a BOM.
- 14:14:47 [norm]
- Norm: If you open a UTF-8 file with a BOM in Java, the first two bytes you read are a BOM!
- 14:15:22 [norm]
- Michael: Can we somehow find a way to say ignore a BOM and also say in passing that you shouldn't be seeing one?
- 14:16:05 [norm]
- Norm: We could, but it isn't going to change anything.
- 14:16:35 [norm]
- Michael: Okay, but I would find it useful as a reader. I don't think we should take the position that that's normal.
- 14:17:26 [norm]
- ... I think we should say it about the input string as well.
- 14:18:03 [norm]
- ... Otherwise, you have to tell users to write their grammars to ignore the BOM.
- 14:18:11 [norm]
- General agreement.
- 14:18:49 [norm]
- Michael: I think that should be a "should" not a "must".
- 14:19:14 [norm]
- Some discussion of BOMs in UTF-16.
- 14:19:30 [norm]
- Norm asserts that in UTF-16 files, the BOM is stripped off by the OS/library.
- 14:20:05 [norm]
- Michael: I think it's sufficient to say "if it's UTF-8" and leave it to implementations to document that.
- 14:20:33 [norm]
- ACTION: 2023-05-09-a: Norm to revise the erratum to include stripping the BOM from UTF-8 input strings (as a should)
- 14:21:03 [norm]
- topic: Issue #176 Encoding CR, NEL, LINE SEPARATOR etc
- 14:22:08 [norm]
- Norm: I don't think there's a bug here.
- 14:22:39 [norm]
- Some discussion of the significance of "If you use the XSLT and XQuery Serialization..."
- 14:23:11 [norm]
- Michael: Can we have a non-binding note somewhere that observes how XSLT and XQuery Serialization works.
- 14:23:45 [norm]
- Norm: I think we just need a note.
- 14:24:06 [norm]
- Michael: It will arise for some people, I'm persuaded we should add a note describing it.
- 14:25:47 [norm]
- Steven: I'm not sure how this effects interoperability. We talk about serialization. To what extent should we more normative than we are?
- 14:26:16 [norm]
- John: We stop at the point where you've produced an XML tree as a serialization, how you get that into text isn't part of our text.
- 14:27:00 [norm]
- Norm: I'd be in favor of a normative reference to XSLT/XQuery serialization.
- 14:27:24 [norm]
- Michael: I'm not sure our requirements are the same. We don't need to be about "round trip" data model instances.
- 14:27:55 [norm]
- ... I wouldn't be at all unhappy if an iXML processor serialized those CRs as literal characters and not character references because they only reason they're there is that the OS convention for line beaks.
- 14:28:21 [norm]
- ... When they get written out, I want the same convention and encoding them as #d; and just using line feeds isn't just surprising, I don't like it very much.
- 14:28:41 [norm]
- ... This is an example of a case where serialization is attempting to deal with problems that aren't ours.
- 14:29:22 [norm]
- Steven: There's the related question of what should happen if you output a newline in an attribute.
- 14:30:08 [norm]
- Some discussion of how the email was rendered.
- 14:31:01 [norm]
- https://lists.w3.org/Archives/Public/public-ixml/2023May/0006.html
- 14:32:15 [norm]
- (It appears that the something went wrong with Steven's mailer's interpretation of 
 in the message Norm sent)
- 14:33:06 [norm]
- Michael: I'm nervous about a normative reference to XSLT and XQuery Serialization because it's a big spec.
- 14:33:20 [norm]
- ... I think an informative reference would be a good idea.
- 14:34:22 [norm]
- Norm: Would you be ok with a literal newline in the attribute value?
- 14:34:46 [norm]
- Michael: I've been asking myself that question...I'm going to give you two answers: yes and no.
- 14:35:27 [norm]
- ... As a spec lawyer, that would trouble me somewhat, but as a markup practitioner, I decided years ago that if you care about details like that, use markup to retain it. Literal whitespace gets rearranged by lots of tools.
- 14:37:18 [norm]
- Norm: Proposal: an informative reference to the XSLT and XQuery Serialization specification and a short discussion somewhere in the specification about the consequences of serialization (attribute values with whitespace, etc.)
- 14:37:32 [norm]
- John: You also have to be aware of indentation changing the output.
- 14:37:59 [norm]
- Michael: Our specification doesn't overtly contemplate indentation.
- 14:38:37 [norm]
- Michael: I think Norm's suggestion is a good one. We have a section on serialization...
- 14:38:50 [norm]
- Steven: We also have a "hints to the implementor" section that might be a better place.
- 14:39:28 [norm]
- Michael: I think an informative reference and prose in either or both of those sections would be a good idea.
- 14:40:09 [norm]
- ACTION 2023-05-09-b: Norm to make a pass attempting to describe serialization
- 14:40:37 [norm]
- ACTION 2023-05-09-c: Steven to produce new sample grammars for issue #139 for discussion in June.
- 14:41:17 [norm]
- Steven: Let's make sure JSON is on the agenda for next month.
- 14:41:35 [norm]
- Steven: Do we have a renaming proposal?
- 14:41:41 [norm]
- John: No, not yet.
- 14:42:10 [norm]
- ACTION 2023-05-09-d: Steven to produce a discussion document for renaming (issue #13)
- 14:42:38 [norm]
- Reminder: everyone should read John's message about dynamic renaming
- 14:43:47 [norm]
- Next meeting is 13 June 2023.
- 14:43:51 [norm]
- Adjourned.
- 14:43:56 [norm]
- rrsagent, set logs world visible
- 14:44:13 [norm]
- rrsagent, publish minutes
- 14:44:14 [RRSAgent]
- I have made the request to generate https://www.w3.org/2023/05/09-ixml-minutes.html norm
- 14:44:37 [cmsmcq]
- chair: Steven
- 14:44:37 [norm]
- Chair: Steven
- 14:44:39 [norm]
- rrsagent, publish minutes
- 14:44:41 [RRSAgent]
- I have made the request to generate https://www.w3.org/2023/05/09-ixml-minutes.html norm
- 14:45:20 [norm]
- s/ACTION 2023/ACTION: 2023/g
- 14:45:24 [norm]
- rrsange, publish minutes
- 14:45:43 [norm]
- rrsagent, publish minutes
- 14:45:44 [RRSAgent]
- I have made the request to generate https://www.w3.org/2023/05/09-ixml-minutes.html norm
- 14:46:20 [norm]
- rrsagent, publish minutes
- 14:46:21 [RRSAgent]
- I have made the request to generate https://www.w3.org/2023/05/09-ixml-minutes.html norm
- 14:47:07 [norm]
- Close enough, I think.
- 15:04:01 [mjoel]
- mjoel has joined #ixml
- 15:04:17 [norm]
- norm has joined #ixml
- 15:04:21 [norm]
- rrsagent, pointer
- 15:04:21 [RRSAgent]
- See https://www.w3.org/2023/05/09-ixml-irc#T15-04-21
- 15:04:44 [mjoel]
- Am I on the right Google Meet?
- 15:05:35 [Steven]
- Call is over. The time was wrong in the agenda I'm afraid
- 15:06:58 [mjoel]
- THere was a typo in the link: https://www.timeanddate.com/worldclock/fixedtime.html?msg=ixml+online+meeting&iso=202305097T1500 (an extra 7 before T) but even correcting for that, it gave me a time of 11am Eastern
- 15:09:27 [mjoel]
- OK, I see the update
- 16:03:45 [cmsmcq]
- cmsmcq has joined #ixml
- 17:38:16 [cmsmcq]
- cmsmcq has joined #ixml
- 20:28:00 [Steven]
- Steven has left #ixml