This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.
4.1.1 Examples [1] para 4 "Hence each token occupies exactly one position, and no overlapping of tokens occurs." That's true for the sample tokenization, but 4.1 just gave an example tokenization with overlapping tokens. What are the relative positions of "dampf", "dampfschiff", and "dampfschifffahrt"? (This might actually be a technical comment.) [2] list 2, bullet 1 "The tokens in the first element are assigned relative paragraph number 1" [2a] s/first element/first 'offer' element/, I think you mean. [2b] But you said that end of line characters are paragraph delimiters, so the tokens in the first 'offer' element would be in paragraphs 1, 2, and 3.
The FTTF considered your item [1] at its F2F in June, 2007. In response to this report, we have clarified that the sentence you cited applies specifically to the example at hand and does not specify a broad rule for the full-text spec as a whole. We have also rewritten the paragraph immediately preceding section 4.1 to make it very clear that the code and XML documents that appear in Section 4 exist for expositional purposes only (to explain what the semantics are for various operations) and not to indicate prescriptive implementation techniques. As other parts of this bug report are resolved, further comments will be added to this bug until they are all resolved. At that time, we will mark the bug RESOLVED and ask you to mark it CLOSED.
WRT 2a: classified as editorial and done.
#2b classified as editorial; done by striking "and end of line characters" This resolves all the items in this bug. Please mark as CLOSED.