IRC log of pronunciation on 2019-09-18

Timestamps are in UTC.

01:43:08 [RRSAgent]
RRSAgent has joined #pronunciation
01:43:08 [RRSAgent]
logging to https://www.w3.org/2019/09/18-pronunciation-irc
01:43:11 [dom]
RRSAgent, make log public
01:47:24 [dom]
Meeting: Improving Spoken Presentation of Content
01:51:20 [koalie]
koalie has joined #pronunciation
01:52:00 [koalie]
RRSAgent, make logs public
01:52:05 [koalie]
koalie has changed the topic to: https://w3c.github.io/tpac-breakouts/sessions.html
01:52:10 [koalie]
koalie has left #pronunciation
01:52:24 [CharlesL]
CharlesL has joined #pronunciation
01:56:06 [Zakim]
Zakim has joined #pronunciation
01:57:22 [dom]
dom has joined #pronunciation
01:58:43 [Irfan]
Irfan has joined #pronunciation
02:00:22 [Avneesh]
Avneesh has joined #pronunciation
02:01:18 [joanie]
joanie has joined #pronunciation
02:01:24 [jihye]
jihye has joined #pronunciation
02:01:34 [achraf]
present+
02:01:42 [CharlesL]
present+
02:01:45 [mhakkinen]
mhakkinen has joined #pronunciation
02:01:58 [joanie]
present+ Joanmarie_Diggs
02:02:24 [CharlesL]
Meeting: Improving Spoken Presentation of Web Content
02:02:41 [CharlesL]
rrsagent, make logs public
02:03:01 [Irfan]
present+
02:03:06 [CharlesL]
chair: mhakkinen
02:03:44 [whsieh]
whsieh has joined #pronunciation
02:04:11 [CharlesL]
scribe+
02:04:33 [jihye]
present+
02:04:35 [CharlesL]
MH: will give you background on what we are doing in the pronunciation TF under the APA
02:04:38 [CharlesL]
present+
02:04:47 [Makoto_]
Makoto_ has joined #pronunciation
02:04:49 [Roy]
present+
02:05:26 [Makoto_]
presen+
02:05:32 [Makoto_]
present+
02:05:41 [CharlesL]
chair+ Irfan
02:07:02 [Avneesh]
present+
02:07:25 [Omar]
Omar has joined #pronunciation
02:07:34 [Judy]
Judy has joined #pronunciation
02:07:38 [Judy]
present+ Judy
02:07:42 [CharlesL]
MH: idea behind the personalization TF, under APA facilitator is Irfan
02:07:46 [Judy]
present+ Janina
02:08:10 [CharlesL]
… Pearson, DAISY, supported by Microsoft, college Board participating
02:08:37 [Irfan]
https://w3c.github.io/pronunciation/user-scenarios/
02:08:40 [CharlesL]
… need from education community Pearson and College Board does educational assessments active since Oct 2018
02:08:44 [Irfan]
https://w3c.github.io/pronunciation/gap-analysis/
02:08:58 [Irfan]
https://w3c.github.io/pronunciation/use-cases/
02:09:24 [CharlesL]
… first working drafts just published, gap analysis, use cases and user senarios.
02:10:01 [CharlesL]
… student using AT, screen aloud technology. students listening to content that was being misspoken
02:10:40 [CharlesL]
… education setting even slight problems is a major problem if a word is not spoken exactly like the teacher that is a problem. Read aloud tools can assist language learners.
02:10:54 [CharlesL]
… learning disabilities to understand content on the web.
02:11:12 [CharlesL]
… voice base assistance (aka Alexa, Siri, Google Home) etc.
02:11:19 [aarongu_]
aarongu_ has joined #pronunciation
02:11:39 [CharlesL]
… how do we enable content authors to make these systems to speak the content correctly.
02:12:12 [CharlesL]
… We can't do this yet in HTML. audio books in EPUB with TTS or reading their Ebook, or books on-mass using TTS is a use case here.
02:12:28 [CharlesL]
… active spoken content critical in publishing and educational domains.
02:12:40 [CharlesL]
… we are trying to solve the problem today.
02:12:50 [Irfan]
RRSAgent, make minutes
02:12:50 [RRSAgent]
I have made the request to generate https://www.w3.org/2019/09/18-pronunciation-minutes.html Irfan
02:12:54 [igarashi]
igarashi has joined #pronunciation
02:13:07 [igarashi]
present+
02:13:14 [CharlesL]
… there are hacks today: improper use of ARIA standard with aria-label but that only helps SR users not read aloud.
02:14:26 [CharlesL]
… data attributes being uses may be used in proprietary products but no interoperability. Refreshable Braille ETS, Pearson, will put into ARIA will be sent to the speech synth but being read on the display incorrectly then which is a real problem.
02:15:10 [CharlesL]
… looking for a standards based solution SSML a growing # of Speech Engine support this, Amazon Polly CSS Speech is dead.
02:15:16 [CharlesL]
AT have nothing to support.
02:16:14 [CharlesL]
… decision by author speech synth are getting better but education context but the author needs to be best to suggest the spoken content. US there is a consortium for spoken math content.
02:16:58 [CharlesL]
… people put commas in the text to add commas for pause but this causes issues on the braille display getting ,,,, etc for a long pause.
02:17:34 [CharlesL]
SSML is a great standard, CSS Speech is dead, PLS is another domain lexicons specification.
02:17:52 [CharlesL]
… PLS can be domain specific say in chemistry.
02:17:56 [Makoto_]
In the context of EPUB3, we have a standard attribute for embedding SSML within HTML content documents.
02:18:21 [Makoto_]
It has been very heavily used in Japan. For example, by the biggest textbook publisher (Tokyo Shoseki).
02:18:53 [CharlesL]
… Gap analysis change language of content, gender, phonetic, substitution, see Gap analysis document. pitch volume, emphasis, say-as
02:19:21 [CharlesL]
… example zipcode wont' read it as separate numbers for example.
02:19:29 [CharlesL]
… pausing is an issue.
02:20:13 [CharlesL]
… HTML lets us markup language an semantics emphasis, language support, emphasis not widely supported capability in HTML but not supported.
02:20:30 [CharlesL]
ARIA, does not help solving the problem with substitution but this would be a hack.
02:20:41 [CharlesL]
PSL helps phonetic pronunciation.
02:21:06 [CharlesL]
CSS speech did rate/pitch, volume but not much else.
02:21:27 [CharlesL]
SSML, does support all of these potential gaps.
02:22:05 [CharlesL]
Mokoto: Japanese publishing company uses SSML but costs 4X more
02:22:32 [CharlesL]
MH: thats only for phonetic pronunciation, could make it easier to markup the language.
02:23:02 [CharlesL]
… say-as digits/numbers, emphasis, break, verbosity wan to expose to content creators.
02:23:35 [Makoto_]
I am afraid that I have to go to a JLreq TF meeting.
02:23:37 [CharlesL]
Inline SSML within HTML has been a nonstarter, talking with AT at this point not looking to support inline SSML
02:24:23 [CharlesL]
… attribute model in EPUB3, like data-ssml or just ssml but these are only hacks.
02:24:30 [Makoto_]
But let me ask whether the API between browser engines or EPUB reading systems and T2S engines.
02:24:44 [CharlesL]
Key points: Content encode SSML into HTML
02:24:55 [Makoto_]
Text only? Or DOM tree? This issue was raised in the joint meeting of the CSS and I18N.
02:25:09 [CharlesL]
AT and other speech producers must be able to consume the SSML from the content.
02:25:24 [CharlesL]
TTS must consume the SSML and render the correct speech.
02:25:50 [Makoto_]
BTW, DAISY people in Japan are very skeptical about the use of ruby for T2S.
02:25:54 [CharlesL]
Apple can map most of the SSML to the native speech, would be great to support this
02:26:18 [CharlesL]
??: Apples position on this has not changed
02:26:54 [dontcallmeDOM]
dontcallmeDOM has joined #pronunciation
02:27:26 [whsieh]
CharlesL: whsieh here from Apple (sorry!)
02:27:29 [whsieh]
present+
02:27:33 [CharlesL]
MH: 2015 working with IMS adding SSML to the QTI standard (authoring profile for test questions allowed authors to use SSML into test questions, but that standard QTI gets translated to HTML but then lost the SSML support.
02:27:50 [CharlesL]
s/??/whsieh
02:27:51 [Roy]
See the Pronunciation Overview at https://www.w3.org/WAI/pronunciation/
02:28:21 [CharlesL]
… attribute approach data-SSML has some support
02:28:36 [CharlesL]
… simple JSON value pairs
02:28:55 [CharlesL]
… some vendors seem to think this is a doable option.
02:29:11 [CharlesL]
Irfan: there is a wiki page for the example
02:29:43 [Irfan]
https://github.com/w3c/pronunciation/wiki
02:29:51 [CharlesL]
MH: angle 30deg instead of AT saying CAB or C A B should be interpreted as separate characters.
02:30:27 [CharlesL]
… no speech synth can do coordinates in math, substitution method where pm gets expanded to picometer for example.
02:30:49 [CharlesL]
Judy: noun and verb are pronounced with different emphasis,
02:30:56 [CharlesL]
MH: we haven't see that in practice.
02:32:04 [CharlesL]
… creating web components, inline SSML, multiple attributes
02:32:28 [CharlesL]
… survey has been put out towards Speech consumers which options are acceptable.
02:33:25 [CharlesL]
Omar: use case use SSML for chatbot to service customers not a11y /SR we send the voice file
02:33:30 [Judy]
[Judy: Wow! (Markku, comprehensive overview, thank you!]
02:34:02 [CharlesL]
… we would have to stop doing that from the backend to support SR. other issue is to support other languages.
02:34:32 [CharlesL]
Janina: when we get to the normative part of the spec, we will need to specify language to ensure all TTS is already loaded.
02:34:32 [Judy]
q+
02:34:45 [CharlesL]
Judy: isn't that a guideline in WCAG?
02:34:53 [Irfan]
ack Judy
02:35:00 [CharlesL]
Janina: with inline you must declare the languages
02:35:25 [CharlesL]
aaron: we could do a prescan automation,
02:35:35 [CharlesL]
Omar: but will that be a refresh of page?
02:36:01 [CharlesL]
aaron: shold not be a problem nor refresh.
02:36:37 [achraf]
q+ Arabic Diacritics (long/short vowels)
02:36:53 [CharlesL]
MH: wcag 2.2 might help us description of spoken content is a AAA requirement would like to see it as a AA.
02:37:00 [Judy]
q+
02:37:12 [Irfan]
ack achraf
02:38:16 [CharlesL]
Arabic: arabic terms depends on the context of the sentence to add specific diatecs.
02:38:18 [Irfan]
ack Judy
02:38:38 [CharlesL]
MH: ruby text
02:39:06 [Irfan]
https://w3c.github.io/pronunciation/use-cases/#use-case-ruby
02:39:24 [CharlesL]
Judy: found the overview helpful information dense, i think a very highlevel overview would be good.
02:39:48 [Judy]
q?
02:39:57 [Judy]
ack ar
02:39:59 [Judy]
ack di
02:40:07 [Judy]
ack (long
02:40:11 [aarongu]
aarongu has joined #pronunciation
02:40:28 [CharlesL]
Bobby: req. for Japan is text layout Ruby model technical issues markup using Ruby above or right side
02:41:20 [Irfan]
zakim, who is here?
02:41:20 [Zakim]
Present: achraf, CharlesL, Joanmarie_Diggs, Irfan, jihye, Roy, Makoto_, Avneesh, Judy, Janina, igarashi, whsieh
02:41:22 [Zakim]
On IRC I see aarongu, dontcallmeDOM, igarashi, Judy, Omar, whsieh, mhakkinen, jihye, joanie, Avneesh, Irfan, Zakim, CharlesL, RRSAgent, shawn-away, achraf, Roy, trackbot
02:42:27 [CharlesL]
… issues with ruby model to support Japanese language, can hear these annotations twice, should just skip the Ruby base.
02:42:35 [CharlesL]
q?
02:43:33 [CharlesL]
… issue with pronunciation with ruby annotations potenially. Chinese traditional / simplified
02:43:46 [CharlesL]
q?
02:44:04 [Roy]
ack v
02:45:41 [CharlesL]
MH: one challenges getting all the stakeholders in the same room, we haven't had any Chinese companies to be part of the TF. would be great to get review from Apple, Google, would be great to get Apples involvement. We welcome more input, more eyes looking at what we are doing.
02:46:12 [CharlesL]
… I am looking at Avneesh and representing the Publishing community.
02:46:40 [CharlesL]
Avneesh: Matt Garrish has already been assigned to review your specification.
02:47:17 [CharlesL]
Irfan: FPWD has been published and will add more such as the gap analysis and add examples
02:47:46 [CharlesL]
use case needs more examples based on the feedback here. we have some timelines and are working towards meeting those.
02:48:35 [CharlesL]
MH: ETS some testing tools explore the different markup approaches, and they tend to work across platforms but for Mac , there are extra JS to o the mappings.
02:49:08 [CharlesL]
Irfan: the Survey, we got some feedback but still waiting, so we extended the date to next week.
02:49:44 [CharlesL]
MH: we will send of further surveys as we get closer to some recomendations, we are reaching out to the AT, and consumers, the amazons, googles, etc.
02:50:23 [CharlesL]
I was working on an Alexa skill that would take content from the web and if there was SSML contained then it would be spoken correctly.
02:51:41 [CharlesL]
MH: content editable is an important case
02:52:36 [CharlesL]
Input text can that speech markup can be done, JS range case
02:53:16 [CharlesL]
Irfran: HTML content editable, JS can manipulate this…
02:53:40 [CharlesL]
MH: masters student can take these WYHIWYS (What you Hear is what you See)
02:54:15 [CharlesL]
MH: costs how do we make this easier and cheeper and easy to maintain
02:55:10 [CharlesL]
… thank you all for coming here today, ruby text was great, the cost for SSML, text entry input was all great topics to bring up.
02:55:20 [CharlesL]
Thanks everyone. great discussion.
02:55:22 [joanie]
joanie has left #pronunciation
02:55:45 [CharlesL]
rrsagent, draft minutes
02:55:45 [RRSAgent]
I have made the request to generate https://www.w3.org/2019/09/18-pronunciation-minutes.html CharlesL
03:44:26 [whsieh]
whsieh has joined #pronunciation
03:45:05 [whsieh]
whsieh has left #pronunciation
04:29:39 [dontcallmeDOM]
dontcallmeDOM has joined #pronunciation
04:34:36 [Roy]
Roy has joined #pronunciation
04:35:20 [CharlesL]
CharlesL has joined #pronunciation
04:36:37 [Judy]
Judy has joined #pronunciation
04:44:11 [dom]
dom has left #pronunciation
04:52:11 [Zakim]
Zakim has left #pronunciation
05:23:38 [CharlesL]
CharlesL has joined #pronunciation
05:35:33 [CharlesL]
CharlesL has joined #pronunciation
05:37:27 [Judy]
Judy has joined #pronunciation
05:39:39 [Roy]
Roy has joined #pronunciation
05:52:00 [aarongu]
aarongu has joined #pronunciation
06:18:38 [Roy]
Roy has joined #pronunciation
06:34:23 [CharlesL]
CharlesL has joined #pronunciation
06:54:51 [Roy]
Roy has joined #pronunciation
07:37:52 [aarongu]
aarongu has joined #pronunciation
07:38:09 [aarongu]
aarongu has left #pronunciation
08:04:00 [CharlesL]
CharlesL has joined #pronunciation
08:26:17 [Roy]
Roy has joined #pronunciation
08:52:05 [CharlesL]
CharlesL has left #pronunciation
08:52:27 [Judy]
Judy has joined #pronunciation