IRC log of htmlspeech on 2011-06-16

Timestamps are in UTC.

16:01:07 [RRSAgent]: RRSAgent has joined #htmlspeech
16:01:07 [RRSAgent]: logging to http://www.w3.org/2011/06/16-htmlspeech-irc
16:01:17 [burn]: trackbot, start telcon
16:01:19 [trackbot]: RRSAgent, make logs public
16:01:21 [trackbot]: Zakim, this will be
16:01:21 [Zakim]: I don't understand 'this will be', trackbot
16:01:22 [trackbot]: Meeting: HTML Speech Incubator Group Teleconference
16:01:22 [trackbot]: Date: 16 June 2011
16:01:26 [burn]: zakim, aacc is Dan_Druta
16:01:26 [Zakim]: +Dan_Druta; got it
16:01:32 [Zakim]: + +1.425.830.aadd
16:01:43 [burn]: zakim, nick DanD is Dan_Druta
16:01:43 [Zakim]: ok, burn, I now associate DanD with Dan_Druta
16:01:52 [burn]: zakim, aadd is Charles_Hemphill
16:01:52 [Zakim]: +Charles_Hemphill; got it
16:02:09 [burn]: zakim, nick Charles is Charles_Hemphill
16:02:09 [Zakim]: ok, burn, I now associate Charles with Charles_Hemphill
16:02:14 [burn]: zakim, who's here?
16:02:14 [Zakim]: On the phone I see +1.818.237.aaaa, Milan_Young, [Microsoft], Michael_Johnston, Dan_Burnett, Michael_Bodell, Olli_Pettay, Dan_Druta, Charles_Hemphill
16:02:16 [Zakim]: On IRC I see RRSAgent, satish, DanD, mbodell, Charles, Zakim, smaug, burn, Milan, Robert, trackbot
16:02:35 [burn]: zakim, aaaa is Patrick_Ehlen
16:02:35 [Zakim]: +Patrick_Ehlen; got it
16:02:49 [burn]: zakim, [Microsoft] is Robert_Brown
16:02:49 [Zakim]: +Robert_Brown; got it
16:03:01 [MichaelJ]: MichaelJ has joined #htmlspeech
16:03:44 [Zakim]: -Olli_Pettay
16:04:20 [burn]: Scribe: Patrick_Ehlen
16:05:37 [ehlen]: ehlen has joined #htmlspeech
16:05:38 [burn]: satish, will you be joining us today? or anyone from Google?
16:05:47 [burn]: zakim, nick ehlen is Patrick_Ehlen
16:05:47 [Zakim]: ok, burn, I now associate ehlen with Patrick_Ehlen
16:05:52 [burn]: ScribeNick: ehlen
16:06:11 [burn]: zakim, who's here?
16:06:11 [Zakim]: On the phone I see Patrick_Ehlen, Milan_Young, Robert_Brown, Michael_Johnston, Dan_Burnett, Michael_Bodell, Dan_Druta, Charles_Hemphill
16:06:14 [Zakim]: On IRC I see ehlen, MichaelJ, RRSAgent, satish, DanD, mbodell, Charles, Zakim, burn, Milan, Robert, trackbot
16:06:14 [smaug]: smaug has joined #htmlspeech
16:06:27 [burn]: Chair: Dan_Burnett
16:06:39 [burn]: Agenda: http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Jun/0033.html
16:06:45 [satish]: burn: I can't join the conf call as I'm on a train, hence only in IRC
16:06:56 [satish]: burn: Bjorn is still on paternity leave, not sure about Glen Shires
16:07:24 [Zakim]: +??P21
16:07:29 [burn]: satish, thanks. Hopefully Glen will join. We will be making decisions today about other discussion topics
16:07:33 [smaug]: Zakim, ??21 is Olli_Pettay
16:07:33 [Zakim]: sorry, smaug, I do not recognize a party named '??21'
16:07:51 [smaug]: Zakim, nick smaug is Olli_Pettay
16:07:51 [Zakim]: sorry, smaug, I do not see a party named 'Olli_Pettay'
16:08:01 [ehlen]: Topic: New design decisions?
16:08:18 [smaug]: Zakim, ??P21 is Olli_Pettay
16:08:20 [Zakim]: +Olli_Pettay; got it
16:08:34 [smaug]: Zakim, nick smaug is Olli_Pettay
16:08:34 [Zakim]: ok, smaug, I now associate you with Olli_Pettay
16:08:37 [ehlen]: robert: is audio recording without recognition be supported?
16:09:00 [ehlen]: are there important scenarios for supporting recording without recognition
16:09:51 [burn]: satish, any update on markup binding?
16:10:12 [ehlen]: topic: markup binding
16:11:33 [satish]: burn: None, Bjorn was collecting input from the chrome team and since he has gone on leave I have no contact on what the status was.
16:12:21 [burn]: satish, can you please check? we are not waiting on the answer, but it would be nice to have the input
16:12:21 [ehlen]: robert: google issue on whether there should be a button to press
16:12:43 [satish]: burn: yes, I can take an action to get a definitive answer in the next few days.
16:13:07 [ehlen]: burn: satish will take this on w/ the chrome team
16:13:12 [ehlen]: topic: discussion time
16:13:36 [ehlen]: topic: do we need to support audio recording with recognition?
16:14:38 [ehlen]: burn: an advantage could be endpointing.
16:14:47 [ehlen]: ... is that an important criteria in this case as well?
16:15:15 [ehlen]: charles: another question is how real-time is the reco response?
16:15:23 [ehlen]: ... a recording may result in reco later
16:16:22 [ehlen]: ... an identifier might later associate the recording with a reco transcription
16:16:50 [ehlen]: burn: brings up question of whether we support reco on recorded audio
16:17:54 [ehlen]: robert: garbage models could be used to make recording in edge cases
16:18:01 [ehlen]: ... "overloading" recognition
16:18:15 [ehlen]: ... or will recording be a more common task
16:18:48 [ehlen]: ... Do we think recording with endpointing is important?
16:20:18 [ehlen]: milan: channel adaptation, sharing headers in same structure, parameters could be reused; sharing the same network paths -- convenient to use same
16:21:09 [ehlen]: Charles: Also, the on-line vs. off-line cases
16:22:20 [ehlen]: milan: would most recording be associated with an attempt to understand the text in the recording?
16:23:00 [ehlen]: burn: Most significant feature is the endpointing
16:23:26 [ehlen]: milan: in that case, why not just use dict model, do reco, and save the waveform as backup?
16:24:06 [ehlen]: ... and how common would that be. If not so common, could use a garbage model (even a "first-class" one)
16:24:41 [ehlen]: burn: seems strange to call recording a weird special case of reco
16:26:04 [ehlen]: burn: in favor of using the recording resource as described in mrcp
16:27:16 [ehlen]: robert: though endpointing may be valuable, would we support a "record" object in the API? how would this go all the way to the developer?
16:27:44 [ehlen]: burn: does not seem to be in our scope
16:28:00 [ehlen]: olli: there are other proposals that would handle recording
16:28:54 [ehlen]: charles: channel adaptation
16:30:39 [ehlen]: burn: channel normalization is not a valid reason for recording support
16:31:32 [ehlen]: charles: should probably also include built-in record grammar
16:32:04 [ehlen]: (milan above)
16:35:03 [ehlen]: milan: use case: may want to to do dictation in parallel with c&c
16:35:21 [ehlen]: ... e.g., provide a c&c followed immediately by dictation
16:36:18 [ehlen]: burn: but does that really belong as a built-in type in a grammar?
16:36:33 [ehlen]: ... sounds like there is not real consensus today vis-a-vis supporting a recording capability
16:37:06 [ehlen]: robert: have not heard a compelling reason to support recording
16:38:08 [ehlen]: burn: consensus not to do it now
16:38:23 [ehlen]: milan: would like a standard way to do it, should the need arise
16:38:55 [ehlen]: burn: we could state that we reserve this for the future
16:39:14 [ehlen]: milan: there should be some consistent and portable way to do this across engines
16:39:59 [ehlen]: robert: could be done as a proprietary extension
16:40:37 [ehlen]: milan: at least provide a consistent hack, like builtin:record
16:40:45 [ehlen]: robert: that's what the garbage model recording would be
16:41:27 [ehlen]: milan: that's fine, as long as all engines support this type of garbage model
16:42:30 [ehlen]: burn: to summarize, can't agree on specific recording scenarios
16:42:46 [ehlen]: (robert above)
16:43:06 [ehlen]: ... should agree on supporting garbage-recording scenario
16:43:26 [ehlen]: burn: as a group, agree not to define an explicit recording capability at this time.
16:43:47 [ehlen]: ... can be supported using a garbage model, or capabilities defined outside this group
16:45:05 [ehlen]: topic: what are the built-ins, and what does that mean?
16:45:55 [ehlen]: milan: existing builtins: dictation, search, address, numbers
16:46:07 [ehlen]: robert: already agreed there should be a certain set of predefined grammars
16:46:14 [ehlen]: ... so how do we refer to those?
16:46:53 [ehlen]: burn: 2 things make builtins interesting: (1) parameterization; (2) no language is required
16:50:05 [ehlen]: milan: markup already has certain defined types, parameters, etc, as native to HTML5. Would make sense to pay attention to that here
16:50:34 [ehlen]: burn: an unconstrained text box should naturally bind to a dictation model
16:50:54 [ehlen]: milan: should we remap the names of the builtins?
16:51:58 [ehlen]: burn: argue strongly for using html as a starting point
16:52:33 [ehlen]: robert: These should be builtins, not re-used vxml grammars
16:53:19 [smaug]: could someone paste a link to voicexml's builtin grammars ?
16:53:27 [ehlen]: charles: they've become a de facto standard; not supporting them is awkward
16:54:16 [Robert]: these are the HTML input types: http://www.w3.org/TR/html5/the-input-element.html#attr-input-type
16:54:24 [ehlen]: burn: if someone wants to support legacy builtins in a way that doesn't break existing builtins, that's not a problem
16:54:28 [Robert]: perhaps have builtins that match these
16:57:00 [ehlen]: charles: there needs to be some way to include these
16:57:23 [ehlen]: (milan above)
16:57:51 [ehlen]: ... is there something about this that can't be represented by a query string?
16:59:06 [ehlen]: michael: do you want to reference, for example, an html number type, or some arbitrary number?
17:00:37 [ehlen]: milan: easier to use old builtins & augment them
17:00:52 [ehlen]: charles: need to look at greater good of using html vs vxml
17:01:21 [mbodell]: Widely implemented? See http://en.wikipedia.org/wiki/URI_scheme
17:04:51 [ehlen]: burn: michael, how would you reference grammars that are assoc. with html input types?
17:06:00 [ehlen]: michael: an html ruleref, with various attributes; or don't specify URI and ref them by markeup AP...
17:06:16 [ehlen]: ... most important is associating grammars with individual input elements
17:08:37 [ehlen]: ... not a strong use case to have URIs for these things, or ability for user to write their own that reference these
17:10:05 [ehlen]: burn: when people want to hack something up quickly, common input types should lend themselves to being included as part of a larger utterance
17:10:20 [ehlen]: michael: may be other ways to specify input for that type of scenarios
17:11:33 [ehlen]: burn: maybe reference not the grammar but the input type itself
17:12:58 [ehlen]: charles: similar input types not always require the same grammar
17:14:02 [ehlen]: burn: but the app author may want a way to link these different types of builtin grammars together
17:14:50 [ehlen]: milan: perhaps just do the proposal
17:15:43 [ehlen]: burn: who on the call is interested in builtin models?
17:16:16 [ehlen]: charles: interested in it; this group seems focused on web search and dictation, as opposed to broader html cases
17:16:25 [mbodell]: <input type="search" name="q" speech required onspeechchange="startSearch">
17:17:28 [ehlen]: michael: there will probably be a standard set of grammar libraries, though perhaps the market will provide those
17:17:57 [ehlen]: johnston: can't see us requiring something like a "zip code" lib, for internationalization reasons
17:18:10 [ehlen]: burn: HTML has already handled a lot of these issues
17:18:21 [ehlen]: (milan above)
17:18:43 [ehlen]: (michael, above, actually)
17:18:51 [mbodell]: s/burn/michael/
17:20:04 [ehlen]: milan: should there be an html binding?
17:20:42 [ehlen]: michael: would be better if you could speech enable certain input types with little work
17:22:09 [ehlen]: robert: if no builtins were specified, what are the consequences?
17:22:49 [ehlen]: burn: if you want broad adoptability and usage, it needs to be as easy to create simple apps as vxml
17:23:15 [ehlen]: robert: we need it to do the html binding.
17:23:49 [ehlen]: ... so how much do we need the html binding part?
17:24:06 [ehlen]: milan: definitely need the capability to specify search, dictation, etc.
17:24:31 [ehlen]: robert: that's different from looking at html input types, etc. that's a complex problem
17:24:55 [ehlen]: milan: would like to have a notion of how to solve binding problem before we do dictation
17:25:58 [ehlen]: robert: does anyone have a proposal to volunteer?
17:26:11 [ehlen]: milan: perhaps can do it after I get the dictation stuff out
17:26:56 [ehlen]: micheal: there is a topic in the API about markup bindings.
17:27:10 [ehlen]: burn: true that it's a binding issue
17:28:13 [ehlen]: burn: without a proposal, it doesn't happen.
17:28:23 [ehlen]: ... so it will be up to someone to write a proposal
17:28:36 [ehlen]: milan: perhaps sending a message to google on this
17:28:42 [ehlen]: robert: or to satish
17:29:11 [ehlen]: burn: action item for milan to talk with satish and ask for help on structuring a proposal
17:29:34 [ehlen]: burn: reminder: no call next week
17:29:49 [ehlen]: robert: but there will be a protocol meeting
17:30:52 [Zakim]: -Michael_Bodell
17:30:53 [Zakim]: -Milan_Young
17:30:56 [Zakim]: -Robert_Brown
17:31:07 [Zakim]: -Olli_Pettay
17:31:08 [Zakim]: -Patrick_Ehlen
17:31:09 [Zakim]: -Michael_Johnston
17:31:23 [Zakim]: -Dan_Burnett
17:31:24 [burn]: zakim, bye
17:31:24 [Zakim]: Zakim has left #htmlspeech
17:31:25 [Zakim]: leaving. As of this point the attendees were +1.818.237.aaaa, Milan_Young, +1.973.452.aabb, Michael_Johnston, Dan_Burnett, Michael_Bodell, Olli_Pettay, +1.425.580.aacc, Dan_Druta,
17:31:27 [Zakim]: ... +1.425.830.aadd, Charles_Hemphill, Patrick_Ehlen, Robert_Brown
17:31:33 [burn]: rrsagent, make log public
17:31:38 [burn]: rrsagent, draft minutes
17:31:38 [RRSAgent]: I have made the request to generate http://www.w3.org/2011/06/16-htmlspeech-minutes.html burn
17:32:34 [burn]: Regrets: Raj_Tumuluri, Bjorn_Bringert
17:35:28 [burn]: s/+1.818.237.aaaa, //
17:35:41 [burn]: s/+1.973.452.aabb, //
17:35:57 [burn]: s/+1.425.580.aacc, //
17:36:12 [burn]: s/+1.425.830.aadd, //
17:36:16 [burn]: rrsagent, draft minutes
17:36:16 [RRSAgent]: I have made the request to generate http://www.w3.org/2011/06/16-htmlspeech-minutes.html burn
17:37:18 [burn]: rrsagent, bye
17:37:18 [RRSAgent]: I see no action items