IRC log of htmlspeech on 2011-06-16

Timestamps are in UTC.

16:01:07 [RRSAgent]
RRSAgent has joined #htmlspeech
16:01:07 [RRSAgent]
logging to http://www.w3.org/2011/06/16-htmlspeech-irc
16:01:17 [burn]
trackbot, start telcon
16:01:19 [trackbot]
RRSAgent, make logs public
16:01:21 [trackbot]
Zakim, this will be
16:01:21 [Zakim]
I don't understand 'this will be', trackbot
16:01:22 [trackbot]
Meeting: HTML Speech Incubator Group Teleconference
16:01:22 [trackbot]
Date: 16 June 2011
16:01:26 [burn]
zakim, aacc is Dan_Druta
16:01:26 [Zakim]
+Dan_Druta; got it
16:01:32 [Zakim]
+ +1.425.830.aadd
16:01:43 [burn]
zakim, nick DanD is Dan_Druta
16:01:43 [Zakim]
ok, burn, I now associate DanD with Dan_Druta
16:01:52 [burn]
zakim, aadd is Charles_Hemphill
16:01:52 [Zakim]
+Charles_Hemphill; got it
16:02:09 [burn]
zakim, nick Charles is Charles_Hemphill
16:02:09 [Zakim]
ok, burn, I now associate Charles with Charles_Hemphill
16:02:14 [burn]
zakim, who's here?
16:02:14 [Zakim]
On the phone I see +1.818.237.aaaa, Milan_Young, [Microsoft], Michael_Johnston, Dan_Burnett, Michael_Bodell, Olli_Pettay, Dan_Druta, Charles_Hemphill
16:02:16 [Zakim]
On IRC I see RRSAgent, satish, DanD, mbodell, Charles, Zakim, smaug, burn, Milan, Robert, trackbot
16:02:35 [burn]
zakim, aaaa is Patrick_Ehlen
16:02:35 [Zakim]
+Patrick_Ehlen; got it
16:02:49 [burn]
zakim, [Microsoft] is Robert_Brown
16:02:49 [Zakim]
+Robert_Brown; got it
16:03:01 [MichaelJ]
MichaelJ has joined #htmlspeech
16:03:44 [Zakim]
-Olli_Pettay
16:04:20 [burn]
Scribe: Patrick_Ehlen
16:05:37 [ehlen]
ehlen has joined #htmlspeech
16:05:38 [burn]
satish, will you be joining us today? or anyone from Google?
16:05:47 [burn]
zakim, nick ehlen is Patrick_Ehlen
16:05:47 [Zakim]
ok, burn, I now associate ehlen with Patrick_Ehlen
16:05:52 [burn]
ScribeNick: ehlen
16:06:11 [burn]
zakim, who's here?
16:06:11 [Zakim]
On the phone I see Patrick_Ehlen, Milan_Young, Robert_Brown, Michael_Johnston, Dan_Burnett, Michael_Bodell, Dan_Druta, Charles_Hemphill
16:06:14 [Zakim]
On IRC I see ehlen, MichaelJ, RRSAgent, satish, DanD, mbodell, Charles, Zakim, burn, Milan, Robert, trackbot
16:06:14 [smaug]
smaug has joined #htmlspeech
16:06:27 [burn]
Chair: Dan_Burnett
16:06:39 [burn]
Agenda: http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Jun/0033.html
16:06:45 [satish]
burn: I can't join the conf call as I'm on a train, hence only in IRC
16:06:56 [satish]
burn: Bjorn is still on paternity leave, not sure about Glen Shires
16:07:24 [Zakim]
+??P21
16:07:29 [burn]
satish, thanks. Hopefully Glen will join. We will be making decisions today about other discussion topics
16:07:33 [smaug]
Zakim, ??21 is Olli_Pettay
16:07:33 [Zakim]
sorry, smaug, I do not recognize a party named '??21'
16:07:51 [smaug]
Zakim, nick smaug is Olli_Pettay
16:07:51 [Zakim]
sorry, smaug, I do not see a party named 'Olli_Pettay'
16:08:01 [ehlen]
Topic: New design decisions?
16:08:18 [smaug]
Zakim, ??P21 is Olli_Pettay
16:08:20 [Zakim]
+Olli_Pettay; got it
16:08:34 [smaug]
Zakim, nick smaug is Olli_Pettay
16:08:34 [Zakim]
ok, smaug, I now associate you with Olli_Pettay
16:08:37 [ehlen]
robert: is audio recording without recognition be supported?
16:09:00 [ehlen]
are there important scenarios for supporting recording without recognition
16:09:51 [burn]
satish, any update on markup binding?
16:10:12 [ehlen]
topic: markup binding
16:11:33 [satish]
burn: None, Bjorn was collecting input from the chrome team and since he has gone on leave I have no contact on what the status was.
16:12:21 [burn]
satish, can you please check? we are not waiting on the answer, but it would be nice to have the input
16:12:21 [ehlen]
robert: google issue on whether there should be a button to press
16:12:43 [satish]
burn: yes, I can take an action to get a definitive answer in the next few days.
16:13:07 [ehlen]
burn: satish will take this on w/ the chrome team
16:13:12 [ehlen]
topic: discussion time
16:13:36 [ehlen]
topic: do we need to support audio recording with recognition?
16:14:38 [ehlen]
burn: an advantage could be endpointing.
16:14:47 [ehlen]
... is that an important criteria in this case as well?
16:15:15 [ehlen]
charles: another question is how real-time is the reco response?
16:15:23 [ehlen]
... a recording may result in reco later
16:16:22 [ehlen]
... an identifier might later associate the recording with a reco transcription
16:16:50 [ehlen]
burn: brings up question of whether we support reco on recorded audio
16:17:54 [ehlen]
robert: garbage models could be used to make recording in edge cases
16:18:01 [ehlen]
... "overloading" recognition
16:18:15 [ehlen]
... or will recording be a more common task
16:18:48 [ehlen]
... Do we think recording with endpointing is important?
16:20:18 [ehlen]
milan: channel adaptation, sharing headers in same structure, parameters could be reused; sharing the same network paths -- convenient to use same
16:21:09 [ehlen]
Charles: Also, the on-line vs. off-line cases
16:22:20 [ehlen]
milan: would most recording be associated with an attempt to understand the text in the recording?
16:23:00 [ehlen]
burn: Most significant feature is the endpointing
16:23:26 [ehlen]
milan: in that case, why not just use dict model, do reco, and save the waveform as backup?
16:24:06 [ehlen]
... and how common would that be. If not so common, could use a garbage model (even a "first-class" one)
16:24:41 [ehlen]
burn: seems strange to call recording a weird special case of reco
16:26:04 [ehlen]
burn: in favor of using the recording resource as described in mrcp
16:27:16 [ehlen]
robert: though endpointing may be valuable, would we support a "record" object in the API? how would this go all the way to the developer?
16:27:44 [ehlen]
burn: does not seem to be in our scope
16:28:00 [ehlen]
olli: there are other proposals that would handle recording
16:28:54 [ehlen]
charles: channel adaptation
16:30:39 [ehlen]
burn: channel normalization is not a valid reason for recording support
16:31:32 [ehlen]
charles: should probably also include built-in record grammar
16:32:04 [ehlen]
(milan above)
16:35:03 [ehlen]
milan: use case: may want to to do dictation in parallel with c&c
16:35:21 [ehlen]
... e.g., provide a c&c followed immediately by dictation
16:36:18 [ehlen]
burn: but does that really belong as a built-in type in a grammar?
16:36:33 [ehlen]
... sounds like there is not real consensus today vis-a-vis supporting a recording capability
16:37:06 [ehlen]
robert: have not heard a compelling reason to support recording
16:38:08 [ehlen]
burn: consensus not to do it now
16:38:23 [ehlen]
milan: would like a standard way to do it, should the need arise
16:38:55 [ehlen]
burn: we could state that we reserve this for the future
16:39:14 [ehlen]
milan: there should be some consistent and portable way to do this across engines
16:39:59 [ehlen]
robert: could be done as a proprietary extension
16:40:37 [ehlen]
milan: at least provide a consistent hack, like builtin:record
16:40:45 [ehlen]
robert: that's what the garbage model recording would be
16:41:27 [ehlen]
milan: that's fine, as long as all engines support this type of garbage model
16:42:30 [ehlen]
burn: to summarize, can't agree on specific recording scenarios
16:42:46 [ehlen]
(robert above)
16:43:06 [ehlen]
... should agree on supporting garbage-recording scenario
16:43:26 [ehlen]
burn: as a group, agree not to define an explicit recording capability at this time.
16:43:47 [ehlen]
... can be supported using a garbage model, or capabilities defined outside this group
16:45:05 [ehlen]
topic: what are the built-ins, and what does that mean?
16:45:55 [ehlen]
milan: existing builtins: dictation, search, address, numbers
16:46:07 [ehlen]
robert: already agreed there should be a certain set of predefined grammars
16:46:14 [ehlen]
... so how do we refer to those?
16:46:53 [ehlen]
burn: 2 things make builtins interesting: (1) parameterization; (2) no language is required
16:50:05 [ehlen]
milan: markup already has certain defined types, parameters, etc, as native to HTML5. Would make sense to pay attention to that here
16:50:34 [ehlen]
burn: an unconstrained text box should naturally bind to a dictation model
16:50:54 [ehlen]
milan: should we remap the names of the builtins?
16:51:58 [ehlen]
burn: argue strongly for using html as a starting point
16:52:33 [ehlen]
robert: These should be builtins, not re-used vxml grammars
16:53:19 [smaug]
could someone paste a link to voicexml's builtin grammars ?
16:53:27 [ehlen]
charles: they've become a de facto standard; not supporting them is awkward
16:54:16 [Robert]
these are the HTML input types: http://www.w3.org/TR/html5/the-input-element.html#attr-input-type
16:54:24 [ehlen]
burn: if someone wants to support legacy builtins in a way that doesn't break existing builtins, that's not a problem
16:54:28 [Robert]
perhaps have builtins that match these
16:57:00 [ehlen]
charles: there needs to be some way to include these
16:57:23 [ehlen]
(milan above)
16:57:51 [ehlen]
... is there something about this that can't be represented by a query string?
16:59:06 [ehlen]
michael: do you want to reference, for example, an html number type, or some arbitrary number?
17:00:37 [ehlen]
milan: easier to use old builtins & augment them
17:00:52 [ehlen]
charles: need to look at greater good of using html vs vxml
17:01:21 [mbodell]
Widely implemented? See http://en.wikipedia.org/wiki/URI_scheme
17:04:51 [ehlen]
burn: michael, how would you reference grammars that are assoc. with html input types?
17:06:00 [ehlen]
michael: an html ruleref, with various attributes; or don't specify URI and ref them by markeup AP...
17:06:16 [ehlen]
... most important is associating grammars with individual input elements
17:08:37 [ehlen]
... not a strong use case to have URIs for these things, or ability for user to write their own that reference these
17:10:05 [ehlen]
burn: when people want to hack something up quickly, common input types should lend themselves to being included as part of a larger utterance
17:10:20 [ehlen]
michael: may be other ways to specify input for that type of scenarios
17:11:33 [ehlen]
burn: maybe reference not the grammar but the input type itself
17:12:58 [ehlen]
charles: similar input types not always require the same grammar
17:14:02 [ehlen]
burn: but the app author may want a way to link these different types of builtin grammars together
17:14:50 [ehlen]
milan: perhaps just do the proposal
17:15:43 [ehlen]
burn: who on the call is interested in builtin models?
17:16:16 [ehlen]
charles: interested in it; this group seems focused on web search and dictation, as opposed to broader html cases
17:16:25 [mbodell]
<input type="search" name="q" speech required onspeechchange="startSearch">
17:17:28 [ehlen]
michael: there will probably be a standard set of grammar libraries, though perhaps the market will provide those
17:17:57 [ehlen]
johnston: can't see us requiring something like a "zip code" lib, for internationalization reasons
17:18:10 [ehlen]
burn: HTML has already handled a lot of these issues
17:18:21 [ehlen]
(milan above)
17:18:43 [ehlen]
(michael, above, actually)
17:18:51 [mbodell]
s/burn/michael/
17:20:04 [ehlen]
milan: should there be an html binding?
17:20:42 [ehlen]
michael: would be better if you could speech enable certain input types with little work
17:22:09 [ehlen]
robert: if no builtins were specified, what are the consequences?
17:22:49 [ehlen]
burn: if you want broad adoptability and usage, it needs to be as easy to create simple apps as vxml
17:23:15 [ehlen]
robert: we need it to do the html binding.
17:23:49 [ehlen]
... so how much do we need the html binding part?
17:24:06 [ehlen]
milan: definitely need the capability to specify search, dictation, etc.
17:24:31 [ehlen]
robert: that's different from looking at html input types, etc. that's a complex problem
17:24:55 [ehlen]
milan: would like to have a notion of how to solve binding problem before we do dictation
17:25:58 [ehlen]
robert: does anyone have a proposal to volunteer?
17:26:11 [ehlen]
milan: perhaps can do it after I get the dictation stuff out
17:26:56 [ehlen]
micheal: there is a topic in the API about markup bindings.
17:27:10 [ehlen]
burn: true that it's a binding issue
17:28:13 [ehlen]
burn: without a proposal, it doesn't happen.
17:28:23 [ehlen]
... so it will be up to someone to write a proposal
17:28:36 [ehlen]
milan: perhaps sending a message to google on this
17:28:42 [ehlen]
robert: or to satish
17:29:11 [ehlen]
burn: action item for milan to talk with satish and ask for help on structuring a proposal
17:29:34 [ehlen]
burn: reminder: no call next week
17:29:49 [ehlen]
robert: but there will be a protocol meeting
17:30:52 [Zakim]
-Michael_Bodell
17:30:53 [Zakim]
-Milan_Young
17:30:56 [Zakim]
-Robert_Brown
17:31:07 [Zakim]
-Olli_Pettay
17:31:08 [Zakim]
-Patrick_Ehlen
17:31:09 [Zakim]
-Michael_Johnston
17:31:23 [Zakim]
-Dan_Burnett
17:31:24 [burn]
zakim, bye
17:31:24 [Zakim]
Zakim has left #htmlspeech
17:31:25 [Zakim]
leaving. As of this point the attendees were +1.818.237.aaaa, Milan_Young, +1.973.452.aabb, Michael_Johnston, Dan_Burnett, Michael_Bodell, Olli_Pettay, +1.425.580.aacc, Dan_Druta,
17:31:27 [Zakim]
... +1.425.830.aadd, Charles_Hemphill, Patrick_Ehlen, Robert_Brown
17:31:33 [burn]
rrsagent, make log public
17:31:38 [burn]
rrsagent, draft minutes
17:31:38 [RRSAgent]
I have made the request to generate http://www.w3.org/2011/06/16-htmlspeech-minutes.html burn
17:32:34 [burn]
Regrets: Raj_Tumuluri, Bjorn_Bringert
17:35:28 [burn]
s/+1.818.237.aaaa, //
17:35:41 [burn]
s/+1.973.452.aabb, //
17:35:57 [burn]
s/+1.425.580.aacc, //
17:36:12 [burn]
s/+1.425.830.aadd, //
17:36:16 [burn]
rrsagent, draft minutes
17:36:16 [RRSAgent]
I have made the request to generate http://www.w3.org/2011/06/16-htmlspeech-minutes.html burn
17:37:18 [burn]
rrsagent, bye
17:37:18 [RRSAgent]
I see no action items