W3C

- DRAFT -

Data on the Web Best Practices Working Group Teleconference

13 Apr 2015

Agenda

See also: IRC log

Attendees

Present
[IPcaller], Makx_Dekkers, deirdrelee, Riccardo, phila, giancarlo_guizzardi_, ericstephan, flavio, newton, annette_g, yaso, BernadetteLoscio, giancarlo_guizzardi, GiselePappa, Sumit
Regrets
Steve
Chair
deirdrelee
Scribe
hadleybeeman, annette_g, phila, ericstephan, BernadetteLoscio, yaso, Newton

Contents


<trackbot> Date: 13 April 2015

<deirdrelee> Hi Antoine

<deirdrelee> sitting in Phil's sunny conservatory :) Waiting for Austin team

<RiccardoAlbertoni> i will survive don't worry :)

<Makx_Dekkers> zakim ipcaler is me

<hadleybeeman> deirdrelee phila RiccardoAlbertoni — we seem to have lost you?

<deirdrelee_> Phil's rewiring his house...

<austincomputer> Hi all, just getting on line with our austin AV system

<deirdrelee_> Hi AustinComputer!

<austincomputer> (this is Eric S)

<austincomputer> :-)

<deirdrelee_> Ok everyone, we have a live zakim line and we will have skype online too so that you can see us :)

<deirdrelee_> my skype id is deirdrelee

<ericstephan> Good morningish everyone

<deirdrelee> it's meant to

<deirdrelee> good morning/afternoon ericstephan

<deirdrelee> Can you hear ipswich?

<laufer> good morning/afternoon all

<ericstephan> Good day would be more appropriate I guess :-)

<ericstephan> ericphb@gmail.com

<BernadetteLoscio> https://plus.google.com/hangouts/_/g4vna23i4mz4l6doovo34sysm4a

<phila> We've lost Austin on Zakim

<phila> antoine, we're only trying to use the hangout for video

<ericstephan> Hi phila we switched over to google chat

<phila> Yeah, Eric, but we have Antoine and Makx on zakim...

<ericstephan> ok...

<ericstephan> just a sec...

<ericstephan> Reconnecting...

<ericstephan> My only concern is Antoine and Makx being able to hear everyone else, Antoine and Makx let us know if you can't hear Austin

<Makx_Dekkers> i cant hear austin

<Makx_Dekkers> lots of echo

<antoine> I can hear Ipswich and Austin only when one person speaks

<Makx_Dekkers> i can hear you

<ericstephan> connecting the old fashioned way by iphone

<ericstephan> yes it is clear

<deirdrelee> Start agenda with BP, after one hour, go back to quality vocab discussion, when makx and antoine are present, then return to bp

<laufer> ...

<deirdrelee> Add your name to scribe for one hour: https://www.w3.org/2013/dwbp/wiki/April_2015_F2F

<yaso> we just made a new decoration of the room

<hadleybeeman> yaso, you're redecorating? :)

<yaso> hadleybeeman: no, It was Summit idea :)

<hadleybeeman> @yaso: that's lovely :)

<deirdrelee> Please add name to scribe before we start!!

<deirdrelee> no austin team volunteers yet....

<annette_g> I can scribe

<annette_g> Should I sign up for 9:00?

<phila> Can Austin hear Ipswich?

<BernadetteLoscio> hi phil!

<phila> Can you hear us BernadetteLoscio?

<hadleybeeman> ericstephan BernadetteLoscio can you hear us on the phone?

<BernadetteLoscio> yes

<laufer> not yet, phil

<BernadetteLoscio> i can hera

<BernadetteLoscio> hear

<annette_g> Eric is hooking up the hangout

<phila> Thanks annette_g

<laufer> we have hi-lo tech in this f2f

<hadleybeeman> While we're waiting for Austin to get on the phone... please sign up for an hour to scribe! https://www.w3.org/2013/dwbp/wiki/April_2015_F2F#Agenda

<BernadetteLoscio> Gisele

<hadleybeeman> scribe: hadleybeeman

<annette_g> lewis is out of the room for an hour or so

<BernadetteLoscio> yes :):):)

<yaso> One hour!

<Makx_Dekkers> going to be disconnecting voice for a while

Best Practices (for one hour)

deirdrelee: What we need from this meeting:

<phila> Meeting: DWBP Face to Face 3, Day 1

deirdrelee: look at our three deliverables. We need a clear plan for the next draft of publication. We hope to have a new version of each of those by one month from now, beginning of May.
... We've been building toward that. We want to get rid of all issues.
... Goal 1: close all issues. 2: set the actions for and 3. clear plan of milestones and dates to publish the next working drafts by early May.
... I put the issues together in groups https://docs.google.com/spreadsheets/d/1aqkh8jJCyHn0GaPk3Xh277b5L1bQjfwDHukmWUTWryg/edit
... 15 minutes maximum per issue, as we agreed.
... At the end of tomorrow, we can go back to ones that may be still open. But for each issue, we read it, discuss any points. Please be as short as you can, so everyone can discuss it. At the end, a concrete decisions of how to resolve this issue and close it.
... Handing over to Bernadette, Caroline and Newton to discuss these issues. I'll remind you when we have 10 minutes left and 15 minutes left, for each issue.

<phila> I think we're looking at this version http://bernafarias.github.io/dwbp/bp.html in Berna's GH repo

Issue-76?

<trackbot> Issue-76 -- What advice do we give about publishing metadata so that we identify the intended outcome without making assumptions that maybe false? -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/76

<phila> And the open issues are listed at https://www.w3.org/2013/dwbp/track/issues/open

issue-76?

<trackbot> issue-76 -- What advice do we give about publishing metadata so that we identify the intended outcome without making assumptions that maybe false? -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/76

BernadetteLoscio: ... This issue was opened in the last f2f. For me, it's not clear. Hadley opened it.

<deirdrelee> Bernadette, there is echo

In the issue, there is a link to the original discussion: http://www.w3.org/2014/10/31-dwbp-irc#T22-37-06

Hadleybeeman: I think we are already addressing this. It's about making sure we don't make assumptions about the type of data that restricts what might be published.

BernadetteLoscio: We are doing this. We are defining metadata without thinking about the data it's referring to.
... can we close this issue?

hadleybeeman: Yes

close issue-76

<trackbot> Closed issue-76.

deirdrelee: We will be using the queuing system, as usual. Put yourself on the queue if you want to talk.

<phila> RESOLUTION: Close issue 76

deirdrelee: Move on to issue 79

<phila> RESOLVED: Close issue 76

Issue-79?

<trackbot> Issue-79 -- Discovery vs structural metadata -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/79

<phila> we don't have any BPs about structural metadata

BernadetteLoscio: We are going to use DCAT vocabulary. But in structured metadata, we don't have a best practice right now for it. I think discovery of metadata is different from structured metadata.
... I'm not sure, we could have a best practice that structured metadata should be available.
... In some case, I'm not sure if it's helpful to provide structured metadata.

<Zakim> phila, you wanted to talk about CSVW and VoID

deirdrelee: reminds eveyrone to use the queue on zakim

<phila> CSVW home page

phila: Re structured metadata: We have two examples to point to.
... The CSV working group is publishing a stable version of this document this week.
... They're pretty much done.

<phila> VoID

phila: Other one is VoID, which has been around a while
... An established way of describing an RDF dataset.
... For XML, you'd publish the schema.
... Any list we give would always be incomplete. I suspect annette_g and ericstephan would give us examples from NetCDF, etc.
... I think we do need to talk about structured metadata, and we can give examples on how.

<SumitPurohit> keyboard noise

<laufer> we are hearing hadley typing

deirdrelee: WE have a best practice on discovery, structured metadata. so you're saying yes.

<BernadetteLoscio> :(

laufer: I think we have discovery as one thing. For discovery, people may want to choose a lot of metadata.
... Descriptions, version, structure. We don't know what they'll choose.
... Structured metadata is one kind of metadata. We have to talk about it giving it. I don't think VoID is structured metadata; it's equivalent to DCAT.

<Ig_Bittencourt> +1 to laufer comment about VoID.

laufer: But a data dictionary would be structure.
... We should talk about someone giving XML schema, or a data dictionary from a relational database. But I think we need a specific best practice saying that someone
... If you don't understand the structure, you don't understand the dataset.

annette_g: Bernadette mentioned that files like audio files may not have a structure and it could be difficult to require that for people.
... My thought is that it has a linear structure in it. You just also need to document that structure

deirdrelee: we're talking about the structure of the metadata, not the data itself.
... also, as discussed on Friday, file formats are out of scope.
... Okay to give as examples, but not in the scope of the group.
... annette_g, is that okay here?

<BernadetteLoscio> i think we are talking about structural metadata not about structured metadata

annette_g: The idea is that there is metadata for the structure of the data.

<laufer> metadata about the structure of the dataset

<BernadetteLoscio> yes!

deirdrelee: I think we all agree that there should be an agreement that says the structure of the actual data should be one of the metadata values.

BernadetteLoscio: I think we agree we should provide info re the structure of the data. It will depend on the data format. We should have a best practice that says structural metadata should be provided whenever possible.
... metadata to describe the structure of the data.

deirdrelee: Close the issue then and create an action for the BP editors to add a BP for structural metadata

close action-79

<trackbot> Closed action-79.

close issue-79

<trackbot> Closed issue-79.

<deirdrelee> close issue 79

<scribe> ACTION: bernadette to to add a BP for structural metadata [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action01]

<trackbot> Created ACTION-147 - to add a bp for structural metadata [on Bernadette Farias Loscio - due 2015-04-20].

rrsagent pointer?

Ig_Bittencourt: For every BP we are proposing, should we define a use case?

BernadetteLoscio: I think we should start with the use case/requirement that we have. If we don't find one, then we need a new one.

issue-84?

<trackbot> issue-84 -- Should DCAT include a pointer to a mirror/back up? -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/84

<phila> RESOLVED: Close issue 79. ACtion on BP editors to add BO for structural metadata

BernadetteLoscio: Steve raised this. Anyone else know about it?

phila: I don't know why steve raised it, but I do have an answer. I would tell Steve that a mirror is another distribution, which is already accommodated in DCAT

deirdrelee: Is this about preservation and archiving? Backups?

<deirdrelee> ack BernadetteLoscio r

deirdrelee: So let's close this issue.

close issue-84

<trackbot> Closed issue-84.

<phila> RESOLVED: CLOSE ISSUE-84

issue-136?

<trackbot> issue-136 -- Proposal for Metadata BP -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/136

BernadetteLoscio: I think we should discuss the proposal we have now.
... We were saying that metadata should be available, machine-readable and to document what metadata we are using, self-documented metadata, and reuse of terms.
... We now have 2 best practices for metadata. 1) general, providing metadata, which includes the three best practices at the top, and a 2) which is about the others.

<BernadetteLoscio> http://bernafarias.github.io/dwbp/bp.html

BernadetteLoscio: This is our new proposal on metadata best practices.
... I'd also like to discuss another ideal. Now we have two BPs on metadata.

<newton> http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata

BernadetteLoscio: The idea of the first best practice was that metadata should be provided in human-readable and machine readable forms.

<antoine> this sounds complex: isn't all this about merging old BP1 and BP2 into a new BP2, and keeping old BP3 as such (except its number)?

BernadetteLoscio: But I was thinking about the DCAT vocab, and how we agreed to use the def of dataset provided by DCAT
... Then, the metadata should be a description of the dataset based on the DCAT definition.
... We have properties for discovery, versioning, etc., which are different.
... In my opinion, "provide metadata" should be "provide a description of the dataset according to the DCAT vocabulary", if we agree we will use the defintions in DCAT

<phila> +1 to saying 'provide a description of the dataset using the terms from DCAT'

BernadetteLoscio: For the human-readable property, I propose to extend DCAT with a new property.
... There we have dataset descriptions with all the topics we want to cover in our different types of metadata.

<Ig_Bittencourt> Perhaps, to this human-readable, we could use owl:seeAlso

BernadetteLoscio: In each one of the specific best practices that we have, we want to explain some properties of the definition to give info about discovery.
... In the implementation, we can give examples,

<newton> +1 to BernadetteLoscio

<BernadetteLoscio> +q

ericstephan: I'm finding, in the Data Usage vocab work, that it's easy to look at standards terms (like W3C) to use. But when we're looking at terms used by other communities — it's hard to determine what is standard.
... It's almost like we need metrics for one person in the world, a speciality.
... It's a problem in how people define standards. There are other vocaburalies out there too. Can we use the best practices to help people work out if their own vocabularies are standard or just made up by one person?

laufer: I think we have to talk about DCAT, but as ericstephan said, we don't know. We know we have a minimum of types of metadata.
... But each community will have its own, which is important to them. We have the example of HCLS community, which Phila gave.
... We'll have lots of examples of how to describe datasets, in different communities.
... We can say that we have a minimum, and that we are supporting DCAT as the minimum.

deirdrelee: Is this similar to what we did with scope, that we don't pick a particular format to discuss. But, phila — do we have to refer to W3C standards?

<Zakim> phila, you wanted to suggest that dcat:landingPage *might* do what Berna is talking about

phila: We're now talking about 'what is a standard'. What a community decides they're going to do. A group of people hacking together once a week, a government choosing something with international recognition. RSS is not a standard, robots.txt is not a standard.

GeoJSON isn't a standard, but often used.

<phila> http://www.w3.org/TR/ld-bp/#vocabulary-creation

phila: The danger is that you are so general that yu don't give actionable advice. Where we have a clear spec we can point to, we can give advice on what you need to look for
... And you can say: "for this BP, use DCAT unless you have a reason not to." "Comply or explain" is more useful and actionable and testable.

<ericstephan> +1 Phil

<annette_g> +1 Phil

phila: It's okay to give people an out, a way to decide not to follow the advice. But it helps to tell people what we think they should do.

<SumitPurohit> +1

<SumitPurohit> +q

BernadetteLoscio: I agree with PhilA. We need something more concrete to work with.

<gguizzardi> Folks, just to be clear, we should differentiate between "de facto" standards and formal standards. Just for the sake of clarity. Formally speaking, standards are the things that are produced by standardization bodies

BernadetteLoscio: Maybe when we say "provide metadata", it should be "provide a description for the dataset"

<gguizzardi> formally speaking, "de facto" standards are not standards

<annette_g> okay

BernadetteLoscio: We could say, "You are free to use other vocabularies for dataset description. But we propose W3C's DCAT.

<gguizzardi> and, of course, there are things that are not even "de facto" standards, they are just "frequently used"

<scribe> scribe: annette_g

np

BernadetteLoscio: what is data? datasets? we need to see which BP applies to data itself or content of dataset. Maybe "data" is too general
... we need a concrete definition

<Ig_Bittencourt> http://bernafarias.github.io/dwbp/bp.html#ShareOpenVocabularies

Ig_Bittencourt: re standards and vocabs, we already have BP 15

assessment is hard

maybe after we find vocabularies, we can do the assessment

we could extend DCAT

<ericstephan> Nice link Ig_Bittencourt, okfn has a vocabulary for assessing vocab stability I'll try to find it

<laufer> oh

<Ig_Bittencourt> Nice ericstephan, perhaps it could be put also in the indication of BP15.

SumitPurohit: agrees with Phil and BernadetteLoscio in saying ppl should use DCAT, but be can also show alternative to use something else. A concrete example would be good.

<laufer> +1 to sumit

<phila> +1 to giving examples

We should give that example in our BP

BernadetteLoscio: agrees with Sumit

<Ig_Bittencourt> +1 to SumitPurohit and BernadetteLoscio

<Zakim> deirdrelee, you wanted to talk about scope

deirdrelee: summarizes that we want to include DCAT by name and also other options

<riccardoAlbertoni> +1 to name dcat and be open

<phila> PROPOSED: That we include concrete standards from W3C but that we leave it open for other standards to be used as well

BernadetteLoscio: this doesn't close the issue

we still need to talk about structured metadata

<phila> I'm happy to support the editors' choices. When I read it, if I don't like it, I'll say

BernadetteLoscio: should we keep the metadata BPs as they are now, or have one type of metadata link to the other?

<BernadetteLoscio> +1

<gguizzardi> Folks, sorry for insisting on this rather picky issue, but something like this "Is it really an error if a standard vocabulary was not used? How people will know which vocabularies should be used?" is not really meaningful unless we have a precise definition of what we mean by standard

<phila> +1

<deirdrelee> +1

<riccardoAlbertoni> +1

+1

<antoine> +1

<ericstephan> +1

<laufer> +1

not meeting a best practice is not an error

muffled audio

<laufer> +1

who is speaking now?

<antoine> why don't we just agree that http://en.wikipedia.org/wiki/Technical_standard is ok? And yes it's vague but the world is such.

<ericstephan> +1

gguizzardi: need a definition of what it means to be standard

<phila> RESOLVED: That we include concrete standards from W3C but that we leave it open for other standards to be used as well

<Ig_Bittencourt> +1 to the proposal

<Gisele> +1 to the proposal

<Ig_Bittencourt> http://bernafarias.github.io/dwbp/bp.html#ShareOpenVocabularies

<Zakim> phila, you wanted to make a proposal about defn of standards

<Ig_Bittencourt> not this

off topic

<Ig_Bittencourt> http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata

<BernadetteLoscio> we have 02 general best practices BP1 and BP2

<ericstephan> +1 to having a definition in the glossary

<BernadetteLoscio> and specific BP for each type of metadata

<phila> ACTION: Yaso to include a definition of 'a standard' in the glossary [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action02]

<trackbot> Created ACTION-148 - Include a definition of 'a standard' in the glossary [on Yaso Córdova - due 2015-04-20].

deirdrelee: asks yaso if it's okay to add to the glossary our definition of standard

yaso: yes

<yaso> The glossary is here, just commited

<yaso> https://github.com/w3c/dwbp/blob/gh-pages/glossary.html

BernadetteLoscio: we should update the glossary in an incremental way, so we don't have to do it all last minute

<phila> The glossary as HTML page

laufer: communities decide to use a standard as a de facto thing. They choose what they think is best at that moment. It's a kind of "light standard"

<antoine> for info the wikipedia def mentioned de facto standards

<BernadetteLoscio> my proposal is to keep the current organization of BP metadata section: two general best practices (provide metadata and use standard terms) and specific BP for specific types of metadata

<gguizzardi> Yes, Antoine, but we are putting together Formal Standards, "light standards", community conventions, de facto standards and putting it all together under the same term. The point is that we are recommending BPs to other people that cannot be sure if something is "light" standard enough to be consider a case of following the BP in question

<BernadetteLoscio> Provide metadata

<BernadetteLoscio> Use standard terms to define metadata

<newton> http://w3c.github.io/dwbp/bp.html#DocumentMetadata and http://w3c.github.io/dwbp/bp.html#ProvideMetadataMachine

<deirdrelee> Proposed: to keep the current organization of BP metadata section: two general best practices (provide metadata and use standard terms) and specific BP for specific types of metadata

<yaso> I made comments at the commit. https://github.com/w3c/dwbp/commit/7bc41017a8a39cc766c920ba791ec43d0def8594 - but we can make it as issues, or new commits with suggestions.

PROPOSED: that we use two general metadata best practices, one to say that people should provide metadata and one to say to use standard terms, and then to provide other BPs for specific metadata.

<antoine> I'm looking at http://www.w3.org/TR/dwbp/#metadata

antoine: what about the third one?

<Ig_Bittencourt> hi gguizzardi, but if we basically provide in the glossary what we are meant by standard, them it could be enough.

<gguizzardi> Sure. I am fine with that

<deirdrelee> +1

<antoine> q

<Ig_Bittencourt> great gguizzardi

<Ig_Bittencourt> +1

<laufer> +1

<deirdrelee> http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata

antoine: was referring to the older version of the BPs

<phila> ACTION: Lee to include the standard resolution in the scope page on the wiki [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action03]

<trackbot> 'Lee' is an ambiguous username. Please try a different identifier, such as family name or username (e.g., dlee8, klee5).

<phila> ACTION: Deirdre to include the standard resolution in the scope page on the wiki [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action04]

<trackbot> Created ACTION-149 - Include the standard resolution in the scope page on the wiki [on Deirdre Lee - due 2015-04-20].

antoine: so, the proposal is to just publish 1 and 2 under metadata

<ericstephan> should we try voting again?

BernadetteLoscio: some people didn't agree that the human and machine readable should be separate BPs
... so, if we accept this proposal, we resolve the issue

<phila> PROPOSED: that we use two general metadata best practices, one to say that people should provide metadata and one to say to use standard terms, and then to provide other BPs for specific metadata.

<ericstephan> +1

+1

<deirdrelee> +1

<laufer> +1

<antoine> +1

<yaso> 1

<BernadetteLoscio> +1

<gguizzardi> +1

<yaso> +1

<riccardoAlbertoni> +1

<Ig_Bittencourt> ++1

<phila> RESOLVED: that we use two general metadata best practices, one to say that people should provide metadata and one to say to use standard terms, and then to provide other BPs for specific metadata.

<ericstephan> woo hoo!

<SumitPurohit> +1

<phila> PROPOSED: The the current structure is fine

<deirdrelee> http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata

<laufer> +1 to woo hoo!

<deirdrelee> +1

<Gisele> +1

+1

<phila> PROPOSED: The the structure at http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata on 2015-04-13 is fine

<BernadetteLoscio> +1

<phila> +1

<ericstephan> +1

<phila> RESOLVED: The the structure at http://bernafarias.github.io/dwbp/bp.html#DocumentMetadata on 2015-04-13 is fine

<Ig_Bittencourt> +1

<deirdrelee> http://www.w3.org/2013/dwbp/track/issues/136

<phila> issue-136?

<trackbot> issue-136 -- Proposal for Metadata BP -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/136

<laufer> close close close

<phila> close issue-136

<trackbot> Closed issue-136.

<ericstephan> :-)

<phila> issue-139?

<trackbot> issue-139 -- Suggests additional content for BP1 implementation -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/139

<antoine> +1 but after that someone will have to explain me what we've resolved. the structure of http://bernafarias.github.io/dwbp/bp.html#metadata is the same as http://www.w3.org/TR/2015/WD-dwbp-20150224/#metadata

BernadetteLoscio: issue 139

is from a really old version of the doc. can we close?

<Ig_Bittencourt> Yes

<phila> close issue-139

<trackbot> Closed issue-139.

<deirdrelee> issue-147?

<trackbot> issue-147 -- We should differentiate a metadata document from documenting metadata (meta metadata) -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/147

<phila> RESOLVED: close 139 as new teplate overrides it.

<ericstephan> Ig_Bittencourt: Vocab status that I mentioned earlier: http://lov.okfn.org/dataset/lov/vocabs/vs

<laufer> clcloclose

BernadetteLoscio: I think we solved this with the new version

laufer: yes

<phila> close issue-147

<trackbot> Closed issue-147.

<phila> issue-145?

<trackbot> issue-145 -- Should we change the title of BP7? -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/145

<antoine> "provide unique identifiers"

<phila> BP on Identifiers perhaps?

<phila> So issue-147 is about http://bernafarias.github.io/dwbp/bp.html#ProvideUniqueIdentifiers

<phila> deirdrelee: So what do you propose to change it to annette_g ?

<SumitPurohit> +q

<phila> annette_g: I'm not sure it means anythinbg to say you need to give something a unique ID whewn it's on the Web - since URIs are the only way to do it

<phila> ... Not sure it's a useful BP

BernadetteLoscio: agrees with Annette, points out that datasets need to be associated with a unique identifier. we need to talk more about what the identifiers are for. resources?

<Ig_Bittencourt> ericstephan: nice. I didn't know this Vocab status. Somehow it is related to the DUV.

deirdrelee: this is not really about metadata, okay to move it?

BernadetteLoscio: yes

<Ig_Bittencourt> ericstephan: perhaps we could reuse it in DUV.

<phila> deirdrelee: Issue postponed until discussion of identifiers

SumitPurohit: yes, it

will be more relevant later

breaking up

BernadetteLoscio: we

are going to change the issue title to show the name of the bP

<ericstephan> Ig_Bittencourt: interesting about including it in DUV, we've been focusing on usage of dataset, metadata would be good.

deirdrelee: 10 minute break?

<Makx_Dekkers> yes

<antoine> https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

<phila> Not getting a response from that wiki page :-(

deirdrelee: come back at half past to talk about Q+G vocab

<antoine> deirdre I'm really not sure we can talk about quality if the wiki's not working :-(

<phila> Seems the wiki is having a hissy fit

<Makx_Dekkers> trying to coenect to https://www.w3.org/2013/dwbp/ but site seems to be dead

<phila> We have wikis!

<phila> (Someone turned them off and on again)

Quality discussion

<phila> scribe: phila

<annette_g> scribe is back

<annette_g> still a couple missing here

<annette_g> thanks, Phil!

<ericstephan> OK!

deirdrelee: We'll do this for 2 hours then break for lunch
... over to Antoine

We're looking at https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

<Makx_Dekkers> link to doc does not work

No, Makx_Dekkers Grr

<gguizzardi> I am the ipcaller

<ericstephan> @phila, having trouble with the wiki again, I can scribe next

<annette_g> wiki is stalled?

<deirdrelee> antoine: the first thing was looking at the bp group and tring to identify concrete things about the scope of quality vocab, what we call competency questions

<Makx_Dekkers> yes i see it now

<gguizzardi> I can't access it

<Ig_Bittencourt> me neither.

<annette_g> Gisele got it, the rest of us are waiting

<newton_> https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

<BernadetteLoscio> https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

<ericstephan> Could anything be sent via email attachment?

<Ig_Bittencourt> Good idea ericstephan. Gisele or BernadetteLoscio, please send the file by email.

Comments

<laufer> maybe you can generate a pdf and send by email

See https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/

<laufer> as eric said

<ericstephan> okay just didn't know if local copies existed somewhere

<deirdrelee> ...https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/

<deirdrelee> https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3003

deirdrelee: Points out that Andrea's comment is at odds with what we just agreed

BernadetteLoscio: I'm not sure ... we have a new proposal now. WE have already discussed the proposal
... so I suggest we close it with what we just agreed

<annette_g> +1 to BernadetteLoscio

https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3003?cid=3003

phila: Explains the comment system, how we can resolve in differnet names

annette_g: It feels like we have resolved no - we don't agree with Andrea

BernadetteLoscio: I think it's more resolve partial because we have another proposal.

<ericstephan> I think resolving partial keeps the door open to get his feedback

<yaso> +1 to BernadetteLoscio

<ericstephan> +1

<flavio> +1

PROPOSED: Mark as Resolved Partial and action on BernadetteLoscio to contact Andrea and let him know about the new wording

+1

<deirdrelee> https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3004

<riccardoAlbertoni> +1

RESOLUTION: Mark as Resolved Partial and action on BernadetteLoscio to contact Andrea and let him know about the new wording

BernadetteLoscio: I've already written to him - OK, no action required

<BernadetteLoscio> I'm gonna write a new message

<BernadetteLoscio> you can create an action if you want

<scribe> ACTION: BernadetteLoscio to write to Andrea Perego informing of the new text [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action05]

<trackbot> Created ACTION-150 - Write to andrea perego informing of the new text [on Bernadette Farias Loscio - due 2015-04-20].

laufer: The three comments that Andrea made are all related

BernadetteLoscio: So I can write to him once and say that his comments are taken into accoiunt in the new version
... of the BP doc

Now looking at https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3005

BernadetteLoscio: I didn't cover this issue in the new version of the BP text arund data discovery

laufer: Our scope should be the dataset... should I say that it's not part of our goals to talk about data discovery?

ericstephan: I agree that we should stick at the dataset level

<Ig_Bittencourt> +1 to ericstephan

<BernadetteLoscio> Bernadette

<Ig_Bittencourt> and annette_g

<BernadetteLoscio> +q

SumitPurohit: I'm not happy with saying you have any gibberish in your data as long as the description is good

deirdrelee: I agree. In terms of the content, we still recommend quality data, standards etc. All we're saying is that for the discoverability, that the metadata should facilitate discoverability
... it's implied that if you do that then there's potential for data being discoverable

scribe is very confused

<ericstephan> I filtered myself :-)

BernadetteLoscio: If we're going to provide structrual metata, this can hekp find more info about the data itself.

<ericstephan> Metadata should be useful and appropriate

deirdrelee: Calls for a proposal

PROPOSED: We're not going to go into detail on the discovery of data content that is addressed by metadata, including quality of data etc. We're not going into discovery of the data itself

<ericstephan> I would ammend this with we arent going to define a BP for going into detail..

phila: happy for someone to put in a better proposal

<SumitPurohit> we lost skype.....

<laufer> I think search engines are the responsibles for creating the metadata about content itself (just thinking...)

<yaso> skype absents

<ericstephan> austin we have a problem

<deirdrelee> :)

<annette_g> working on it

<BernadetteLoscio> one moment

<ericstephan> we are pooling money to get back on skype

<laufer> very near houston...

<deirdrelee> ok, let's leave these comments and get back to quality-

<laufer> our 2F2F

<BernadetteLoscio> ok ;)

<gguizzardi> Folks, I can no longer understand anything

<deirdrelee> bernadette - let's mark all as three as resolved-part, reply to Andrea

<ericstephan> Yaso saves the day

<BernadetteLoscio> yes Deirdre! I agree

<deirdrelee> austin, can you hear us?

<annette_g> no

<yaso> Not yet, deirdre

<laufer> w--e a-- he--..ari--ng y o u ... li---ke thi...s

<BernadetteLoscio> please, create an action for me to send a new message to Andrea

<yaso> Run out of credits on skype, and then run out of battery in our bluetooth micro

action-150?

<trackbot> action-150 -- Bernadette Farias Loscio to Write to andrea perego informing of the new text -- due 2015-04-20 -- OPEN

<trackbot> http://www.w3.org/2013/dwbp/track/actions/150

<deirdrelee> we can hear you, talk to us!

<deirdrelee> what's happening?

<Ig_Bittencourt> gguizzardi: the discussion was about Andrea's comments about the BP Doc.

<BernadetteLoscio> thanks Phil!

<laufer> apollo 14 it´s back home

<Ig_Bittencourt> https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3003?cid=3003

<gguizzardi> Thanks, Ig.

<Ig_Bittencourt> https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3005

<ericstephan> Yes

<gguizzardi> I couldn't hear anything

<gguizzardi> Now, it seems to be working again

<gguizzardi> thanks!

<ericstephan> drums beating

<antoine> https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

deirdrelee: Back to you Antoine - with audio and a functioning wiki

https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP

antoine: First thing on the agenda is trying to identify relevant requirements from the Best Practices
... First outcome is a table https://www.w3.org/2013/dwbp/wiki/Requirements_From_FPWD_BP#Requirements
... We're aware of different levels. Some reqs are more general than making classes and properties in the vocab
... third catageory is what we called competency questions
... for e.g. there has to be a quality score. This calls for classes and props to describe the score
... what we could do is to go through these requirements
... I could go through them all and then get an overview
... for example, the first req stems from BPs 2 and 8,
... this is something about how the vocab shouold be desigend in general. It's a general need for data quality expression as some form of metadata but not what form it should take

Makx_Dekkers: Maybe a more fundamental Q first.
... We have rewuirements and then we jump to the conclusion that we need a vocab for that
... a lot can already be expressed in DCAT et al
... intrinsic characteristics of datasets can be expressed. Why are we jumping to a new vocab - maybe we can do it already?

<laufer> I feel the same...

annette_g: It's in our charter to do this. There are lots of things that we left out of the BP doc because we thought they'd be in the quality vocab

<riccardoAlbertoni> deirdrelee: we are not defining vocabulary for its sake but we want to address specifically quality

<riccardoAlbertoni> perhaps, we don't need a new vocabulary but we want more application profile

<riccardoAlbertoni> antoine: we are identifining requirements if these requirements are already satisfied by existing vocabulary so we can avoid to have a new one, bu first let picture requirements

<riccardoAlbertoni> laufer: maybe we don't have to define a new vocabulary but give advice on how to use the existing

<scribe> scribe: phila

<deirdrelee> close speaker queue

BernadetteLoscio: We don't know exactly *what* should be described to cover quality

<gguizzardi> I fully agree with Bernadette

Ig_Bittencourt: I agree with Berna that we should follow up with a conceptual framework to define the vocab
... I like the way Antoine has started. Also important to define the concepts

<Ig_Bittencourt> http://www.slideshare.net/OpenDataSupport/open-data-quality-29248578

Ig_Bittencourt: e.g. (points to makx's work)

BernadetteLoscio: The table of reqs that we have - fne - but how to provide the quality metrics

ericstephan: I was at a smart release workshop a month ago. One of the things they said was there are a lot of vocabs but very few effots to get things done
... the DUV is a kind of glue

<Ig_Bittencourt> If you look in the link, there are some dimensions: accuracy, availability, completeness, conformance, consistency, credibility, processability, relevance and timeliness

<ericstephan> Has http://qudt.org/ been considered for standard quantities? It might be useful for metrics

ericstephan: The otehr comment - Jeremy Tandy sent me a link to a vocab for quantity that I thought was useful
... a standardised set of metrics would be good

annette_g: I wanted to agree with eric and Berna - we need to focus on the metrics that are in the caht. That's the value
... I appreciate the effort that went into the list
... but I don't want to reiterate what we already have in the BP doc
... requirements to include prov - that's BP

<SumitPurohit> +1 annette_g

<Ig_Bittencourt> +1 to antoine

annette_g: those would be hard to discuss without a vocab

<Ig_Bittencourt> ops... +1 to annette_g

deirdrelee: So, Antoine, what do you see as the next steps

<antoine> https://www.w3.org/2013/dwbp/wiki/Data_quality_schedule

antoine: The schedule includes a lot of references to things like the work already mentioned
... we do want to start from the UCR as a concrete starting poinbt
... Helps to identify what we should focus on
... Right now we're focussed more on looking at ...
... after the f2f we hope we'll have enough consensus on the scope to sgtart looking at the related work

s/looking at.../ looking at the scope/

Makx_Dekkers: Coming back to an earlier poinbt - we're still talking on the meta level. How to express quality. We're not talking about what quality is
... something we discussed at Share-PSI in Romania - what do people think reflects quality
... I wrote a message to the group - we want a metric like processability
... someone said well, a metric for that wouold be the number of stars in the 5 star scheme
... so far we used 'quality' the same way we use otehr words
... they want info about processability, consistency etc.
... then you can see whether there is an objective metric, or a subjective metric, like freshness
... I think we need to get to what are the components of quality
... you can say how fresh data is but you don't need a qulaity vocab for that
... I'm not sure we're going into the right aspects

deirdrelee: What Antoine, riccardo and I have been looking at is how to describe quality in an abstract layer
... then the next layer is the dimensions - and that's what I think makx is talkinbg about
... and then at the lowest level - the metrics, how do you measure it
... If that's all in scope for us is to be decided

<Makx_Dekkers> i think daq describes the abstract layer

<BernadetteLoscio> the sound is really bad!

<Makx_Dekkers> cant hear either

riccardoAlbertoni: I would like to add some words... I am a big supporter of metrics, but I'm not sure they're the only way to express quality

<laufer> daq is kind of "foundation ontology" for quality...

<yaso> +1 to riccardoAlbertoni

<ericstephan> +1 laufer

riccardoAlbertoni: My question is - is there any other way to represent qwuality that we should consider

<Makx_Dekkers> 'metrics' can also be subjective statements not necessarily numbers

riccardoAlbertoni: for example - people publishing open data will assess the quality in some metrics framework, or a short decsription
... what is really used by people?
... I know about metrics and we'll have a framework,

<ericstephan> Could you give an example Makx_Dekkers ?

riccardoAlbertoni: but I guess that at the moment, as metrics aren't so commonly deployed, people figure out other ways to speak out quality

<Gisele> about terminology - a metric is a number, a "measure" can be a different thing

annette_g: hard to hear what you were saying
... looking at what we have to start with, porocessability is embodied in others
... accuracy, completeness etc.
... these are all good, but processability will be determined by the otehr dimensions

<annette_g> the list: accuracy,

<annette_g> availability,

<annette_g> completeness,

<annette_g> conformance,

<annette_g> consistency,

<annette_g> credibility,

<annette_g> processability,

<annette_g> relevance

<annette_g> timeliness

<yaso> deirdre I can try

BernadetteLoscio: We have dimensions, critera, metrics

<antoine> @annette where is this list coming from?

<antoine> @annette is it fromMakx' presentation?

deirdrelee: For me, annette's list is a list of dimensions

<annette_g> Ig pasted it in before

deirdrelee: we can also call them criteria

<Ig_Bittencourt> yes antoine

BernadetteLoscio: We can look at th literature an dsee what is at different levels

<annette_g> +1 to Bernadette

BernadetteLoscio: I think it should have a way to describe these criteria - which I think is a better word than dimensions

group discusses dimension vs criteria

deirdrelee: Please use the queue or remotes can't understand

<yaso> define as items, maybe?

deirdrelee: Want to go back to Antoine... want to focus the conversation

<ericstephan> scribe: ericstephan

deirdrelee: what do you want to spend the next hour on antoine ?

<annette_g> http://onlinehelp.tableau.com/v6.1/public/online/en-us/Id112A8A00YEX.html

antoine: We should add dimensions and metrics and have standards to represent that...
... maybe we could go to the next stage, actually defining dimensions and metrics

<SumitPurohit> +q

<laufer> daq talk about categories...

antoine: In the next hour get some confirmation on dimensions and metrics, looking at the use case owner as well to get some confirmation as well
... do we have the right amount of information as well?

<antoine> @annette we have put definition in our draft questions: https://www.w3.org/2013/dwbp/wiki/QualityQuestionnaire

<Zakim> phila, you wanted to take a stab at that list

phila: (sorry hard to hear) mentioned layered model, Makx_Dekkers mentioned the need for accuracy conformance etc.
... some things can only be described in natural language, how do you declare credibility for instance? Very difficult problems...
... How you express that difficult, how it is expressed in a use case helps provide focus

phila - I hope I captured your thoughts...

SumitPurohit: If the relevance is not there, its all subjective. Quantitative measurements are criteria we can provide guidance.

<phila> phila: I mean that that list of dimensions is helpful. How you declare each of them will vary and may or may not need a new vocab. ANd we'll prob have to be flexible in how it's expressed anyway

<phila> phila: And the fact the our charter says we'll create a vocab is not a reason to do it. Use cases are

deirdrelee: we have stronger feelings around the abstract layer than metrics etc. It should be closed, always will be exceptions.

<riccardoAlbertoni> +1 to deirdrelee dimensions can't be a close set we define

deirdrelee: reliability is subjective, it needs specific critiera.

Makx_Dekkers: phila said some things can be computed (repeatibility) the level of quality based on opinions, relability is perspective based.
... raw data that needs to be verified, for some communities the data is valuable (mashups) might not be for other communities.

<laufer> we are mixing the dimensoins of what we call quality with dimensions of qualities of the dataset...

Makx_Dekkers: maybe step back to the abstract level to answer "who says" its quality. Credintials perhaps? Provenance may help here

BernadetteLoscio: Describing the quality of a dataset is important, the vocabulary should provide the means to provide common terms to describe quality in their own particular way. E.g. dimension concept.
... maybe the vocabulary should be about describing the concepts not the properties

<laufer> who say what are the dimensions of quality that would be important... different consumers could have different qualities needs...

annette_g: maybe part of the solution to this, is not having a good/bad assignment, its more about definiing what raw data means...
... provide clarity on terms as opposed to judgements about the data
... my first question about datasets is: how clean is it, how many zeros are in it etc. These are in my mind important to capture.

deirdrelee: agree with annette_g , its not the most useful thing is to describe how reliable something is...where we can define items we should, defined in vocabulary or examples...

<Zakim> phila, you wanted to talk about PROV of statements

<phila> https://certificates.theodi.org/about

<laufer> what is the quality of a mp3 file...

phila: we talk about the dimensions theodi tries to get around the complexity with ennumerations but not describe what that means.

<riccardoAlbertoni> BTW, PROV of statements happens to be the requirement no.5 in the list of requirements

phila: theodi advangtage is simplicity, compared to provenance it might not always be accurate we need to weigh these different approaches

<riccardoAlbertoni> speak louder please

antoine: (had trouble hearing)
... from the completeness perspective, its difficult to find completeness. Should we attempt to define metrics or is it perhaps to difficult?

<BernadetteLoscio> +q

<phila> ericstephan: We have something to start on. I like Annete's idea of a base vocab that I can adpt to what I need

<phila> ... I like general terms that I can tailor

<phila> ... terms might not always work out but we start from a common definition

<phila> ... start at the concept level and then add properties. That's the starting place I think

<riccardoAlbertoni> a lot echo

<phila> Makx_Dekkers: Eric made a good point. Quality is in the eye of the beholder.

<phila> ... it's about giving people enough info to allow them to decide whether it's useful

<phila> ... is it complete, yes or not? I think it's about perspective. The publisher may not think it's complete

Makx_Dekkers: quality is in the perception in the way a person looks at it. antoine mentioned is it complete yes or no? In the eyes of the data owner, they will provide a subjective answer.

<phila> ... (gives example of subjective statements)

<Ig_Bittencourt> daq

Ig_Bittencourt: starting to rethink about the need for a vocabulary. If we look at each dimension, it could be very difficult to maintain. If we need to go to a new vocabulary we need to think about the ramifications of redefining metrics etc
... maybe we should define a foundational ontology for that.

<Ig_Bittencourt> http://butterbur04.iai.uni-bonn.de/ontologies/daq/daq#

BernadetteLoscio: The vocabulary - the daq could be the starting point for the vocabulary.

<riccardoAlbertoni> it is actually listed among the vocabulary on which we can start building ..

<antoine> well we're already doing this

BernadetteLoscio: we should start with something that is there, if they have a description for dimensions, why not use that as a starting place?

<gguizzardi> Giancarlo Guizzardi

<laufer> I think we have here problems with different complexity levles for the audience of our documents

gguizzardi: One issue which I think is important to address dimensions and metrics, we should provide a vocabulary to describe things in a way that someone else will understand. We are just providing a language.
... I should know if I am publishing data it would be useful to provide a language so that people can talk about it

antoine: I agree with many things, I have a lot of doubts on granularity in terms of what we should address.
... completeness I could think of the amount to represent persons, it could be about the average of the time a name is present about all persons,
... there is a whole range about possibilities and there isn't a lot of concrete clues about what we should focus on.

<phila> That's an interesting point re Use Cases

laufer: I don't know if we have two different complexities around the document we are writing here. Wil the BP document be the same as how to generate information on the quality of data.
... How are we addressing things from the consumers perspective?

<deirdrelee> PROPOSED: Provide a vocab (or vocab AP) to allow someone to define quality of data on the web PROPOSED: This vocab AP should allow someone to define a quality dimension PROPOSED: This vocab AP should allow someone to define a quality metric PROPOSED: Quality vocab should include specific dimentsions (not exhaustive) PROPOSED: Specific dimensions should be defined as separate list PROPOSED: Quality vocab should include specific metrics (not exhaustive)[CUT]

multiset-proposal?

<deirdrelee> PROPOSED: Provide a vocab (or vocab AP) to allow someone to define quality of data on the web

<Ig_Bittencourt> With this proposal, are we agreeing that we should really provide a vocab?

deirdrelee: Do you generally agree with the proposal?

<phila> -1

<Ig_Bittencourt> -1

deirdrelee: does it make sense?

<annette_g> +1

<laufer> -1

<Makx_Dekkers> -1

<Ig_Bittencourt> We should first think if we should really provide a new vocab.

phila: The reason why I am saying no, we have antoine saying its difficult to go to the use cases and come up with something concrete about how to move forward?

<Makx_Dekkers> the list is not 'correct', it's a list of things that are maybe useful

phila: If the list of dimensions are correct, then as the vocab evolves, the list could be used to create other dimensions.

<annette_g> list of caveats?

<Makx_Dekkers> +10 to phil

<laufer> \me it seems like things that maybe will be useful to didcover things...

phila: if one or more of these dimensions shows the need to create a new vocabulary then that provides an argument to move forward on the creation process.

<Ig_Bittencourt> I agree with phila.

<phila> I was pointing at the list of citeria - accessibility, processability etc

<BernadetteLoscio> +q

Makx_Dekkers: the list of dimensions and critiera is what phila was expressing. What dimensions and criteria you have and contextually how you interpret it. Frequency is an example of where you can point to an existing vocab rather than inventing somehting new

<antoine> +1 to Makx about the design principle

<phila> Draft proposal: We will consider whether the Open Data Support list of dimensions is the best list we can identify, based on the use cases that we have

deirdrelee: is there a list of criteria someone has?

<annette_g> accuracy,

<annette_g> availability,

<annette_g> completeness,

<annette_g> conformance,

<annette_g> consistency,

<annette_g> credibility,

<annette_g> processability,

<annette_g> relevance

<annette_g> timeliness

<antoine> https://www.w3.org/2013/dwbp/wiki/Data_quality_notes#Suggested_requirements:

phila: can we evalue this (link) in the context of the use cases?

Makx_Dekkers: Its sort of a meta-list of the open data support?

<Ig_Bittencourt> Some metrics here: http://www.scielo.cl/pdf/jtaer/v9n2/art06.pdf

phila: we should be able to provide some type of comparison between this and the use cases. Can someone spend some time just making an assessment at a starting point?

<Ig_Bittencourt> deirdrelee: maybe, there are some metrics related to some of these dimensions.

<Ig_Bittencourt> s\maybe\

<Makx_Dekkers> Open Data Support presentation http://www.slideshare.net/OpenDataSupport/open-data-quality-29248578

annette_g: distinction between timelyness and frequency. its important to address this in the quality vocabulary

BernadetteLoscio: Its going to be hard to extract from the use cases detailed information. Unless you are familiar with the use case, its difficult to evaluate.

<phila> Draft proposal: We will review the Open Data Support list of dimensions in the light of the use cases that we have, looking for any obvious omissions or unnecessary dimensions

BernadetteLoscio: from the perspective of the consumer (user) what are the needs?

<Makx_Dekkers> I saw one in the requirements: 21 says Quality data must be available in an up-to-date manner and the update frequency made explicit.

BernadetteLoscio: we don't have enough to really make the assessment.

giancarlo_guizzardi: It would be a good idea to use a shared open vocabulary, to describe terms from a standard vocabulary rather than use local terms.
... this is just one example

deirdrelee: any vocabs in mind?

giancarlo_guizzardi: everything we put in the recommendation in the bp practice document, we should be giving people a way to show how they can comply

<annette_g> inaudible

no sound quality

in one half hour /lunch slash dinner break. We should find a path forward, even though the use cases aren't complete, we should try to find use cases where these dimensions could be drawn out.

annette_g: is a survey relevant? asking people what are the barriers?

<BernadetteLoscio> +1 to Annette

<BernadetteLoscio> the sound is bad :(

<giancarlo_guizzardi> very hard to understand

riccardoAlbertoni: (hard to hear muffled)

<annette_g> * type it?

riccardoAlbertoni - could you add your thoughts to the notes?

<Makx_Dekkers> compliance to best practice dct:conformsTo

<riccardoAlbertoni> I was saying compliancy to BP can be a further way to represent quality, that can be used besides metrics,

<Ig_Bittencourt> Ig_Bittencourt: What I said was that we firstly should define the conceptual framework (e.g. those dimensions) we should follow/use in order to guide us in a more detailed discussion. After that, to check if it is really possible to define metrics for that conceptual framework and so on.....

<antoine> https://www.w3.org/2013/dwbp/wiki/QualityQuestionnaire

<riccardoAlbertoni> so the my question was if statements about compliance to Bp should be included in the vocabulary quality

antoine: the idea of the survey, would it be different than what we started to put together? Annette could you send around ideas?

annette_g: yes, you show more of a verification. I suggest something is more open, more qualitatie approach?

antoine: trying to understand what you suggest.

annette_g: the point is to get something from an original source, rather than coming up with something on our own.

antoine: thinking about taking this back to the use case owner...

<annette_g> aren't the use case owners people in this group?

<deirdrelee> sorry guys, we dropped...

<giancarlo_guizzardi_> Agree with Antoine

We didn't do it this time (just being contextual on quality)

<annette_g> W3C has expressed interest in getting better relationships with developers. We should get info from developers for this in particular, not just W3C people.

<antoine> @annette ok I get it

<giancarlo_guizzardi_> If I got his point correctly, that is what I was saying. If there is something which we judge to be a BP, we should give people a way to state how they comply to that. Just to clarify my previous example: IF we say, people should use a Standard Vocabularity. We should have a dimension for them to rank how they comply to this

<annette_g> dogfood, man

lol annette_g sick em :-)

deirdrelee: what proposals should be introduced? annette_g , antoine Makx_Dekkers ?

<annette_g> PROPOSED: that we do a survey to find out what the barriers to data reuse are

<Ig_Bittencourt> giancarlo_guizzardi_: I agree with what you are saying, but I believe that most of the BP is related to Data Usage Vocabulary in spite of Data Quality Vocab. Another point is that if we do not follow a conceptual framework, it will quite hard to go through this vocab and provide an interesting one.

<Makx_Dekkers> PROPOSED: ask use case authors to say which aspects of quality they want to see expressed and how

annette_g: ask anyone who considers themselves a user of data on the web? from a data quality point of view?

<antoine> I think Annette's suggestion is excellent. But I'm afraid of the resources it would consume to make it a success

<deirdrelee> PROPOSED: that we do a survey to find out what the barriers to data reuse are

<giancarlo_guizzardi_> I am fine with using a conceptual framework, of course. However, we might need to adapt existing ones to deal with Datasets on the web. This seems to me a systematic way to find obvious omissions. My example of "how authoritative/standard" a vocabulary is came to mind as an omission without effort

<deirdrelee> -1

<phila> -1 - it's what the W3C process does

-1

<antoine> +0 if someone volunteers I think it's ok

<Makx_Dekkers> -1

deirdrelee: in this stage, this might take too many resources and would delay the work very much..

<phila> Draft proposal: We will review the Open Data Support list of dimensions in the light of the use cases that we have and Makx's Share-PSI session, looking for any obvious omissions or unnecessary dimensions. Publish a draft ASAP and seek review, asking specifically whether we have the right dimensions.

<phila> (so Annette says I'm not against what she's saying, just the method)

<deirdrelee> PRPOSED: ask use case authors to sy whihc aspects of quality they want to see expressed and how

<deirdrelee> 0

<Ig_Bittencourt> agree with phila's way of think ing.

PROPOSED: ask use case authors to say which aspects of quality they want to see expressed and how

<deirdrelee> Draft PRPOSED: ask use case authors to sy which aspects of quality, from list of 12 criteria above they want to see expressed and how

<annette_g> does writing a use case qualify someone in particular to speak for data consumers?

<deirdrelee> +1

<annette_g> -1

<phila> +0

0

<laufer> -101+

DRAFT PROPOSED: ask use case authors what type of feedback they want from data consumers?

<antoine> I would expect use case owners to have good knowledge of the applications, no?

<laufer> one question: when we ask what criteria of quality one consumer whant to see, he will answer completeness or the number of interviewed people?

<annette_g> like asking Microsoft to provide feedback on usability?

<Zakim> phila, you wanted to talk about evidence we already have available

annette_g: trying to find ways to do this, I'm am just worried that going back to the group for the same thing isn't going to do that.

<phila> PROPOSED: We will review the Open Data Support list of dimensions in the light of the use cases that we have and Makx's Share-PSI session, looking for any obvious omissions or unnecessary dimensions. Publish a draft ASAP and seek review, asking specifically whether we have the right dimensions and perhaps highlighting ones that didn't make the cut.

<phila> Makx's mail

phila: we have a starting point as noted in my proposal
... Makx_Dekkers proposed the list and got some feedback, so we have a sense of a review.

<Ig_Bittencourt> +1 to phila

phila: if we take that list and put it in a doc that can be reviewed as a draft recommendation, we can ask, did we get this right?
... we aren't starting from nothing, Makx_Dekkers list is a good starting point, the best way is to get something out in a draft for public review

<phila> PROPOSED: We will review the Open Data Support list of dimensions in the light of the use cases that we have and Makx's Share-PSI session, looking for any obvious omissions or unnecessary dimensions. Publish a draft ASAP and seek review, asking specifically whether we have the right dimensions and perhaps highlighting ones that didn't make the cut.

<riccardoAlbertoni> +1 ( It seems a very pragmatical way to go )

<giancarlo_guizzardi_> +1

+1 for getting public feedback asap

<Ig_Bittencourt> +1

<BernadetteLoscio> +1

<Gisele> +1

<SumitPurohit> +1

<antoine> +1

<annette_g> +1 ~

<Makx_Dekkers> +1

<flavio> _q

does ~ mean ish?

<flavio> +1

<deirdrelee> +1

<annette_g> yes, ish

<Makx_Dekkers> Also see rough notes of Share-PSI session https://www.w3.org/2013/share-psi/wiki/Timisoara/Scribe#Tuesday_17th_March_.2811:30_-_12:40_Parallel_Sessions_B.29

<phila> RESOLVED: We will review the Open Data Support list of dimensions in the light of the use cases that we have and Makx's Share-PSI session, looking for any obvious omissions or unnecessary dimensions. Publish a draft ASAP and seek review, asking specifically whether we have the right dimensions and perhaps highlighting ones that didn't make the cut.

<laufer> +1

high noon in the heart of Texas

<laufer> aftre the approval

deirdrelee: antoine what say you?

<Ig_Bittencourt> s\aftre\after

antoine: it seems like a good approach

<antoine> may I ask Makx to be co-actor on this?

<scribe> ACTION: antoine to follow up on resolution that Phil proposed [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action06]

<trackbot> Created ACTION-151 - Follow up on resolution that phil proposed [on Antoine Isaac - due 2015-04-20].

<phila> ACTION: antoine to follow up on the resolution re starting with the ODS list, Makx's list etc, working towards FPWD ASAP [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action07]

<trackbot> Created ACTION-152 - Follow up on the resolution re starting with the ods list, makx's list etc, working towards fpwd asap [on Antoine Isaac - due 2015-04-20].

did I screw that up phila?

<phila> close action-151

<trackbot> Closed action-151.

<Makx_Dekkers> antoine, afraid that I can't commit myself

thank you

<Makx_Dekkers> already overloaded

<giancarlo_guizzardi_> see u in 40'-ish

<Ig_Bittencourt> Thanks.

<giancarlo_guizzardi_> thanks

<antoine> Bye everyone, thanks for the great discussion, and enjoy the rest of the meeting!

<antoine> I wish I could be with you F2F

<deirdrelee> Thanks Antoine!

<deirdrelee> Talk to you later

<deirdrelee> google hangout: https://plus.google.com/hangouts/_/gu2q3xlnxb4p6sjowowzowteema

<Makx_Dekkers> no that's not me

<Makx_Dekkers> zakim p1 is me

<giancarlo_guizzardi_> The sound quality here is very bad

<giancarlo_guizzardi_> can't understand anything right now

<Makx_Dekkers> i cant understand a word that is said

<Makx_Dekkers> has the meeting started yet?

<Makx_Dekkers> yes

<deirdrelee> google hangout: https://plus.google.com/hangouts/_/gu2q3xlnxb4p6sjowowzowteema

<deirdrelee> Welcome back Austin!

<deirdrelee> yaso, where is everyone??

<deirdrelee> We're nearly ready for bed ;)

<annette_g> us, too

<phila> :-) annette_g

<deirdrelee> google hangout: https://plus.google.com/hangouts/_/gu2q3xlnxb4p6sjowowzowteema

the quality of my credit card usage was called into question when I purchased our external speaker at 1am in the Austin Airport from an electronics vending machine.

<phila> deirdrelee: Re-Starts the meeting

Quality issues

<phila> scribe: BernadetteLoscio

<phila> issue-116?

<trackbot> issue-116 -- Best Practices for Data Quality - Insertion of specific strategies apart from DATA QUALITY Vocabulary -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/116

<phila> issue-117?

<trackbot> issue-117 -- Should Data quality vocabulary be mentioned as specific strategy in BP Document? -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/117

<giancarlo_guizzardi_> really bad audio

<phila> riccardoAlbertoni: At the time we drafted the doc, my concern was that we didn't get much input

<deirdrelee> BP Doc: http://bernafarias.github.io/dwbp/bp.html#ProvideDataQuality

i'm sorry... i cant understand

<giancarlo_guizzardi_> me neither

<phila> riccardoAlbertoni: We shouod mention for sure that we are going to develop a vocab to hekp documetn quality and that should be clearly stated

<annette_g> me neither

<phila> riccardoAlbertoni: My concern/question was, is there any obecjtion to the proposal that...

<yaso> in Austin it is hard to hear

<ericstephan> could the proposal be written?

<phila> ... that we explicitly mention in the BP doc that there will be advice/vocab on quality description

<Ig_Bittencourt> s\hekp\help

<Ig_Bittencourt> s\documetn\document

<phila> PROPOSED: That the data quality work is explicitly mentioned in the data Quality section of the BP document

<ericstephan> +1

<deirdrelee> +1

<phila> +1

<GiselePappa> +1

+1

<giancarlo_guizzardi_> +1

<annette_g> +1

<laufer> +1

<SumitPurohit> +1

<Ig_Bittencourt> 0 I lost the discussion

<phila> RESOLVED: That the data quality work is explicitly mentioned in the data Quality section of the BP document

<phila> close 116

<phila> close issue-116

<trackbot> Closed issue-116.

<phila> close issue-117

<trackbot> Closed issue-117.

<phila> BernadetteLoscio: Someone said that we shouldn't mention the quality work in the BP doc...

<phila> deirdrelee: Can you recall who made the comment?

<phila> BernadetteLoscio: Might have been Carlos I?

Annette: We should have a reference to the vocabulary in the document

<phila> annette_g: What I voted yes to was having a reference to the vocabulary doc from the BP doc

<phila> ... what I would oppose would be having BPs in the vocab document

<annette_g> * yes, Phil

<yaso> Scribe: yaso

<BernadetteLoscio> thanks yaso ;)

deirdre: issue n.55

<deirdrelee> Issue-55?

<trackbot> Issue-55 -- The word "granularity" can been many things. scope, city/state/country, data aggregation -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/55

deirdre: Is it something that we want to include, or we can forget about it and go ahed

Berna: It is not clear what type of granularity we are talking about.
... In the doc we identified some types of granularity. For example, we can talk about different levels of granularity

yes, thanks, Phila

scribe: if we are going to consider this in the document, because we had a section about it, but it was removed because it was empty

laufer: this issue is about the word?

BernadetteLoscio: we need to define what is granularity for us, for the group.
... this issue was opened due the last F2F. We removed because we had nothing written.

<phila> draft proposal - that we effectively ignore the word 'granularity' and just talk about data quality

<deirdrelee> google hangout: https://plus.google.com/hangouts/_/gu2q3xlnxb4p6sjowowzowteema

<ericstephan> https://plus.google.com/hangouts/_/g4vna23i4mz4l6doovo34sysm4a

had to get out, computer was freezing. Sorry

BernadetteLoscio: should we talk about this in the BP document?
... this issue es related to the BP document, but also related to the DQ vocabulary

ericstephan: this is just a draft, if we leave it now, and then add it when it becomes clear

<annette_g> +1 to eric

BernadetteLoscio: we should keep it like this

<giancarlo_guizzardi> yes

<giancarlo_guizzardi> I called via skype

<ericstephan> I don't see it as being useless, its just not really clear about how we articulate it

<SumitPurohit> +q

<giancarlo_guizzardi> my connection is not very stable so I have to reconnect frequently

<phila> PROPOSED: That we put the word 'granularity' on ice until and unless we need it

Thanks, phila

+1

<BernadetteLoscio> +1

<annette_g> +1

<Ig_Bittencourt> +1

<riccardoAlbertoni> +1

<ericstephan> +1 ice cubes, blocks whatever granularity you want

<deirdrelee> +1

<phila> +1

<laufer> +1

<GiselePappa> +1

<phila> RESOLVED: That we put the word 'granularity' on ice until and unless we need it

<phila> close issue-55

<trackbot> Closed issue-55.

<phila> issue-64?

<trackbot> issue-64 -- Jeremy t's expression of concern over 'data must be complete' - not realistic. better to say where it isn't complete -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/64

deirdre: this is about completeness

<phila> (Jeremy T is Jeremy Tandy from the UK Met Office who joined us at TPAC)

SumitPurohit: doo we think that granularity can be related to quality in some way?

ericstephan: I think I remember when Jeremy mentioned this
... Phil had introduced the idea of how do we measure the BPs
... I thin in all the data quality points that we were talking today

<ericstephan> I like that idea deirdrelee

<phila> riccardoAlbertoni: I think the discussion about granularity should be addressed at the level of BPs

can't und :-/erstand

<phila> ... It's not related to quality, it's the way you;re going to provide your data

<phila> ... that's best practice, not quality

<phila> Everyone says they don't want it, BP or quality vocab

deirdre: maybe we should not keep it

<phila> riccardoAlbertoni: If the data quality editors don't want it and the BP editors don't want it then maybe we shouldn't keep it

<annette_g> is this about completeness or granularity?

<giancarlo_guizzardi> I have the same problem

<phila> We're talking about completeness I think...

<giancarlo_guizzardi> As consequence, I am a bit confused as well what we are discussing about now. Completeness or granularity or something more general

<deirdrelee> PROPOSED: Let data quality editors look at completeness as one of the quality dimensions

<ericstephan> +1

<BernadetteLoscio> +1

<annette_g> +1

<phila> +1

+1

<riccardoAlbertoni> +1

<GiselePappa> +1

<newton> +1

<GiselePappa> +1

<laufer> +1

<Ig_Bittencourt> +1

<SumitPurohit> +1

<giancarlo_guizzardi> +1

<phila> RESOLVED: Let data quality editors look at completeness as one of the quality dimensions

<phila> close issue-64

<trackbot> Closed issue-64.

<phila> ACTION: antoine to look at completeness as one of the quality dimensions [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action08]

<trackbot> Created ACTION-153 - Look at completeness as one of the quality dimensions [on Antoine Isaac - due 2015-04-20].

Best Practices

<deirdrelee> https://docs.google.com/spreadsheets/d/1aqkh8jJCyHn0GaPk3Xh277b5L1bQjfwDHukmWUTWryg/edit#gid=0

<phila> Grrr Google Docs :-(

because 54 is to solve tomorrow

<phila> Commente 3006 https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3006?cid=3006

Someone wants to scribe? It is difficult to follow

BernadetteLoscio: we had some discussion about this by email
... dan said that there is not a complete distinction about metadata and data
... this is something that the group have to resolve
... we are going to keep this definition, that metadata is data about data
... or we are going to discuss another definition
... we are going to keep this discussion or leave it as it is

annette_g: ... I like to suggest that we stick with our definition

<laufer> +1

<Zakim> phila, you wanted to argue for the sentence that DanBri suggests

phila: In 2003 we drafted RDF spec

<ericstephan> draft proposal: Dan is right, however we keep the informal standard metadata/data definition because it is more easily understood by the general open data community.

<phila> Draft proposal - that we include the sentence Dan suggests and then say that for this document we are using the simplified definition that metadata is data about data

ericstephan: I'd rather go with that, that go with something too complex
... the community can understand easily

http://lists.w3.org/Archives/Public/public-dwbp-wg/2015Apr/0019.html

<newton> yaso: I think it's related to the Makx point sent on this e-mail; I think people will continue having discussions like this if we don't define in a clear way what metadata is

<ericstephan> metadata is in the eye of the beholder, so to speak

giancarlo_guizzardi: I think that the sentence can mean anything we want
... if we want to be more precise,
... I think that this is intuitive,
... there is not a real distinction of what is metadata and data

<ericstephan> perhaps a more detailed explaination of data/metadata could be in the glossary?

<annette_g> +1 to glossary for this

giancarlo_guizzardi: I think if we want to make this more clear,

<Ig_Bittencourt> +1 to glossary for this too, showing the distinction.

deirdre: should we leave the description of metadata as it is
... or we add this to glossary?

<deirdrelee> PROPOSED: that we include the sentence Dan suggests and then say that for this document we are using the simplified definition that metadata is data about data

<ericstephan> -1 (see the glossary comments)

<annette_g> +1 only if it's in the glossary

<SumitPurohit> +1

<riccardoAlbertoni> +1

<GiselePappa> +1, but it is not simplified, it is the standard

<laufer> -1

-1

<newton> +1 only if it's in the glossary

<giancarlo_guizzardi> +1 to including the definition in the glossary

<phila> PROPOSED: to include the term metadata in the glossary and give a definition (TBD)

<deirdrelee> Current glossary: http://htmlpreview.github.io/?https://github.com/w3c/dwbp/blob/gh-pages/glossary.html

Ig_Bittencourt: we should change the proposal

BernadetteLoscio: just to comment that at the tracker there is something else that we should remove from the document

<laufer> the prefix meta indicates that metadata is data... we do not have to clarify this...

BernadetteLoscio: because it makes the things worse. So it is not that we are going to have a metadata doc separated

<annette_g> +1 to Bernadette

BernadetteLoscio: we should remove the sentence off the document

<Ig_Bittencourt> PROPOSED: that we include an explanation (and Dan's comments) in the glossary and then say that for this document we are using the definition that metadata is data about data

<BernadetteLoscio> "A metadata document must be published together with the data"; taken together this makes the distinction seem more clear-cut than it often seems in practice.

BernadetteLoscio: I just suggested to remove this phrase

<annette_g> +++++

<laufer> I think the comment of Dan is about to confuse the reader

<laufer> we do not have to say that a metalanguage is a language...

<laufer> a metaproperty is a property... and so on...

<annette_g> let's vote on Ig's

<phila> +1 to Ig's proposal

<Ig_Bittencourt> +1

<giancarlo_guizzardi> A metaproperty is a property but one about a property

<BernadetteLoscio> +1

<flavio> +1

<ericstephan> +1

<giancarlo_guizzardi> not about individuals in the domain

<GiselePappa> +1 to Ig's

<annette_g> +1

0

<giancarlo_guizzardi> that makes a lot of difference

<SumitPurohit> +1

<laufer> -1

<newton> 0

<annette_g> did you go away?

<phila> deirdrelee: We're going to leave this comment for now - it will come up again tomorrow I'm sure

<giancarlo_guizzardi> I am very confused right now...

Data Versioning

<phila> issue-94?

<trackbot> issue-94 -- Dataset versioning and dataset replication -- open

<trackbot> http://www.w3.org/2013/dwbp/track/issues/94

issue 94

deirdre: we are following the agenda?

<giancarlo_guizzardi> The discussion between the relative notions of level and meta-level, from a logical point of view is IMHO very well addressed here: http://nemo.inf.ufes.br/mlt

<phila> Moving on to Comment 3007 https://www.w3.org/2006/02/lc-comments-tracker/68239/WD-dwbp-20150224/3007?cid=3007

<phila> (we'll come back to issue-94)

annette_g: My understanding is that memento is for preserving versions of documents, so I would mention this at the implementation section

<SumitPurohit> +1 anne

+1

<phila> About Memento

<Ig_Bittencourt> +1 to annette_g

thanks annette_g

<Zakim> phila, you wanted to talk about referencing Memento

what annette_g is proposing is to mention memento at the implementation section

<annette_g> +1 to Phil

phila: I would include it as a possible implementation

BernadetteLoscio: We agree

<phila> phila: I'm happy to refer to it as a possible approach but I wouldn't put it more stringly than that

laufer: I think that it is a general advise for all the BPS
... for all the implementation approaches

<Ig_Bittencourt> The name of the section in BP doc is "Possible Approach to Implementation"

<Ig_Bittencourt> So it is OK to put there.

deirdre: we can create an action to include this at the possible implementation

Newton: I can do this

<ericstephan> suggested ammendment is there a simpler example that can also be shown?

<phila> PROPOSED: That we refer to Memento in the 'possible implementation' section of the relevent BP; resolve yes to the comment and action Newton to write to Herbert vS

<Ig_Bittencourt> +1

<riccardoAlbertoni> +1

<deirdrelee> +1

<phila> +1

<newton> +1

<laufer> +1

newton: Annette and me worked together at the versioning section

+1

<giancarlo_guizzardi_> +1

<ericstephan> +1 if we can have another simpler example that could also be shown

<BernadetteLoscio> +1

agree with ericstephan

"if we can have another simpler example that could also be shown"

<laufer> s/Laufer/Annette/

<Ig_Bittencourt> ericstephan: there are others indication in the BP 17 (Vocabulary versioning).

<annette_g> +1 to have a second example

ericstephan: I think it is great to have this example at the BP, but it can scare people, so we can have another example, simpler,
... to do not overwhelm anyone with a too much complex implementation approach

Ig_Bittencourt: there are 2 vocabularies that provide purpose to versioning numbers, so maybe it can solve this problem

deirdre: I said that there are 2 examples in the BP-27, and maybe it sufficient to add the memento and it is ok

<phila> Are we looking at http://bernafarias.github.io/dwbp/bp.html#VersioningVocabularies ?

laufer: memento is more that that data versioning
... it is about data preservation also

<newton> Phil, Ig talks about this doc http://w3c.github.io/dwbp/bp.html#VersioningVocabularies

<Ig_Bittencourt> Vocabulary Versioning.

<deirdrelee> http://bernafarias.github.io/dwbp/bp.html#VersioningVocabularies

<newton> are the same BP, only with different numbers...

<Ig_Bittencourt> yes phila. This one http://bernafarias.github.io/dwbp/bp.html#VersioningVocabularies

<phila> Yes, we're looking at the same one. Excpet we have 15 differevnt versions of the doc and I no longer know which one I'm suppoed to be looking at!

<phila> s/I'm/I'm/g#

<annette_g> http://bernafarias.github.io/dwbp/bp.html

<phila> OK, so we're just looking at http://bernafarias.github.io/dwbp/bp.html, not a Google doc. OK, I'm happy enough

<phila> Thanks BernadetteLoscio

<phila> So we 8are* looking at http://bernafarias.github.io/dwbp/bp.html#VersioningVocabularies

BernadetteLoscio: the feedback that we had was for both: versioning of vocabularies and for datasets

<phila> PROPOSED: That we only have one BP about versioning

annette_g: I think that we really should have only one BP about versioning, just deleting n. 16

<laufer> +1

annette_g: proposed is to remove the BP for vocabulary versioning

<phila> PROPOSED: That we only have one BP about versioning, by merging data versioning and vocabulary versioning, favouring the data versioning one

BernadetteLoscio: maybe we need more time for this one

Ig_Bittencourt: thinking about the differences of remove or merge it,
... different things

<phila> PROPOSED: That we only have one BP about versioning, by merging data versioning and vocabulary versioning, favouring the data versioning one

<ericstephan> +1

<annette_g> -1

<newton> -1

<Ig_Bittencourt> -1

<giancarlo_guizzardi> -1

<phila> Then we misunderstood here...

<annette_g> PROPOSED: that we remove the best practice about versioning vocabularies

<phila> No

<laufer> +1

<laufer> -1

<laufer> +1

<laufer> _1

<SumitPurohit> +q

<Ig_Bittencourt> -1

<giancarlo_guizzardi> -1

<deirdrelee> PROPOSED: Keep both bps on versioning and add memento to both

<phila> -1

SumitPurohit: data also includes vocabulary
... but once you start working with data, this distinction actually fades

<SumitPurohit> +q

<giancarlo_guizzardi> "data includes vocabulary" could be interpreted as a version of "there is no distinction between data and metadata"

ericstephan: in some ways what I'm suggesting is dumming down things, unless you can make cristal clear for the 2
... that is why I made my +1 for merging the 2

<phila> PROPOSED: That we remove the best practice on versioning vocabularies, transferring any relevant points to the data versioning BP. We will refer to memento as a possible implementation

<annette_g> +1

BernadetteLoscio: I agree with ericstephan, but when writing the BP, we are talking about data or a dataset?
... if we can solve this we are going to solve a lot of problems at the document

<phila> IMO - for DWBP data = dataset

BernadetteLoscio: a dataset should have a version number,

<Makx_Dekkers> a vocabulary is data, not metadata. you can have metadata about a vocabulary (e.g. ADMS)

<Ig_Bittencourt> +1 to phila

BernadetteLoscio: we need 1 BP for datasets and 1 BP for vocabulary
... BP should be applied on something

<ericstephan> I like the idea about tightening up the language on data

BernadetteLoscio: data will be updated over time - we are talking about data collection or dataset updating?

<giancarlo_guizzardi> Makx: Vocabulary is a kind of content metadata. The fact that we can have metadata about it, it does not mean that it cannot be metadata

<giancarlo_guizzardi> we can have metadata about metadata

<giancarlo_guizzardi> that is what I meant by this being a relative notion

<Makx_Dekkers> giancarlo, don't go there!

<phila> I think we're proving DanBri's point...

<ericstephan> is metadata about metadata metameta?

<giancarlo_guizzardi> I think we are proving that if we don't have precise definitions, it is very easy to get lost in all related discussions

agree with phila

<phila> CHAIR INTERRUPT

<BernadetteLoscio> :)

<Ig_Bittencourt> +1 to phila and giancarlo_guizzardi

<phila> deirdrelee: We have 2 issues. We have a BP on vocab versioning and another on data versioning

<BernadetteLoscio> +1 to giancarlo!

<phila> ... we've gone back to talling about metadata and data. I know they're related bu t we jave to move on

<SumitPurohit> -q

<laufer> semantics is exactly about the confusion, giancarlo...

<phila> deirdrelee: Looks like we have agreement to include Memento as a possible implementation

<Makx_Dekkers> nce between BPs for data and metadata, and BPS for data and vocabularies

<giancarlo_guizzardi> laufer: I am confused and not about semantics :-)

<ericstephan> one best practice for versioning that can be applied to dataset and vocab

<Makx_Dekkers> +1 to eric

<flavio> +1 to eric

<annette_g> +1 to eric

<phila> PROPOSED: That we remove the best practice on versioning vocabularies, transferring any relevant points to the data versioning BP. We will refer to memento as a possible implementation

<Ig_Bittencourt> Maybe the ericstephan's proposal could be the best way

BernadetteLoscio: a BP for versioning that is UNIVERSAL

<Ig_Bittencourt> Not necessarily universal, but that fits to both.

PROPOSED: That we remove the best practice on versioning vocabularies, transferring any relevant points to the data versioning BP. We will refer to memento as a possible implementation

<ericstephan> PROPOSED: We have one best practice for versioning that can be applied to dataset and vocabulary

<annette_g> +1

<flavio> +1

<Ig_Bittencourt> +1 to ericstephan

<GiselePappa> +1

+1 to ericstephan

<ericstephan> +1

<riccardoAlbertoni> +1

<Makx_Dekkers> .+1

<deirdrelee> +1

<newton> +1

<giancarlo_guizzardi> +0

<SumitPurohit> +1

<phila> +1

<laufer> we are just postponing the necessity of the BP for vocabularies versioning...

<BernadetteLoscio> +1

<laufer> +1

<deirdrelee> +1

giancarlo_guizzardi: I think that we should ask is why people would want versioning, in the first place

It has to do with the compliance to this vocabulary

scribe: that could be ok, for a given application
... why do we think that this is relevant
... this BP are important to reflect some quality dimensions?

<Makx_Dekkers> you may not want versioning, but it is a fact of life. things chang

<SumitPurohit> +q

deirdre: moving to the next point

<phila> RESOLVED: We have one best practice for versioning that can be applied to dataset and vocabulary

<laufer> the versioning is necessary because we have the same identifier for the dataset during the life of data...

<Makx_Dekkers> no! different version, different identifier!

annette_g: our document is really long

<phila> +1 Makx

<laufer> not for vocabularies, makx

annette_g: we should look for opportunities to make it shorter

SumitPurohit: that inconsistent with the rest of the document. If quality is only about dataset, and not about versioning, it is kind of confusing

<Makx_Dekkers> laufer, even for vocabularies

SumitPurohit: just scan the document and make sure that we used all the words when needed

<laufer> even for uris, makx?

<Makx_Dekkers> laufer, the terms may not change and keep the same identifiers, but the vocabulary as a whole may be versioned

BernadetteLoscio: I think what SumitPurohit is saying that in the document we are explicit about the terms

<giancarlo_guizzardi> Folks, unfortunately I will need to leave right now. This has been very interesting. Thanks.

<Makx_Dekkers> a vocab with terms a, b, and c is different from the vocabulary with a, b, c and d, even if a, b, and c stay the same

BernadetteLoscio: in several parts we talk about datasets, then we need to review the document and see if the BP apply to both cases

<phila> Thanks giancarlo_guizzardi

<laufer> the vocabulary is versioned but is the same uri

<Ig_Bittencourt> Thanks giancarlo_guizzardi. Look at my e-mail after.

<giancarlo_guizzardi> ok. will do. talk to you tomorrow

<giancarlo_guizzardi> ciao

BernadetteLoscio: when we talk about data licences, for instance, vocabulary and dataset? Why not include the vocabulary?
... this is going to be a big change to the document

<Makx_Dekkers> no! different version, different uri

<SumitPurohit> +1

<ericstephan> Are dataset and vocab to be assumed unless otherwise noted?

<Zakim> phila, you wanted to talk about dataset/vocab

<laufer> so, schem.org has different uris for each new verion of its vocabularies, maks... is this?

phila: we don't want to have 2 separated BPs

<deirdrelee> http://htmlpreview.github.io/?https://github.com/w3c/dwbp/blob/gh-pages/glossary.html

<deirdrelee> o 52, 55, 59, 68, 80, 82, 133, 134

https://w3c.github.io/dwbp/glossary.html

<Makx_Dekkers> laufer, I don't know about schema.org. I know what I've been doing for ADMS vocabularies.

<ericstephan> perhaps a "context" field needs to be added to describe what 'thing' it should be applied

BernadetteLoscio: we talked about how to use this vocabularies, we should talk about licences

<laufer> ok, makx

BernadetteLoscio: the second question is: the BPs for metadata apply to datasets and vocabs?

<phila> deirdrelee: Can we frame things so that it's clear whether we're talking about data, vocabs and/or metadata?

<Zakim> annette_g, you wanted to question the assumption that a vocabulary is a dataset

annette_g: the more I think about the less I think it is true that a vocabulary is metadata
... I think it is out of scope

<Makx_Dekkers> In my opinion, a vocabulary is a dataset. It fits the definition of dataset in DCAT

<SumitPurohit> +q

BernadetteLoscio: it is just that: what is the difference between publishing a dataset and a vocabulary

<Makx_Dekkers> I describe vocabularies with ADMS as a adms:Asset which is a subclass of dcat:Dataset

BernadetteLoscio: in the document we are talking about publishing dataset and publishing vocabulary

annette_g: I think we should mention vocabs but I do not think that we are here to teach people how to make a vocabulary

deirdre: it is in scope?
... we really want to make some decisions

<phila> deirdrelee: I think Annette brings up an important issue, not only are datasets and vocabs different and is that question is scope

<ericstephan> Could we be onto a new glossary term "vataset" or "docab"? ;-)

<phila> ... if it's in scope, how do we include it in the docs

<phila> +1 ericstephan

<Makx_Dekkers> The issue is, I think, that people have a mental model of what a dataset is. I've been asking to make that mental model explicit.

<laufer> a vocabulary is a language to talk with machines... as a natural language is a language to talk with humans...

<ericstephan> +1 Makx_Dekkers

<Makx_Dekkers> I favour an inclusive mental model that includes vocabularies, images, text collections, spreadsheets, video streams etc.

+1 to Makx_Dekkers

<ericstephan> +q

<newton> Makx_Dekkers, I agree with you. Do you have any suggestions or would like to start it?

<Makx_Dekkers> But I have also expressed my opinion that we can't write BPs for all those types of data. We just don't have the knowledge for that.

<laufer> vocabularies are replicating the way humans establish communication...

maintenance is different of creation imho

Laufer: do you want to scribe?

<Makx_Dekkers> My proposal was to focus our work just on spreadsheet-like data and leave the rest ot other experts.

<ericstephan> Makx_Dekkers - I like the schema.org term https://schema.org/CreativeWork which is a parent to many mental model digital objects

This is a bargain, deirdre

<Makx_Dekkers> Otherwise, BPs can only be very general. What's the similarity between publishing a spreadsheet and a video feed? Not much.

<laufer> thank you, phil... I almost believed...

sorry, deirdre, I lost that

<Makx_Dekkers> I like https://schema.org/Thing better...

<ericstephan> it is the parent of https://schema.org/CreativeWork

thanks, deirdre!

<laufer> tomorrow...

<phila> scribe: Newton

SumitPurohit: for this group probably we may distinguish between data and vocabulary

<Makx_Dekkers> Apologies, have to sign off. Supper calls...

Ig: even this situation we are talking about data on the web, not necessary about vocabulary or ontology

<yaso> Ig_Bittencourt: we are talking about creation of BP for vocabulary creation

<phila> Thanks Makx - bon apetite

Ig: In my opinion is still the same, and I agree on removing versioning vocabulary BP

<yaso> If anyone wants to explore this question about a mental model with me after the meeting, We could spend an hour on this

<annette_g> we need a definition of data

<laufer> remove remove remove

<yaso> I really believe that this could help the grou to stick to situations related to context

ericstephan: Makx was sharing great thoughts
... BernadetteLoscio and I were thinking about different kinds of data to put on the web
... Makx shared thoghts about a mental model concerning about more kinds of data

<yaso> I agree with ericstephan

<Ig_Bittencourt> s\talking\not talking

ericstephan: I think that might be example that people use everyday, such as videos, experimental datasets

<annette_g> data = recorded observations

ericstephan: I think we need a glossary

<Zakim> phila, you wanted to make a proposal that Berna might not like but might move us on a little

ericstephan: we could have a glossary not only for general terms, but to specific things too

<ericstephan> draft proposal: define example "mental models" as examples of dataset, vocabulary in glossary.

<phila> phila: I was saying thagt most BPs will apply to all kinds of data. Where they only apply to specific types, say, vocabs, then we should say so. perhaps organise the doc from general down to specific.

<phila> In the glossary or BP doc, eric?

yaso: I was going to propose about the mental model, but ericstephan already said about it

annette_g: I think this is a very fundamental distinction; versioning is important to data, and it's necessary to have a meaning of what is data

<phila> I'm recalling the conversation in Croke Park a few years ago when we came to the conclusion that dcat:Dataset = Information Resource, which is counter to what Annette is saying

<phila> CHAIR INTERRUPT

<phila> 20 mins to go...

deirdrelee: 20 minutes left... and a couple of things to decide
... first of all is the vocabulary is not a dataset (?)

<phila> deirdrelee: Options are - vocabs, metadata are treated as datasets, or we agree that they are not, or that it's out of scope and we don't have to decide

<phila> ... and there's the added thing of deciding which BP applies to each

<phila> ... that may be a very long conversation

<phila> deirdrelee: Consensus seems to be that we treat metadata and vocabs as datasets... most of the time

<phila> ... what gets included in the glossary, what gets included in the BP doc

<ericstephan> draft proposal: conclude that vocabs, datasets aren't the same thing, but the same technique can at times be applied to both

<yaso> 1+

<BernadetteLoscio> +1

<deirdrelee> conclude that vocabs, datasets aren't the same thing, but the same technique can at times be applied to both

ericstephan: we have different mental model on what those things are
... but we can use the same BP technique on vocab, datasets
... we need to focus on methodology and on how methodology could be applied on it

<riccardoAlbertoni> +1 to conclude that vocabs, datasets aren't the same thing, but the same technique can at times be applied to both

annette_g: I agree that a specific methodology can be applied on both

<deirdrelee> draft proposal: to decide if vocab and dataset are equal is out of scope, but if BPs specifically to metadata or vocab, we call it out in the BP (not that we have to say it necessarily for each BP)

<Zakim> annette_g, you wanted to ask what is not data?

<Zakim> phila, you wanted to make a concrete proposal

<phila> One can treat vocabularies, metadata and data as one species - the boundaries between them are not always absolute. However, for the purpose of the Best Practices, we have a number of mental models to help:

<phila> A dataset is...

<phila> A vocbulary is...

<phila> Metadata is...

<yaso> annette_g: jpeg picture published with encoded exif data, for instance: jpeg is not data, but has data encoded in exif tags. This metadata can be extracted. Jpeg is not data, but can be read by spiders that identify some features and put them together in a dataset. Then is data extracted from the picture, I think...

<BernadetteLoscio> https://www.w3.org/2013/dwbp/wiki/Glossary

<annette_g> yaso, yes, one can use it as data if that is the intention

<ericstephan> +1 phila

<yaso> annette_g: that's why I think we should think in mental models, or "situations"

deirdrelee: we have a glossary that yaso and bernadette are working on
... we also may include a disclaimer to recognize

<annette_g> yaso: if we cover all situations, we will have an endless document

<yaso> and then say: in this specific situations, this wg will not work, like best formats to encode metadata, or best softwares to extract data from media

deirdrelee: we can assume that all BPs apply to dataset

<scribe> scribe: yaso

defining situations mean that we can get rid of some of them :-)

<laufer> and we have to define what we assume of beeing a dataset...

<newton> yaso: I was talking to annette_g to define some situations that could help us

<phila> PROPOSED: That we state that the boundaries between metadata, dataset and vocabulary is not always clear, however, we have the following mentalmodels in mind... and then talk about the three

<ericstephan> +1

<deirdrelee> +1

<newton> ... we need to defines the situations before, for instance, we cannot recommend BPs to extract metadata from media files

<newton> ... we have to recognize are data on the web also, but we don't necessary work on those situations

<SumitPurohit> +q

+ 1 to phila

<newton> ... we can't ignore that, so we can recognize that but not necessary to work on that

<laufer> +1

<newton> SumitPurohit: have we the distinction between vocabularies and metadata

<SumitPurohit> +1

<annette_g> +1

<GiselePappa> +1

<scribe> scribe: yaso

<newton> +1

<phila> RESOLVED: That we state that the boundaries between metadata, dataset and vocabulary is not always clear, however, we have the following mental models in mind... and then talk about the three

+1

<Ig_Bittencourt> +1

<phila> ACTION: yaso to add the definitions of dataset, vocabulary and metadata in the glossary, noting that the boundaries are not always distinct (or helpful). [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action09]

<trackbot> Created ACTION-154 - Add the definitions of dataset, vocabulary and metadata in the glossary, noting that the boundaries are not always distinct (or helpful). [on Yaso Córdova - due 2015-04-20].

<SumitPurohit> +q

<phila> Yaso - I'm happy to help write that

<ericstephan> Thank you all, see you tomorrow!

phila: great! I will appreciate it indeed

<annette_g> maybe discuss over email?

phila: I'try to write something today, then you can review as soon as I finish, is that ok for you?

<Ig_Bittencourt> There is a definition about dataset and vocab in the glossary. So we just need to give a definition to metadata.

<phila> deirdrelee: I'd like to park the conversation about defining the three things or we'll spend endless time discussing it.

<phila> Thanks yaso - that would be great

Nice!

<phila> Eneergy levels at critical here now I'm afraid

people get too much philosophical when tired

<phila> The DCAT definition Dataset is "A collection of data, published or curated by a single agent, and available for access or download in one or more formats."

<phila> phila: Suggests you have plenty to talk about over dinner in Austin. Please come back with a consensus decision that we can all agree to tomorrow

<BernadetteLoscio> ;)

<ericstephan> Yes I think we need to hit the bars on this one

<phila> deirdrelee: Wraps up the meeting

<phila> phila: Records thanks to Dee for excellent chairing today

<ericstephan> bye bye

<phila> Night all

<riccardoAlbertoni> bye, thanks

Tks, bye, see you tomorrow

Summary of Action Items

[NEW] ACTION: antoine to follow up on resolution that Phil proposed [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action06]
[NEW] ACTION: antoine to follow up on the resolution re starting with the ODS list, Makx's list etc, working towards FPWD ASAP [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action07]
[NEW] ACTION: antoine to look at completeness as one of the quality dimensions [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action08]
[NEW] ACTION: bernadette to to add a BP for structural metadata [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action01]
[NEW] ACTION: BernadetteLoscio to write to Andrea Perego informing of the new text [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action05]
[NEW] ACTION: Deirdre to include the standard resolution in the scope page on the wiki [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action04]
[NEW] ACTION: Lee to include the standard resolution in the scope page on the wiki [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action03]
[NEW] ACTION: yaso to add the definitions of dataset, vocabulary and metadata in the glossary, noting that the boundaries are not always distinct (or helpful). [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action09]
[NEW] ACTION: Yaso to include a definition of 'a standard' in the glossary [recorded in http://www.w3.org/2015/04/13-dwbp-minutes.html#action02]
 
[End of minutes]

Minutes formatted by David Booth's scribe.perl version 1.140 (CVS log)
$Date: 2015/04/13 20:59:12 $

Scribe.perl diagnostic output

[Delete this section before finalizing the minutes.]
This is scribe.perl Revision: 1.140  of Date: 2014-11-06 18:16:30  
Check for newer version at http://dev.w3.org/cvsweb/~checkout~/2002/scribe/

Guessing input format: RRSAgent_Text_Format (score 1.00)

Succeeded: s/WE/we/
Succeeded: s/phil/hadley/
Succeeded: s/CLS/HCLS/
Succeeded: s/PROPOSAL/PROPOSED/
Succeeded: s/1/+1/
Succeeded: s/goof/good/
Succeeded: s/shoujld/should/
Succeeded: s/ion/on/
Succeeded: s/BO doc/BP doc/
Succeeded: s/a plrofile/application profile/
FAILED: s/looking at.../ looking at the scope/
Succeeded: s/jear/hear/
Succeeded: s/(something)/of the time a name is present/
Succeeded: s/PRPOSED/PROPOSED/
Succeeded: s/make a/make it a/
Succeeded: s/sy whihc/say which/
Succeeded: s/thinkin/thin in/
Succeeded: s/thinkin/think in/
Succeeded: s/preserving/preserving versions of/
Succeeded: s/implnementation sectio/implementation section/
Succeeded: s/Laufer/Annette/
FAILED: s/Laufer/Annette/
WARNING: Bad s/// command: s/I;m/I'm/g#
Succeeded: s/I;m/I'm/g
Succeeded: s/60/16/
Succeeded: s/not/no/
Succeeded: s/wauallity/quality/
Succeeded: s/Makz/Makx/g
Succeeded: s/menatl /mental/
Found Scribe: hadleybeeman
Inferring ScribeNick: hadleybeeman
Found Scribe: annette_g
Inferring ScribeNick: annette_g
Found Scribe: phila
Inferring ScribeNick: phila
Found Scribe: phila
Inferring ScribeNick: phila
Found Scribe: ericstephan
Inferring ScribeNick: ericstephan
Found Scribe: BernadetteLoscio
Inferring ScribeNick: BernadetteLoscio
Found Scribe: yaso
Inferring ScribeNick: yaso
Found Scribe: Newton
Inferring ScribeNick: newton
Found Scribe: yaso
Inferring ScribeNick: yaso
Found Scribe: yaso
Inferring ScribeNick: yaso
Scribes: hadleybeeman, annette_g, phila, ericstephan, BernadetteLoscio, yaso, Newton
ScribeNicks: hadleybeeman, annette_g, phila, ericstephan, BernadetteLoscio, yaso, newton

WARNING: Replacing list of attendees.
Old list: deirdrelee RiccardoAlbertoni phila antoine HadleyBeeman Makx_Dekkers Austin ericstephan austincomputer [IPcaller] annette_g laufer Ig_Bittencourt flavio newton yaso Gisele Sumit
New list: deirdrelee RiccardoAlbertoni phila antoine HadleyBeeman Makx_Dekkers ericstephan austincomputer [IPcaller] annette_g laufer Ig_Bittencourt flavio newton yaso Gisele Sumit BernadetteLoscio SumitPurohit +31.20.420.aaaa Ipswich


WARNING: Replacing list of attendees.
Old list: deirdrelee RiccardoAlbertoni phila antoine HadleyBeeman Makx_Dekkers ericstephan austincomputer [IPcaller] annette_g laufer Ig_Bittencourt flavio newton yaso Gisele Sumit BernadetteLoscio SumitPurohit +31.20.420.aaaa Ipswich
New list: [IPcaller] Makx_Dekkers deirdrelee Riccardo phila giancarlo_guizzardi_ ericstephan flavio newton annette_g yaso BernadetteLoscio giancarlo_guizzardi

Default Present: [IPcaller], Makx_Dekkers, deirdrelee, Riccardo, phila, giancarlo_guizzardi_, ericstephan, flavio, newton, annette_g, yaso, BernadetteLoscio, giancarlo_guizzardi
Present: [IPcaller] Makx_Dekkers deirdrelee Riccardo phila giancarlo_guizzardi_ ericstephan flavio newton annette_g yaso BernadetteLoscio giancarlo_guizzardi GiselePappa Sumit
Regrets: Steve
Agenda: https://www.w3.org/2013/dwbp/wiki/April_2015_F2F#Agenda
Found Date: 13 Apr 2015
Guessing minutes URL: http://www.w3.org/2015/04/13-dwbp-minutes.html
People with action items: antoine bernadette bernadetteloscio deirdre lee yaso

[End of scribe.perl diagnostic output]