20:44:59 RRSAgent has joined #dxwgdcat 20:44:59 logging to https://www.w3.org/2020/06/17-dxwgdcat-irc 20:45:01 RRSAgent, make logs public 20:45:01 Zakim has joined #dxwgdcat 20:45:03 Meeting: Dataset Exchange Working Group Teleconference 20:45:03 Date: 17 June 2020 20:45:12 present+ 20:51:29 rrsagent, draft minutes 20:51:29 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html riccardoAlbertoni 20:52:21 rrsagent, make log public 20:52:31 rrsagent, draft minutes 20:52:31 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html riccardoAlbertoni 20:58:33 PWinstanley_ has joined #dxwgdcat 21:00:25 scribenick: PWinstanley_ 21:00:34 present+ 21:02:33 AndreaPerego has joined #dxwgdcat 21:02:42 present+ 21:03:07 agenda: https://www.w3.org/2017/dxwg/wiki/Meetings:Telecon2020.06.17 21:03:10 RRSAgent, draft minutes v2 21:03:10 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego 21:04:16 regrets+ SimonCox 21:04:27 Chair: RiccardoAlbertoni 21:04:51 RRSAgent, draft minutes v2 21:04:51 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego 21:05:34 PROPOSED: approve last meeting minutes https://www.w3.org/2020/06/03-dxwgdcat-minutes 21:05:37 +1 21:05:37 +1 21:05:43 +1 21:05:52 RESOLVED: approve last meeting minutes https://www.w3.org/2020/06/03-dxwgdcat-minutes 21:06:00 RRSAgent, draft minutes v2 21:06:00 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego 21:06:03 https://www.w3.org/2017/dxwg/wiki/Meetings:Telecon2020.06.17 21:06:34 topic: versioning 21:07:03 https://github.com/w3c/dxwg/issues/89 21:07:22 riccardoAlbertoni: issue #89 - version delta 21:07:40 ... in the last meeting we discussed a lot but didn't come to a clear conclusion 21:08:19 ... I am proposing that we use adms:versionNotes 21:08:24 +1 21:08:30 Proposed: Consider adms:versionNotes as a working solution for expressing textual description of the version delta. 21:09:13 +1 21:09:17 ... principally because it does not imply any entailments. It is a good flexible starting point 21:09:35 +1 21:09:35 +1 21:09:51 resolve:Consider adms:versionNotes as a working solution for expressing textual description of the version delta. 21:10:05 resolved: Consider adms:versionNotes as a working solution for expressing textual description of the version delta. 21:10:12 RRSAgent, draft minutes v2 21:10:12 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego 21:10:17 https://github.com/w3c/dxwg/issues/91 Version release date 21:10:46 riccardoAlbertoni: version release date - 21:11:26 ... we need to have some mapping between dcterms and prov 21:11:39 s/prov/prof/ 21:11:58 ... we had already included dc:modified 21:12:04 q+ 21:12:14 ... what are the feelings about dc:issued? 21:12:35 AndreaPerego: I think we should start with the simple solution, in the absence of a motivating use case 21:13:46 ... until we have a good use case then we should not tie ourselves to anything . a version of a dataset is a dataset 21:14:36 riccardoAlbertoni: so you agree with adopting dct:created ... assuming that each time a new version is created it is a new dataset 21:16:52 AndreaPerego: perhaps the idea was to have a creation date with the initiation of the dataset, then a version for each publication. Until there is something tangible, using the same properties means that it is like another dataset. Using this approach we are not explicitly providing a solution for a release date 21:18:13 riccardoAlbertoni: one discussion related to the one you are trying is the PAV which distinguishes between two types of version - one for long-standing entities and another for more short-lived ones 21:18:29 AndreaPerego_ has joined #dxwgdcat 21:18:46 RRSAgent, draft minutes v2 21:18:46 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego_ 21:19:03 q+ 21:19:15 ack AndreaPerego 21:19:16 ... since one of the driving ideas I was following was to make DCAT a lingua franca, I wonder if we need to distinguish between long-standing assets and shorter lived assets 21:20:16 AndreaPerego_: we need a use case. 21:20:54 https://docs.google.com/spreadsheets/d/1kOp810ep3gQ2iezVXH-abX2q2QubqxNmyJ2bcX6WAFw/edit?usp=sharing 21:21:02 ... for the time-being we assume that a version of a dataset is another dataset. Without a use case it is difficult to know how to handle this point 21:21:06 RRSAgent, draft minutes v2 21:21:06 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego_ 21:21:14 q- 21:22:19 riccardoAlbertoni: To explain why I am dealing with this type of issue, 2 of the most powerful vocabularies about versioning make this distinction (long-lived vs short-lived) and so perhaps it is an implicit requirement 21:24:06 ... we could ignore fine-tuning but we lose expressivity. We need to think about the need to be able to map a vocab like PAV to DCAT 21:24:29 q+ 21:24:44 ... So, although we don't have a formal requirement, my concern is that we should be able to map these vocabularies. 21:25:40 AndreaPerego: I understand the point, but I'm still concerned about addressing this in DCAT without a statement about how and why we are doing this. Perhaps it is just to get alignment with PAV or Version 21:25:43 q+ 21:25:49 ack AndreaPerego 21:26:18 ... so perhaps it is an alignment issue rather than an extension of DCAT 21:26:25 ack PWinstanley_ 21:28:06 PWinstanley_: there is already a link to Prov and use thing like wasGeneratedBy 21:29:19 q+ 21:29:57 riccardoAlbertoni: the PAV shows how dct:isVersionOf is the prop that links something that is not version with something that it.... this kind of property occurs in so many places it makes it difficult to actually find the actual version that an entity is. We need something that is explicit 21:30:21 ack AndreaPerego 21:31:00 AndreaPerego: so we can provide something simple in DCAT and if that is not enough we can direct users to PAV or other version vocabularies 21:31:34 riccardoAlbertoni: I think that if you provide only the simplest thing you are providing something that is not very useful/useable 21:32:15 q+ 21:33:21 ... This isn't an issue that we absolutely need to deal with , but we risk providing a solution that is not really satisfactory - but it depends on what kind of user you have in mind. Simple annotation of the next version, then that simple case is enough, but with the need to know the most recent version then something more sophisticated is needed 21:34:16 AndreaPerego: I agree that the key information needed is whether a version is the most recent or not. 21:35:53 ... I think with the prof and adms properties we can cover things like next and last. This is ok for the use cases that I can figure out, but there may be some that cannot be covered by this. For the time-being I think that what we have from dc and adms could be enough 21:37:15 riccardoAlbertoni: the relation which is the pointer to the latest version - what does that point to? there needs to be an update to earlier versions at publication time 21:37:57 AndreaPerego: if there is a URL pointing to the latest version, then that is OK, but otherwise there is a need to have updates 21:38:27 ... so we need to distinguish between a vocabulary requirement and an implementation approach 21:38:39 ... this is something from DWBP 21:39:27 riccardoAlbertoni: there needs to be a property that returns the latest version without assuming anything about versioning practice 21:40:02 ... my hunch is that if we don't want to enter into this best practice we need a more complex solution 21:41:03 ... each time the version is updated the prev relationship needs to be entered to the previous version 21:41:57 AndreaPerego: the unversioned entity always corresponds to the most recent? 21:42:47 ... this is about the ways of publishing data. It is platform-dependent. Otherwise it has to be done another way 21:43:10 ... zenodo has a mechanism for providing a list of previous versions 21:43:30 ... but in this case it is related to the publication mechanism 21:44:05 riccardoAlbertoni: you can cover both approaches with the same vocabulary 21:44:20 AndreaPerego: in CKAN I don't think this is possible 21:45:03 AndreaPerego: so there is no single option. PAV would fit with some platforms; adms into others 21:45:34 ... so instead of mandating a single approach we can show alternatives and people can select depending on the data publishing platform 21:45:53 q+ 21:46:01 ack AndreaPerego 21:46:29 AndreaPerego: I know use cases where conceptually this relationship wouldn't work 21:47:25 ... gathering data for a specific purpose, what may happen is that the approach may change from year to year. So they are different datasets because the survey approach changed from year to year 21:48:12 ... so it is important to know that datasets are related, but they are not versions of the same entity because of the survey changes. Nevertheless I would still want to know the latest version 21:48:29 ?q 21:48:33 q? 21:48:33 ack PWinstanley_ 21:50:29 PWinstanley_: we need to keep a broad approach that will allow the versioning of any of the main classes of DCAT 21:50:34 q+ 21:51:08 riccardoAlbertoni: one possible approach is to describe the cases that AndreaPerego was suggesting and see how different vocabs measure up 21:52:13 AndreaPerego: we are not going to address versioning in detail, we are going to deal with a subset - but we can refer to other vocabularies whilst looking at the subset of cases 21:52:35 ... we can provide pointers to other vocabularies 21:54:18 AndreaPerego: prov is powerful but the same thing can be said in different ways. 21:55:35 riccardoAlbertoni: we just need to discuss this. we might not arrive at a solution, but we need to see what we can do 21:55:57 AndreaPerego: we can include dc and adms, they are stable and related to DCAT 21:57:06 riccardoAlbertoni: if there are terms from vocabularies that are not standard we could implement something similar within DCAT. If we are precise enough we can have a mapping with existing vocabularies 21:58:22 riccardoAlbertoni: People still want a 'solution' for versioning in spite of there being many vocabularies.... I think we should take this opportunity 21:59:36 topic: how to catalog a relational database 22:00:16 https://github.com/w3c/dxwg/issues/1240 22:00:26 AndreaPerego: this is related to cataloguing a service or API. there was a similar requirement for a triplestore recently. We need to put this on the agenda 22:00:43 ... this is something we might want to address for the next version 22:01:11 riccardoAlbertoni: this is also something for the primer, and we might not be able to provide a full solution, but we can give some pointers 22:04:43 ... We need to get more thoughts about versioning, but for the next meeting we perhaps need to focus on something else and see if we can make progress there 22:07:30 riccardoAlbertoni: I will send out an email asking for topics for the next call 22:07:45 RRSAgent, draft minutes v2 22:07:45 I have made the request to generate https://www.w3.org/2020/06/17-dxwgdcat-minutes.html AndreaPerego 22:07:52 bye