IRC log of dpub-arch on 2016-02-04

Timestamps are in UTC.

16:55:38 [RRSAgent]
RRSAgent has joined #dpub-arch
16:55:38 [RRSAgent]
logging to http://www.w3.org/2016/02/04-dpub-arch-irc
16:55:57 [Zakim]
Zakim has joined #dpub-arch
16:56:11 [TimCole]
rrsagent, set log public
16:56:27 [TimCole]
Meeting: DPUB Archival TF
16:57:01 [TC2]
TC2 has joined #dpub-arch
16:57:14 [tzviya]
tzviya has joined #dpub-arch
16:59:28 [HeatherF]
present +Heather_Flanagan
17:00:09 [TimCole]
present+ Tim_Cole
17:01:09 [tzviya]
present+ Tzviya
17:01:39 [dkaplan3]
dkaplan3 has joined #dpub-arch
17:01:42 [dauwhe]
dauwhe has joined #dpub-arch
17:01:53 [dkaplan3]
present+ Deborah_Kaplan
17:02:01 [mgylling]
mgylling has joined #dpub-arch
17:02:34 [HeatherF]
scribenick: HeatherF
17:03:26 [HeatherF]
HeatherF has joined #dpub-arch
17:03:39 [HeatherF]
scribenick: HeatherF
17:03:44 [Bill_Kasdorf]
Bill_Kasdorf has joined #dpub-arch
17:03:53 [Bill_Kasdorf]
present+ Bill_Kasdorf
17:04:03 [dauwhe]
present+
17:04:06 [mgylling]
Present+ Markus
17:04:20 [TimCole]
Wiki page: https://www.w3.org/dpub/IG/wiki/Task_Forces/archival
17:04:32 [tzviya]
agenda: https://lists.w3.org/Archives/Public/public-digipub-ig/2016Feb/0013.html
17:05:19 [TimCole]
Leonard's email: https://lists.w3.org/Archives/Public/public-digipub-ig/2016Feb/0021.html
17:05:45 [HeatherF]
TimCole: Reviewing task force goals (see wiki page for initial draft)
17:06:15 [HeatherF]
...: any changes, either in structure or in content?
17:07:37 [mgylling]
q+
17:07:49 [mgylling]
ack me
17:07:55 [HeatherF]
...: Should we keep the potential for expanded scope of material going beyond the PWP?
17:08:05 [dkaplan3]
q+
17:08:34 [HeatherF]
mgylling: the problem statement/goal is spot on; any output from this TF should feed the use cases more than the PWP directly.
17:08:40 [tzviya]
q+
17:08:54 [HeatherF]
...: so, produce use cases and functional requirements
17:09:30 [HeatherF]
dkaplan: agree. Would add that in the long run, the product of this TF would be an archival profile for the PWP.
17:09:39 [Bill_Kasdorf]
q+
17:09:47 [HeatherF]
...: right now, however, we're creating functional requirements and use cases.
17:09:56 [TimCole]
ack dkaplan
17:10:13 [TimCole]
ack tzviya
17:10:20 [HeatherF]
tzviya: +1. If other cases come up that fall outside of this remit, we can always record them on the wiki to save for later.
17:10:50 [TimCole]
ack Bill
17:11:05 [HeatherF]
Bill_Kasdorf: as a point of clarification, it seems clear that with the existing goals statement, that we are focusing on formal archives.
17:11:30 [dkaplan3]
q+
17:11:30 [HeatherF]
...: we are not talking about a publisher who wants to archive a version of a publication for future use. Is that correct? Should we explicitly state this?
17:11:58 [TimCole]
ack dka
17:12:32 [HeatherF]
dkaplan: rather than saying that's out of scope, we should consider it a subset. This is about preservation, not just archiving.
17:13:59 [HeatherF]
...: we are talking about the formal archivist definition of preservation. We are talking about long-term, persistent ability to access content.
17:14:00 [mgylling]
+1
17:14:31 [HeatherF]
TimCole: Suggests that we need to have some mods to the goals, including that we are going to create use cases, and that we need to confine scope to formal archiving.
17:14:48 [HeatherF]
...: hopefully we won't have to define "formal archiving" from scratch; want to use someone else's.
17:14:57 [dkaplan3]
http://www2.archivists.org/glossary/terms/p/preservation
17:15:22 [HeatherF]
ACTION: dkaplan3 to pull together the formal definition of archiving/preservation
17:15:44 [HeatherF]
ACTION: TimCole to add the creation of use cases to the goals on the wiki
17:16:36 [HeatherF]
TimCole: Next topic - experts we should consult. We have some pointers to documents on the wiki; do we need specific people brought in as well?
17:16:57 [tzviya]
q
17:17:05 [Bill_Kasdorf]
q+
17:17:07 [dkaplan3]
q+
17:18:04 [HeatherF]
tzviya: Our goal is to define use cases. To get a broader set of use cases, it would be useful to either interview or invite others from that community.
17:18:30 [HeatherF]
...: Deborah has formal archival training. (So does Heather)
17:18:54 [HeatherF]
:-)
17:19:51 [HeatherF]
TimCole: have been in touch with people at Portico for ideas about their workflows; they ingest data and normalize it on a regular basis. Want to know how the format of what they get impacts their workflow.
17:20:41 [HeatherF]
...: To avoid duplication, suggests that we keep track on the wiki re: who we are reaching out to.
17:20:56 [TimCole]
ack Bill
17:22:20 [HeatherF]
Bill_Kasdorf: Portico and Lockss/Clokss are interesting contrasting organizations in this space.
17:23:04 [TimCole]
ack dka
17:23:05 [HeatherF]
...: Portico normalizes the content, whereas Lockss/Cloks harvests documents and so has a lot of web documents.
17:24:17 [HeatherF]
dkaplan3: Outreach - yes, we should do that, and not just to organizations, and we should keep a list. Another problem to be aware of, this TF is currently mostly US participants.
17:24:48 [HeatherF]
...: if we can get someone not anglophone, at least as consulting expert, that would be helpful.
17:25:37 [HeatherF]
TimCole: What about resources or documents? Anything to add there? Please add if you think of anything.
17:25:44 [Bill_Kasdorf]
Also British Library, KB (Nat. Lib. of Netherlands), Bibliotheque Francaise
17:26:56 [Bill_Kasdorf]
Important issue with BL, KB, etc. is that they are they are mandated to archive content, "legal depository"
17:27:12 [HeatherF]
TimCole: Regarding logistics, this TF has enough work to keep us busy for a while. Should we get on a regular call schedule? What would be a good timeline for this work?
17:27:23 [HeatherF]
+1
17:27:26 [mgylling]
+1
17:27:31 [tzviya]
+1
17:28:15 [HeatherF]
We will aim for twice a month, though perhaps not at this time.
17:30:53 [HeatherF]
TimCole: Will search in the range of 10am-12pm Eastern, M-Th. This will narrow down the doodle poll.
17:31:37 [HeatherF]
...: Emails should go to the main dpub list, but email authors should remember to put in [dpub-arch] in the subject for easier sorting.
17:31:46 [dkaplan3]
+1
17:32:14 [dkaplan3]
q+
17:32:18 [HeatherF]
...: What's our timeline? How long should this TF expect to run?
17:32:25 [tzviya]
q+ to discuss goals and timeline
17:32:39 [tzviya]
ack dk
17:32:42 [TimCole]
ack dka
17:33:24 [HeatherF]
dkaplan3: As long as we keep the scope narrow, we start with what is not already defined (don't reinvent definitions where we don't have to)
17:33:32 [TimCole]
ack tzv
17:33:32 [Zakim]
tzviya, you wanted to discuss goals and timeline
17:34:00 [mgylling]
q+
17:34:36 [TimCole]
ack mgyl
17:34:46 [HeatherF]
tzviya: Let's not let this be something that just happens at the meetings; do work between meetings. We can target writing use cases and seeing how much we can do in three months.
17:35:26 [HeatherF]
mgylling: +1 to tzviya. In terms of timeline, we haven't set a final delivery date to the larger use case effort, but we will soon. Having a note by TPAC this year (end of September) would be a reasonable target.
17:35:41 [HeatherF]
and music ensues
17:36:15 [HeatherF]
mgylling: NISO also has work going on in this space; make sure we don't duplicate effort.
17:36:26 [HeatherF]
no more classical music. sadness.
17:36:51 [HeatherF]
ACTION: TimCole to reach out to Todd Carpenter at NISO re their work in this space
17:37:49 [HeatherF]
TimCole: so, three to four month slot. Target end of May.
17:38:08 [HeatherF]
...: Is there a deadline on the PWP?
17:38:30 [HeatherF]
tzviya: The IG chairs need to talk about that.
17:38:49 [HeatherF]
mgylling: if this group comes up with a new paragraph, that will be enough to refresh the PWP regardless of its state.
17:39:22 [HeatherF]
...: it is a lightweight process to update that when needed.
17:40:09 [tzviya]
q+
17:40:31 [HeatherF]
TimCole: does anyone have comments on Leonards presentation re: PDF/A? Might schedule time on a future call for Leonard to talk about this directly.
17:40:38 [dkaplan3]
q+
17:40:49 [mgylling]
q?
17:40:54 [mgylling]
ack tzv
17:40:55 [HeatherF]
...: PDF/A is a recognized standard, but probably not sensible to turn everything into PDF/A
17:41:36 [HeatherF]
tzviya: An interesting presentation, but don't put the cart before the horse. We are not recommending one particular solution here.
17:42:17 [HeatherF]
dkaplan: There are very good reasons that PDF/A is not the appropriate recommendation. We are (probably) not headed towards ISO standardization. In generating the PDF/A standard, many contacts were made and use cases developed.
17:42:30 [HeatherF]
...: to the extent that the archival community participated, we should find that input and use it
17:42:38 [TimCole]
q?
17:42:43 [TimCole]
ack dka
17:43:46 [HeatherF]
TimCole: Do people want to start commenting on what use cases? What libraries have done historically is collect content from publishers at time of publication, so there are use cases of library services telling publishers what they need
17:43:59 [dkaplan3]
q+
17:44:09 [HeatherF]
...: but often libraries are coming to content well after publication. That's another category of use cases.
17:44:15 [TimCole]
ack dka
17:45:18 [HeatherF]
dkaplan: Archivists can ingest just about anything. Anything you come to after-the-fact, anything that hasn't been made as an archival document to begin with, is just like anything else (games, etc) that they might have to archive.
17:46:03 [HeatherF]
TimCole: so should we make clear some of the potential trade-offs about what happens if you don't consider archival requirements up front?
17:46:16 [Bill_Kasdorf]
q+
17:46:31 [HeatherF]
...: Print materials were simpler. Digital material introduces problems of versioning.
17:46:51 [TimCole]
ack Bill
17:47:49 [dkaplan3]
q+
17:47:53 [HeatherF]
Bill_Kasdorf: Would like to see a basic definition that a PWP is natively amenable to archiving, similar to how EPUB is natively amenable to accessibility
17:47:57 [TimCole]
ack dka
17:48:27 [HeatherF]
dkaplan: Agree with limitations. Accessibility should be the default, and the same thing with preservation. This is, however, a huge limitation.
17:49:00 [HeatherF]
...: A preservable document can't be preservable unless it is entirely offline with all its essential elements.
17:49:23 [HeatherF]
Bill_Kasdorf: That is a fundamental principle of PWP.
17:50:08 [HeatherF]
TimCole: Evn when you can take everything offline, if you try and open it in 5 years, it likely won't look the same as it did at time of publication.
17:50:57 [HeatherF]
Bill_Kasdorf: What is it that's being preserved? Is it the appearance or the essential content?
17:51:18 [HeatherF]
dkaplan: That is a question that even in the preservation community must be decided on a document-by-document basis.
17:53:05 [TimCole]
q?
17:53:34 [tzviya]
rrsagent, make logs public
17:53:50 [tzviya]
rrsagent, make minutes
17:53:50 [RRSAgent]
I have made the request to generate http://www.w3.org/2016/02/04-dpub-arch-minutes.html tzviya
17:54:00 [tzviya]
rrsagent, bye
17:54:00 [RRSAgent]
I see 3 open action items saved in http://www.w3.org/2016/02/04-dpub-arch-actions.rdf :
17:54:00 [RRSAgent]
ACTION: dkaplan3 to pull together the formal definition of archiving/preservation [1]
17:54:00 [RRSAgent]
recorded in http://www.w3.org/2016/02/04-dpub-arch-irc#T17-15-22
17:54:00 [RRSAgent]
ACTION: TimCole to add the creation of use cases to the goals on the wiki [2]
17:54:00 [RRSAgent]
recorded in http://www.w3.org/2016/02/04-dpub-arch-irc#T17-15-44
17:54:00 [RRSAgent]
ACTION: TimCole to reach out to Todd Carpenter at NISO re their work in this space [3]
17:54:00 [RRSAgent]
recorded in http://www.w3.org/2016/02/04-dpub-arch-irc#T17-36-51
17:54:05 [tzviya]
zakim, bye
17:54:05 [Zakim]
leaving. As of this point the attendees have been Tim_Cole, Tzviya, Deborah_Kaplan, Bill_Kasdorf, dauwhe, Markus
17:54:05 [Zakim]
Zakim has left #dpub-arch