W3C

– DRAFT –
RCH WG bi-weekly meeting

01 February 2023

Attendees

Present
aidhog, dlehn, gkellogg, Kazue, manu, pchampin, phila, yamdan
Regrets
-
Chair
markus_sabadello
Scribe
gkellogg

Meeting minutes

<markus_sabadello> gkellogg: Work is advancing in RDF* WG, issue activity and some work on getting Editors drafts of those specs in shape

<markus_sabadello> gkellogg: Started on some of the work on canonicalization, we just need definitions for quads

<markus_sabadello> phila: gkellogg is there anything in RDF* that we here need to pay attention to?

<markus_sabadello> gkellogg: Activity has been on semantics.. Some background activity about the mechanical bits

<markus_sabadello> gkellogg: Some things still need to be done before FPWD, hopefully in February

<Zakim> pchampin, you wanted to react to phila

pchampin: The blocking point right now is assigning editors, which is in progress now.
… Once we have editors, I don't see an obstacle on having FPWD.

<Zakim> manu, you wanted to note some items happening in other WGs. and to also mention Gordian Envelope and ACDC work.

<manu> Demonstration of Support for EdDSA Cryptosuite Adoption into VCWG: https://lists.w3.org/Archives/Public/public-vc-wg/2023Jan/0027.html

manu: One of the first crypto suites is up for a WG adoption call today.
… We've gotten signatures from a variety of implementers.
… We have a list of 16 implementors who have implemented the EDD crypto suite.
… there's a vote today on adopting the suite. We've passed three of four hurdles.
… Having interoperable implementations isn't a guarantee that the WG will accept it.
… Please mail into the VCWG mailing list to support it, if you're a member.
… The call today is challenging for people in Europe or asia to make.

<manu> Alternatives to RDF*: https://lists.w3.org/Archives/Public/public-credentials/2023Jan/0073.html

<aidhog> Gordian Envelopes

manu: There are some alternatives to RDF-star. One called Gordian Envelopes.
… It's a new graph structure that allows you to do interesting things.
… There's been some "crankiness" to RDF in parts of the group.

<TallTed> Nested containers. Graphs all the way down.

manu: THese other systems are rooted in the cryptographic community, rather than RDF.

markus_sabadello: What we're doing here is fundamental to support the VCWG work.

Horizontal review process

markus_sabadello: This is a part of the W3C process, and phila has created a number of issues on this.

phila: I made some issues for us to track progress. Accessibility and Internationalization are the "easy" ones.
… I don't think for these there's much to say, but it needs to be done.
… There are potentially issues for text strings in triples for internationalization.

<manu> Phil is correct, I don't think we considered I18N canonicalization rules on text...

phila: But, these come up in the RDF specs anyway.
… For Privacy and Security, we need text in the document to be reviewed.
… I'd ask for that to be pushed up the priority order.

<Zakim> manu, you wanted to reach out to IETF CFRG?

manu: We didn't consider i18n specific c14n of strings.
… It would be a good idea to reach out to the crypto forum research group and IETF and ask for a form or something.
… That could backfire, as there may be some lack of sympathy for the c14n approach.
… If we go to the TAG, they'll ask if we had any external review.
… Of course, we have our own people in the group, but they may want some external review.

gkellogg: I don't have anything to contribute to privacy and security, so looking for support.

phila: perhaps Kazue or yamdan have something to contribute here?
… Should I write to the CFRG group? I don't know anyone there myself.

<Zakim> manu, you wanted to offer /some/ text -- which we know about... and ask Chairs to reach out to IETF CFRG.

manu: On the S&C considerations, we have at least two sections to add there.
… One around BBS signatures and why HMAC.
… We should also point to the published mathematical proof, as well as Aiden's work to indicate that there has been mathematical review.
… WRT contacting CFRG, probably best if the chairs reach out rather than some other member.
… Wendy S used to be the link, but that has changed ...

ACTION: phila to reach out to CFRG

yamdan: As for S&P section, I can also contribute something from the viewpoint of the user of the C14N algorithm.

<manu> ooh, poison graphs is another good addition to security section!

yamdan: As for security, my understanding of C14N is not enough to contribute. Particularly for "poison graph" attacks. This requires some deep knowledge and specific types of "bad" graphs.
… iherman had already referenced some papers, which is important for the section.

markus_sabadello: I'm sure you and kazue have deep knowledge on the privay aspects, so that would be very valuable.

phila: for the process, getting these sections complete are really important.

markus_sabadello: We were just talking about security and poison graphs, which leads into the next topic.

Comparison and confidence in algorithms, with invited guest Aidan Hogan

<yamdan> "academic paper about poison graphs" I mentioned is https://arxiv.org/abs/1705.03686 , referred by Aidan in w3c/rdf-canon#10 (comment)

markus_sabadello: As a reminder, we discussed the two algorithms at the kick-off if this group.
… We had presentations from both dlongley and aidhog.
… We've begun work on dlongley's algorithm, but we've had open issues on comparing the algorithms.
… Aiden commented on #6 and #10 already.

aidhog: WRT #10, there's a great list of comparison points.
… We know that both algorithms are similar, and Ivan has implemented both.
… It's not clear to me which is more efficient, and what does "common dataset" mean.
… The only way to determine this would be to select some representative datasets and run against each algorithm.
… We know the poison datasets are something to work out.
… Both algorithms have had mathematical proofs, and I think the one done on dlongley's was particularly good.
… Of course, they stoped short of saying that "we're certain this is correct"
… Also, Dave's addresses dataset and not graphs.
… I'd suggest that RDF-star use cases be reified into RDF datasets and canonicalize that rather than trying to extend the algorithms.
… On correctness: I'm certain that it's possible to define an algorithm that handles these.
… I'm highly confident that both algorithms work correctly.
… But, it depends on the risk/benefit analysis in case something goes wrong.
… What is sufficient due diligence before we can go forward.
… For example, there is a graph isomorphism algorithm that was widely thought to be correct, before it was found not to be.
… But, I think it's highly likely that both algorithms are fine.
… I don't think you can say that just because algorithms have peer review that they're necessarily correct.
… To convince people, we should probably avoid people and look at a proof verification system.
… That would be the ultimate of due diligence. But, this would be a slow and difficult process.
… On poison graphs, there will always be some that take a long time barring some quantum breakthrough.

markus_sabadello: Thanks, this is really helpful. Not all of us have such deep technical knowledge.

<Zakim> manu, you wanted to comment on time/resource complexity has come up in other groups as well. and to note "backup c14n algorithm"

manu: I agree with everything Aiden says.
… On the time/resource complexity question, this has come up in other groups.
… For example, people are complaining that RDF C14N is slow.
… We do have a benchmark which shows just the opposite. RDF C14N is 100's to 1000's of times faster than JSON Schema, for example.
… There's quite a lot of misrepresentation on how slow or fast these algorithms are.
… We do have a benchmark sure (from 5 years ago).
… We have other datasets that are either going into production are are already there. But people may still claim that it's not representative of their datasets.
… +1 for finding time to do benchmarking for the algorithms.
… Regarding formal verification: +1 to what Aiden said.
… We should try to find someone or some group to do a formal proof, because there will always be nay-sayers.
… Finding the people with the expertise and time to do this will be challenging.
… In parallel, we can look at alternative schemas such as JCS (JSON C14N).
… This might be easier to prove, but could be a backup scheme.
… If our C14N scheme fails, we should have a backup.
… You would loose all kinds of features (selective disclosure, etc.)
… We could suggest that if rdf-canon fails, the fallback is a non-RDF C14N scheme (JCS or straight hashing of JSON-LD)

<manu> oh, AND, we could make aiden's alg the backup (which is what I meant to lead with)! :)

<aidhog> Coq

pchampin: I happen to be in contact with a group of people that know Coq.
… I can reach out to them and know if they might be interested in working on this.

<TallTed> see http://dbpedia.org/resource/Coq

aidhog: I think it would be great to have a benchmark.

<manu> Here's the current benchmark DB has: https://github.com/digitalbazaar/rdf-canonize/tree/main/benchmark

aidhog: I think Ted mentioned some datasets to consider.
… In the case of my algorithm, it runs in memory, which is a MUST.
… Most RDF graphs are not anything like graphs you see in isomorphism problems.
… There are graphs that will run in milliseconds, and others that will never complete.
… Manu asked about a backup, but there's a question about damage done if the original one fails.
… It would have an effect on the trust of the group, if not the W3C.
… I don't think it's necessary to be super formal, but if the damage done by failing would be critical, we should look further.
… I'm not sure this is all compatible with the timeframe of the group.

<Zakim> manu, you wanted to note "cost and damage" of initial failing... it's "bad"... very bad. :P

manu: It would be very, very bad if it was shown to be a problem after deployment.
… It's being used for things like personal identification by government agencies.
… That would be Trillions of dollars of trade documents.
… We should do what we can, but the timeline question is key.
… Having a backup we can switch to is what organizations should be doing.
… We could do that in parallel to formal verification.
… If there was a problem, we could rescind, or extend the group's lifetime until it's happened.

markus_sabadello: We've been pursuing a parallel path from the beginning.
… We can look at other things to work on at the same time.
… Thanks to Aiden.

F2F meetings and W3C TPAC

markus_sabadello: The VCWG has an F2F in a couple of weeks, so we maybe skip the next meeting on 15 Feb.
… We can also have our own F2F, but there didn't seem to be much support for that last time.

<manu> +1 to skip next meeting due to VCWG F2F meeting.

markus_sabadello: Next meeting in four weeks. on 1 March.

gkellogg: we need to consider the outputs of the algorithm, hash and selective disclosure.

Summary of action items

  1. phila to reach out to CFRG
Minutes manually created (not a transcript), formatted by scribe.perl version 210 (Wed Jan 11 19:21:32 2023 UTC).

Diagnostics

Succeeded: s/Envelops/Envelopes/

Succeeded: s/we have/DB has/

Maybe present: markus_sabadello

All speakers: aidhog, gkellogg, manu, markus_sabadello, pchampin, phila, yamdan

Active on IRC: aidhog, dlehn, gkellogg, Kazue, manu, markus_sabadello, pchampin, phila, TallTed, yamdan