The work in
MARMOTTA-438 introduces a new legal issue: there we are caching the test cases in turtle.
The original HTML+RDFa document is https://dvcs.w3.org/hg/ldpwg/raw-file/default/Test%20Cases/LDP%20Test%20Cases.html
And this is the generated RDF content in Turtle we are caching for implementing our test suite: http://www.w3.org/2012/pyRdfa/extract?uri=https%3A%2F%2Fdvcs.w3.org%2Fhg%2Fldpwg%2Fraw-file%2Fb3683634c29f%2FTest%2520Cases%2FLDP%2520Test%2520Cases.html&rdfa_lite=false&vocab_expansion=false&embedded_rdf=true&validate=yes&space_preserve=true&vocab_cache_report=false&vocab_cache_bypass=false
The document, so the data, is under W3C Document License: http://www.w3.org/Consortium/Legal/2002/copyright-documents-20021231
Which contains a paragraph which could be problematic:
No right to create modifications or derivatives of W3C documents is granted pursuant to this license. However, if additional requirements (documented in the Copyright FAQ) are satisfied, the right to create modifications or derivatives is sometimes granted by the W3C to individuals complying with those requirements.
The question is: the RDFa->Turtle format transformation is considered a modification? I'll ask at LEGAL.
A possible workaround could be not lively fetch the data before running the test suite, instead of caching it.