Details
-
Wish
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Apache Rat and Apache Whiskers are tools which help people to audit and comprehend licenses within releases.
Both Rat and Whiskers lack both a comprehensive library and consistent terminology for licenses: a license lexicon. Though this ticket is raised under Rat, a successful GSOC would hopefully lead to the development of an independent product.
This GSOC project would both compile a set of meta-data, plus perhaps support libraries which could be reused both by these projects and more widely. Collate comprehensive license meta-data, beginning with all OSI approved open source licenses and working outwards to include other common binary only licenses. This meta-data might include:
- descriptions of common source boilerplates
- generation and recognition templates
- consistent language, both machine and human readable
This is most suitable for students whose interests focus more in semantics than coding. The scope and direction are negotiable.
For students interested in text parsing, this meta-data could be used to create fast parsers. For students interested in technology law, the semantics could be extend to include relationships between licenses. For students more interested in informatics, there would be an opportunity to work on a comprehensive collection. For all students, this is an opportunity to showcase communication skills.