René Pickhardt’s blog post title was: Slides of Related work application presented in the Graphdevroom at FOSDEM, which is unlikely to catch your eye. The paper title is: A neo4j powered social networking and Question & Answer application to enhance scientific communication.
I took the liberty of crafting a shorter title for this post. 😉
The problems René addresses are shared by all academics:
- Finding new relevant publications
- Connecting people interested in the same topic
This project is the result of the merger of the Open Citation and Related Work project, on which see: Open Citations and Related Work projects merge.
The terminology for the project components:
- Open Citations Corpus: data corpus
- Open Citations Corpus Datastore (OCCD): infrastructure of the data corpus
- Related Work: user-oriented services built on top of the citation data
Resources:
- Work-Related.net source code
- A neo4j powered social networking and Question & Answer application to enhance scientific communication. (slides)
- Free Open Data from Related-Work.net
- Related-work.net – Product Requirement Document released!
- Get the full neo4j power by using the Core Java API for traversing your Graph data base instead of Cypher Query Language
- Related-Work.net blog
You need to take a long look at the project in general but the data in particular.
From the data webpage:
We downloaded the source files of all arxiv articles published until 2012-09-31, extracted the references and matched them against the metadata using these python scripts. The result is a 2.0Gb sized *.txt file with more than 16m lines representing the citaiton graph in the following format:
Document level linking so there is still topic map work to be done merging the same subjects identified differently but this data set is certainly a “leg up” on that task.
We should all encourage if not actively contribute to the Related Work project.