Book-NLP: Natural language processing pipeline for book-length documents.
From the webpage:
BookNLP is a natural language processing pipeline that scales to books and other long documents (in English), including:
- Part-of-speech tagging (Stanford)
- Dependency parsing (MaltParser)
- Named entity recognition (Stanford)
- Character name clustering (e.g., “Tom”, “Tom Sawyer”, “Mr. Sawyer”, “Thomas Sawyer” -> TOM_SAWYER)
- Quotation speaker identification
- Pronominal coreference resolution
I can think of several classes of documents where this would be useful. Congressional hearing documents for example. Agency reports would be another.
Not the final word for mapping but certainly an assist to an author.