The Short Comings of Full-Text Searching by Jeffrey Beall from the University of Colorado Denver.
- The synonym problem.
- Obsolete terms.
- The homonym problem.
- Spamming.
- Inability to narrow searches by facets.
- Inability to sort search results.
- The aboutness problem.
- Figurative language.
- Search words not in web page.
- Abstract topics.
- Paired topics.
- Word lists.
- The Dark Web.
- Non-textual things.
Questions:
- Watch the slide presentation.
- Can you give three examples of each short coming? (excluding #5 and #6, which strike me as interface issues, not searching issues)
- How would you “solve” the word list issue? (Don’t assume quantum computing, etc. There are simpler answers.)
- Is metadata the only approach for “non-textual things?” Can you cite 3 papers offering other approaches?