Evaluating Entity Linking with Wikipedia by Ben Hachey, et al.
Abstract:
Named Entity Linking (NEL) grounds entity mentions to their corresponding node in a Knowledge Base (KB). Recently, a number of systems have been proposed for linking entity mentions in text to Wikipedia pages. Such systems typically search for candidate entities and then disambiguate them, returning either the best candidate or NIL. However, comparison has focused on disambiguation accuracy, making it difficult to determine how search impacts performance. Furthermore, important approaches from the literature have not been systematically compared on standard data sets.
We reimplement three seminal NEL systems and present a detailed evaluation of search strategies. Our experiments find that coreference and acronym handling lead to substantial improvement, and search strategies account for much of the variation between systems. This is an interesting finding, because these aspects of the problem have often been neglected in the literature, which has focused largely on complex candidate ranking algorithms.
A very deep survey of entity linking literature (including record linkage) and implementation of three complete entity linking systems for comparison.
At forty-eight (48) pages it isn’t a quick read but should be your starting point for pushing the boundaries on entity linking research.
I first saw this in a tweet by Alyona Medelyan.