Day 14: Stanford NER–How To Setup Your Own Name, Entity, and Recognition Server in the Cloud by Shekhar Gulati.
From the post:
I am not a huge fan of machine learning or natural text processing (NLP) but I always have ideas in mind which require them. The idea that I will explore during this post is the ability to build a real time job search engine using twitter data. Tweets will contain the name of the company which if offering a job, the location of the job, and name of the contact person at the company. This requires us to parse the tweet for Person, Location, and Organisation. This type of problem falls under Named Entity Recognition.
A continuation of Shekhar’s Learning 30 Technologies in 30 Days… but one that merits a special shout out.
In part because you can consume the entities that other “recognize” or you can be in control of the recognition process.
It isn’t easy but on the other hand, it isn’t free from hidden choices and selection biases.
I would prefer those were my hidden choices and selection biases, if you don’t mind. 😉