Archive for the ‘Yahoo!’ Category

Streaming IN Hadoop: Yahoo! release Storm-YARN

Saturday, June 15th, 2013

Streaming IN Hadoop: Yahoo! release Storm-YARN by Jim Walker.

From the post:

Over the past year, customers have told us they want to store all their data in one place and interact with it in multiple ways… they want to use Hadoop, but in order to do so, it needs to extend beyond batch. It also needs to be interactive and real-time (among others).

This is the entire principle behind YARN, which together with others in the community, Arun Murthy and the team at Hortonworks have been working on for more than 5 years! The YARN based architecture of Hadoop 2.0 is hugely significant and we have been working closely with many partners to incorporate it into their applications.

Storm-YARN Released as Open Source

Yahoo! has been testing Hadoop 2 and its YARN-based architecture for quite some time. All the while they have worked on the convergence of the streaming framework Storm with Hadoop. This work has resulted in a YARN based version of Storm that will radically improve performance and resource management for streaming.

The release blog post from Yahoo.

Processing of data, even big data, is approaching “interactive and real-time,” although I suspect definitions of those terms vary. What is “interactive” for an automated trader might be too fast for human trader.

What I haven’t seen is concurrent development on the handling of the semantics of big data.

After the initial hysteria over the scope of NSA snooping, except for cases where the NSA was given the identity of a suspect (and not always then), was its data gathering of any use.

In topic map terms, the semantic impedance between the data systems was too great for useful manipulation of the data sets as one.

Streaming in Hadoop is welcome news, but until we can robustly manages the semantics of data in streams, much gold is going to pass uncollected from streams.

Introducing BOSS Geo – the next chapter for BOSS

Friday, September 28th, 2012

Introducing BOSS Geo – the next chapter for BOSS

From the post:

Today, the Yahoo! BOSS team is thrilled to announce BOSS Geo, new additions to our Search API that’s designed to help foster innovation in the search industry. BOSS Geo, comprised of two popular services – PlaceFinder and PlaceSpotter – now offers powerful, new geo services to BOSS developers.

Geo is increasingly important in today’s always-on, mobile world and adding features like these have been among the most requested we’ve received from our developers. With mobile devices becoming more pervasive, users everywhere want to be able to quickly pull up relevant geo information like maps or addresses. By adding PlaceFinder and PlaceSpotter to BOSS, we’re arming developers with rich new tools for driving more valuable and personalized interactions with their users.

PlaceFinder – Geocoding made simple

PlaceFinder is a geocoder (and reverse geocoder) service. The service helps developers convert an address into a latitude/longitude and alternatively, if you provide a latitude/longitude it can resolve it to an address. Whether you are building a check-in service or want to show an address on a map, we’ve got you covered. PlaceFinder already powers several popular applications like foursquare. which uses it to power check-ins on their mobile application. BOSS PlaceFinder offers tiered pricing and one simple monthly bill.

(graphics omitted)

PlaceSpotter – Adding location awareness to your content

The PlaceSpotter API (formerly known as PlaceMaker) allows developers to take a piece of content, pull rich information about the locations mentioned and provide meaning to those locations. A news article is no longer just text but has rich, meaningful geographical information associated with it. For instance, the next time your users are reading a review of a cool new coffee shop in the Mission neighborhood in San Francisco, they can discover another article about a hip new bakery in the same neighborhood. Learn more on the new PlaceSpotter service.

What information would you merge using address data as a link point?

Amsterdam (Netherlands) is included. Perhaps sexual preferences in multiple languages, keyed to your cell phone’s location? (Or does that exist already?)

BTW,

We intend to shut down the current free versions of PlaceFinder and PlaceMaker on November 17, 2012.

Development using YQL tables will still be available.

Yahoo! Search Scientists Break New Ground on Search Results

Saturday, March 17th, 2012

Yahoo! Search Scientists Break New Ground on Search Results

From the post:

Understanding a person’s intent when searching on the web is critical to the quality of search results offered and at Yahoo! Search, the science team is constantly working to refine our technology and provide people with more relevant answers, not links, to their search query.

Recently, Yahoo! Search scientists built a new search platform from the ground up with machine learning technology that improves Yahoo!’s vertical intent triggering system and, as a result, our ability to better anticipate the needs of the individual user as he or she searches online. With this new platform, our search algorithm has the ability to adapt to what users are really interested in, by continuously monitoring how they engage with the search results. The system then continuously and automatically improves itself to provide the most engaging web search experience.

This technology was recently launched for news and movie search queries, two categories that tested extremely well with the technology. For example, with breaking news search terms constantly changing, humans can’t instantly track which queries are now breaking news stories. The intended result for a user can change for the same search query on a daily or even hourly basis. The technology can determine what the users are looking for and bring it to the top. And the key results that may have been at the top this morning, can be moved to the middle of the search results page at the end of day if user behavior determines other content is now more relevant.

Based on the positive feedback we’ve received in testing this platform for news and movie searches, we plan to roll out this new technology to support shopping, local, travel and mobile searches in the coming months, as well as other experiences across the Yahoo! network.

There wasn’t enough information in the post to evaluate the claims of improvement. I tried to post a comment asking when more details will appear but it was with FireFox on Ubuntu so it may not have taken.

If you know what Yahoo! has done differently and can say what it is, please do. I am sure we would all like to know.

As you know, enabling users to state their intent seems like a better strategy to me. At least better than simply running the numbers like a network rating sweeps. It works, but only just.