Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

May 2, 2011

Neo4J – @emileifrem

Filed under: Neo4j,NoSQL — Patrick Durusau @ 10:31 am

Need a graph database like Twitter is built on? @neo4j delivers, @emileifrem tells why

Emil Eifrem is the CEO of Neo Technology and co-founder of the Neo4J project.

Nothing technical but an engaging overview of why Neo4J matters.

It was amusing when the interviewer asked about scaling.

Emil said the then current release goes up to 12 billion nodes.

Calls transparent (to the user) partitioning of a graph, for further scaling, a “non-trivial CS problem.”

But also says that will be solved in Neo4J 2.0.

Triplification Challenge 2011

Filed under: Conferences,RDF — Patrick Durusau @ 10:31 am

Triplification Challenge 2011

From the website:

The yearly organized Linked Data Triplification Challenge awards prizes to the most promising application demonstrations and approaches in three fields related to Linked Data.

For the success of the Semantic Web it is from our point of view crucial to overcome the chicken-and-egg problem of missing semantic representations on the Web and the lack of their utilization within concrete applications, to solve real-world problems. The Triplification Challenge aims to expedite this process by raising awareness and showcasing best practices.

3.000 € in prize money will be awarded to the winners of the open track and the special Open Government Data track.

The challenge is open to anyone interested in applying Semantic Web and Linked Data technologies. This might include students, developers, researchers, and people from industry. Individual or group submissions are both acceptable.

Could be an interesting opportunity to expose topic maps as triples. Not to mention an attractive prize!

Important dates:

Extended Submission Deadline: May 30, 2011

Notification of Acceptance: June 27, 2011

Camera-Ready Paper: July 18, 2011

I-SEMANTICS 2011: September 7 – 9, 2011

May 1, 2011

That’s What She Said: Double Entendre Identification

Filed under: Ambiguity,Metaphors — Patrick Durusau @ 5:26 pm

That’s What She Said: Double Entendre Identification by Chloé Kiddon and Yuriy Brun.

Abstract:

Humor identification is a hard natural language understanding problem. We identify a subproblem — the “that’s what she said” problem—with two distinguishing characteristics: (1) use of nouns that are euphemisms for sexually explicit nouns and (2) structure common in the erotic domain. We address this problem in a classification approach that includes features that model those two characteristics. Experiments on web data demonstrate that our approach improves precision by 12% over baseline techniques that use only word-based features.

A highly entertaining paper that examines a particular type of double entendre, which is itself a particular type of metaphor.

The authors note:

A “that’s what she said” (TWSS) joke is a type of double entendre. A double entendre, or adianoeta, is an expression that can be understood in two different ways: an innocuous, straightforward way, given the context, and a risqué way that indirectly alludes to a different, indecent context. To our knowledge, related research has not studied the task of identifying double entendres in text or speech. The task is complex and would require both deep semantic and cultural understanding to recognize the vast array of double entendres. We focus on a subtask of double entendre identification: TWSS recognition. We say a sentence is a TWSS if it is funny to follow that sentence with “that’s what she said”. (emphasis added)

It would be interesting to see a crowd-sourced topic map project on double entendre.

BTW, strictly for non-office enjoyment, see: TWSS, a site that collects TWSS stories.

Installing and using Apache Cassandra With Java Part 1 (Installation)

Filed under: Cassandra,NoSQL — Patrick Durusau @ 5:25 pm

Installing and using Apache Cassandra With Java Part 1 (Installation)

This series starts here and goes for five (5) parts for Cassandra 0.6.4.

From the introduction:

I’m going to write a few postings on how to use the Cassandra database with Java, although i am in no way an expert on how to use Cassandra i am very intrigued about the database because of it’s small installation, high performance and scalability. During the writing of these posts i am also learning the Cassandra database and i’m sharing my experiences with it through my posts on this blog.

Like i said before, Cassandra is a very high performing and scalable database, it doesn’t follow the normal SQL database principles like schema’s, tables / columns, datatypes and a query language like SQL. Instead it’s a non-relational database similar to Google’s BigTable. Cassandra was initially developed by Facebook which has contributed it to the open source community. Currently it is used by websites like Facebook, Twitter, Digg, Rackspace and many others. So even though it is still only version 0.6 at the time of writing this it has already proven itself in production environments.

It isn’t possible to say which (if any) of the NoSQL databases will prove to be the best fits for topic maps in particular or general situations.

What is clear is that a lot of experimentation and development is underway and hopefully the results will be interesting.

Ontology: A Practical Guide

Filed under: Ontology — Patrick Durusau @ 5:24 pm

Ontology: A Practical Guide by Adam Pease.

From the announcement:

This new book reports on a decade of work developing SUMO and its associated tools, models and domain ontologies. Written for a wide audience, it should be accessible to anyone with a general computer science background. It includes introductions to topics such as formal theorem proving and the properties of different formal knowledge representation languages.

The book is suitable as a self-study guide for the professional, student or researcher. It also includes a number of exercises with selected answers, making it appropriate as a textbook for a senior year or graduate level course in AI knowledge representation.

Adam Pease, is the Technical Editor for the Suggested Upper Merged Ontology (SUMO) project.

I am waiting for my copy to arrive! More comments to follow.

Overview of Neo4j.rb 1.0.0

Filed under: Graphs,Neo4j,Ruby — Patrick Durusau @ 5:23 pm

Overview of Neo4j.rb 1.0.0

While specific for JRuby, this is a very nice set of guides, examples and documentation that will benefit anyone using Neo4J.

BTW, the main page for Neo4j.rb.

You may also be interested in Introduction to Neo4j.rb. (Note, the first slide really is blank except for “JAYWAY.” Just go to the next slide.)

« Newer Posts

Powered by WordPress