Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

June 29, 2011

NoSQL should be in your business…

Filed under: MongoDB,NoSQL — Patrick Durusau @ 9:03 am

NoSQL should be in your business, and MongoDB could lead the way by Savio Rodrigues.

From the post:

NoSQL is still not well understood, as a term or a database market category, by IT decision makers. However, one NoSQL vendor — 10gen, creators of the open source MongoDB — appears to be growing into enterprise accounts and distancing itself from competitors. If you’re considering, or curious about, NoSQL databases, I recommend you spend some time looking at MongoDB.

One important fact is that the demand for Mongo and MongoDB skills is getting larger.

If you are looking for more information on MongoDB, check out www.mongodb.org and www.10gen.com but in particular see: www.10gen.com/presentations.

I saw the notice about the videos in an email alert so had to delete all the tracking URL crap, then delete the path to the specific video with all its tracking crap, then I was able to give you a link to the page with the videos so you could make your own choice. Less tracking, more choice. That sounds like a better plan.

May 12, 2011

Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison

Filed under: Cassandra,CouchDB,HBase,MongoDB,NoSQL,Redis,Riak — Patrick Durusau @ 7:56 am

Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison

Good thumb-nail comparison of the major features of all six (6) NoSQL databases by Kristóf Kovács.

Sorry to see that Neo4J didn’t make the comparison.

April 8, 2011

MongoDB Videos

Filed under: MongoDB — Patrick Durusau @ 7:20 pm

MongoDB Videos

Since no doubt your favorite contestant has been voted off American Idol, some MongoDB videos to pass the weekend. 😉

March 30, 2011

The Little MongoDB Book

Filed under: MongoDB,NoSQL — Patrick Durusau @ 12:35 pm

The Little MongoDB Book

From the webpage:

I’m happy to freely release The Little MongoDB Book; an ebook meant to help people get familiar with MongoDB and answer some of the more common questions they have.

Not complete but a useful short treatment.

March 29, 2011

MongoDB Manual

Filed under: Database,MongoDB,NoSQL — Patrick Durusau @ 12:46 pm

MongoDB Manual

More of a placeholder for myself than anything else.

I am going to create a page of links to the documentation for all the popular DB projects.

MongoDB with Style

Filed under: MongoDB,NoSQL — Patrick Durusau @ 12:46 pm

MongoDB with Style

One of the more amusing introductions to use of MongoDB.

March 19, 2011

Implementing Replica Set Priorities

Filed under: MongoDB,NoSQL — Patrick Durusau @ 5:54 pm

Implementing Replica Set Priorities, Kristina Chodorow explains replica sets:

Replica set priorities will, very shortly, be allowed to vary between 0.0 and 100.0. The member with the highest priority that can reach a majority of the set will be elected master. (The change is done and works, but is being held up by 1.8.0… look for it after that release.) Implementing priorities was kind of an interesting problem, so I thought people might be interested in how it works. Following in the grand distributed system lit tradition I’m using the island nation of Replicos to demonstrate.

Should be of interest for anyone planning distributed topic map stores/distributions.

March 17, 2011

The Joy of Indexing

Filed under: Indexing,MongoDB,NoSQL,Subject Identity — Patrick Durusau @ 6:52 pm

The Joy of Indexing

Interesting blog post on indexing by Kyle Banker of MongoDB.

Recommended in part to understanding the limits of traditional indexing.

Ask yourself, what is the index in Kyle’s examples indexing?

Kyle says the example are indexing recipes but is that really true?

Or is it the case that the index is indexing the occurrence of a string at a location in the text?

Not exactly the same thing.

That is to say there is a difference between a token that appears in a text and a subject we think about when we see that token.

It is what enables us to say that two or more words that are spelled differently are synonyms.

Something other that the two words as strings is what we are relying on to make the claim they are synonyms.

A traditional indexing engine, of the sort described here, can only index the strings it encounters in the text.

What would be more useful would be an indexing engine that indexed the subjects in a text.

I think we would call such a subject-indexing engine a topic map engine. Yes?

Questions:

  1. Do you agree/disagree that a word indexing engine is not a subject indexing engine? (3-5 pages, no citations)
  2. What would you change about a word indexing engine (if anything) to make it a subject indexing engine? (3-5 pages, no citations)
  3. What texts/subjects would you use as test cases for your engine? (3-5 pages, citations of the test documents)

MongoDB 1.8 Released

Filed under: MongoDB,NoSQL — Patrick Durusau @ 6:49 pm

MongoDB 1.8 Released

Highlights from the announcement:

March 12, 2011

Social Analytics on MongoDB

Filed under: Analytics,MongoDB,NoSQL — Patrick Durusau @ 6:48 pm

Social Analytics on MongoDB, Patrick Stokes of Buddy Media does a highly entertaining presentation on MongoDB and its adoption by Buddy Media.

Unfortunately the slides don’t display during the presentation.

Still, refreshing in the honesty about the development process.

PS: I have written to ask about where to find the slides.

Update

You can find the slides at: http://www.slideshare.net/pstokes2/social-analytics-with-mongodb

Learn MongoDB Basics

Filed under: MongoDB,NoSQL — Patrick Durusau @ 6:47 pm

Learn MongoDB Basics

Covers the basics of the MongoDB.

One nice aspect is the immediate feedback/reinforcement of the principles being taught.

I can imagine someone creating a resource for topic maps along these lines.

Hopefully both a web as well as local version.

I don’t think the benefits of using topic maps are in dispute.

What is unclear is how to convey those benefits to user?

Comments/suggestions?

Questions:

What aspects of the Learn MongoDB site did you find the least/most helpul? (2-3 pages, no citations)
  • How would you construct a topic map tutorial using this as a guide? (4-6 pages, no citations)
  • What other illustrations would you use to convey the advantages of topic maps? (4-6 pages no citations)
  • March 10, 2011

    MongoDB Development Prize!

    Filed under: MongoDB,NoSQL,Topic Maps — Patrick Durusau @ 11:39 am

    MongoDB Development Prize!

    From the website:

    10gen is pleased to announce the first MongoDB developer blog contest of 2011. We’ll be announcing the contest categories at the beginning of each month, and the 10gen engineering team will pick the winner. We hope that these contests will be a way for developers to share their experiences with MongoDB and learn from one another. And we are giving away some pretty cool prizes 🙂 This month we’re teaming up with (mt) Media Temple, who is offering free hosting to the winner.

    Check out the blog for the March contest.

    Could be a good way to get some PR for topic maps, your project, not to mention yourself.

    March 8, 2011

    Summify’s Technology Examined

    Filed under: Data Analysis,Data Mining,MongoDB,MySQL,Redis — Patrick Durusau @ 9:54 am

    Summify’s Technology Examined

    Phil Whelan writes an interesting review of the underlying technology for Summify.

    Many those same components are relevant to the construction of topic map based services.

    Interesting that Summify uses MySQL, Redis and MongoDB.

    I rather like the idea of using the best tool for a particular job.

    Worth a close read.

    March 3, 2011

    MongoDB 1.8 Released!

    Filed under: MongoDB,NoSQL — Patrick Durusau @ 10:12 am

    MongoDB 1.8 Released

    Release notes for MongoDB 1.8.

    Incremental map/reduce supported to enable incremental updating of collections.

    Reminds me to ask about incremental updating of topic maps.

    Introduction to MongoDB – Post

    Filed under: MongoDB,NoSQL — Patrick Durusau @ 10:06 am

    Introduction to MongoDB

    If you want a quick introduction to MongoDB with some lite examples, this post is for you.

    March 1, 2011

    Social Data and Log Analysis Using MongoDB

    Filed under: Data Mining,Log Analysis,MongoDB — Patrick Durusau @ 11:33 am

    Social Data and Log Analysis Using MongoDB

    Interesting use of MongoDB.

    Work through the slide deck and consider the following questions along the way:

    1. How would your analysis of the logs (the process of analysis) be different if you were using topic maps?
    2. How would your results from #1 be different?
    3. Choose a set of logs and test your answers to #1 and #2.

    (Credit will be equally rewarded whether #3 confirms or contradicts your analysis in #1 and #2. The purpose of the exercise is to develop a “fee” for fruitful areas of investigation.)

    NoSQL Databases: Why, what and when

    NoSQL Databases: Why, what and when by Lorenzo Alberton.

    When I posted RDBMS in the Social Networks Age I did not anticipate returning the very next day with another slide deck from Lorenzo. But, after viewing this slide deck, I just had to post it.

    It is a very good overview of NoSQL databases and their underlying principles, with useful graphics as well (as opposed to the other kind).

    I am going to have to study his graphic technique in hopes of applying it to the semantic issues that are at the core of topic maps.

    February 20, 2011

    On Distributed Consistency — Part 1 (MongoDB)

    Filed under: Consistency,Distributed Consistency,MongoDB,NoSQL — Patrick Durusau @ 1:04 pm

    On Distributed Consistency — Part 1 (MongoDB)

    The first of a six part series on consistency in distributed databases.

    From the website:

    See also:

    • Part 2 – Eventual Consistency
    • Part 3 – Network Partitions
    • Part 4 – Multi Data Center
    • Part 5 – Multi Writer Eventual Consistency
    • Part 6 – Consistency Chart

    For distributed databases, consistency models are a topic of huge importance. We’d like to delve a bit deeper on this topic with a series of articles, discussing subjects such as what model is right for a particular use case. Please jump in and help us in the comments.

    Consistency is an issue that will confront distributed topic maps so best to start learning the options now.

    February 18, 2011

    Large Scale Packet Dump Analysis with MongoDB

    Filed under: Marketing,MongoDB,Subject Identity — Patrick Durusau @ 6:51 am

    Large Scale Packet Dump Analysis with MongoDB

    I mention this because it occurs to me that distributed topic maps could be a way to track elusive web traffic that passes through any number of servers from one location to another.

    I will have to pull Stevens’ TCP/IP Illustrated off the shelf to look up the details.

    Thinking that subject identity in this case would be packet content and not the usual identifiers.

    And that with distributed topic maps, no one map would have to process all the load.

    Instead, upon request, delivering up proxies to be merged with other proxies, which could then be displayed as partial paths through the next works with the servers where changes took place being marked.

    The upper level topic maps being responsible for processing summaries of summaries of data, but with the ability to drill back down into the actual data.

    True, there is a lot of traffic, but simply by dumping all the porn, that reduces the problem by a considerable percentage. I am sure there are other data collection improvements that could be made.

    February 17, 2011

    MongoDB – Usage Posts

    Filed under: MongoDB — Patrick Durusau @ 6:52 am

    MongoDB

    A series of posts and presentations on use of MongoDB by Boxed Ice.

    From the website:

    Post index

    Presentation slides:

    Good resource on MongoDB and an example of a type of posting that be useful for topic maps.

    As much a note to myself as anyone, thinking a series of posts on a data set of general interest might attract favorable attention.

    February 16, 2011

    Wordnik – 10 million API Requests a Day on MongoDB and Scala – Post

    Filed under: MongoDB,Scala — Patrick Durusau @ 1:08 pm

    Wordnik – 10 million API Requests a Day on MongoDB and Scala

    From the website:

    Wordnik is an online dictionary and language resource that has both a website and an API component. Their goal is to show you as much information as possible, as fast as we can find it, for every word in English, and to give you a place where you can make your own opinions about words known. As cool as that is, what is really cool is the information they share in their blog about their experiences building a web service. They’ve written an excellent series of articles and presentations you may find useful: (see the post)

    Of course, what I find fascinating is the “…make your own opinions about words known” aspect of the system.

    Even so, from a scaling standpoint, this sounds like an impressive bit of work.

    Definitely worth a look.

    February 13, 2011

    O’Reilly Book Sale

    Filed under: Books,MongoDB,NoSQL,Topic Maps — Patrick Durusau @ 7:14 am

    OK, ok, one more not-strictly topic map item and I promise no more for today!

    Buy 2, Get 1 Free at O’Reilly caught my eye this morning.

    I think it is justified to appear here for two reasons:

    1) It has a lot of books, such as those on databases, that are relevant to implementing topic map systems.

    But, just as importantly:

    2) The O’Reilly online catalog illustrates the need for topic maps.

    Look at the catalog listings under Other Databases for MongoDB (you may have heard about it). Now look under Database Design and Analysis. Opps! There you will find: MongoDB: The Definitive Guide. (at least as of 13 February 2010, 7:01 AM Eastern time)

    One way (not the only way) to implement a topic map here would result in a single source of updates across the catalog. And the catalog could also act as a resource pointer to other materials. The Other Resources for the MongoDB book, isn’t terribly inspiring.

    *****
    PS: I am hopeful the interest in NoSQL will drive greater exploration of MySQL, PostgresSQL and Oracle databases in general and as part of topic maps systems in particular.

    February 5, 2011

    Mapping Wikileaks’ Cablegate topics using Python, MongoDB, Neo4j and Gephi

    Filed under: Gephi,MongoDB,Neo4j — Patrick Durusau @ 7:55 am

    Mapping Wikileaks’ Cablegate topics using Python, MongoDB, Neo4j and Gephi

    Data and slides and movies while the conference is ongoing! Oh My!

    This is the sort of effort that topic maps needs to step up to and compete against.

    I have some thoughts on what that would take with the Afghan war diaries that I will be posting later today.

    February 2, 2011

    Mapping Wikileaks’ Cablegate using Python, mongoDB and Gephi – Saturday, 5 Feburary 2011

    Filed under: Gephi,MongoDB,Natural Language Processing — Patrick Durusau @ 10:34 am

    Mapping Wikileaks’ Cablegate using Python, mongoDB and Gephi

    From the website:

    Text analysis and graph visualization on the Wikileaks Cablegate dataset.

    We propose to present a complete work-flow of textual data analysis, from acquisition to visual exploration of a complex network. Through the presentation of a simple software specifically developed for this talk, we will cover a set of productive and widely used softwares and libraries in text analysis, then introduce some features of Gephi, an open-source network visualization & analysis software, using the data collected and transformed with cablegate-semnet.

    See: cablegate-semnet

    If you are in (or can be) Brussels, Belgium this coming Saturday and Sunday, don’t miss this presentation!

    There will be many others worthy of your attention as well.

    February 1, 2011

    A Short eBook on Scaling MongoDB

    Filed under: MongoDB,NoSQL — Patrick Durusau @ 7:52 am

    A Short eBook on Scaling MongoDB

    Kristina Chodorow’s blog, Snail in a Turtleneck announced a short eBook by Kristina.

    I haven’t read it, yet, but am sure to be doing so in the near future.

    Comments on the same are welcome!

    January 31, 2011

    MongoVUE

    Filed under: MongoDB,NoSQL — Patrick Durusau @ 1:49 pm

    MongoVUE

    From the website:

    What Is MongoVUE?

    • MongoVUE is a GUI (graphical user interface) application that helps you administer, develop and learn MongoDB.
    • MongoVUE is FREE to use.
    • To run properly, MongoVUE requires Microsoft .NET Framework 2.0 SP1 installed on your computer.

    Tools for working with NoSQL databases are starting to appear.

    Any thoughts on this one that you would like to share?

    January 26, 2011

    A Quick WebApp with Scala, MongoDB, Scalatra and Casbah – Practice for TMs

    Filed under: MongoDB,Scala,Software,Topic Maps — Patrick Durusau @ 8:41 am

    A Quick WebApp with Scala, MongoDB, Scalatra and Casbah

    However clever, topic maps aren’t of much interest unless they are delivered to users.

    In the general case that means a web based application.

    This post is a short introduction to several tools you may find handy with building and/or delivering topic maps.

    *****
    PS: We will know topic maps have arrived when the technology keeps changing but management of subject identity is inherent in both programming languages and application design. Ways to go yet.

    January 25, 2011

    Translate SQL to MongoDB MapReduce

    Filed under: MapReduce,MongoDB,SQL — Patrick Durusau @ 10:18 am

    Translate SQL to MongoDB MapReduce

    There is a growing sense that SQL vs. MapReduce or NoSQL is a question of fitness of the tool for the purpose at hand.

    If your problem is best solved by commodity hardware working in parallel, then NoSQL solutions may be the path to take.

    I have seen that expressed in a number of ways but not with a lot of detail on what factors drive the choice one way or the other.

    With enough detail, that could make both a very good guide and topic map for those faced with this sort of issue.

    First noticed on Alex Popescu’s myNoSQL blog.

    Yet another MongoDB Map Reduce tutorial – Post

    Filed under: MapReduce,MongoDB,NoSQL — Patrick Durusau @ 6:32 am

    Yet another MongoDB Map Reduce tutorial

    From the post:

    As the title says, this is yet-another-tutorial on Map Reduce using MongoDB. But two things that are different here:

    1. A problem solving approach is used, so we’ll take a problem, solve it in SQL first and then discuss Map Reduce.

    2. Lots of diagrams, so you’ll hopefully better understand how Map Reduce works.

    First noticed on Alex Popescu’s myNoSQL blog.

    January 23, 2011

    Rogue – Some Details

    Filed under: MongoDB,NoSQL — Patrick Durusau @ 1:41 pm

    Rogue: A Type-Safe Scala DSL for querying MongoDB a blog post from Foursquare that gives some examples and details on Rogue. More to follow.

    « Newer PostsOlder Posts »

    Powered by WordPress