Another Word For It Patrick Durusau on Topic Maps and Semantic Diversity

September 16, 2013

Cassandra – A Decentralized Structured Storage System [Annotated]

Filed under: Cassandra,CQL - Cassandra Query Language,NoSQL — Patrick Durusau @ 4:11 pm

Cassandra – A Decentralized Structured Storage System by Avinash Lakshman, Facebook and Prashant Malik, Facebook.

Abstract:

Cassandra is a distributed storage system for managing very large amounts of structured data spread out across many commodity servers, while providing highly available service with no single point of failure. Cassandra aims to run on top of an infrastructure of hundreds of nodes (possibly spread across different data centers). At this scale, small and large components fail continuously. The way Cassandra manages the persistent state in the face of these failures drives the reliability and scalability of the software systems relying on this service. While in many ways Cassandra resembles a database and shares many design and implementation strategies therewith, Cassandra does not support a full relational data model; instead, it provides clients with a simple data model that supports dynamic control over data layout and format. Cassandra system was designed to run on cheap commodity hardware and handle high write throughput while not sacrificing read efficiency.

Annotated version of the original 2009 Cassandra paper.

Not a guide to future technology but a very interesting read about how Cassandra arrived at the present.

No Comments

No comments yet.

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress