Emre Sokullu
Sep 26
Note to Self: the difference between Hive and Cassandra
Sat 26 Sep 2009 15:38:31 | 1 comments
After long hours of research and a heated debate at our office, I've finally come to a conclusion on the difference between 2 fairly new  open source projects that are being used by Facebook too; Hive and Cassandra.

Cassandra: (=BigTable=HBase) saves computational power and time (by indexing with a column based approach) but is a resource hog when it comes to storage.
Hive: doesn't consume that much storage space bit responds slowlier.

So how they are in use at Facebook.

As far as we were able to figure out, friend recommendations are produced by Hive. Hive computations are done by cronjobs. Cassandra is in use when immediate responsiveness becomes a requirement. So for example, for the inbox system.

Know more than that? Have any input on how they are being used? Please let me know in the comments.
Comments
www.opticalbase.com 5 months ago
Emree, you've gotta do something about the sites speed man.....
www.opticalbase.com



or
Connect with Facebook
Destekliyoruz...
Twitter
More about me...
My Flickr Stream
Notifications