RSS

Author Archives: shabazza

Storm

By looking around for a suitable system to do realtime analytics for the project group I have come to Storm.

https://github.com/nathanmarz/storm

Storm is a system to distribute realtime analytics to a cluster of server. For this the storm system uses Zookeeper to coordinate master and worker nodes and use a message queues to build create datastream to connect the workernodes. Such a connection of several workernode is called a topology. A topology could by deployed local on one server or could be submitted to the cluster.

I have read some stuff about the system and it would be easy to run this on our cluster and connect this to e.g. hbase. All we need ist to set up the masterprocess and the worker nodes, zookeeper ist already running for hbase.

Here some further information:

A presentation of storm:
http://www.infoq.com/presentations/Storm

The stormwiki:
https://github.com/nathanmarz/storm/wiki

 

 
Leave a comment

Posted by on 26.03.2012 in Hadoop, Zookeeper