RSS

Storm

26 Mar

By looking around for a suitable system to do realtime analytics for the project group I have come to Storm.

https://github.com/nathanmarz/storm

Storm is a system to distribute realtime analytics to a cluster of server. For this the storm system uses Zookeeper to coordinate master and worker nodes and use a message queues to build create datastream to connect the workernodes. Such a connection of several workernode is called a topology. A topology could by deployed local on one server or could be submitted to the cluster.

I have read some stuff about the system and it would be easy to run this on our cluster and connect this to e.g. hbase. All we need ist to set up the masterprocess and the worker nodes, zookeeper ist already running for hbase.

Here some further information:

A presentation of storm:
http://www.infoq.com/presentations/Storm

The stormwiki:
https://github.com/nathanmarz/storm/wiki

 

Advertisements
 
Leave a comment

Posted by on 26.03.2012 in Hadoop, Zookeeper

 

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

 
%d bloggers like this: