Monthly Archives: July 2014

Running a Storm Cluster on Ubuntu

Step One: Install Ubuntu Zookeeper Package. sudo apt-get install zookeeper Zooker is located in /usr/share/zookeper Revise /etc/zookeeper/conf/zoo.cfg for any config changes Step two: Download and unpack storm, for instance in /opt/ and create a storm symlink pointing to the distribution … Continue reading

Posted in Distributed Computing | Tagged , , | Leave a comment

Reading Avro files from HDFS

If you want to read Avro files from HDFS and you’re using schema – generated classes instead of GenericRecords, you’ll have to use the specific datum reader. So it’s basically as easy as reading the GenericRecords. Don’t forget to add … Continue reading

Posted in Uncategorized | Leave a comment