Hadoop: Useful links

In the following, I’ve listed some very useful resources for hadoop:

http://www.cloudera.com/blog/2009/02/02/the-small-files-problem/
http://stackoverflow.com/questions/964332/java-large-files-disk-io-performance
http://www.cloudera.com/blog/2010/01/hadoop-world-building-data-intensive-apps-with-hadoop-and-ec2/
http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Single-Node_Cluster%29

https://twiki.grid.iu.edu/bin/view/Storage/HadoopOperations#Cleaning_Up_a_CORRUPT_Filesystem

Advertisements
This entry was posted in Distributed Computing, Enterprise Java and tagged . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s