The irony of the story (Microsoft, Hadoop and Dryad)

  Microsoft’s relationship with Hadoop was for a long time ambiguous: from a rumor about “Hadoop on Azure”

Understanding Spark caching

Spark is great for cached data.  Take a look here to understand various caching options for spark. Read

Elephant Scale is Building on the Success of its First Houston Hadoop Bootcamp

Elephant Scale, a provider of Big Data training, implementations, and vertical Hadoop product applications, is pleased to announce

Houston Hadoop Bootcamp was a real success

What was so amazing about our March 28-30 bootcamp? A number of things: We collected more than twenty students

Big Data, Hadoop, and NoSQL Testing

Abstract In this paper we discuss best practices and real world testing strategies for Big Data, Hadoop, and

maven project pom.xml for Hadoop/HBase dependencies

If you are building a Java project that has Hadoop or HBase dependency  (for example a Java mapreduce

printing out configurations of Hadoop, HBase, Accumulo clusters

Hadoop has hundreds of configurable parameters. Â And Hadoop admins and developers spend a lot of time tweaking

Hadoop in Cloud — Plenty of Choices

Want to run Hadoop in the cloud? Good news is, right about now you have some pretty good