Understanding Spark caching
Spark is great for cached data. Take a look here to understand various caching options for spark. Read
HBase Design Patterns
Readers would be pleased to know that we have teamed up with Packt Publishing to organize a Giveaway
KDNuggets Interview With Sujee Maniyam
KDNuggets interviewed Sujee Maniyam about Big Data eco system / open source …etc Part 1 & Part
Elephant Scale is Building on the Success of its First Houston Hadoop Bootcamp
Elephant Scale, a provider of Big Data training, implementations, and vertical Hadoop product applications, is pleased to announce
Houston Hadoop Bootcamp was a real success
What was so amazing about our March 28-30 bootcamp? A number of things: We collected more than twenty students
Big Data, Hadoop, and NoSQL Testing
Abstract In this paper we discuss best practices and real world testing strategies for Big Data, Hadoop, and
Review on “Cassandra Design Patterns” book
A new book by Packt, on which I am a reviewer. Also, see my Amazon review for
maven project pom.xml for Hadoop/HBase dependencies
If you are building a Java project that has Hadoop or HBase dependency  (for example a Java mapreduce
printing out configurations of Hadoop, HBase, Accumulo clusters
Hadoop has hundreds of configurable parameters. Â And Hadoop admins and developers spend a lot of time tweaking
Hadoop in Cloud — Plenty of Choices
Want to run Hadoop in the cloud? Good news is, right about now you have some pretty good