Big Data, Hadoop, and NoSQL Testing

Abstract In this paper we discuss best practices and real world testing strategies for Big Data, Hadoop, and NoSQL. The subjects of testing and software correctness take an even more important role in the world of Big Data, and that is why taking them into account throughout the project lifetime, from design to implementation and […]

Cartoon – Announcing Hadoop (TM) Coin

With Bitcoin market cap at reported 12B, and with the Hadoop yearly market targeting 4B by 2017, this is an apples-to-oranges comparison, and it is as hard to decide between the two as to answer the old children’s question, “if an elephant steps on a whale, who will win?” For this reason we decided to create a […]

Cartoon – Apache Hadoop Books

It used to be that Hadoop books (good or bad) were far and few in between. Now, however, it’s different. In the words of a wise man, “And furthermore, my son, be admonished: of making many books there is no end; and much study is a weariness of the flesh.”There are many Hadoop books (good […]

maven project pom.xml for Hadoop/HBase dependencies

If you are building a Java project that has Hadoop or HBase dependency  (for example a Java mapreduce application), here is a simple POM.xml to get you started 1) project structure: pom.xml src/main/java   <— all the java code goes here 2) POM.XML here is the pom.xml on github : https://gist.github.com/sujee/7916669 <project xmlns=”http://maven.apache.org/POM/4.0.0″ xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance” xsi:schemaLocation=”http://maven.apache.org/POM/4.0.0 […]