Linus Torvalds is an amazing man, and without him much of Big Data development would not be where it is now. This is our artist’s homage to Linus.
Which is better, Enterprise Data Warehouse or Hadoop? Here is our artist’s simple answer.
According to many – and here is an article on this that I liked – data scientist is one of the most demanded professions. For each three or four open positions, there will be only one data scientist in the coming ten years. But this is not all. Being a data scientist also makes you popular at […]
Big Data has become Big Business in 2013 – you read about it everywhere. I read it in SD Times. But sometimes it can become so overwhelming that I just leave it to the artist to explain. Please enjoy the cartoon.
Abstract In this paper we discuss best practices and real world testing strategies for Big Data, Hadoop, and NoSQL. The subjects of testing and software correctness take an even more important role in the world of Big Data, and that is why taking them into account throughout the project lifetime, from design to implementation and […]
A new book by Packt, on which I am a reviewer. Also, see my Amazon review for it here. Also, I am an official Packt reviewer on the books – see title page. With further review requests, or if you would like to review this book, contact Elephant Scale.
With Bitcoin market cap at reported 12B, and with the Hadoop yearly market targeting 4B by 2017, this is an apples-to-oranges comparison, and it is as hard to decide between the two as to answer the old children’s question, “if an elephant steps on a whale, who will win?” For this reason we decided to create a […]
This step-by-step guide walks through installing a Hadoop 2 on a single node. We use TAR files. This is ideal for setting up a development environment on a laptop / workstation.
It used to be that Hadoop books (good or bad) were far and few in between. Now, however, it’s different. In the words of a wise man, “And furthermore, my son, be admonished: of making many books there is no end; and much study is a weariness of the flesh.”There are many Hadoop books (good […]
If you are building a Java project that has Hadoop or HBase dependency Â (for example a Java mapreduce application), here is a simple POM.xml to get you started 1) project structure: pom.xml src/main/java Â <— all the java code goes here 2) POM.XML here is the pom.xml on github : https://gist.github.com/sujee/7916669 <project xmlns=”http://maven.apache.org/POM/4.0.0″ xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance” xsi:schemaLocation=”http://maven.apache.org/POM/4.0.0 […]