Optimizing Retail Discounts with Machine Learning

Abstract In this paper, we show how to apply machine learning to pricing and discounts. The goal is

How to prepare for the Cloudera Data Scientist Certification Exam

At our Houston Hadoop Meetup, Austin Sun showed how to prepare for the Cloudera Data Scientist Certification exam.

Review of “Learning Spark” by Karau, Konwinski, Wendell & Zaharia

  “Learning Spark” was the first published book on the subject. Six months later, there appeared a plethora

Review of “Monitoring Hadoop” by Gurmukh Singh

This book is recently published, April 2015, and it covers Nagios, Ganglia, Hadoop monitoring and monitoring best practices.

Review of “Apache Flume” second edition by Steve Hoffman

This is the second edition of the Apache Flume book, and it covers the latest Flume version 5.2. The

Review of “Hadoop in Action,” second edition

Manning Publications, by Chuck P. Lam, Mark W. Davis, and Ajit Gaddam Four years have passed since the

The Power of Text Analytics at DARPA/Memex

Elephant Scale is proud to be part of the DARPA Memex team. One of the things we are

Spark Summit 2015 highlights and recap

(Disclaimer : This is not an official post from Databricks) Spark Summit 2015 in San Francisco was well attended.  Kudos

Tale of two conferences : Hadoop / Spark

Hadoop summit was in San Jose this week — and next week there is Spark summit in San

Review of Apache Flume book (Packt), second edition

This is a second edition of the Apache Flume book, and it covers the latest Flume version 5.2.