Optimizing Retail Discounts with Machine Learning
Abstract In this paper, we show how to apply machine learning to pricing and discounts. The goal is
How to prepare for the Cloudera Data Scientist Certification Exam
At our Houston Hadoop Meetup, Austin Sun showed how to prepare for the Cloudera Data Scientist Certification exam.
Review of “Learning Spark” by Karau, Konwinski, Wendell & Zaharia
“Learning Spark” was the first published book on the subject. Six months later, there appeared a plethora
Review of “Monitoring Hadoop” by Gurmukh Singh
This book is recently published, April 2015, and it covers Nagios, Ganglia, Hadoop monitoring and monitoring best practices.
Review of “Apache Flume” second edition by Steve Hoffman
This is the second edition of the Apache Flume book, and it covers the latest Flume version 5.2. The
Review of “Hadoop in Action,” second edition
Manning Publications, by Chuck P. Lam, Mark W. Davis, and Ajit Gaddam Four years have passed since the
The Power of Text Analytics at DARPA/Memex
Elephant Scale is proud to be part of the DARPA Memex team. One of the things we are
Spark Summit 2015 highlights and recap
(Disclaimer : This is not an official post from Databricks) Spark Summit 2015 in San Francisco was well attended. Kudos
Tale of two conferences : Hadoop / Spark
Hadoop summit was in San Jose this week — and next week there is Spark summit in San
Review of Apache Flume book (Packt), second edition
This is a second edition of the Apache Flume book, and it covers the latest Flume version 5.2.