Posts in Big data & data science

All posts in Big data & data science in chronological order with newest first.

Posts in Big data & data science

Jul 06, 2015 big-data

Good Hands-on Introduction to Apache Spark

Anyone who wants to learn the basics of Spark is well-advised to read the book “Learning Spark”. I particularly liked that the book is very practice-oriented and that you can...

#review #distributed #object-functional

Read more arrow_forward

image

Jul 02, 2015 big-data

k-d-trees with Apache Spark and Scala

This article shows how to use k-d-trees with Apache Spark.

#data #scala #jvm #object-functional

Read more arrow_forward

Jun 02, 2015 big-data

Definitive Overview of the Hadoop Ecosystem

The Hadoop ecosystem has grown significantly over time. “Hadoop: The Definitive Guide” provides an overview of the framework’s most important topics and projects.

#review #distributed

Read more arrow_forward

May 15, 2015 big-data

Good Introduction to 'Data Culture', but Too Uncritical

In “Data Driven - Creating a Data Culture”, the authors explain what they mean by a “data culture”.

#review

Read more arrow_forward

Mar 27, 2015 big-data

Java MapReduce with Hadoop

MapReduce is a “corset” and forces the developer into narrow boundaries. Therefore, it makes sense to read “MapReduce Design Patterns” to quickly learn the common tricks and techniques. It is...

#review #data #distributed

Read more arrow_forward

Jun 04, 2013 big-data

Good Introduction to Non-Relational Databases

The small book “NoSQL Distilled:” provides a good overview of various NoSQL databases.

#review #data

Read more arrow_forward

Feb 01, 2010 big-data

Column-oriented databases

From 2002 to 2006, I worked at a Canadian manufacturer of a column-oriented database.

#data

Read more arrow_forward

Jan 02, 2005 big-data

Fraud Detection with Artificial Intelligence

From 1999 to 2004, I collected information on the topic of ‘Fraud detection’ on my website. When I started this in 1999 as a research assistant at the University of...

#data #machine-learning #ai #data

Read more arrow_forward