Forays into Kafka – Enabling Flexible Data Pipelines

October 28, 2015 0 Comments

One of the defining features of “Big Data” from a technologist’s point of view is the sheer number of tools and permutations at one’s disposal..

Continue Reading

Trickle-Feeding Log Data into the HBase NoSQL Database using Flume

May 21, 2014

The other day I posted an article on the blog around using Flume to transport Apache web log entries from our website into Hadoop, with.

Continue Reading

Trickle-Feeding Log Files to HDFS using Apache Flume

May 18, 2014

In some previous articles on the blog I’ve analysed Apache webserver log files sitting on a Hadoop cluster using Hive, Pig and most recently, Apache.

Continue Reading