Forays into Kafka - Enabling Flexible Data Pipelines
One of the defining features of “Big Data” from a technologist’s point of view is the sheer number of tools and permutations at one’s »
One of the defining features of “Big Data” from a technologist’s point of view is the sheer number of tools and permutations at one’s »
The other day I posted an article on the blog around using Flume to transport Apache web log entries from our website into Hadoop, with the »
In some previous articles on the blog I’ve analysed Apache webserver log files sitting on a Hadoop cluster using Hive, Pig and most recently, Apache »