Tagged

hadoop

A collection of 20 posts

Technical

Introducing Oracle Big Data Discovery Part 2: Data Transformation, Wrangling and Exploration

In yesterday’s post I looked at Oracle Big Data Discovery and how it brought the search and analytic capabilities of Endeca to Hadoop [https://www.rittmanmead.com/blog/2015/02/introducing-oracle-big-data-discovery-part-1-the-visual-face-of-hadoop/] . We looked at how the Oracle Endeca Information Discovery Studio application works with a version of the Endeca

Technical

Connecting OBIEE11g on Windows to a Kerberos-Secured CDH5 Hadoop Cluster using Cloudera HiveServer2 ODBC Drivers

In a few previous posts and magazine articles [http://www.oracle.com/technetwork/issue-archive/2014/14-sep/o54ba-2279189.html] I’ve covered connecting OBIEE11g to a Hadoop cluster [https://www.rittmanmead.com/blog/2014/01/obiee-11-1-1-7-cloudera-hadoop-hiveimpala-part-2-load-data-into-hivehcatalog-analyze-using-impala/] , using OBIEE 11.1.1.7 and Cloudera CDH4 and CDH5 as the examples. Things

Technical

Analytics with Kibana and Elasticsearch through Hadoop - part 2 - Getting data into Elasticsearch

Introduction In the first part of this series [https://www.rittmanmead.com/blog/2014/11/analytics-with-kibana-and-elasticsearch-through-hadoop-part-1-introduction/] I described how I made several sets of data relating to the Rittman Mead blog from various sources available through Hive. This included blog hits from the Apache webserver log, tweets, and metadata from

Technical

Trickle-Feeding Log Files to HDFS using Apache Flume

In some previous articles on the blog I’ve analysed Apache webserver log files sitting on a Hadoop cluster using Hive [https://www.rittmanmead.com/blog/2014/04/simple-data-manipulation-and-reporting-using-hive-impala-and-cdh5/] , Pig [https://www.rittmanmead.com/blog/2014/05/simple-hadoop-dataflows-using-apache-pig-and-cdh4-6/] and most recently, Apache Spark [https://www.rittmanmead.com/blog/2014/05/