Tagged

Big Data

A collection of 112 posts

Technical

Trickle-Feeding Log Files to HDFS using Apache Flume

In some previous articles on the blog I’ve analysed Apache webserver log files sitting on a Hadoop cluster using Hive [https://www.rittmanmead.com/blog/2014/04/simple-data-manipulation-and-reporting-using-hive-impala-and-cdh5/] , Pig [https://www.rittmanmead.com/blog/2014/05/simple-hadoop-dataflows-using-apache-pig-and-cdh4-6/] and most recently, Apache Spark [https://www.rittmanmead.com/blog/2014/05/

Technical

Using Oracle R Enterprise to Analyze Large In-Database Datasets

The other week I posted an article on the blog about Oracle R Advanced Analytics for Hadoop [https://www.rittmanmead.com/blog/2014/03/running-r-on-hadoop-using-oracle-r-advanced-analytics-for-hadoop/] , part of Oracle’s Big Data Connectors [http://www.oracle.com/technetwork/database/database-technologies/bdc/big-data-connectors/overview/index.html] and used for running certain types

Oracle BI Suite EE

Testing Oracle Direct Connector for HDFS as an Alternative to Hive ODBC for OBIEE11g

In a post on the blog a couple of weeks ago, I looked at loading up a set of flight delays data into Apache Hadoop, then analysing it using Apache Hive, Cloudera Impala and OBIEE [https://www.rittmanmead.com/blog/2014/01/obiee-11-1-1-7-cloudera-hadoop-hiveimpala-part-2-load-data-into-hivehcatalog-analyze-using-impala/] . In this scenario, OBIEE connects to the

Oracle BI Suite EE

Rittman Mead BI Forum 2014 Call for Papers Closing Soon - And News on This Year's Masterclass

Its a couple of days to go until the call for papers for the Rittman Mead BI Forum 2014 closes [https://docs.google.com/a/rittmanmead.com/spreadsheet/viewform?formkey=dEdBdDNJb0JUTVBrNTJhaS00MmFjRXc6MA] , with suggested topics this year including OBIEE (of course), Essbase, Endeca, Big Data, Visualizations, In-Memory analysis and data integration.

Oracle BI Suite EE

OBIEE 11.1.1.7, Cloudera Hadoop & Hive/Impala Part 2 : Load Data into Hive Tables, Analyze using Hive & Impala

In yesterday’s post on analyzing Hadoop data using Cloudera CDH4, Amazon EC2 and OBIEE 11.1.1.7 [https://www.rittmanmead.com/blog/2014/01/obiee-11-1-1-7-cloudera-hadoop-hiveimpala-part-1-install-and-set-up-an-ec2-hadoop-cluster/] , I went through the setup process for Cloudera Manager Standard and then used it to set up a four-node Hadoop cluster in Amazon

Oracle BI Suite EE

Connecting OBIEE 11.1.1.7 to Cloudera Impala

A few months ago I posted a series of articles about connecting OBIEE 11.1.1.7 [http://www.rittmanmead.com/blog/2013/04/obiee-odi-and-hadoop-part-1-so-what-is-hadoop-mapreduce-and-hive] , Exalytics [http://www.rittmanmead.com/blog/2013/07/accelerating-hadoophive-obiee-queries-using-exalytics-and-the-summary-advisor/] and ODI [http://www.rittmanmead.com/blog/2013/04/obiee-odi-and-hadoop-part-3-a-closer-look-at-hive-hfds-and-cloudera-chd3/] to Apache Hadoop through Hive [http: