Robin Moffatt

Robin Moffatt

127 posts published

Using SparkSQL and Pandas to Import Data into Hive and Big Data Discovery
Big Data

Using SparkSQL and Pandas to Import Data into Hive and Big Data Discovery

Big Data Discovery [https://www.oracle.com/big-data/big-data-discovery/index.html] (BDD) is a great tool for exploring, transforming, and visualising data stored in your organisation’s Data Reservoir. I presented a workshop on it at a recent conference [https://speakerdeck.com/rmoff/unlock-the-value-in-your-big-data-reservoir-using-oracle-big-data-discovery-and-oracle-big-data-spatial-and-graph] , and got an interesting question from

Forays into Kafka - Logstash transport / centralisation
Big Data

Forays into Kafka - Logstash transport / centralisation

The holy trinity of Elasticsearch, Logstash, and Kibana (ELK) are a powerful trio of tools for data discovery [https://www.rittmanmead.com/blog/2015/04/using-the-elk-stack-to-analyse-donors-choose-data/] and systems diagnostics [https://www.rittmanmead.com/blog/2014/10/monitoring-obiee-with-elasticsearch-logstash-and-kibana/] . In a nutshell, they enable you to easily search through your log files,