The other day I posted an article on the blog around using Flume to transport
Apache web log entries from our website into Hadoop
[https://www.rittmanmead.com/blog/2014/05/trickle-feeding-webserver-log-files-to-hdfs-using-apache-flume/]
, with the final destination for the entries being an HDFS file - with the HDFS
file essentially mirroring