↧
Camus Sweeper
I’ve used LinkedIn’s Kafka->HDFS pipeline Camus. Unfortunately the generated HDFS files are too small (something about 20k to 4m) in my case. That small files are a killer for MapReduce jobs running...
View ArticleRealtime Analysis with Kafka and Vertx
This is a follow-up of Sliding Window and Adaptive Counting and Vertx, Twitter and Top-K. Code for this ist on here. Lambda Architecture There is a good architectural concept, where to locate Realtime...
View ArticleCharting for Splout with AngularJS and NVD3.js
I’ve created a AngularJS based charting solution for Splout called Dabado. Why this? Necessary, beside automatic application of realtime analyzed or by M/R jobs calculated data, is charting. There are...
View Article