From Simple CQL to Time-Series Event Tracking and Aggregation Using Cassandra and Hadoop
's is a classifieds website and Norway's busiest website. This session will go through various product development where c* has shown to be the best choice, focusing on our primary c* use-case: our in-house tracking solution that's collects raw time-series data in c* and aggregates minute-by-minute it using hadoop into various new datasets from advert-centric statistics to user-centric behavioural analysis. I'll cover the final technical design chosen after a number of development iterations touching on technologies: scribe, thrift, kafka, hadoop, pig, mahout; the hurdles faced along the way, and the throughput and performance of today's systems.
Programmer at FINN.no, Norway's largest classifieds website, working on core platform systems. Also a committer for Apache Tiles.
Remove this from your schedule?
This session is full and you may not be able to get back in.