Top-K Queries in Realtime with Cassandra and Intravert
Performing ranking queries to find the most relevant documents, most popular urls, etc on huge datasets is trivial —if you're willing to wait a while for the answers. For those with less time to waste, this session describes techniques for performing such queries efficiently. We'll describe the ranking queries problem, outline the Cassandra CQL3 data structures and code that can be used to solve it and describe the trade-offs available. We describe intravert, an innovative server-side programming solution for Cassandra, and show how it can be used to reduce network usage and improve performance by filtering data closer to source.
Core Developer, JBoss
Jonathan Halliday is a core developer at JBoss, where he builds open source solutions for big-data analytics.
Postgrad Student, Newcastle University
Rui Vieira is a postgraduate student at Newcastle University, researching the adaptation of statistical algorithms to modern nosql execution environments.
Remove this from your schedule?
This session is full and you may not be able to get back in.