Title:  Real-time applications have long been considered off-limits for Hadoop Clusters


This talk will show how latencies of less than 5 seconds can be achieved for a sample metrics application while still retaining years of data with high availability and durability. This is done using a hybrid system consisting of Storm plus Hadoop that cooperate to do something neither can do alone.

This talk will provide a theory of operations, systems description and possibly a demo of a live system. All code will be made available on Github.

 
About Ted

Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.

Ted Dunning Portland JUG June 2012

Useful Links:

  • Click here to get the slides from this talk. 

  • To see other presentations by Ted, you can follow him on Slide Share. 

  • To ask Ted a question click here




© 2012 MapR Corporation. All Rights Reserved
Privacy Policy