DATE: Thursday, Novemeber 14, 2013
TIME: Noon - 1:00 pm
PLACE: CIC - 4th floor (ISTC Panther Hollow Room)

SPEAKER: Dmitriy Ryaboy, Engineering Manager, Analytics Infrastructure, Twitter

TITLE: Realtime and Batch Data Processing @ Twitter *

ABSTRACT:
Summingbird, an open-source project recently released by Twitter, allows engineers to easily build data processing pipelines that work both in a streaming context provided by Twitter Storm, and in offline batch context through Apache Hadoop. This talk will cover the practical motivation for building such a thing, and explain the core Summingbird architecture and components.

BIO:
Dmitriy Ryaboy (@squarecog) manages the Twitter Analytics Infrastructure team. He's previously worked at Cloudera, Ask.com, and Lawrence Berkeley National Laboratory. He holds a Master's degree in VLIS from CMU and a Bachelor's in EECS from UC Berkeley.

VISITOR HOST: Andy Pavlo
VISITOR COORDINATOR: Jennifer Landefeld (jennsbl@cs.cmu.edu)

SDI / ISTC SEMINAR QUESTIONS?
Karen Lindenfelser, 86716, or visit www.pdl.cmu.edu/SDI/

Joint with MCDS
*partially funded by