DATE: Thursday, December 10, 2015
TIME: 12:00 pm - 1:00 pm
PLACE: RMCIC 4th Floor Panther Hollow Room

SPEAKER: Indranil Gupta, University of Illinois at Urbana-Champaign

TITLE: Graph-Busters: Distributed Graph Processing Engines Must Tread Carefully

ABSTRACT:
Clusters are increasingly being used in industry to process large graphs, and many distributed graph processing systems have emerged out of both the research community and industry. In this class of distributed systems we explore certain design decisions that appear to be natural and intuitive at first glance, but in fact may turn out to be harmful, in that these decisions hurt performance. Concretely, we first show how intelligent partitioning schemes (for splitting the graph across servers) may not always be worth the cost--in many cases they can be replaced by simpler hash-based schemes that achieve an orders of magnitude improvement in performance. Second we show how checkpointing in such engines is expensive, and that it can be replaced by opportunistic failure recovery mechanisms, which achieve highly accurate answers even under failures. The talk will also present experimental results from our two resulting systems, LFGraph and Zorro, respectively.

BIO:
Indranil Gupta (Indy) is an Associate Professor of Computer Science at the University of Illinois at Urbana-Champaign. His Distributed Protocols Research Group (http://dprg.cs.uiuc.edu) works on large-scale distributed systems, with a recent focus on cloud and big data systems. Indranil is recipient of the the NSF CAREER award in 2005, and Best Paper Awards (IEEE ICAC 2015, BigMine 2012). He has (co-)chaired leading distributed computing conferences (ICDCS 2016 Cloud Computing and Datacenters Track, IEEE P2P 2014, ACM/IFIP/Usenix Middleware 2010, IEEE SASO 2010, StoDiS 2005, ACM PODC 2007 General Chair). Previously, Indranil received his PhD from Cornell University in 2004 and Bachelors degree from IIT Madras in 1998. He worked at Google (2011-12) as a full-time employee/Visiting Scientist in Mountain View, and previously at IBM Research and Microsoft Research.

VISITOR HOST: Garth Gibson

SDI / ISTC SEMINAR QUESTIONS?
Karen Lindenfelser, 86716, or visit www.pdl.cmu.edu/SDI/