PARALLEL DATA LAB 

PDL Publications by Project

Big Learning

  • DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving. Foteini Strati, Sara McAllister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic. Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, July 21-27, 2024.
    Abstract / PDF [2.1M]

  • Erasure Coded Neural Network Inference via Fisher Averaging. Divyansh Jhunjhunwala, Neharika Jali, Gauri Joshi, Shiqiang Wang. IEEE International Symposium on Information Theory (ISIT), Athens, Greece,July 7-12, 2024.
    Abstract / PDF [175K]

  • Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs. Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak. arXiv:2406.01566v1 [cs.DC] 3 Jun 2024.
    Abstract / PDF [775K]

  • Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems. Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi. International Conference on Artificial Intelligence and Statistics (AISTATS), May 2nd - May 4th, 2024, Valencia, Spain.
    Abstract / PDF [800K]

  • Baleen: ML Admission & Prefetching for Flash Caches. Daniel Lin-Kit Wong, Hao Wu, Carson Molder, Sathya Gunasekar, Jimmy Lu, Snehal Khandkar, Abhinav Sharma, Daniel S. Berger, Nathan Beckmann, Gregory R. Ganger. 22nd USENIX Conference on File and Storage Technologies (FAST'24), Feb. 27–29, 2024, Santa Clara, CA.
    Abstract / PDF [2.7M] / Code / Traces

  • Sia: Heterogeneity-aware, Goodput-optimized ML-cluster Scheduling. Suhas Jayaram Subramanya, Daiyaan Arfeen, Shouxu Lin, Aurick Qiao, Zhihao Jia, and Gregory R. Ganger. 2023. ACM SIGOPS 29th Symposium on Operating Systems Principles (SOSP ’23), October 23–26, 2023, Koblenz, Germany.
    Abstract / PDF [1.23M]

  • Validating Large Language Models with ReLM. Michael Kuchnik, Virginia Smith, George Amvrosiadis. 6th MLSys Conference, Miami Beach, FL, USA, June 4-8, 2023. OUTSTANDING PAPER AWARD AT MLSYS23!
    Abstract / PDF [1.2M]

  • Federated Learning Under Distributed Concept Drift. Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons. International Conference on Artificial Intelligence and Statistics (AISTATS), Apr 2023. In preprint arXiv:2206.00799v1.
    Abstract / PDF [956K]

  • GL-Cache: Group-level Learning for Efficient and High-performance Caching. Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi. 21st USENIX Conference on File and Storage Technologies (FAST '23). Feb. 21–23, 2023, Santa Clara, CA.
    Abstract / PDF [1.84M]

  • Rateless Sum-Recovery Codes For Distributed Non-Linear Computations. Ankur Mallick,Gauri Joshi. Information Theory Workshop (ITW), November 6-9, 2022. Mumbai, India.
    Abstract / PDF [728K]

  • MATCHA: A Matching-Based Link Scheduling Strategy to Speed up Distributed Optimization. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. IEEE Transactions on Signal Processing, Oct 2022.
    Abstract / PDF [1.85M]

  • Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines. Michael Kuchnik, Ana Klimovic, Jirı Simsa, Virginia Smith, George Amvrosiadis. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [7M]

  • The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [1.3M]

  • Matchmaker: Data Drift Mitigation in Machine Learning for Large-scale Systems. Ankur Mallick, Kevin Hsieh, Behnaz Arzani, Gauri Joshi. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [500K]

  • RACOD: Algorithm/Hardware Co-design for Mobile Robot Path Planning. Mohammad Bakhshalipour, Seyed Borna Ehsani, Mohamad Qadri, Dominic Guri, Maxim Likhachev, Phillip B. Gibbons. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [2.2M]

  • Varuna: Scalable, Low-cost Training of Massive Deep Learning Models. Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, Nipun Kwatra. EuroSys ’22, April 5-8, 2022, Rennes, France. BEST PAPER AWARD!
    Abstract / PDF [1.5M]

  • C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery. Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman. 19th USENIX Symposium on Networked Systems Design and Implementation. April 4–6, 2022 • Renton, WA, USA.
    Abstract / PDF [1.9M] / Slides / Talk Video

  • Arithmetic-Intensity-Guided Fault Tolerance for Neural Network Inference on GPUs. Jack Kosaian, K. V. Rashmi. SC’21, November 14–19, 2021, St. Louis, MO, USA.
    Abstract / PDF [256K] / Slides / Code

  • A Novel Framework for the Analysis and Design of Heterogeneous Federated Learning. Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, H. Vincent Poor. IEEE Transactions on Signal Processing, Sept 2021.
    Abstract / PDF [835K]

  • Cooperative SGD: A Unified Framework for the Analysis of Local-Update. Jianyu Wang, Gauri Joshi. SGD Journal of Machine Learning Research (JMLR), 2021. September 2021.
    Abstract / PDF [860K]

  • Personalized Federated Learning for Heterogeneous Clients with Clustered Knowledge Transfer. Yae Jee Cho, Jianyu Wang, Tarun Chiruvolu, Gauri Joshi. arXiv:2109.08119v1 [cs.LG] 16 Sep 2021.
    Abstract / PDF [852K]

  • Rateless Codes for Distributed Non-linear Computations. Ankur Mallick, Sophie Smith, Gauri Joshi. International Symposium on Topics in Coding, Montréal, Québec, Canada, from August 30th to September 3rd, 2021.
    Abstract / PDF [1.1M]

  • Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size. Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, K. V. Rashmi. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 18-24 July 2021, Virtual Event.
    Abstract / PDF [518K] / Appendix / Code / Slides and Talk Video

  • Progressive Compressed Records: Taking a Byte out of Deep Learning Data. Michael Kuchnik, George Amvrosiadis, Virginia Smith. Proceedings of the VLDB Endowment, Vol. 14, No. 11 ISSN 2150-8097, July 2021.
    Abstract / PDF [3.86M]

  • Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing. 15th USENIX Symposium on Operating Systems Design and Implementation, Virtual Event, July 14–16, 2021. BEST PAPER AT OSDI'21!
    Abstract / PDF [930K] / Slides / Talk Video

  • Learning on Distributed Traces for Data Center Storage Systems. Giulio Zhou, Martin Maas Conference on Machine Learning and Systems '21, April 5-9, 2021. Virtual Event.
    Abstract / PDF [1.3M] / Talk Video

  • CORTEX: A Compiler for Recursive Deep Learning Models. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 4th MLSys Conference, San Jose, CA, USA, Apr 4-7, 2021.
    Abstract / PDF [622K] / Talk Video (starts at 29:41)

  • DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift. Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phillip B. Gibbons. arXiv:2003.06508 [cs.LG], August, 2020.
    Abstract / PDF [1.2M]

  • Access-optimal Linear MDS Convertible Codes for All Parameters. Francisco Maturana, V. S. Chaitanya Mukka, K. V. Rashmi. 2020 IEEE International Symposium on Information Theory 21-26 June 2020 • Virtual Los Angeles, California, USA.
    Abstract / PDF[287K] / Talk Video

  • Active Learning for ML Enhanced Database Systems. Lin Ma, Bailu Ding, Sudipto Das, Adith Swaminathan. SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.4M]

  • Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination. Conglong Li, Minjia Zhang, David G. Andersen, Yuxiong He. SIGMOD ’20, June 14–19, 2020, Virtual Portland, OR, USA.
    Abstract / PDF [800K]

  • Learning-Based Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. IEEE Journal on Selected Areas in Information Theory, March 2020.
    Abstract / PDF [654K]

  • Convertible Codes: New Class of Codes for Efficient Conversion of Coded Data in Distributed Storage. Francisco Maturana, K. V. Rashmi. 11th Innovations in Theoretical Computer Science Conference (ITCS 2020). Seattle, WA, January 12-14, 2020.
    Abstract / PDF [687K]

  • Parity Models: Erasure-Coded Resilience for Prediction Serving Systems. Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • PipeDream: Generalized Pipeline Parallelism for DNN Training. Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil R. Devanur, Gregory R. Ganger, Phillip B. Gibbons, Matei Zaharia. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • Rateless Codes for Distributed Computations with Sparse Compressed Matrices. Ankur Mallick, Gauri Joshi. IEEE International Symposium on Information Theory (ISIT), July 7-12, 2019, Paris, France.
    Abstract / PDF [672K]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Fast and Efficient Distributed Matrix-Vector Multiplication Using Rateless Fountain Codes. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 12 - 17 May, 2019 · Brighton, UK.
    Abstract / PDF [485K]

  • Automating Dependence-Aware Parallelization of Machine Learning Training on Distributed Shared Memory. Jinliang Wei, Garth A. Gibson, Phillip B. Gibbons, Eric P. Xing. EuroSys '19: Proceedings of the Fourteenth EuroSys Conference, March 2019, Dresden, Germany.
    Abstract / PDF [1.1M]

  • Towards Lightweight and Robust Machine Learning for CDN Caching. Daniel S. Berger. HotNets-XVII, November 15–16, 2018, Redmond, WA, USA.
    Abstract / PDF [610K]

  • Focus: Querying Large Video Datasets with Low Latency and Low Cost. Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Oct. 8–10, 2018, Carlsbad, CA.
    Abstract / PDF [1.2M]

  • Tributary: Spot-dancing for Elastic Services with Latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. 2018 USENIX Annual Technical Conference. July 11–13, 2018 Boston, MA, USA. Supersedes Carnagie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102.
    Abstract / PDF [1.25M]

  • Geriatrix: Aging What You See and What You Don’t See -- A File System Aging Approach for Modern Storage Systems. Saurabh Kadekodi, Vaishnavh Nagarajan, Gregory R. Ganger, Garth A. Gibson. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.44M]

  • Cavs: An Efficient Runtime System for Dynamic Neural Networks. Shizhen Xu, Hao Zhang, Graham Neubig, Wei Dai, Jin Kyu Kim, Zhijie Deng, Qirong Ho, Guangwen Yang, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.7M]

  • Litz: Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [298K]

  • Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. arXiv:1806.01259v1 [cs.LG], 4 Jun 2018
    Abstract / PDF [575K]

  • The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality in GPUs. Nandita Vijaykumar, Eiman Ebrahimi, Kevin Hsieh, Phillip B. Gibbons, Onur Mutlu. The 45th International Symposium on Computer Architecture - June 2-6, ISCA 2018. Los Angeles, California, USA.
    Abstract / PDF [3.1M]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. arXiv:1803.07445v1 [cs.LG] 20 Mar 2018.
    Abstract / PDF [1M]

  • 3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. arXiv:1802.07389v1 [cs.LG] 21 Feb 2018.
    Abstract / PDF [586K]

  • PipeDream: Fast and Efficient Pipeline Parallel DNN Training. Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Gregory R. Ganger, Phil Gibbons. SysML '18, Feb. 15-16, 2018 , Stanford, CA.
    Abstract / PDF [615K]

  • Intermittent Deep Neural Network Inference. Graham Gobieski, Nathan Beckmann, Brandon Lucia. SysML 2018, February 15-16, 2018, Stanford, CA.
    Abstract / PDF [450K]

  • Tributary: Spot-dancing for elastic services with latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102, Jan. 2018.
    Abstract / PDF [990K]

  • Aging Gracefully with Geriatrix: A File System Aging Tool. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-106, October 2017. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [560K]

  • Litz: An Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. Carnegie Mellon Univedrsity Parallel Data Laboratory Technical Report CMU-PDL-17-103. June 2017.
    Abstract / PDF [424K]

  • Proteus: Agile ML Elasticity through Tiered Reliability in Dynamic Resource Markets. Aaron Harlap, Alexey Tumanov, Andrew Chung, Gregory R. Ganger, Phil Gibbons. ACM European Conference on Computer Systems, 2017 (EuroSys'17), 23rd-26th April, 2017, Belgrade, Serbia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [743K]

  • Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds. Kevin Hsieh, Aaron Harlap, Nandita Vijaykumar, Dimitris Konomis, Gregory R. Ganger, Phillip B. Gibbons, Onur Mutlu. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI), March 27–29, 2017, Boston, MA.
    Abstract / PDF [1.5M]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, and Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-108, October 2016.
    Abstract / PDF [900K]

  • Benchmarking Apache Spark with Machine Learning Applications. Jinliang Wei, Jin Kyu Kim, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-107 October 2016.
    Abstract / PDF [360K]

  • Addressing the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2016. Oct 5-7, Santa Clara, CA. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • TierML: Using Tiers of Reliability for Agile Elasticity in Machine Learning. Aaron Harlap, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [590K]

  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [617K]

  • Scalable Deep Learning on Distributed GPUs with a GPU-specialized Parameter Server. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-107, October 2015.
    Abstract / PDF [537K]

  • SMPFRAME: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Hoy, Seunghak Lee Xun Zheng, Wei Dai, Garth A. Gibson, Eric Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-103, May 2015.
    Abstract / PDF [1.57M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho*, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-105. April 2015.
    Abstract / PDF [2.62M]

  • Solving the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • High-Performance Distributed ML at Scale through Parameter Server Consistency Models. Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth A. Gibson, Eric P. Xing. 29th AAAI Conf. on Artificial Intelligence (AAAI-15), Jan 25-29, 2015, Austin, Texas.
    Abstract / PDF [733K]

  • Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • On Model Parallelization and Scheduling Strategies for Distributed Machine Learning. S. Lee, J. K. Kim, X. Zheng, Q. Ho, G. A. Gibson, E. P. Xing. Proceedings of 2014 Neural Information Processing Systems (NIPS’14), December 2014.
    Abstract / PDF [336K]

  • Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 USENIX Annual Technical Conference (ATC'14). June 19-20, 2014. Philadelphia, PA. Supersedes CMU-PDL-14-101.
    Abstract / PDF [731K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth A. Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
    Abstract / PDF [174K]

Cloud Computing

  • Reducing Cross-Cloud/Region Costs with the Auto-Configuring MACARON Cache. Hojin Park, Ziyue Qiu, Gregory R. Ganger, George Amvrosiadis. SOSP ’24, November 4–6, 2024, Austin, TX, USA.
    Abstract / PDF [2.7M]

  • Erasure Coded Neural Network Inference via Fisher Averaging. Divyansh Jhunjhunwala, Neharika Jali, Gauri Joshi, Shiqiang Wang. IEEE International Symposium on Information Theory (ISIT), Athens, Greece,July 7-12, 2024.
    Abstract / PDF [175K]

  • Designing Cloud Servers for Lower Carbon. Jaylen Wang, Daniel S. Berger, Fiodar Kazhamiaka, Celine Irvene, Chaojie Zhang, Esha Choukse, Kali Frost, Rodrigo Fonseca, Brijesh Warrier, Chetan Bansal, Jonathan Stern, Ricardo Bianchini, Akshitha Sriraman. Proceedings of the 51st Intl. Symposium on Computer Architecture (ISCA 2024), Buenos Aires, Argentina, June 2024.
    Abstract / PDF [1.35M]

  • SIEVE is Simpler than LRU: An Efficient Turn-Key Eviction Algorithm for Web Caches. Yazhuo Zhang, Juncheng Yang, Yao Yue, Ymir Vigfusson, K. V. Rashmi. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI'24), April 16–18, 2024. Santa Clara, CA. COMMUNITY AWARD FOR BEST PAPER!
    Abstract / PDF [1M]

  • Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments. Ziqi Wang, Kaiyang Zhao, Pei Li, Andrew Jacob, Michael Kozuch, Todd Mowry, Dimitrios Skarlatos. MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture. October 2023. Toronto, Canada.
    Abstract / PDF [935K]

  • FIFO Queues Are All You Need for Cache Eviction. Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, Rashmi Vinayak SOSP '23: Proceedings of the 29th Symposium on Operating Systems Principles, October 2023. Koblenz, Germany.
    Abstract / PDF [1.6M]

  • Peeling Back the Carbon Curtain: Carbon Optimization Challenges in Cloud Computing. Jaylen Wang, Udit Gupta, Akshitha Sriraman. HotCarbon 2023.July 9, 2023, Boston, MA, USA.
    Abstract / PDF [840K]

  • FIFO Can Be Better than LRU: The Power of Lazy Promotion and Quick Demotion. Juncheng Yang, Ziyue Qiu, Yazhuo Zhang*, Yao Yue^, K. V. Rashmi. HotOS ’23, June 22–24, 2023, Providence, RI, USA.
    Abstract / PDF [1.2M]

  • Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud. Hojin Park, Gregory R. Ganger, George Amvrosiadis. SYSTOR '23: Proceedings of the 16th ACM International Conference on Systems and Storage, Haifa, Israel, June 5-7, 2023.
    Abstract / PDF [1.4M]

  • Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. Huaicheng Li, Daniel S. Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, Mark D. Hill, Marcus Fontoura, Ricardo Bianchini. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. DISTINGUISHED PAPER AWARD!
    Abstract / PDF [1.7M]

  • Realizing Value in Shared Compute Infrastructures. Andrew Chung. Carnegie Mellon University PhD Dissertation CMU-CS-22-151, December 2022.
    Abstract / PDF [3M]

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. SC ’21, November 14–19, 2021, St. Louis, MO, USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M] / Slides / Talk Video

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M]

  • Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing. 15th USENIX Symposium on Operating Systems Design and Implementation, Virtual Event, July 14–16, 2021. BEST PAPER AT OSDI'21!
    Abstract / PDF [930K] / Slides / Talk Video

  • Open Problems in Queueing Theory Inspired by Datacenter Computing. Mor Harchol-Balter. Queueing Systems, vol. 97, no. 1, February 2021, pp. 3-37.
    Abstract / PDF [690K]

  • Distributed Metadata and Streaming Data Indexing as Scalable Filesystem Services. Qing Zheng. Carnegie Mellon University School of Computer Science Ph.D. Dissertation, CMU-CS-21-103. February 2021.
    Abstract / PDF [2.1M]

  • Unearthing Inter-job Dependencies for Better Cluster Scheduling. Andrew Chung, Subru Krishnan, Konstantinos Karanasos, Carlo Curino, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.0M] / Slides / Talk Video

  • The CacheLib Caching Engine: Design and Experiences at Scale. Benjamin Berg, Daniel S. Berger, Sara McAllister, Isaac Grosof, Sathya Gunasekar, Jimmy Lu, Michael Uhlar, Jim Carrig, Nathan Beckmann, Mor Harchol-Balter, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [606K] / Slides / Talk Video

  • Streaming Data Reorganization at Scale with DeltaFS Indexed Massive Directories. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. ACM Transactions on Storage, Vol. 16, No. 4, Article 23. September 2020.
    Abstract / PDF [2.1M]

  • Caching with Delayed Hits. Nirav Atre, Justine Sherry, Weina Wang, Daniel S. Berger. SIGCOMM ’20, August 10–14, 2020, Virtual Event, NY, USA.
    Abstract / PDF [2.7M] / Talk Video

  • More IOPS for Less: Exploiting Burstable Storage in Public Clouds. Hojin Park, Gregory R. Ganger, George Amvrosiadis. 12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud ’20). Virtual Boston, MA, July 13-14, 2020.
    Abstract / PDF [600K] / Talk Video / Slides

  • Machine Learning on Volatile Instances. Xiaoxi Zhang, Jianyu Wang, Gauri Joshi, Carlee Joe-Wong. IEEE Intl. Conf. on Computer Communications (INFOCOM). Virtual Toronto, Canada, July 6-9, 2020.
    Abstract / PDF [516K]

  • Lookahead Converges to Stationary Points of Smooth Non-Convex Functions. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICASSP 2020: 45th International Conference on Acoustics, Speech, and Signal Processing. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [242K]

  • Efficient Remote Procedure Calls for Datacenters. Anuj Kalia. Carnegie Mellon University PhD Dissertation CMU-CS-19-126, September 2019.
    Abstract / PDF [1.7M]

  • Peering through the Dark: An Owl’s View of Inter-job Dependencies and Jobs’ Impact in Shared Clusters. Andrew Chung, Carlo Curino, Subru Krishnan, Konstantinos Karanasos, Panagiotis Garefalakis, Gregory R. Ganger. SIGMOD ’19, June 30–July 5, 2019, Amsterdam, Netherlands.
    Abstract / PDF [1.6M]

  • Distribution-based Cluster Scheduling. Jun Woo Park. Carnegie Mellon University School of Computer Science PhD Dissertation, June 2019.
    Abstract / PDF [1.47M]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Gregory R. Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
    Abstract / PDF [735K]

  • Scaling Video Analytics on Constrained Edge Nodes. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. 2nd SysML Conference (SysML ’19). March 31-April 2, 2019, Palo Alto, CA.
    Abstract / PDF [8.5M]

  • Datacenter RPCs can be General and Fast. Anuj Kalia, Michael Kaminsky, David G. Andersen. 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Feb. 26–28, 2019, Boston, MA. Best Paper award!
    Abstract / PDF [555K]

  • Scaling Embedded In-Situ Indexing with DeltaFS. Qing Zheng, Charles D. Cranor, Danhao Guo, Gregory R. Ganger, George Amvrosiadis, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider, Fan Guo. SC18, November 11-16, 2018, Dallas, Texas, USA.
    Abstract / PDF [927K]

  • Stratus: Cost-aware Container Scheduling in the Public Cloud. Andrew Chung, Jun Woo Park, Gregory R. Ganger. ACM Symposium on Cloud Computing, 2018 (SoCC’18), Carlsbad, CA October 11-13, 2018.
    Abstract / PDF [1.5M]

  • RobinHood: Tail Latency Aware Caching—Dynamic Reallocation from Cache-Rich to Cache-Poor. Daniel S. Berger, Benjamin Berg, Timothy Zhu, Siddhartha Sen, Mor Harchol-Balter. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’18). October 8–10, 2018 • Carlsbad, CA, USA.
    Abstract / PDF [2.9M]

  • Putting the “Micro” Back in Microservice. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [740K]

  • Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela H. Jiang, Daniel L.K. Wong, Christopher Canel, Lilia Tang, Ishan Misra, Michael Kaminsky*, Michael A. Kozuch*, Padmanabhan Pillai*, David G. Andersen Gregory R. Ganger. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA, USA.
    Abstract / PDF [1.5M]

  • On the Diversity of Cluster Workloads and its Impact on Research Results. George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. 2018 USENIX Annual Technical Conference (ATC '18), Boston, MA, July 11-13, 2018.
    Abstract / PDF [285K]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson, Gregory R. Ganger. 10th USENIX Workshop on Hot Topics in Cloud Computing, July 9, 2018, Boston, MA. Supersedes CMU-PDL-17-105.
    Abstract / PDF [250K]

  • Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela Jiang, Christopher Canel, Daniel Wong, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, David G. Andersen, Gregory R. Ganger. SysML 18, February 15–16, 2018. Stanford, CA.
    Abstract / PDF [450K]

  • Picking Interesting Frames in Streaming Video. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [913K]

  • 3Sigma: Distribution-based Cluster Scheduling for Runtime Uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. EuroSys ’18, April 23–26, 2018, Porto, Portugal. Supersedes CMU-PDL-17-107, Nov. 2017.
    Abstract / PDF [1.4M]

  • 3Sigma: Distribution-based cluster scheduling for runtime uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-107, November 2017.
    Abstract / PDF [800K]

  • Software-Defined Storage for Fast Trajectory Queries using a DeltaFS Indexed Massive Directory. Qing Zheng, George Amvrosiadis, Saurabh Kadekodi, Garth A. Gibson, Chuck Cranor, Brad Settlemyer, Gary Grider, Fan Guo. PDSW-DISCS 2017: 2nd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems held in conjunction with SC17, Denver, CO, November 2017.
    Abstract / PDF [1.25M]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-105, October 2017.
    Abstract / PDF [280K]

  • Bigger, Longer, Fewer: What do cluster jobs look like outside Google? George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-104, October 2017.
    Abstract / PDF [360K]

  • WorkloadCompactor: Reducing datacenter cost while providing tail latency SLO guarantees. Timothy Zhu, Michael A. Kozuch & Mor Harchol-Balter. ACM Symposium on Cloud Computing (SoCC'17) , Santa Clara, Oct 2017.
    Abstact / PDF [3.25M]

  • Principled Workflow-centric Tracing of Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Jonathan Mace, Benjamin H. Sigelman, Rodrigo Fonseca, Gregory R. Ganger. ACM Symposium on Cloud Computing 2016 (SoCC ’16) October 5-7, 2016, Santa Clara, CA, USA.
    Abstract / PDF [590K]

  • SNC-Meister: Admitting More Tenants with Tail Latency SLOs. Timothy Zhu, Daniel S. Berger, Mor Harchol-Balter. SoCC ’16, October 05-07, 2016, Santa Clara, CA, USA.
    Abstract / PDF [500K]

  • Online Deduplication for Distributed Databases. Lianghong Xu. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, September 2016.
    Abstract / PDF [1.8M]

  • JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes. Alexey Tumanov, Angela Jiang, Jun Woo Park, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-104. September 2016.
    Abstract / PDF [1.6M]

  • Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [617K]

  • TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters. Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [8M]

  • DeltaFS: Exascale File Systems Scale Better Without Dedicated Servers. Qing Zheng, Kai Ren, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider. PDSW2015: 10th Parallel Data Storage Workshop, held in conjunction with SC15, Austin, TX, November 16, 2015.
    Abstract / PDF [930K]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth A. Gibson. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [275K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy, Krishnamurthy Viswanathan, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI. See the extended Technical Report for more information.
    Abstract / PDF [1.3M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [369K]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Jin Li, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [501K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy Krishnamurthy Viswanathan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-106. April 2015. Extended version of the 2015 SoCC paper.
    Abstract / PDF [925K]

  • A Cloud Computing Course: From Systems To Services. M. Suhail Rehman, Jason Boles, Mohammad Hammoud, Majd F. Sakr. Proceedings of the 46th ACM Special Interest Group on Computer Science Education Conference (SIGCSE 2015), Kansas City, USA, March 2015.
    Abstract / PDF [356K]

  • STOVE: Strict, Observable, Verifiable Data and Execution Models for Untrusted Applications. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014 (Doctoral Symposium), pp.644,649, 15-18 Dec. 2014.
    Abstract / PDF [541K]

  • STOVEPipe: Observable Access Control of User Data for Untrusted Applications on Mobile Devices. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. Poster at the IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014, 15-18 Dec. 2014.
    Abstract / PDF [149K]

  • Exploiting Iterative-ness for Parallel ML Computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Greg R. Ganger, Phil B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. Supersedes Carnegie Mellon University Parallel Data Technical Report CMU-PDL-14-107.
    Abstract / PDF [609K]

  • PriorityMeister: Tail Latency QoS for Shared Networked Storage. Timothy Zhu, Alexey Tumanov, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014.
    prioritymeister-SoCC14.pdf
    Abstract / PDF [940K]

  • Cloudlets: at the Leading Edge of Mobile-Cloud Convergence. M. Satyanarayanan, Z. Chen, K. Ha, W. Hu, W. Richter, P. Pillai. Proceedings of MobiCASE 2014: Sixth International Conference on Mobile Computing, Applications and Services, Austin, TX, November 2014.
    Abstract / PDF [859K]

  • A Brief History of Cloud Offload. M. Satyanarayanan. GetMobile, Volume 18, Issue 4, October 2014.
    Abstract / PDF [360K]

  • Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, And Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. ACM Transactions on Storage, Vol. 10, No. 4, Article 16, Publication date: October 2014.
    Abstact / PDF [1.34M]

  • Towards Wearable Cognitive Assistance. Kiryong Ha, Zhuo Chen, Wenlu Hu, Wolfgang Richter, Padmanabhan Pillai, Mahadev Satyanarayanan. Proceedings of the 12th ACM International Conference on Mobile Computing, Systems and Services (MobiSys’14), June 2014.
    Abstract / PDF [1.54M]

  • QuiltView: Glass-Sourced Video for Google Maps Queries. Zhuo Chen, Wenlu Hu, Kiryong Ha, Jan Harkes, Benjamin Gilbert, Jason Hong, Asim Smailagic, Dan Siewiorek, Mahadev Satyanarayanan. The 15th International Workshop on Mobile Computing Systems and Applications (HotMobile'14), February 2014.
    Abstract / PDF [4.51M]

  • Agentless Cloud-wide Streaming of Guest File System Updates. Wolfgang Richter, Canturk Isci, Jan Harkes, Benjamin Gilbert, Vasanth Bala, Mahadev Satyanarayanan. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. Best Paper.
    Abstract / PDF [978K]

  • SpringFS: Bridging Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. 12th USENIX Conference on File and Storage Technologies (FAST '14), Santa Clara, CA, February 17–20, 2014.
    Abstract / PDF [319K]

  • Tetrisched: Space-Time Scheduling for Heterogeneous Datacenters. Alexey Tumanov, Timothy Zhu, Michael A. Kozuch†, Mor Harchol-Balter, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-112, December, 2013.
    Abstract / PDF [716K]

  • Just-in-Time Provisioning for Cyber Foraging. Kiryong Ha, Padmanabhan Pillai, Wolfgang Richter, Yoshihisa Abe, Mahadev Satyanarayanan The 11th International Conference on Mobile Systems, Applications, and Services (MobiSys'13), June 25–28, 2013, Taipei, Taiwan.
    ha-mobisys-vmsynthesis-2013.pdf cloud
    Abstract / PDF [2.29M]

  • vQuery: A Platform for Connecting Configuration and Performance. Ilari Shafer, Snorri Gylfason, Gregory R. Ganger. vwWare Labs Technical Report, Palo Alto, CA. December 2012.
    Abstract / PDF [288K]

  • alsched: Algebraic Scheduling of Mixed Workloads in Heterogeneous Clouds. Alexey Tumanov, James Cipar, Michael A. Kozuch, Gregory R. Ganger. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
    Abstract / PDF [379K]

  • Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis. Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA. 2021 SoCC Test of Time Award!
    Abstract / PDF [3.1M]

  • Saving Cash by Using Less Cache. Timothy Zhu, Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. 4th USENIX Workshop of Hot Topics in Cloud Computing (Hotcloud 2012). June 12-13, 2012, Boston, MA.
    Abstract / PDF [177K]

  • Towards Understanding Heterogeneous Clouds at Scale: Google Trace Analysis Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. Intel Science and Technology Center for Cloud Computing Technical Report ISTC-CC-TR-12-101, April 27, 2012.
    Abstract / PDF [876K]

  • Near-Real-Time Inference of File-Level Mutations from Virtual Disk Writes. Wolfgang Richter, Mahadev Satyanarayanan, Jan Harkes, Benjamin Gilbert. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-103. February 2012.
    Abstract / PDF [343K]

  • ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
    Abstract / PDF [567K]

  • Privacy-Sensitive VM Retrospection. Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode, Nilton Bila, Eyal De Lara, Vas Bala, Mahadev Satyanarayanan. HotCloud 2011 3rd USENIX Workshop on Hot Topics in Cloud Computing. Portland, OR, June 14-17, 2011.
    Abstract / PDF [1.97M]

  • On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth A. Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • Exertion-based Billing for Cloud Storage Access. Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger. Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '11). June 14-15, 2011, Portland, OR. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-105. March 2011.
    Abstract / PDF [65K]

  • The Case for Content Search of VM Clouds. Mahadev Satyanarayanan, Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode. 34th Annual IEEE Computer Software and Applications Conference Workshops (COMPSACW), July 19-23, 2010, Seoul, Korea.
    Abstract / PDF [831K]

  • Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
    Abstract / PDF [1.1M]

Database Systems

  • The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions. William Zhang, Wan Shen Lim, Matthew Butrovich, Andrew Pavlo. Proceedings of the VLDB Endowment, 17(11): 3373-3387, 2024. July 2024.
    Abstract / PDF [2.6M]

  • Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems. Wan Shen Lim, Lin Ma, William Zhang, Matthew Butrovich, Samuel Arch, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 17, No. 11, ISSN 2150-8097. July 2024.
    Abstract / PDF [1.25M]

  • Survey and Evaluation of Database Management System Extensibility. Abigale Kim. Carnegie Mellon University School of Computer Science M.S.Thesis CMU-CS-23-144. January 2024.
    Abstract / PDF [1.25M]

  • Dear User-Defined Functions, Inlining isn’t working out so great for us. Let’s try batching to make our relationship work. Sincerely, SQL. Kai Franz, Samuel Arch, Denis Hirn, Torsten Grust, Todd C. Mowry, Andrew Pavlo. Conference on Innovative Data Systems Research (CIDR 2024), Chaminade, CA, USA, January 14-17, 2024.
    Abstract / PDF [545K]

  • Rethinking the Encoding of Integers for Scans on Skewed Data. Martin Prammer, Jignesh M. Patel. Proc. ACM Manag. Data, Vol. 1, No. 4 (SIGMOD), Article 257. December 2023.
    Abstract / PDF [1.5M]

  • Tigger: A Database Proxy That Bounces With User-Bypass. Matthew Butrovich, Karthik Ramanathan, John Rollinson, Wan Shen Lim, William Zhang, Justine Sherry, Andrew Pavlo Proceedings of the VLDB Endowment, Vol. 16, No. 11, 2023.
    Abstract / PDF [1.3M]

  • Simple Adaptive Query Processing vs. Learned Query Optimizers: Observations and Analysis. Yunjia Zhang, Yannis Chronis, Jignesh M. Patel, Theodoros Rekatsinas. Proceedings of the VLDB Endowment, Vol. 16, No. 11.
    Abstract / PDF [950K]

  • Database Gyms. Wan Shen Lim, Matthew Butrovich, William Zhang, Andrew Crotty, Lin Ma, Peijing Xu, Johannes Gehrke, Andrew Pavlo. CIDR 2023. 13th Annual Conference on Innovative Data Systems Research (CIDR ’23). January 8-11, 2023, Amsterdam, The Netherlands.
    Abstract / PDF [800K] / Slides

  • Tastes Great! Less Filling! High Performance and Accurate Training Data Collection for Self-Driving Database Management Systems. Matthew Butrovich, Wan Shen Lim, Lin Ma, John Rollinson, William Zhang, Yu Xia, Andrew Pavlo. SIGMOD ’22, June 12–17, 2022, Philadelphia, PA, USA.
    Abstract / PDF [1.1M]

  • MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems. Lin Ma, William Zhang, Jie Jiao, Wuwen Wang, Matthew Butrovich, Wan Shen Lim, Prashanth Menon, Andrew Pavlo. SIGMOD ’21, June 20–25, 2021, Virtual Event, China.
    Abstract / PDF [1.25M]

  • Filter Representation in Vectorized Query Execution. Amadou Ngom, Prashanth Menon, Matthew Butrovich, Lin Ma, Wan Shen Lim, Todd C. Mowry, Andrew Pavlo. International Workshop on Data Management on New Hardware, pages. 6:1—6:7, June 2021.
    Abstract / PDF [720K]

  • Are You Sure You Want to Use MMAP in YourDatabase Management System? Andrew Crotty, Viktor Leis, Andrew Pavlo. 12th Annual Conference on Innovative Data Systems Research (CIDR ’22). January 9-12, 2022, Chaminade, USA.
    Abstract / PDF [690K] / Talk Video

  • Everything is a Transaction: Unifying Logical Concurrency Control and Physical Data Structure Maintenance in Database Management Systems. Ling Zhang, Matthew Butrovich, Tianyu Li, Yash Nannapanei, Andrew Pavlo, John Rollinson, Huanchen Zhang, Ambarish Balakumar, Daniel Biales, Ziqi Dong, Emmanuel Eppinger, Jordi Gonzalez, Wan Shen Lim, Jianqiao Liu, Lin Ma, Prashanth Menon, Soumil Mukherjee, Tanuj Nayak, Amadou Ngom, Jeff Niu, Deepayan Patra, Poojita Raj, Stephanie Wang, Wuwen Wang, Yao Yu, William Zhang. Conference on Innovative Data Systems Research (CIDR) 2021. January 11-15, 2021. Virtual Event.
    Abstract / PDF [352K] / Talk Video

  • Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats. Tianyu Li, Matthew Butrovich, Amadou Ngom, Wan Shen Lim, Wes McKinney, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 14, No. 4 ISSN 2150-8097, pp. 534-546, Dec. 2020.
    Abstract / PDF [633K]

  • Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling. Prashanth Menon, Amadou Ngom, Lin Ma, Todd C. Mowry, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 14, iss. 2, pages. 101—113, October 2020.
    Abstract / PDF [904K]

  • External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems. Andrew Pavlo, Matthew Butrovich, Ananya Joshi, Lin Ma, Prashanth Menon, Dana Van Aken, Lisa Lee, Ruslan Salakhutdinov. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 42(2): 32-46 (2019).
    Abstract / PDF [555K]

  • Query-based Workload Forecasting for Self-Driving Database Management Systems. Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, Geoffrey J. Gordon. SIGMOD/PODS '18 International Conference on Management of Data, Houston, TX, USA, June 10 - 15, 2018.
    Abstract / PDF [1.25M]

  • Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last. Prashanth Menon, Todd C. Mowry & Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 11, No. 1, 2017.
    Abstact / PDF [970K]

  • Automatic Database Management System Tuning Through Large-scale Machine Learning. Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, Bohan Zhang. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017. Chicago, IL, USA.
    Abstract / PDF [760K]

  • Online Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017.
    Abstract / PDF [890K]

  • An Empirical Evaluation of In-Memory Multi-Version Concurrency Control. Yingjun Wu, Joy Arulraj, Jiexi Lin, Ran Xian, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 10, iss. 7, pages. 781—792, March 2017.
    Abstract / PDF [660K]

  • An Evaluation of Distributed Concurrency Control. Rachael Harding, Dana Van Aken, Andrew Pavlo, Michael Stonebraker. Proceedings of the VLDB Endowment, vol. 10, iss. 5, pages. 553—564, January 2017.
    Abstract / PDF [421K]

  • Self-Driving Database Management Systems. A. Pavlo, G. Angulo, J. Arulraj, H. Lin, J. Lin, L. Ma, P. Menon, T. Mowry, M. Perron, I. Quah, S. Santurkar, A. Tomasic, S. Toor, D. V. Aken, Z. Wang, Y. Wu, R. Xian, and T. Zhang. In CIDR 2017, Conference on Innovative Data Systems Research. January 8-11, 2017, Chaminade, CA.
    Abstract / PDF [680K]

  • Write-Behind Logging. J. Arulraj, M. Perron, A. Pavlo. Proc. VLDB Endow., vol. 10, pp. 337-348, December, 2016.
    Abstract / PDF [931K]

  • Online Deduplication for Distributed Databases. Lianghong Xu. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, September 2016.
    Abstract / PDF [1.8M]

  • Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems. Lin Ma, Joy Arulraj, Sam Zhao, Andrew Pavlo, Subramanya R. Dulloor, Michael J. Giardino, Jeff Parkhurst, Jason L. Gardner, Kshitij Dosh*, Col. Stanley Zdonik. DaMoN’16, June 26-July 01 2016, San Francisco, CA, USA.
    Abstract / PDF [1.25M]

  • Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. Joy Arulraj, Andrew Pavlo, Prashanth Menon. SIGMOD’16, June 26-July 01, 2016, San Francisco, CA, USA.
    Abstract / PDF [575K]

  • Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

  • Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes. Huanchen Zhang, Andy Pavlo, David G. Andersen, Michael Kaminsky, Lin Ma, Rui Shen. ACM SIGMOD International Conference on Management of Data 2016 (SIGMOD'16), June 2016.
    Abstract / PDF [715K]

  • BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed. D. Van Aken, D. E. Difallah, A. Pavlo, C. Curino, and P. Cudré-Mauroux. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, 2015, pp. 1069-1073.
    Abstract / PDF [1.2M]

  • Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. Joy Arulraj, Andrew Pavlo, Subramanya R. Dulloor. Proceedings ACM SIGMOD, Melbourne, Victoria, Australia, May 31-June 4, 2015.
    Abstract / PDF [1M]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta Jin Li, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-108. December 2014.
    Abstract / PDF [646K]

Decentralized Caching

  • Cuckoo Linear Algebra. Li Zhou, David G. Andersen, Mu Li, Alexander J. Smola. KDD’15, August 10-13, 2015, Sydney, NSW, Australia.
    Abstract / PDF [611K]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014.
    Abstract / PDF [343K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Integrating Portable and Distributed Storage. Niraj Tolia, Jan Harkes, Michael Kozuch, and M. Satyanarayanan. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [881K] / PDF [211K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • Opportunistic Use of Content Addressable Storage for Distributed File Systems. Niraj Tolia, Michael Kozuch, Mahadev Satyanarayanan, Brad Karp, Thomas Bressoud, and Adrian Perrig. Proceedinge USENIX Annual Technical Conference, General Track 2003: 127-140, June 9-14, San Antonio, TX.
    Abstract / Postscript [1M] / PDF [284K]

  • Data Staging on Untrusted Surrogates. Jason Flinn, Shafeeq Sinnamohideen, Niraj Tolia, M. Satyanarayanan. Proceedings 2nd USENIX Conference on File and Storage Technologies (FAST03), Mar31-Apr2, 2003, San Francisco, CA.
    Abstract / Postscript [1.5M] / PDF [325K]

  • Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • My Cache or Yours? Making Storage More Exclusive. Theodore M. Wong, John Wilkes. USENIX Annual Technical Conference (USENIX 2002), pp. 161-175, 10-15 June 2002, Monterey, CA. Supercedes CMU SCS Tech. Report CMU-CS-02-186, which supercedes CMU-CS-00-157, originally published in November 2000.
    Abstract / Postscript [759K] / PDF [253K]

Energy Efficiency

  • RACER: Bit-Pipelined Processing Using Resistive Memory. Minh S. Q. Truong, Eric Chen, Deanyone Su, Alexander Glass, Liting Shen, L. Richard Carley, James A. Bain, Saugata Ghose. 54th IEEE/ACM International Symposium on Microarchitecture, ser. MICRO 2021, Oct. 2021.
    Abstract / PDF [2.2M]

  • MANIC: A Vector-Dataflow Architecture for Ultra-Low-Power Embedded Systems. Graham, Amolak Nagi, Nathan Serafin, Mehmet Meric Isgenc, Nathan Beckmann, Brandon Lucia. MICRO '52: Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Columbus, OH, October 2019.
    Abstract / PDF [1.2M]

  • A Scalable Priority-Aware Approach to Managing Data Center Server Power. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu. HPCA 2019: The 25th International Symposium on High-Performance Computer Architecture, February 16 - 20, 2019, Washington D.C.
    Abstract / PDF [610K]

  • LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching. Mohammad Sadrosadati, Amirhossein Mirhosseini, Seyed Borna Ehsani, Hamid Sarbazi-Azad, Mario Drumond, Babak Falsafi, Rachata Ausavarungnirun, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.M]

  • Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability. Maciej Besta, Syed Minhaj Hassan, Sudhakar Yalamanchili, Rachata Ausavarungnirun, Onur Mutlu, Torsten Hoefler. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.6M]

  • MASK: Redesigning the GPU Memory Hierarchy to Support Multi-Application Concurrency. Rachata Ausavarungnirun, Vance Miller, Joshua Landgraf, Saugata Ghose, Jayneel Gandhi, Adwait Jog, Christopher J. Rossbach, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.1M]

  • μC-States: Fine-grained GPU Datapath Power Management. Onur Kayıran, Adwait Jog, Ashutosh Pattnaik, Rachata Ausavarungnirun, Xulong Tang, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das. Proceedings of the The 25th International Conference on Parallel Architectures and Compilation Techniques (PACT 2016), Haifa, Israel, September 2016.
    Abstract / PDF [823K]

  • SizeCap: Efficiently Handling Power Surges in Fuel Cell Powered Data Centers. Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu. 22nd International Symposium on High Performance Computer Architecture (HPCA), March 12-16, Barcelona, Spain, 2016.
    Abstract / PDF [1.32M]

  • A Case for Toggle-Aware Compression for GPU Systems. Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [713K]

  • Low-Cost Inter-Linked Subarrays (LISA): Enabling Fast Inter-Subarray Data Movement in DRAM. Kevin K. Chang, Prashant J. Nair, Donghyuk Lee, Saugata Ghose, Moinuddin K. Qureshi, and Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [768K]

  • Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture. Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
    Abstract / PDF [3.17M]

  • Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
    Abstract / PDF [184K]

  • Bottleneck Identification and Scheduling in Multithreaded Applications. José A. Joao, M. Aater Suleman, Onur Mutlu, Yale N. Patt. Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), London, UK, March 2012.
    Abstract / PDF [828K]

  • ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
    Abstract / PDF [567K]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • The Case for Sleep States in Servers. Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. HotPower'11, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [621K]

  • Minimizing Data Center SLA Violations and Power Consumption via Hybrid Resource Provisioning. Anshul Gandhi, Yuan Chen, Daniel Gmach, Martin Arlitt, Manish Marwah. 2nd IGCC 2011 (IEEE International Green Computing Conference 2011) July 25-28, 2011 Orlando, Florida, USA. -- BEST PAPER AWARD
    Abstract / PDF [503K]

  • Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
    Abstract / PDF [463K]

  • Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael A. Kozuch. 5th International Open Cirrus Summit. June 01 – 03, 2011, Moscow, Russia.
    Abstract / PDF [317K]

FAWN

  • SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data. Kai Ren, Qing Zheng, Joy Arulraj, Garth A. Gibson. Proceedings of the VLDB Endowment, Vol. 10, No. 13, 2017.
    Abstract / PDF [2.15M]

  • FaSST: Fast, Scalable and Simple Distributed Transactions with Two-sided (RDMA) Datagram RPCs. Anuj Kalia, Michael Kaminsky, David G. Andersen.12th USENIX Symposium on Operating Systems Design and Implementation November 2–4, 2016, Savannah, GA, USA.
    Abstract / PDF [608K]

  • Achieving One Billion Key-Value Requests Per Second on a Single Server. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. IEEE Micro's Top Picks from the Computer Architecture Conferences 2016, May/June 2016. Top Picks 2016 Award!
    Abstract / PDF [176K]

  • Design Guidelines for High Performance RDMA Systems. Anuj Kalia, Michael Kaminsky, David G. Andersen. 2016 USENIX Annual Technical Conference (USENIX ATC'16), June 2016.
    Abstract / PDF [553K]

  • Cuckoo Linear Algebra. Li Zhou, David G. Andersen, Mu Li, Alexander J. Smola. KDD’15, August 10-13, 2015, Sydney, NSW, Australia.
    Abstract / PDF [611K]

  • Full-Stack Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. ACM Transactions on Computer Systems (TOCS), Vol. 34, No. 2, April 2016.
    Abstract / PDF [1.14M]

  • Be Fast, Cheap and in Control with SwitchKV. Xiaozhou Li, Raghav Sethi, Michael Kaminsky, David G. Andersen, Michael J. Freedman. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI'16), Santa Clara, CA, March 2016.
    Abstract / PDF [594K]

  • Towards Accurate and Fast Evaluation of Multi-Stage Log-Structured Designs. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. In 14th USENIX Conference on File and Storage Technologies (FAST'16), Santa Clara, CA, February 2016.
    Abstract / PDF [2M]

  • Resource-Efficient Data-Intensive System Designs for High Performance and Capacity. Hyeontaek Lim. Carnegie Mellon University PhD Dissertation CMU-CS-15-132, September 2015.
    Abstract / PDF [3.1M]

  • Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. In Proceedings of the 42nd International Symposium on Computer Architecture (ISCA 2015), Portland, OR, June 2015. Fast-tracked to Transactions on Computer Systems (TOCS).
    Abstract / PDF [350K]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014. Abstract / PDF [343K]
    Abstract / PDF [343K]

  • Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David G. Andersen. ACM SIGCOMM 2014. Chicago, Illinois, August 17-22, 2014. Supersedes CMU-PDL-14-106, June 2014.
    Abstract / PDF [462K]

  • Algorithmic Improvements for Fast Concurrent Cuckoo Hashing. Xiaozhou Li, David G. Andersen, Michael Kaminsky, Michael J. Freedman. Proceedings of the European Conference on Computer Systems (EuroSys '14), April 2014.
    Abstract / PDF [4.3M]

  • MICA: A Holistic Approach to Fast In-Memory Key-Value Storage. Hyeontaek Lim, Dongsu Han, David G. Andersen, Michael Kaminsky. 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI'14), April 2014.
    Abstract / PDF [1.36M]

  • Scalable, High Performance Ethernet Forwarding with CuckooSwitch. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky. Proc. 9th International Conference on emerging Networking EXperiments and Technologies (CoNEXT), Dec. 2013.
    Abstract / PDF [479K]

  • Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
    Abstract / PDF [648K]

  • FAWNSort: Energy-efficient Sorting of 10GB. Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010
    Abstract / PDF [90K]

  • Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru . Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
    Abstract / PDF [645K]

  • FAWN: A Fast Array of Wimpy Nodes. David Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Proc. 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), Big Sky, MT. October 2009. BEST PAPER AWARD!
    Abstract / PDF [332K]

  • FAWNdamentally Power-efficient Clusters. Vijay Vasudevan, Jason Franklin, David Andersen, Amar Phanishayee, Lawrence Tan, Michael Kaminsky, Iulian Moraru. 12th Workshop on Hot Topics in Operating Systems (HotOS XII). May 2009.
    Abstract / PDF [236K]

File System Virtual Appliances (FSVA)

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek , Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. ACM Transactions on Storage, Vol. 8, No. 3, Article 39, May 2012.
    Abstract / PDF [518K]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-105, April 2010.
    Abstract / PDF [513K]

  • File System Virtual Appliances. Michael Abd-El-Malek. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-109, August 2009.
    Abstract / PDF [1.15M]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-102. May 2009.
    Abstract / PDF [486K]

  • File System Virtual Appliances: Third-party File System Implementations without the Pain. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-106, May 2008.
    Abstract / PDF [508K]

  • FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

New Storage Interfaces

  • Morph: Efficient File-Lifetime Redundancy Management for Cluster File Systems. Timothy Kim, Sanjith Athlur, Saurabh Kadekodi, Francisco Maturana Dax Delvira, Arif Merchant, Gregory R. Ganger, K. V. Rashmi. SOSP ’24, November 4–6, 2024, Austin, TX, USA.
    Abstract / PDF [4.6M]

  • Data Caching for Enterprise-Grade Petabyte-Scale OLAP. Chunxu Tang, Bin Fan, Jing Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian (James) Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen. Proceedings of the 2024 USENIX Annual Technical Conference. July 10–12, 2024 • Santa Clara, CA, USA.
    Abstract / PDF [932K]

  • A Call for Research on Storage Emissions. Sara McAllister, Fiodar Kazhamiaka, Daniel S. Berger, Rodrigo Fonseca, Kali Frost, Aaron Ogus, Maneesh Sah, Ricardo Bianchini, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger. HotCarbon’24, July 9, 2024, Santa Cruz, CA.
    Abstract / PDF [2.9M] / Slides

  • Ensō: A Streaming Interface for NIC-Application Communication. Hugo Sadok and Nirav Atre, Carnegie Mellon University; Zhipeng Zhao, Microsoft; Daniel S. Berger, Microsoft Research & University of Washington; James C. Hoe, Carnegie Mellon University; Aurojit Panda, New York University; Justine Sherry, Carnegie Mellon University; Ren Wang, Intel.17th USENIX Symposium on Operating Systems Design and Implementation (OSDI). July 10–12, 2023. Boston, MA.
    Abstract / PDF [800K]

  • FrozenHot Cache: Rethinking Cache Management for Modern Hardware. Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu. EuroSys 2023, Rome, Italy, May 8th-12th, 2023.
    Abstract / PDF [1.14M]

  • Design Principles for Replicated Storage Systems Built on Emerging Storage Technologies. Thomas Kim. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-23-109. March 2023.
    Abstract / PDF [47.5M]

  • RAIZN: Redundant Array of Independent Zoned Namespaces. Thomas Kim, Jekyeom Jeon, Nikhil Arora, Huaicheng Li, Michael Kaminsky, David G. Andersen, Gregory R. Ganger, George Amvrosiadis, Matias Bjørling. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. Supoercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-22-101, January 2022.
    Abstract / PDF [1.65M]

  • Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S Berger, Nathan Beckmann, Gregory R Ganger. ACM Transactions on Storage, Vol. 18, No. 3, Article 21. August 2022.
    Abstract / PDF [1.4M]

  • Bandwidth Cost of Code Conversions in the Split Regime. Francisco Maturana and K. V. Rashmi. 2022 IEEE International Symposium on Information Theory (ISIT22). June 26-July 1, 2022, Espoo, Finland.
    Abstract / PDF [1.22M]

  • Bandwidth Cost of Code Conversions in Distributed Storage: Fundamental Limits and Optimal Constructions. Francisco Maturana, K. V. Rashmi 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
    Abstract / PDF [325K]

  • Kangaroo: Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel Berger, Nathan Beckmann, Gregory R. Ganger. Proceedings of the 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021. Virtual Event. BEST PAPER AT SOSP'21!
    Abstract / PDF [7.8M] / Talk Video-Short / Talk Video-Long / Blog Post

  • Irregular Array Codes with Arbitrary Access Sets for Geo-Distributed Storage. Francisco Maturana, K. V. Rashmi Carnegie Mellon University, Pittsburgh, PA, USA Email: fmaturan@cs.cmu.edu, rvinayak@cs.cmu.edu 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
    Abstract / PDF [288K]

  • ZNS: Avoiding the Block Interface Tax for Flash-based SSDs. Matias Bjørling, Abutalib Aghayev, Hans Holmberg, Aravind Ramesh, Damien Le Moal, Gregory R. Ganger, George Amvrosiadis. USENIX Annual Technical Conference (USENIX 2021), July 14-16, 2021, Virtual Event.
    Abstract / PDF [305K] / Slides / Talk Video

  • Spitfire: A Three-Tier Buffer Manager for Volatile and Non-Volatile Memory. Xinjing Zhou, Joy Arulraj, Andrew Pavlo, David Cohen. SIGMOD/PODS '21: Proceedings of the 2021 International Conference on Management of Data. June 2021.
    Abstract / PDF [1.28M]

  • Segcache: A Memory-efficient and Scalable In-memory Key-value Cache for Small Objects. Juncheng Yang, Yao Yue, K. V. Rashmi. 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI). Virtual Event, April 12–14, 2021. NSDI'21 Community Award and NSDI'21 BEST PAPER AWARD!
    Abstract / PDF [517K] / Slides / Talk Video

  • Fast Software Cache Design for Network Appliances. Dong Zhou, Huacheng Yu, Michael Kaminsky, David Andersen. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [11M] / Talk Video / Slides

  • More IOPS for Less: Exploiting Burstable Storage in Public Clouds. Hojin Park, Gregory R. Ganger, George Amvrosiadis. 12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud ’20). Virtual Boston, MA, July 13-14, 2020.
    Abstract / PDF [600K] / Talk Video

  • Order-Preserving Key Compression for In-Memory Search Trees. Huanchen Zhang, Xiaoxuan Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo
    SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.15M]

  • The Case for Custom Storage Backends in Distributed Storage Systems. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. To appear in ACM Transactions on Storage, Volume 16, Issue 1, March 2020.
    Abstract / PDF [2.6M]

  • File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [870K]

  • STRADS-AP: Simplifying Distributed Machine Learning Programming without Introducing a New Programming Model. Jin Kyu Kim, Abutalib Aghayev, Garth A. Gibson, Eric P. Xing. Proceedings of the 2019 USENIX Annual Technical Conference, July 10–12, 2019 • Renton, WA.
    Abstract / PDF [490K]

  • SuRF: Practical Range Query Filtering with Fast Succinct Tries. Huanchen Zhang, Hyeontaek Lim, Viktor Leis, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.BEST PAPER AWARD!
    Abstract / PDF [1.9M]

  • Building a Bw-Tree Takes More Than Just Buzz Words. Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis, Huanchen Zhang, Michael Kaminsky, David G. Andersen. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [2.2M]

  • Addressing the Long-Lineage Bottleneck in Apache Spark. Haoran Wang, Jinliang Wei, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-101, January 2018.
    Abstract / PDF [250K]

  • Evolving Ext4 for Shingled Disks. Abutalib Aghayev, Theodore Ts’o, Garth A. Gibson, Peter Desnoyers. 15th USENIX Conference on File and Storage Technologies (FAST '17), Feb 27–Mar 2, 2017. Santa Clara, CA.
    Abstract / PDF [1.4M]

  • Aging Gracefully with Geriatrix: A File System Aging Suite. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [503K]

  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth A. Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [1.6M]

  • Scaling Up Clustered Network Appliances with ScaleBricks. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Michael Mitzenmacher, Ren Wang, Ajaypal Singh. Proc. ACM SIGCOMM 2015, August 17-21, 2015, London, United Kingdom.
    Abstract / PDF [626K]

  • Exploiting Compressed Block Size as an Indicator of Future Reuse. Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip P. Gibbons, Michael A. Kozuch, and Todd C. Mowry. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [2.4M]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014.
    Abstract / PDF [343K]

  • Comparing Performance of Different Cleaning Algorithms for SMR Disks. Mukul Kumar Singh. M.S. Thesis: Master of Science in Information Networking, April 2014.
    Abstract / PDF [623K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. 2013 ACM Symposium on Cloud Computing (SoCC'13), Oct. 01-03 2013, Santa Clara, CA, USA.
    Abstract / PDF [944K]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park, Hyunok Oh, Sungchan Kim, Youngmin, Gregory R. Ganger. Proceedings of the ACM Int'l Conference on Supercomputing (ICS), Eugene, OR, June 2013.
    Abstract / PDF [677K]

  • Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • Specialized Storage for Big Numeric Time Series. Ilari Shafer, Raja R. Sambasivan, Anthony Rowe, Gregory R. Ganger. Proceedings of the 5th Workshop on Hot Topics in Storage and File Systems, June 2013.
    Abstract / PDF [161K]

  • MemC3: Compact and Concurrent Memcache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen and Michael Kaminsky. In Proc. 10th USENIX NSDI, Apr 2013. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012. Source code: https://github.com/efficient/libcuckoo
    Abstract / PDF [280K]

  • Practical Batch-Updatable External Hashing with Sorting. Hyeontaek Lim and David G. Andersen and Michael Kaminsky. In Proc. Meeting on Algorithm Engineering and Experiments (ALENEX), Jan 2013.
    Abstract / PDF [536K]

  • Memory-Efficient Group-By-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. Georgia Tech Center for Experimental Research in Computer Systems Technical Report GIT-CERCS-12-08.
    Abstract / PDF [450K]

  • MemC3: Compact and Concurrent MemCache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012.
    Abstract / PDF [824K]

  • JackRabbit: Improved Agility In Elastic Distributed Storage. James Cipar, Lianghong Xu, Elie Krevat, Alexey Tumanov Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-112, October 2012.
    Abstract / PDF [395K]

  • SILT: A Memory-Efficient, High-Performance Key-Value Store. Hyeontaek Lim, Bin Fan, David Andersen and Michael Kaminsky. ACM Symposium on Operating Systems Principles (SOSP'11), Cascais, Portugal, October 2011.
    Abstract / PDF [1.15M]

  • Switching the Optical Divide: Fundamental Challenges for Hybrid Electrical/Optical Datacenter Networks. Hamid Hajabdolali Bazzaz, Malveeka Tewari, Guohui Wang, George Porter, T. S. Eugene Ng, David G. Andersen, Michael Kaminsky, Michael A. Kozuch, Amin Vahdat. Proc. 2nd ACM Symposium on Cloud Computing (SOCC), Oct 2011.
    Abstract / PDF [190K]

  • Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS.
    Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, David G. Andersen. Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP), Oct 2011.
    Abstract / PDF [689K]

  • RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Gregory R. Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
    Abstract / PDF [1.5M]

  • The Case for VOS: The Vector Operating System. Vijay Vasudevan, David Andersen, Michael Kaminsky. In 13th Workshop on Hot Topics in Operating Systems (HotOS 2011). May 2011.
    Abstract / PDF [430K]

  • Principles of Operation for Shingled Disk Devices. Garth A. Gibson, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth A. Gibson. MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
    Abstract / PDF [2.5M]

  • pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth A. Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
    Abstract / PDF [321K]

  • FAWNSort: Energy-efficient Sorting of 10GB. Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010
    Abstract / PDF [90K]

  • Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru . Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
    Abstract / PDF [645K]

  • Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
    Abstract / PDF [1.1M]

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
    Abstract / PDF [944K]

  • FAWN: A Fast Array of Wimpy Nodes. David Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Proc. 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), Big Sky, MT. October 2009. BEST PAPER AWARD!
    Abstract / PDF [332K]

  • Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Brian Mueller. SIGCOMM’09, August 17–21, 2009, Barcelona, Spain.
    Abstract / PDF [755K]

  • Tashi: Location-aware Cluster Management. Michael A. Kozuch, Michael P. Ryan, Richard Gass, Steven W. Schlosser, David O’Hallaron, James Cipar, Elie Krevat, Julio López, Michael Stroucken, Gregory R. Ganger. First Workshop on Automated Control for Datacenters and Clouds (ACDC'09), Barcelona, Spain, June 2009.
    Abstract / PDF [160K]

  • FAWNdamentally Power-efficient Clusters. Vijay Vasudevan, Jason Franklin, David Andersen, Amar Phanishayee, Lawrence Tan, Michael Kaminsky, Iulian Moraru. 12th Workshop on Hot Topics in Operating Systems (HotOS XII). May 2009.
    Abstract / PDF [236K]

  • Enabling Enterprise Solid State Disks Performance. Milo Polte, Jiri Simsa, Garth A. Gibson. 1st Workshop on Integrating Solid-state Memory into the Storage Hierarchy, March 7, 2009, Washington DC.
    Abstract / PDF [302K]

  • Solving TCP Incast in Cluster Storage Systems. Vijay Vasudevan, Hiral Shah, Amar Phanishayee, Elie Krevat, David Andersen, Gregory R. Ganger, Garth A. Gibson. FAST 2009 Work in Progress Report. 7th USENIX Conference on File and Storage Technologies. Feb 24-27, 2009, San Francisco, CA.
    PDF [70K]

  • A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-101, Feb. 2009.
    Abstact / PDF [317K]

  • FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

  • Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • On Multidimensional Data and Modern Disks. Steven W. Schlosser, Jiri Schindler, Stratos Papadomanolakis , Minglong Shao Anastassia Ailamaki, Christos Faloutsos, Gregory R. Ganger. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005.
    Abstract / PDF [220K]

  • Replication Policies for Layered Clustering of NFS Servers. Raja R. Sambasivan, Andrew J. Klosterman, Gregory R. Ganger. 13th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). September 26 - 29, 2005, Atlanta, GA.
    Abstract / PDF [199K]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003.
    Abstract / PDF [281K]

  • A Framework for Building Unobtrusive Disk Maintenance Applications. Eno Thereska, Jiri Schindler, John Bucy, Brandon Salmon, Christopher R. Lumb, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-192, October 2003.
    Abstract / Postscript [5.1M] / PDF [148K]

  • Design and Implementation of a Freeblock Subsystem. Eno Thereska, Jiri Schindler, Christopher R. Lumb, John Bucy, Brandon Salmon, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-107, December, 2003.
    Abstract / Postscript [6.5M] / PDF [165K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • Object-Based Storage. Mike Mesnier, Gregory R. Ganger, Erik Riedel. IEEE Communications Magazine, v.41 n.8 pp 84-90, August 2003.
    Abstract / PDF [85K]

  • Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003. To appear in VLDB 03, Berlin, Sept 9-12, 2003.
    Abstract / PDF [152K]

  • Exposing and Exploiting Internal Parallelism in MEMS-based Storage. Steven W. Schlosser, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-125, March 2003.
    Abstract / Postscript [1.67M] / PDF [136K]

  • Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • Examining Semantics In Multi-Protocol Network File Systems. Edward P. A. Hogan, Garth A. Gibson, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-103, January 2002.
    Abstract / Postscript [981K] / PDF [408K]

  • Blurring the Line Between Oses and Storage Devices. Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-166, December 2001.
    Abstract / Postscript [2.3M] / PDF [974K]

  • Freeblock Scheduling Outside of Disk Firmware. Christopher R. Lumb, Jiri Schindler, Gregory R. Ganger. Conference on File and Storage Technologies (FAST), January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-149.
    Abstract / Postscript [643K] / PDF [150K]

  • Towards Higher Disk Head Utilization: Extracting "Free" Bandwidth From Busy Disk Drives. Lumb, C., Schindler, J., Ganger, G.R., Nagle, D.F. and Riedel, E. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-130, May 2000.
    Abstract / Postscript [2.3M] / PDF [422K]

Non-volatile Memory

  • FairyWREN: A Sustainable Cache for Emerging Write-Read-Erase Flash Interfaces. Sara McAllister, Yucong “Sherry” Wang, Benjamin Berg*, Daniel S. Berger†, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI '24), July 10–12, 2024. Santa Clara, CA, USA.
    Abstract / PDF [6.7M] / Talk Video & Slides

  • Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays. Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi. ACM Transactions on Storage, Vol. 19, No. 1, Article 5. January 2023.
    Abstract / PDF [1.6M]

  • Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S Berger, Nathan Beckmann, Gregory R Ganger. ACM Transactions on Storage, Vol. 18, No. 3, Article 21. August 2022.
    Abstract / PDF [1.4M]

  • Adapting the RACER Architecture to Integrate Improved In-ReRAM Logic Primitives. Minh S. Q. Truong, Liting Shen, Alexander Glass, Alison Hoffmann, L. Richard Carley, James A. Bain, Saugata Ghose. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Early Access, 12 May 2022.
    Abstract / PDF [4.3M]

  • TMO: Transparent Memory Offloading in Datacenters. Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos. ASPLOS ’22, February 28 – March 4, 2022, Lausanne, Switzerland. BEST PAPER AWARD AT ASPLOS '22!
    Abstract / PDF [1.65M]

  • Kangaroo: Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel Berger, Nathan Beckmann, Gregory R. Ganger. Proceedings of the 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021. Virtual Event. BEST PAPER AT SOSP'21!
    Abstract / PDF [7.8M] / Talk Video-Short / Talk Video-Long / Blog Post

  • IODA: A Host/Device Co-Design for Strong Predictability Contract on Modern Flash Storage. Huaicheng Li, Martin L. Putra, Ronald Shi, Xing Lin, Gregory R. Ganger, Haryadi S. Gunawi. SOSP ’21, October 26-29, 2021, Virtual Event, Germany.
    Abstract / PDF [710K] / Talk Video

  • WineFS: A Hugepage-aware File System for Persistent Memory that Ages Gracefully. Rohan Kadekodi, Saurabh Kadekodi, Soujanya Ponnapalli, Harshad Shirwadkar, Gregory R. Ganger, Aasheesh Kolli, Vijay Chidambaram. 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021.
    Abstract / PDF [3M]

  • RACER: Bit-Pipelined Processing Using Resistive Memory. Minh S. Q. Truong, Eric Chen, Deanyone Su, Alexander Glass, Liting Shen, L. Richard Carley, James A. Bain, Saugata Ghose. 54th IEEE/ACM International Symposium on Microarchitecture, ser. MICRO 2021, Oct. 2021.
    Abstract / PDF [2.2M]

  • Challenges and Solutions for Fast Remote Persistent Memory Access. Anuj Kalia, David Andersen, Michael Kaminsky. SoCC ’20, October 19–21, 2020, Virtual Event, USA. BEST PAPER AWARD!
    Abstract / PDF [710K] / Talk Video

  • High Availability in Cheap Distributed Key Value Storage. Thomas Kim, Daniel Lin-Kit Wong, Gregory R. Ganger, Michael Kaminsky, David G. Andersen. SoCC ’20, October 19–21, 2020, Virtual Event, USA.
    Abstract / PDF [2.6M] / Talk Video

  • TVARAK: Software-Managed Hardware Offload for Redundancy in Direct-Access NVM Storage. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. 47th International Symposium on Computer Architecture, May 30 – June 3, 2020, Virtual Valencia, Spain.
    Abstract / PDF [1.6M]

  • Block-Granularity-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
    Abstract / PDF [880K]

  • Sage: Parallel SemiAsymmetric Graph Algorithms for NVRAMs. Laxman Dhulipala, Charles McGuffey, Hongbo Kang, Yan Gu, Guy E. Blelloch, Phillip B. Gibbons, Julian Shun, Proceedings of the VLDB Endowment, Vol. 13, No. 9. May 2020.
    Abstract / PDF [630K]

  • Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM. Rajat Kateja, Andy Pavlo, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-20-101, April 2020.
    Abstract / PDF [665K]

  • Writeback-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Bernhard Haeupler, Charles McGuffey. Society for Industrial and Applied Mathematics. 2020.
    Abstract / PDF [847K]

  • TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
    Abstract / PDF [975K]

  • Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Gregory R. Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
    Abstract / PDF [800K]

  • Non-Volatile Memory Database Management Systems. Joy Arulraj, Andrew Pavlo. Synthesis Lectures on Data Management, Morgan & Claypool Publishers, February 2019.
    Abstract / PDF currently unavailable

  • Improving 3D NAND Flash Memory Lifetime by Tolerating Early Retention Loss and Process Variation. Y. Luo, S. Ghose, Y. Cai, E. F. Haratsch, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [3.2M]

  • The Parallel Persistent Memory Model. Guy E. Blelloch, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. SPAA ’18, July 16–18, 2018, Vienna, Austria.
    Abstract / PDF [760K]

  • FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. A. Tavakkol, M. Sadrosadati, S. Ghose, J. Kim, Y. Luo, Y. Wang, N. M. Ghiasi, L. Orosa, J. Gómez-Luna, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Los Angeles, CA, June 2018.
    Abstract / PDF [888K]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • Implicit Decomposition for Write-Efficient Connectivity Algorithms. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, and Julian Shun. 2018 International Parallel and Distributed Processing Symposium (IPDPS '18). May 21-25, 2018, Vancouver, BC, Canada.
    Abstract / PDF [716K]

  • MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. A. Tavakkol, J. Gómez-Luna, M. Sadrosadati, S. Ghose, and O. Mutlu. USENIX Conference on File and Storage Technologies (FAST), Oakland, CA, February 2018.
    Abstract / PDF [2.25M]

  • Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach & Onur Mutlu. Proc. of the International Symposium on Microarchitecture (MICRO), Cambridge, MA, October 2017.
    Abstact / PDF [1.32M]

  • Ambit: In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology. Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons & Todd C. Mowry. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [2.5M]

  • Detecting and Mitigating Data-Dependent DRAM Failures by Exploiting Current Memory Content. Samira Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee & Onur Mutlu. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [1.5M]

  • Utility-Based Hybrid Memory Management. Yang Li, Saugata Ghose, Jongmoo Choi, Jin Sun, Hui Wang & Onur Mutlu. In Proc. of the IEEE Cluster Conference (CLUSTER), Honolulu, HI, September 2017.
    Abstact / PDF [588K]

  • Error Characterization, Mitigation, and Recovery in Flash-Memory-Based Solid-State Drives. Yu Cai, Saugata Ghose, Erich F. Haratsch, Yixin Luo & Onur Mutlu. Proceedings of the IEEE Volume: 105, Issue: 9, Sept. 2017.
    Abstact / PDF [5.3M]

  • Viyojit: Decoupling Battery and DRAM Capacities for Battery-Backed DRAM. Rajat Kateja, Anirudh Badam, Sriram Govindan, Bikash Sharma, Gregory R. Ganger. ISCA ’17, June 24-28, 2017, Toronto, ON, Canada.
    Abstract / PDF [1M]

  • Understanding Reduced-Voltage Operation in Modern DRAM Devices: Experimental Characterization, Analysis, and Mechanisms. Kevin K. Chang, A. Giray Yaglikçi, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O’Connor, Hasan Hassan & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [4M]

  • Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms. Donghyuk Lee, Samira Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [2.5M]

  • Improving the Reliability of Chip-off Forensic Analysis of NAND Flash Memory Devices. Aya Fukami, Saugata Ghose, Yixin Luo, Yu CaI, Onur Mutlu. DFRWS Digital Forensics Research Conference Europe (DFRWS EU), March 21 - 23, 2017 Lake Constance, Germany.
    Abstract / PDF [1.5M]

  • Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques. Yu Cai, Saugata Ghose, Yixin Luo, Ken Mai, Onur Mutlu, Erich F. Haratsch. 23rd IEEE Symposium on High Performance Computer Architecture, Industrial session, February 2017.
    Abstract / PDF [8.4M]

  • SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies. Hasan Hassan,Nandita Vijaykumar, Samira Khan, Saugata Ghose, Kevin Chang, Gennady Pekhimenko, Donghyuk Lee, Oguz Ergin, Onur Mutlu. International Symposium on High-Performance Computer Architecture (HPCA), February 2017.
    Abstract / PDF [1.6M]

  • Efficient Algorithms with Asymmetric Read and Write Costs. Guy E Blelloch, Jeremy T Fineman, Phillip B Gibbons, Yan Gu, Julian Shun. 24th European Symposium on Algorithms (ESA’16). August, 2016.
    Abstract / PDF [623K]

  • PARBOR: An Efficient System-Level Technique to Detect Data-Dependent Failures in DRAM. Samira Khan, Donghyuk Lee, Onur Mutlu. Proceedings of the 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Toulouse, France, June 28 - July 1 2016.
    Abstract / PDF [630K]

  • Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems. Kevin Hsieh, Eiman Ebrahimi, Gwangsun Kim, Niladrish Chatterjee, Mike O'Connor, Nandita Vijaykumar, Onur Mutlu§, Stephen W. Keckler. Proceedings of the 43rd International Symposium on Computer Architecture (ISCA), Seoul, South Korea, June 18 - 22, 2016.
    Abstract / PDF [1M]

  • Understanding Latency Variation in Modern DRAM Chips: Experimental Characterization, Analysis, and Optimization. Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Khan, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), Antibes Juan-Les-Pins, France, June 14 - 18, 2016.
    Abstract / PDF [3M]

  • A Case for Toggle-Aware Compression for GPU Systems. Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [713K]

  • ChargeCache: Reducing DRAM Latency by Exploiting Row Access Locality. Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [2M]

  • Low-Cost Inter-Linked Subarrays (LISA): Enabling Fast Inter-Subarray Data Movement in DRAM. Kevin K. Chang, Prashant J. Nair, Donghyuk Lee, Saugata Ghose, Moinuddin K. Qureshi, and Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [768K]

  • A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Saugata Ghose, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu. arXiv:1602.01348v1 [cs.AR]. 3 Feb 2016.
    Abstract / PDF [1.87M]

  • Simultaneous Multi-Layer Access: Improving 3D-Stacked Memory Bandwidth at Low Cost. Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Khan, Onur Mutlu. ACM Transactions on Architecture and Code Optimization (TACO), Vol. 12, January 2016. Presented at the 11th HiPEAC Conference, Prague, Czech Republic, January 2016.
    Abstract / PDF [2M]

  • Enabling Accurate and Practical Online Flash Channel Modeling for Modern MLC NAND Flash Memory. Yixin Luo, Saugata Ghose, Yu Cai, Erich F. Haratsch, Onur Mutlu JSAC Special Issue, 2016.
    Abstract / PDF [4.2M]

  • ThyNVM: Enabling Software-Transparent Crash Consistency in Persistent Memory Systems. Jinglei Ren, Jishen Zhao, Samira Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [460K]

  • The Application Slowdown Model: Quantifying and Controlling the Impact of Inter-Application Interference at Shared Caches and Main Memory. Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Khan, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [604K]

  • Gather-Scatter DRAM: In-DRAM Address Translation to Improve the Spatial Locality of Non-unit Strided Accesses. Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [874K]

  • High-Performance and Lightweight Transaction Support in Flash-Based SSDs. Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, Onur Mutlu. IEEE Transactions on Computers (TC), October 2015.
    Abstract / PDF [1.4M]

  • WARM: Improving NAND Flash Memory Lifetime with Write-hotness Aware Retention Management. Yixin Luo, Yu Cai, Saugata Ghose, Jongmoo Choi, Onur Mutlu.MSST 2015: 31st International Conference on Massive Storage Systems and Technologies, Jun 1, 2015 - Jun 5, 2015, Santa Clara, CA.
    Abstract / PDF [1.5M]

  • Page Overlays: An Enhanced Virtual Memory Framework to Enable Fine-grained Memory Management. Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul Chilimbi. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [2.1M]

  • Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. Joy Arulraj, Andrew Pavlo, Subramanya R. Dulloor. Proceedings ACM SIGMOD, Melbourne, Victoria, Australia, May 31-June 4, 2015.
    Abstract / PDF [1M]

  • Data Retention in MLC NAND Flash Memory: Characterization, Optimization and Recovery. Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu. HPCA-21, February 7-11, 2015 — Best Paper Runner Up.
    Abstract / PDF [1.6M]

  • Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case. Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Khan, Vivek Seshadri, Kevin Chang, Onur Mutlu. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [1.67M]

  • Research Problems and Opportunities in Memory Systems. Onur Mutlu, Lavanya Subramanian. Invited Article in Supercomputing Frontiers and Innovations (SUPERFRI), 2015.
    Abstract / PDF [1.72M]

  • The Main Memory System: Challenges and Opportunities. Onur Mutlu, Justin Meza, Lavanya Subramanian. Invited Article in Communications of the Korean Institute of Information Scientists and Engineers (KIISE), 2015.
    Abstract / PDF [813K]

  • Main Memory Scaling: Challenges and Solution Directions. Onur Mutlu. Invited Book Chapter in More than Moore Technologies for Next Generation Computer Design, pp. 127-153, Springer, 2015.
    Abstract / PDF [1.02M]

  • Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories. Hanbin Yoon, Justin Meza, Naveen Mural Imanohar, Norman P. Jouppi, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 11, No. 4, Article 40, December 2014.
    Abstract / PDF [1.06M]

  • FIRM: Fair and High-Performance Memory Control for Persistent Memory Systems. Jishen Zhao, Onur Mutlu, Yuan Xie. Proceedings of the 47th International Symposium on Microarchitecture (MICRO), Cambridge, UK, December 2014.
    Abstract / PDF [626K]

  • Loose-Ordering Consistency for Persistent Memory. Youyou Lu, Jiwu Shu, Long Sun, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [389K]

  • The Blacklisting Memory Scheduler: Achieving High Performance and Fairness at Low Cost. Lavanya Subramanian, Donghyuk Lee, Vivek Seshadri, Harsha Rastogi, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [240K]

  • Characterizing Application Memory Error Vulnerability to Optimize Datacenter Cost via Heterogeneous- Reliability Memory. Yixin Luo, Sriram Govindan, Bikash Sharma, Mark Santaniello, Justin Meza, Aman Kansal, Jie Liu, Badriddine Khessib, Kushagra Vaid, Onur Mutlu Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Atlanta, GA, June 2014.
    Abstract / PDF [1.58]

  • The Efficacy of Error Mitigation Techniques for DRAM Retention Failures: A Comparative Experimental Study. Samira Khan, Donghyuk Lee, Yoongu Kim, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’14), June 2014.
    Abstract / PDF [8M]

  • Bounding Memory Interference Delay in COTS-based Multi-Core Systems. Hyoseung Kim, Dionisio de Niz, Björn Andersson, Mark Klein, Onur Mutlu, Ragunathan (Raj) Rajkumar. Proceedings of the 20th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), Berlin, Germany, April 2014.
    Abstract / PDF [2.5M]

  • Memory Systems. Yoongu Kim, Onur Mutlu. Invited Book Chapter in Computing Handbook, Third Edition: Computer Science and Software Engineering, CRC Press, April 2014.
    Abstract / PDF [453K]

  • Improving DRAM Performance by Parallelizing Refreshes with Accesses. Kevin Chang, Donghyuk Lee, Zeshan Chishti, Chris Wilkerson, Alaa Alameldeen, Yoongu Kim, Onur Mutlu. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA'14), February 2014.
    Abstract / PDF [2.86M]

  • Consistent, Durable, and Safe Memory Management for Byte-addressable Non Volatile Main Memory. Iulian Moraru, David G. Andersen, Michael Kaminsky, Niraj Tolia, Nathan Binkert, Parthasarathy Ranganathan. TRIOS: Conference on Timely Results in Operating Systems. Held in conjunction with SOSP '13. Farmington, PA, November 3, 2013.
    Abstract / PDF [967K]

  • RowClone: Fast and Energy-Efficient In-DRAM Bulk Data Copy and Initialization. Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, and Todd C. Mowry, 46th IEEE/ACM International Symposium on Microarchitecture (MICRO-46), December 2013.
    Abstract / PDF [2.42M]

  • LightTx: A Lightweight Transactional Design in Flash-based SSDs to Support Flexible Transactions. Youyou Lu, Jiwu Shuy, Jia Guo, Shuai Li, Onur Mutlu. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [262K]

  • Program Interference in MLC NAND Flash Memory: Characterization, Modeling, and Mitigation. Yu Cai, Onur Mutlu, Erich F. Haratsch, Ken Mai. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [1.18M]

  • Threshold Voltage Distribution in MLC NAND Flash Memory: Characterization, Analysis, and Modeling. Yu Cai, Erich F. Haratsch, Onur Mutlu and Ken Mai. Design Automation and Test in Europe (DATE 2013), Mar 19-22, 2013, Grenoble, France.
    Abstract / PDF [1.44M]

  • Memory Scaling: A Systems Architecture Perspective. Onur Mutlu. MemCon 2013 (MEMCON), Santa Clara, CA, August 2013.
    Abstract / PDF [114K]

  • A Case for Efficient Hardware/Software Cooperative Management of Storage and Memory. Justin Meza, Yixin Luo, Samira Khan, Jishen Zhao, Yuan Xie, Onur Mutlu. Fifth Workshop on Energy-Efficient Design (WEED 2013). Held in conjunction with the 2013 International Symposium on Computer Architecture (ISCA-40). June 24, 2013, Tel-Aviv, Israel.
    Abstract / PDF [667K]

  • Evaluating STT-RAM as an Energy-Efficient Main Memory Alternative. Emre Kultursay, Mahmut Kandemir, Anand Sivasubramaniam, and Onur Mutlu. 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2013), April 21-23, 2013, Austin, TX.
    Abstract / PDF [1.83M]

  • Error Analysis and Retention-Aware Error Management for NAND Flash Memory. Yu Cai, Gulay Yalcin, Onur Mutlu, Erich F. Haratsch, Adrian Cristal, Osman Unsal, Ken Mai. Intel Technology Journal (ITJ) Special. Issue on Memory Resiliency, 2013.
    Abstract / PDF [270K]

  • Asymmetry-aware Execution Placement on Manycore Chips. Alexey Tumanov, Joshua Wise, Onur Mutlu, Gregory R. Ganger. In Proc. of the 3rd Workshop on Systems for Future Multicore Architectures (SFMA'13), EuroSys'13, Apr. 14-17, 2013, Prague, Czech Republic.
    Abstract / PDF [703K]

  • Application-to-Core Mapping Policies to Reduce Memory System Interference in Multi-Core Systems. Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [623K]

  • MISE: Providing Performance Predictability and Improving Fairness in Shared Main Memory Systems. Lavanya Subramanian, Vivek Seshadri, Yoongu Kim, Ben Jaiyen, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [607K]

  • Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture. Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
    Abstract / PDF [3.17M]

  • Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
    Abstract / PDF [648K]

  • Row Buffer Locality Aware Caching Policies for Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael A. Harding, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Montreal, Quebec, Canada, September 2012. Best paper award in Computer Systems and Applications track.
    Abstract / PDF [577K]

  • A Case for Small Row Buffers in Non-Volatile Main Memories. Justin Meza, Jing Li, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Poster Session, Montreal, Quebec, Canada, September 2012.
    Abstract / PDF [172K]

  • Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
    Abstract / PDF [184K]

  • Persistent, Protected and Cached: Building Blocks for Main Memory Data Stores. Iulian Moraru, David G. Andersen, Michael Kaminsky, Nathan Binkert, Niraj Tolia, Reinhard Munz,Parthasarathy Ranganathan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-114v2, Nov. 2012. Supersedes CMU-PDL-11-114. Dec. 2011.
    Abstract / PDF [1.0M]

  • Row Buffer Locality-Aware Data Placement in Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael Harding, Onur Mutlu. SAFARI Technical Report, TR-SAFARI-2011-005, Carnegie Mellon University, September 2011.
    Abstract / PDF [272K]

  • A Case for Exploiting Subarray-level Parallelism (SALP) in DRAM. Yoongu Kim, Vivek Seshadri, Donghyuk Lee, Jamie Liu, Onur Mutlu. Proceedings of the 39th International Symposium on Computer Architecture, June 2012.
    Abstract / PDF [927K]

  • RAIDR: Retention-Aware Intelligent DRAM Refresh. Jamie Liu, Ben Jaiyen, Richard Veras, Onur Mutlu. In Proceedings of the 39th International Symposium on Computer Architecture, Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [480K]

  • Staged Memory Scheduling: Achieving High Performance and Scalability in Heterogeneous Systems. Rachata Ausavarungnirun, Kevin Kai-Wei Chang, Lavanya Subramanian, Gabriel H. Loh, Onur Mutlu. The 39th International Symposium on Computer Architecture (ISCA), Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [700K]

  • Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
    Abstract / PDF [463K]

  • Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior. Yoongu Kim, Michael Papamichael, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 43rd International Symposium on Microarchitecture (MICRO), Atlanta, GA, December 2010.
    Abstract / PDF [478K]

  • Phase Change Memory Architecture and the Quest for Scalability. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Communications of the ACM (CACM), Research Highlight, Vol. 53, No. 7, pages 99-106, July 2010.
    Abstract / PDF [1.34M]

  • Phase Change Technology and the Future of Main Memory. Benjamin C. Lee, Ping Zhou, Jun Yang, Youtao Zhang, Bo Zhao, Engin Ipek, Onur Mutlu, Doug Burger. IEEE Micro, Special Issue: Micro's Top Picks from 2009 Computer Architecture Conferences (MICRO TOP PICKS), Vol. 30, No. 1, pages 60-70, January/February 2010.
    Abstract / PDF [600K]

  • ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers. Yoongu Kim, Dongsu Han, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 16th International Symposium on High-Performance Computer Architecture (HPCA), Bangalore, India, January 2010.
    Abstract / PDF [333K]

  • Architecting Phase Change Memory as a Scalable DRAM Alternative. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Proceedings of the 36th International Symposium on Computer Architecture (ISCA), pages 2-13, Austin, TX, June 2009.
    Abstract / PDF [2.6M]

Paxos

  • Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-105. May 2014.
    Abstract / PDF [444K]

  • There Is More Consensus in Egalitarian Parliaments. Iulian Moraru, David G. Andersen, Michael Kaminsky. Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP'13), November 3-6, 2013, Nemacolin Woodlands Resort, Farmington, PA.
    Abstract / PDF [713K]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-111. August 2013.
    Abstract / PDF [2.3M]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-109. September 2012. Superseded by CMU-PDL-13-111, August 2013.
    Abstract / PDF [2.3M]

  • Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-108. July 2012.
    Abstract / PDF [363K]

Problem Analysis

  • So, You Want To Trace Your Distributed System? Key Design Insights from Years of Practical Experience. Raja R. Sambasivan, Rodrigo Fonseca, Ilari Shafer, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-102, April 2014.
    Abstract / PDF [870K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. IEEE Transactions on Visualization and Computer Graphics (Proceedings Information Visualization 2013), vol. 19, no. 12, Dec. 2013.
    Abstract / PDF [1.9M] / TRAILER VIDEO [5.6M] / VIDEO [17.9M]

  • Making Problem Diagnosis Work for Large-Scale, Production Storage Systems. Michael P. Kasick, Priya Narasimhan, Kevin Harms. Proceedings of the 27th Large Installation System Administration Conference (LISA '13), Washington, DC, November 2013.
    Abstract / PDF [2.23M]

  • Automated Diagnosis of Chronic Performance Problems in Production Systems. Soila P. Kavulya. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-109, May 2013.
    Abstract / PDF [12.6M]

  • Diagnosing Performance Changes in Distributed Systems by Comparing Request Flows. Raja R. Sambasivan. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-105, May 2013.
    Abstract / PDF [3.9M]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. USENIX ;login, 38(2), April 2013.
    Abstract / PDF [961K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-104 (supersedes CMU-PDL-12-102), April 2013.
    Abstract / PDF [1.93M]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 26th Usenix Large Installation System Administration Conference (LISA'12), Dec. 9-14, San Diego, CA. Best Student Paper.
    Abstract / PDF [913K]

  • Failure Diagnosis of Complex Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Felicita Di Giandomenico (ISTI-CNR, Pisa, Italy), Priya Narasimhan. Chapter in "Resilience Assessment and Evaluation". Editors. Katinka Wolter, Alberto Avritzer, Marco Vieira, Aad van Moorsel. Springer Verlag, December 2012.
    Abstract / PDF [288K]

  • Light-weight Black-box Failure Detection for Distributed Systems. Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-107. July 2012
    Abstract / PDF [300K]

  • Automated Diagnosis without Predictability is a Recipe for Failure. Raja R. Sambasivan & Gregory R. Ganger. Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '12), June 12-13, 2012, Boston, MA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-101.
    Abstract / PDF [368K]

  • Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. Soila P. Kavulya, Scott Daniels (AT&T), Kautubh Joshi (AT&T), Matti Hiltunen (AT&T), Rajeev Gandhi, Priya Narasimhan.IEEE/IFIP Conference on Dependable Systems and Networks (DSN), June 2012.
    Abstract / PDF [859K]

  • End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
    Abstract / PDF [489K]

  • Diagnosis in Automotive Systems: A Survey. Patrick E. Lanigan, Soila Kavulya, Priya Narasimhan, Thomas E. Fuhrman, Mutasim A. Salman. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-110. June 2011.
    Abstract / PDF [369K]

  • Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
    Abstract / PDF [336K]

  • Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-109, April 2011.
    Abstract / PDF [787K]

  • Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI'11). March 30 - April 1, 2011. Boston, MA.
    Abstract / PDF [388K]

  • Behavior-Based Problem Localization for Parallel File Systems. Michael P. Kasick, Rajeev Gandhi, Priya Narasimhan. HotDep '10. October 3, 2010, Vancouver, BC, Canada.
    Abstract / PDF [149K]

  • To Upgrade or Not to Upgrade: Impact of Online Upgrades across Multiple Administrative Domains. T. Dumitras, E. Tilevich, P.Narasimhan. ACM Onward! Conference, Oct. 2010.
    Abstract / PDF [425K]

  • Why Do Upgrades Fail And What Can We Do About It? Toward Dependable, Online Upgrades in Enterprise Systems. T. Dumitras, P. Narasimhan. ACM/IFIP/USENIX Middleware Conference, Nov-Dec. 2009.
    Abstract / PDF [835K]

  • Toward Upgrades-as-a-Service in Distributed Systems. T. Dumitras, P. Narasimhan. Poster Session at Middleware 2009. 10th International Middleware Conference Urbana Champaign, Illinois, USA.
    Abstract / PDF [602K]

Storage for High-End Computing

  • CARP: Range Query-Optimized Indexing for Streaming Data. Ankush Jain, Charles D. Cranor, Qing Zheng, Bradley W. Settlemeyer, George Amvrosiadis, Gary Grider. SC24, November 17-22, 2024, Atlanta, Georgia, USA.
    Abstract / PDF [1M]

  • Extending the Mochi Methodology to Enable Dynamic HPC Data Services. M. Dorier, P. Carns, R. Ross, S. Snyder, R. Latham, A. Gueroudji, G. Amvrosiadis, C. Cranor, J. Soumagne. In 5th Workshop on Extreme-Scale Storage and Analysis (ESSA 2024), May 2024.
    Abstract / PDF [250K]

  • FIFO Queues Are All You Need for Cache Eviction. Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, Rashmi Vinayak SOSP '23: Proceedings of the 29th Symposium on Operating Systems Principles, October 2023. Koblenz, Germany.
    Abstract / PDF [1.6M]

  • Matchmaker: Data Drift Mitigation in Machine Learning for Large-scale Systems. Ankur Mallick, Kevin Hsieh, Behnaz Arzani, Gauri Joshi. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [500K]

  • Tiger: Disk-Adaptive Redundancy Without Placement Restrictions. Saurabh Kadekodi, Francisco Maturana, Sanjith Athlur, Arif Merchant, K. V. Rashmi, Gregory R. Ganger. Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI'22), July 11–13, 2022, Carlsbad, CA, USA.
    Abstract / PDF [1.25M]

  • It’s Time to Talk About HPC Storage: Perspectives on the Past and Future. Bradley Settlemyer, George Amvrosiadis, Philip Carns, Robert Ross. IEEE Computer Society Computing in Science & Engineering November/December 2021.
    Abstract / PDF [483K]

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. SC ’21, November 14–19, 2021, St. Louis, MO, USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M] / Slides / Talk Video

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M]

  • Learning on Distributed Traces for Data Center Storage Systems. Giulio Zhou, Martin Maas Conference on Machine Learning and Systems '21, April 5-9, 2021. Virtual Event.
    Abstract / PDF [1.3M] / Talk Video

  • Distributed Metadata and Streaming Data Indexing as Scalable Filesystem Services. Qing Zheng. Carnegie Mellon University School of Computer Science Ph.D. Dissertation, CMU-CS-21-103. February 2021.
    Abstract / PDF [2.1M]

  • PACEMAKER: Avoiding HeART Attacks in Storage Clusters with Disk-adaptive Redundancy. Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [2.1M] / Slides / Talk Video

  • Streaming Data Reorganization at Scale with DeltaFS Indexed Massive Directories. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. ACM Transactions on Storage, Vol. 16, No. 4, Article 23. September 2020.
    Abstract / PDF [2.1M]

  • Mochi: Composing Data Services for High-Performance Computing Environments. Robert B. Ross, George Amvrosiadis, Philip Carns, Charles D. Cranor, Matthieu Dorier, Kevin Harms, Gregory R. Ganger, Garth A. Gibson, Samuel K. Gutierrez, Robert Latham, Bob Robey, Dana Robinson, Bradley Settlemyer, Galen Shipman, Shane Snyder, Jerome Soumagne, Qing Zheng. Journal of Computer Science and Technology 35(1): 121–144 Jan. 2020.
    Abstract / PDF [1.3M]

  • Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. Ziqi Wang, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri. 28th Parallel Architecture and Compiler Technologies 2019 (PACT'19), Sept 21-25, 2019, Seattle, WA.
    Abstract / PDF [475K]

  • Compact Filters for Fast Online Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. IEEE CLUSTER 2019. September 23 - 26, 2019, Albuquerque, New Mexico, USA.
    Abstract / PDF [1M]

  • Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
    Abstract / PDF[574K]

  • Cluster Storage Systems Gotta Have HeART: Improving Storage Efficiency by Exploiting Disk-reliability Heterogeneity. Saurabh Kadekodi, K. V. Rashmi, Gregory R. Ganger. 17th USENIX Conference on File and Storage Technologies (FAST '19) Feb. 25–28, 2019 Boston, MA.
    Abstract / PDF [1.1M]

  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth A. Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [1.6M]

  • DeltaFS: Exascale File Systems Scale Better Without Dedicated Servers. Qing Zheng, Kai Ren, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider. PDSW2015: 10th Parallel Data Storage Workshop, held in conjunction with SC15, Austin, TX, November 16, 2015.
    Abstract / PDF [930K]

  • High-Performance and Lightweight Transaction Support in Flash-Based SSDs. Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, Onur Mutlu. IEEE Transactions on Computers (TC), October 2015.
    Abstract / PDF [1.4M]

  • Caveat-Scriptor: Write Anywhere Shingled Disks. Saurabh Kadekodi, Swapnil Pimpale, Garth A. Gibson. Proc. Of the Seventh USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’15), Santa Clara, CA, July 2015. Expanded paper available: Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-101.
    Abstract / PDF [3.4M]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-104, April 2015.
    Abstract / PDF [696K]

  • Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson. ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis (SC'14), November 16-21, 2014, New Orleans, LA. BEST PAPER AWARD!
    Abstract / PDF [939K] / Slides [1M]

  • BatchFS: Scaling the File System Control Plane with Client-Funded Metadata Servers. Qing Zheng, Kai Ren, Garth A. Gibson. Proceedings of the 9th international Petascale Data Storage Workshop (PDSW '14) held in conjunction with Supercomputing '14. November 16, 2014, New Orleans, LA.
    Abstract / PDF [651K]

  • Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage. Ellis H. Wilson III, Mahmut T. Kandemir, Garth A. Gibson. The 34th International Conference on Distributed Computing Systems, ICDCS 2014, June 30 - July 3, 2014, Madrid, Spain.
    Abstract / PDF [332K]

  • Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-103. May 2014.
    Abstract / PDF [763K]

  • SpringFS: Bridging Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. 12th USENIX Conference on File and Storage Technologies (FAST '14), Santa Clara, CA, February 17–20, 2014.
    Abstract / PDF [319K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. 2013 ACM Symposium on Cloud Computing (SoCC'13), Oct. 01-03 2013, Santa Clara, CA, USA.
    Abstract / PDF [944K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth A. Gibson. 2013 USENIX Annual Technical Conference, June 26-28, 2013, San Jose, CA.
    Abstract / PDF [867K]

  • PRObE: A Thousand-Node Experimental Cluster for Computer Systems Research. Garth A. Gibson, Gary Grider, Andree Jacobson, Wyatt Lloyd. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.5M]

  • Shingled Magnetic Recording: Areal Density Increase Requires New Data Management. Tim Feldman, Garth A. Gibson. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.17M]

  • I/O Acceleration with Pattern Detection. Jun He, John Bent, Aaron Torres, Gary Grider, Garth A. Gibson, Carlos Maltzahn, Xian-He Sun. The 22nd Int. ACM Symposium on High Performance Parallel and Distributed Computing (HPDC'13), New York City, June 17-21, 2013.
    Abstract / PDF [458K]

  • Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-102, January 2013. Revised version of CMU-PDL-12-110.
    Abstract / PDF [798K

  • Giga+TableFS on PanFS: Scaling Metadata Performance on Cluster File Systems. Kartik Kulkarni, Kai Ren, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-101, January 2013.
    Abstract / PDF [679K]

  • Memory-Efficient Group-By-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. Georgia Tech Center for Experimental Research in Computer Systems Technical Report GIT-CERCS-12-08.
    Abstract / PDF [450K]

  • Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • HPC Computation on Hadoop Storage with PLFS. Chuck Cranor, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-115. Nov. 2012.
    Abstract / PDF [170K]

  • A Case for Scaling HPC Metadata Performance through De-specialization. Swapnil Patil, Kai Ren, Garth A. Gibson. 7th Petascale Data Storage Workshop held in conjunction with Supercomputing '12, November 12, 2012. Salt Lake City, UT. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-111, November 2012.
    Abstract / PDF [512K]

  • RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Gregory R. Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
    Abstract / PDF [1.5M]

  • Shingled Magnetic Recording for Big Data Applications. Anand Suresh, Garth A. Gibson, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-105. May 2012.
    Abstract / PDF [561K]

  • SkyeFS: Distributed Directories using Giga+ and PVFS. Anthony Chivetta, Swapnil Patil & Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-104, May 2012.
    Abstract / PDF [398K]

  • A Statistical Study for File System Meta Data On High Performance Computing Sites. Yifan Wang. M.S. Thesis, Information Networking Institute, Carnegie Mellon University. May 2012.
    Abstract / PDF [5.3M]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • LazyBase: Trading Freshness for Performance in a Scalable Database. James Cipar, Gregory R. Ganger, Kimberly Keeton, Charles B. Morrey III, Craig A. N. Soules, Alistair Veitch. EuroSys 2012 April 10-13, 2012, Bern, Switzerland.
    Abstract / PDF [236K]

  • DiskReduce: Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth A. Gibson. Carnegie Mellon Univsersity Parallel Data Laboratory Technical Report CMU-PDL-11-112, October, 2011.
    Abstract / PDF [897K]

  • Small Cache, Big Effect: Provable Load Balancing for Randomly Partitioned Cluster Services. Bin Fan, Hyeontaek Lim, David Andersen and Michael Kaminsky. ACM Symposium on Cloud Computing (SOCC'11), Cascais, Portugal, October, 2011.
    Abstract / PDF [336K]

  • Applying Idealized Lower-bound Runtime Models to Understand Inefficiencies in Data-intensive Computing (Extended Abstract). Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger: SIGMETRICS 2011: 125-126, San Jose, CA, June 7-11, 2011.
    Abstract / PDF [297K]

  • Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO. Lofstead, Jay, Milo Polte, Garth A. Gibson, Scott A. Klasky, Karsten Schwan, Ron Oldfield, Matthew Wolf, Qing Liu. 20th ACM Int. Symp. On High-Performance Parallel and Distributed Computing (HPDC'11), June 2011.
    Abstract / PDF [595K]

  • On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth A. Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Swapnil Patil, Milo Polte, Kai Ren, Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, Garth A. Gibson, Adam Fuchs, Billie Rinaldi. Proc. of the 2nd ACM Symposium on Cloud Computing (SOCC '11), October 27–28, 2011, Cascais, Portugal. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-111, August 2011.
    Abstract / PDF [1.2M]

  • Principles of Operation for Shingled Disk Devices. Garth A. Gibson, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth A. Gibson. MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
    Abstract / PDF [2.5M]

  • Disks Are Like Snowflakes: No Two Are Alike. Elie Krevat, Joseph Tucek, Gregory R. Ganger. 13th Workshop on Hot Topics in Operating Systems (HotOS 2011), Napa Valley, CA. May 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-102, February 2011.
    Abstract / PDF [1.8M]

  • Applying Simple Performance Models to Understand Inefficiencies in Data-Intensive Computing. Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-103. February 2011.
    Abstract / PDF [476K]

  • Scale and Concurrency of GIGA+: File System Directories with Millions of Files. Swapnil Patil, Garth A. Gibson. Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose CA, February 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-110, Sept. 2010.
    Abstract / PDF [508K]

  • pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth A. Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
    Abstract / PDF [321K]

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
    Abstract / PDF [944K]

  • Applying Performance Models to Understand Data-intensive Computing Efficiency. Elie Krevat, Tomer Shiran, Eric Anderson†, Joseph Tucek†, Jay J. Wylie†, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-108. May 2010.
    Abstract / PDF [304K]

  • ...And eat it too: High read performance in write-optimized HPC I/O middleware file formats. Milo Polte, Jay Lofstead, John Bent, Garth A. Gibson, Scott A. Klasky, Qing Liu, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Meghan Wingate, Matthew Wolf. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-111, November 2009.
    Abstract / PDF [388K]

  • PLFS: A Checkpoint Filesystem for Parallel Applications. John Bent, Garth A. Gibson, Gary Grider, Ben McClelland, Paul Nowoczynski, James Nunez, Milo Polte, Meghan Wingate. Supercomputing '09, November 15, 2009. Portland, Oregon.
    Abstract / PDF [388K]

  • DiskReduce: RAID for Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth A. Gibson. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-112, November 2009.
    Abstract / PDF [304K]

  • Understanding and Maturing the Data-Intensive Scalable Computing Storage Substrate. Garth A. Gibson, Bin Fan, Swapnil Patil, Milo Polte, Wittawat Tantisiriroj, Lin Xiao. Microsoft Research eScience Workshop 2009, Pittsburgh, PA, October 16-17, 2009.
    Abstract / PDF [520K]

  • In Search of an API for Scalable File Systems: Under the table or above it? Swapnil Patil, Garth A. Gibson, Gregory R. Ganger, Julio Lopez, Milo Polte, Wittawat Tantisiroj, and Lin Xiao. USENIX HotCloud Workshop 2009. June 2009, San Diego CA.
    Abstract / PDF [260K]

  • System-Call Based Problem Diagnosis for PVFS. Michael P. Kasick, Keith A. Bare, Eugene E. Marinelli III, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep '09). Lisbon, Portugal. June 2009.
    Abstract / PDF [117K]

  • Directions for Shingled-Write and Two-Dimensional Magnetic Recording System Architectures: Synergies with Solid-State Disks. Garth A. Gibson, Milo Polte. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-104. May 2009.
    Abstract / PDF [70K]

  • Enabling Enterprise Solid State Disks Performance. Milo Polte, Jiri Simsa, Garth A. Gibson. 1st Workshop on Integrating Solid-state Memory into the Storage Hierarchy, March 7, 2009, Washington DC.
    Abstract / PDF [302K]

  • Fast Log-based Concurrent Writing of Checkpoints. Milo Polte, Jiri Simsa, Wittawat Tantisiriroj, Garth A. Gibson, Shobhit Dayal, Mikhail Chainani, Dilip Kumar Uppugandla. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [262K]

  • Comparing Performance of Solid State Devices and Mechanical Disks. Milo Polte, Jiri Simsa, Garth A. Gibson. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [99K]

  • Data-intensive file systems for Internet services: A rose by any other name ... Wittawat Tantisiriroj, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-114. October 2008
    Abstract / PDF [350K]

  • GIGA+ : Scalable Directories for Shared File Systems. Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-110. October 2008.
    Abstract / PDF [400K]

  • Characterizing HEC Storage Systems at Rest. Shobhit Dayal. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-109, July 2008.
    Abstract / PDF [603K]
  • User Level Implementation of Scalable Directories (GIGA+). Sanket Hase, Aditya Jayaraman, Vinay K. Perneti, Sundararaman Sridharan, Swapnil V. Patil, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-107, May 2008.
    Abstract / PDF [1.67M]

  • Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

  • On Application-level Approaches to Avoiding TCP Throughput Collapse in Cluster-Based Storage Systems. E. Krevat, V. Vasudevan, A. Phanishayee, D. Andersen, G. Ganger, G. Gibson, S. Seshan. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [124K]

  • GIGA+: Scalable Directories for Shared File Systems. Swapnil V. Patil, Garth A. Gibson, Sam Lang, Milo Polte. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [114K]

Associated Publications

  • The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining. Samuel Arch, Yuchen Liu, Todd Mowry, Jignesh Patel, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 18, No. 1., December 2024.
    Abstract / PDF [780K]

  • Perspective: A Principled Framework for Pliable and Secure Speculation in Operating Systems. Tae Hoon Kim, David Rudo, Kaiyang Zhao, Zirui Neil Zhao, Dimitrios Skarlatos. Proceedings of the 51st Intl. Symposium on Computer Architecture (ISCA 2024), Buenos Aires, Argentina, June 2024.
    Abstract / PDF [600K]

  • Agents of Autonomy: A Systematic Study of Robotics on Modern Hardware. Mohammad Bakhshalipour and Phillip B. Gibbons. In Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems. (SIGMETRICS/PERFORMANCE Abstracts ’24), June 10–14, 2024, Venice, Italy. ACM, New York, NY, USA. BEST PAPER AT SIGMETRICS '24!
    Abstract / PDF [425K] / Code

  • Agents of Autonomy: A Systematic Study of Robotics on Modern Hardware. Mohammad Bakhshalipour, Phillip B. Gibbons. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Volume 7, Issue 3, Article No.: 43, December 2023. To appear ACM SIGMETRICS / IFIP PERFORMANCE 2024, Venice, Italy, June 10-14, 2024.
    Abstract / PDF [2.1M]

  • BBQ: A Fast and Scalable Integer Priority Queue for Hardware Packet Scheduling. Nirav Atre, Hugo Sadok, Justine Sherry. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI' 24), April 16–18, 2024. Santa Clara, CA.
    Abstract / PDF [1M]

  • Is Perfect Hashing Practical for OLAP Systems? Kevin P. Gaffney, Jignesh M. Patel. 4th Annual Conference on nnovative Data Systems Research (CIDR ’24) January 14-17, 2024, Chaminade, USA.
    Abstract / PDF [765K]

  • Address Scaling: Architectural Support for Fine-Grained Thread-Safe Metadata Management. Deepanjali Mishra, Konstantinos Kanellopoulos, Ashish Panwar, Akshitha Sriraman, Vivek Seshadri, Onur Mutlu, Todd C. Mowry. IEEE Computer Architecture Letters, Volume: 23, Issue: 1, Jan.-June 2024.
    Abstract / PDF [540K]

  • UDIR: Towards a Unified Compiler Framework for Reconfigurable Dataflow Architectures. Nikhil Agarwal, Mitchell Fream, Souradip Ghosh, Brian C. Schwedock, Nathan Beckmann. IEEE Computer Architecture Letters ( Volume: 23, Issue: 1, Jan.-June 2024)UDIR: Towards a Unified Compiler Framework for Reconfigurable Dataflow Architectures. Nikhil Agarwal, Mitchell Fream, Souradip Ghosh, Brian C. Schwedock, Nathan Beckmann. IEEE Computer Architecture Letters (Volume: 23, Issue: 1, Jan.-June 2024).
    Abstract / PDF [1.28M]

  • Simple Adaptive Query Processing vs. Learned Query Optimizers: Observations and Analysis. Yunjia Zhang, Yannis Chronis, Jignesh M. Patel, Theodoros Rekatsinas. Proceedings of the VLDB Endowment, Vol. 16, No. 11.
    Abstract / PDF [950K]

  • Runahead A*: Speculative Parallelism for A* with Slow Expansions. Mohammad Bakhshalipour, Mohamad Qadri, Dominic Guri, Seyed Borna, Ehsani, Maxim Likhachev, Phillip B. Gibbons. ICAPS 2023, Prague, Czech Republic, July 8-13, 2023.
    Abstact / PDF [710K]

  • Contiguitas: The Pursuit of Physical Memory Contiguity in Datacenters. Kaiyang Zhao, Kaiwen Xue, Ziqi Wang, Dan Schatzberg, Leon Yang, Antonis Manousis, Johannes Weiner, Rik van Riel, Bikash Sharma, Chunqiang Tang, Dimitrios Skarlatos. ISCA ’23, June 17–21, 2023, Orlando, FL, USA. BEST PAPER AWARD!
    Abstract / PDF [468K]

  • PIM-trie: A Skew-resistant Trie for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. SPAA '23: Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, June 2023, Orlando, FL.
    Abstract / PDF [1.23M]

  • MANIC: A 19µW @ 4MHz, 256 MOPS/mW, RISC-V Microcontroller with Embedded MRAM Main Memory and Vector-Dataflow Co-Processor in 22nm Bulk FinFET CMOS. Graham Gobieski, Oguz Atli, Cagri Erbagci, Ken Mai, Nathan Beckmann, Brandon Lucia. IEEE International Symposium on Circuits and Systems (ISCAS), Monterey, CA, May 21-25, 2023.
    Abstract / PDF [3.6 M]

  • PIM-tree: A Skew-resistant Index for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. Proc. VLDB Endow. 16(4): 946-958 (2022). In preprint. BEST PAPER RUNNER UP.
    Abstract / PDF [1.1M]

  • RipTide: A Programmable, Energy-minimal Dataflow Compiler and Architecture. Graham Gobieski, Souradip Ghosh, Marijn Heule, Todd Mowry, Tony Nowatzki, Nathan Beckmann, Brandon Lucia. MICRO 2022 - 55th IEEE/ACM International Symposium on Microarchitecture, October 1–5, 2022 Chicago, Illinois, USA.
    Abstract / PDF [3.7M]

  • SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling. Nirav Atre, Hugo Sadok, Erica Chiang, Weina Wang, Justine Sherry. SIGCOMM ’22, August 22–26, 2022, Amsterdam, Netherlands.
    Abstract / PDF [2M]

  • Thermometer: Profile-Guided BTB Replacement for Data Center Applications. Shixin Song, Tanvir Ahmed Khan, Sara Mahdizadeh Shahri, Akshitha Sriraman, Niranjan K Soundararajan, Sreenivas Subramoney, Daniel A. Jiménez, Heiner Litz, Baris Kasikci. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [1.45M]

  • täkō: A Polymorphic Cache Hierarchy for General-Purpose Optimization of Data Movement. Brian C. Schwedock, Piratach Yoovidhya, Jennifer Seibert, Nathan Beckmann. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [1.75M]

  • RTRBench: A Benchmark Suite for Real-Time Robotics. Mohammad Bakhshalipour, Maxim Likhachev, Phillip B. Gibbons 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 22-24 May 2022, Singapore.
    Abstract / PDF [1.85M]

  • MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations. Nandita Vijaykumar, Ataberk Olgun, Konstantinos Kanellopoulos, F. Nisa Bostanci, Hasan Hassan, Mehrshad Lotfi, Phillip B. Gibbons, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 19, No. 2, Article 26. Publication date: March 2022.
    Abstract / PDF [1.75M]

  • Client Selection in Federated Learning: Convergence Analysis and Power-of-Choice Selection Strategies. Yae Jee Cho, Jianyu Wang, Gauri Joshi. International Conference on Artificial Intelligence and Statistics (AISTATS), March 2022.
    Abstract / PDF [1.85M]

  • FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients. Jianyu Wang, Hang Qi, Ankit Singh Rawat, Sashank Reddi, Sagar Waghmare, Felix X. Yu, Gauri Joshi. arXiv:2201.11865v2 [cs.LG], 16 Feb 2022.
    Abstract / PDF [660K]

  • Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation. Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar, Swanand Kadhe, Gauri Joshi. 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Dec. 6-14, 2021. Virtual Event.
    Abstract / PDF [650K]

  • The Most Common Queueing Theory Questions Asked by Computer Systems Practitioners. Mor Harchol-Balter and Ziv Scully. First International Workshop on Teaching Performance Analysis of ComputerSystems (TeaPACS 2021) In conjunction with the IFIP Performance 2021 Conference. Milan, Italy, Nov 2021.
    Abstract / PDF [286K]

  • The Case for Phase-Aware Scheduling of Parallelizable Jobs. Ben Berg, Justin Whitehouse, Ben Moseley, Weina Wang, Mor Harchol-Balter. IFIP Performance 2021. Milan, Italy, November 2021.
    Abstract / PDF [978K]

  • The Gittins Policy in the M/G/1 Queue. Ziv Scully, Mor Harchol-Balter. 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt 2021) Philadelphia, PA, Oct 2021.
    Abstract / PDF [265K]

  • The Processing-in-Memory Model. Hongbo Kang, Phillip B. Gibbons, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
    Abstract / PDF [1.25M]

  • Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications. Tanvir Ahmed Khan, Dexin Zhang, Akshitha Sriraman, Joseph Devietti, Gilles A Pokam, Heiner Litz, Baris Kasikci.International Symposium on Computer Architecture (ISCA), June 2021.
    Abstract / PDF [770K]

  • HerQules: Securing Programs via Hardware-Enforced Message Queues. Daming D. Chen, Wen Shih Lim, Mohammad Bakhshalipour, Phillip B. Gibbons, James C. Hoe, Bryan Parno. ASPLOS ’21, April 19–23, 2021, Virtual, USA.
    Abstract / PDF [1.7M] / Talk Video

  • The Read-Only Semi-External Model. Guy E. Blelloch, Laxman Dhulipala, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. APOCS 2021, January 13, 2021 Virtual Conference, Alexandria, Virginia, U.S.
    Abstract / PDF [740K]

  • A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. Juncheng Yang, Yao Yue, K. V. Rashmi. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.6M] / Slides / Talk Video

  • GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu. MICRO’20. 53rd IEEE/ACM International Symposium on Microarchitecture, Oct 17-21, 2020. Virtual Event.
    Abstract / PDF [1.3M] / Slides / Talk Video

  • Jumanji: The Case for Dynamic NUCA in the Datacenter. Brian Schwedock, Nathan Beckmann. MICRO '53: Proceedings of the 53nd Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Athens, Greece, October 17-21, 2020.
    Abstract / PDF [2.3M] / Slides / Talk Video

  • Unleashing In-network Computing on Scientific Workloads. Daehyeok Kim, Ankush Jain, Zaoxing Liu, George Amvrosiadis, Damian Hazen, Bradley Settlemyer, Vyas Sekar. arXiv:2009.02457v1 [cs.NI], 5 Sep 2020.
    Abstract / PDF [1.25M]

  • Accelerating Genome Analysis: A Primer on an Ongoing Journey. Mohammed Alser, Zülal Bingöl, Damla Senol Cali, Jeremie Kim, Saugata Ghose, Can Alkan, Onur Mutlu. This is an extended and updated version of a paper published in IEEE Micro, vol. 40, no. 5, pp. 65-75, 1 Sept.-Oct. 2020.
    Abstract / PDF [320K]

  • Lightweight Preemptible Functions. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [1M] / Talk Video / Slides

  • Simple Near-Optimal Scheduling for the M/G/1. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proceedings of the ACM Measurement and Analysis of Computer Systems - SIGMETRICS, June 2020, Boston, MA.
    Abstract / PDF [885K] / Talk Video

  • Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD. Jianyu Wang, Hao Liang, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [442K]

  • Correlated Multi-armed Bandits with a Latent Random Source. Samarth Gupta, Gauri Joshi, Osman Yağan. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [1.1M]

  • SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICLR 2020: International Conference on Learning Representations, Apr 26-May 1, 2020, Virtual Addis Ababa, Ethiopia.
    Abstract / PDF [640K] Talk Video & Slides

  • Livia: Data-Centric Computing Throughout the Memory Hierarchy. Elliot Lockerman, Axel Feldmann, Mohammad Bakhshalipour, Alexandru Stanescu, Shashwat Gupta, Daniel Sanchez, Nathan Beckmann. ASPLOS '20: Proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Lausanne, Switzerland, March 16-20, March 2020.
    Abstract / PDF [1.6M] / Talk Video

  • Learning Relaxed Belady for Content Distribution Network Caching. Zhenyu Song, Daniel S. Berger, Kai Li, Wyatt Lloyd. 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’20). February 25–27, 2020. Santa Clara, CA.
    Abstract / PDF [2.25M]

  • Scalable Pointer Analysis of Data Structures using Semantic Models. Pratik Fegade, Christian Wimmer. 29th Conference on Compiler Construction (CC ’20), February 22–23, 2020, San Diego, CA, USA.
    Abstract / PDF [700K]

  • MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. NeurIPS workshop of Federated Learning for Data Privacy and Confidentiality, Dec 13, 2019. Vancouver, BC, Canada. Distinguished Student Paper Award.
    Abstract / PDF [1.1M]

  • Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-vector Multiplication. Ankur Mallick, Malhar Chaudhari, Ganesh Palanikumar, Utsav Sheth, Gauri Joshi. Proc. ACM Meas. Anal. Comput. Syst., Vol. 3, No. 3, Article 58. December 2019.
    Abstract / PDF [1.9M]

  • Demystifying Complex Workload–DRAM Interactions: An Experimental Study. Saugata. Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019. To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 3, No. 3, December 2019.
    Abstract / PDF [4M]

  • Processing-in-Memory: A Workload-Driven Perspective. Saugata Ghose, Amarali Boroumand, Jeremie. S. Kim, Juan. Gómez-Luna, Onur Mutlu. IBM Journal of Research and Development (JRD), Vol. 63, No. 6, November/December 2019.
    Abstract / PDF [2.1M]

  • Vantage: Optimizing Video Upload for Time-shifted Viewing of Social Livestreams. Devdeep Ray, Jack Kosaian, K. V. Rashmi, Srinivasan Seshan. ACM SIGCOMM, August 19-24, 2019, Beijing, China.
    Abstract / PDF [6.75M]

  • Enabling Practical Processing in and Near Memory for Data-Intensive Computing. Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun. Proc. of the Design Automation Conference (DAC), Las Vegas, NV, June 2019.
    Abstract / PDF [477K]

  • CROW: A Low-Cost Substrate for Improving DRAM Performance, Energy Efficiency, and Reliability. Hasan Hassan, Minesh Patel, Jeremie. S. Kim, A. Giray Yaglikçi, Nandita Vijaykumar, Nika Mansouri Ghiasi, Saugata Ghose, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.45M]

  • CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators. Amarali Boroumand, Saugata Ghose, Minesh Patel, Hasan Hassan, Brandon Lucia, Rachata Ausavarungnirun, Kevin Hsieh, Nastaran Hajinazar, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.1M]

  • Understanding the Interactions ofWorkloads and DRAM Types: A Comprehensive Experimental Study. Saugata Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019; To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2019.
    Abstract / PDF [2M]

  • Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems. Graham Gobieski, Brandon Lucia, Nathan Beckmann Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’19), April 13th – April 17th, Providence, RI.
    Abstract / PDF [3.35M]

  • What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study. Suagata. Ghose, Abdullah Giray Yaglikçi, Raghav Gupta, Donghyuk Lee, Kais. Kudrolli, William. X. Liu, Hasan Hassan, Kevin K. Chang, Niladrish Chatterjee, Aditya Agrawal, Mike O'Connor, Onur Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [2.6M]

  • SRPT for Multiserver Systems. Isaac Grosof, Ziv Scully, Mor Harchol-Balter. Performance Evaluation , vol. 127-128, Nov. 2018, pp. 154-175. Also in Proc. 36th International Symposium on Computer Performance, Modeling, Measurements, and Evaluation (Performance 2018) , Toulouse, France, December 2018. Best Student Paper Award.
    Abstract / PDF [780K]

  • SOAP Bubbles: Robust Scheduling Under Adversarial Noise. Ziv Scully, Mor Harchol-Balter. 56th Annual Allerton Conference on Communication, Control, and Computing, 2-5 Oct. 2018. Monticello, IL.
    Abstract / PDF [245K]

  • Exploiting Locality in Graph Analytics through Hardware-Accelerated Traversal Scheduling. Anurag Mukkara, Nathan Beckmann, Maleen Abeydeera, Xiaosong Ma, Daniel Sanchez. 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 20-24 Oct. 2018, Fukuoka, Japan.
    Abstract / PDF [660K]

  • Practical Bounds on Optimal Caching with Variable Object Sizes. Daniel S. Berger, Nathan Beckmann, Mor Harchol-Balter. Proceedings of the ACM on Measurement and Analysis of Computing Systems. Vol. 2, No. 2, Article 32, June 2018.
    Abstract / PDF [1.2M]

  • Practical Bounds on Offline Caching with Variable Object Sizes. Daniel Berger, Nathan Beckmann, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 2, Article 32. June 2018. POMACS 2018.
    Abstract / PDF [1.2M]

  • LHD: Improving Cache Hit Rate by Maximizing Hit Density. Nathan Beckmann, Haoxian Chen, Asaf Cidon. 15th USENIX Symposium on Networked Systems Design and Implementation ({NSDI} 18), April 9-11, 2018, Renton, WA..
    Abstract / PDF [1.1M]

  • GoogleWorkloads for Consumer Devices: Mitigating Data Movement Bottlenecks. Amirali Boroumand, Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu. ASPLOS’18, March 24–28, 2018, Williamsburg, VA, USA.
    Abstract / PDF [885K]

  • SOAP: One Clean Analysis of All Age-Based Scheduling Policies. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 1, Article 16, March 2018.
    Abstract / PDF [885K]

  • Efficient Multi-Tenant Inference on Video using Microclassifiers. Giulio Zhou, Thomas Kim, Christopher Canel, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [1.5M]

  • Towards Optimality in Parallel Job Scheduling. Benjamin Berg, Jan-Pieter Dorsman, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 1, No. 2, Article 40. Publication date: December 2017.
    Abstract / PDF [4.3M]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size. Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf & Benny Van Houdt. Transactions on Networking, September 2017.
    Abstact / PDF [544K]

  • Workload Analysis and Caching Strategies for Search Advertising Systems. Conglong Li, David G. Andersen, Qiang Fu, Sameh Elnikety, Yuxiong He. SoCC ’17, September 24–27, 2017, Santa Clara, CA, USA.
    Abstract / PDF [650K]

  • Scheduling for Efficiency and Fairness in Systems with Redundancy. Kristen Gardner, Mor Harchol-Balter, Esa Hyyti & Rhonda Righter. Performance Evaluation, July 2017.
    Abstact / PDF [784K]

  • Carpool: A Bufferless On-Chip Network Supporting Adaptive Multicast and Hotspot Alleviation. Xiyue Xiang, Wentao Shi, Saugata Ghose, Lu Peng, Onur Mutlu & Nian-Feng Tzeng. In Proc. of the International Conference on Supercomputing (ICS), Chicago, IL, June 2017.
    Abstact / PDF [6.7M]

  • Cachier: Edge-caching for Recognition Applications. Utsav Drolia, Katherine Guo (Bell Labs), Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. The 37th IEEE International Conference on Distributed Computing Systems (ICDCS 2017), June 5 – 8, 2017, Atlanta, GA, USA.
    Abstract / PDF [5.4M]

  • Efficient Redundancy Techniques for Latency Reduction in Cloud Systems. Gauri Joshi, Emina Soljanin & Gregory Wornell. ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS) Volume 2 Issue 2, May 2017.
    Abstact / PDF [1.38M]

  • AdaptSize: Orchestrating the Hot Object Memory Cache in a Content Delivery Network. Daniel S. Berger, Ramesh K. Sitaraman, Mor Harchol-Balter. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17). March 27–29, 2017, Boston, MA.
    Abstract / PDF [560K]

  • Towards Edge-caching for Image Recognition. Utsav Drolia, Katherine Guo, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. First Workshop on Smart Edge Computing and Networking (SmartEdge) '17, held in conjunction with PerCom 2017, March 13 - 17, 2017, Hawaii, USA.
    Abstract / PDF [5.1M]

  • Prescriptive Safety-Checks through Automated Proofs for Control-Flow Integrity. Jiaqi Tan. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, November 2016.
    Abstract / PDF [5.75M]

  • A Survey of Security Vulnerabilities in Bluetooth Low Energy Beacons. Hui Jun Tay, Jiaqi Tan, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-109. November 2016.
    Abstract / PDF [110K]

  • AUSPICE-R: Automatic Safety-Property Proofs for Realistic Features in Machine Code. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, Priya Narasimhan.14th Asian Symposium on Programming Languages and Systems (APLAS), November 2016.
    Abstract / PDF [325K]

  • EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding. K. V. Rashmi, Mosharaf Chowdhury, Jack Kosaian, Ion Stoica & Kannan Ramchandran. 12th USENIX Symposium on Operating Systems Design and Implementation, Nov. 2–4, 2016, Savannah, GA.
    Abstract / PDF [830K]

  • Stateless Model Checking with Data-Race Preemption Points. Ben Blum, Garth A. Gibson. SPLASH 2016 OOPSLA, Oct 30 - Nov 4, 2016, Amsterdam, Netherlands.
    Abstract / PDF [704K]

  • Zorua: A Holistic Approach to Resource Virtualization in GPUs. Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha,Saugata Ghose, Adwait Jogu, Phillip B. Gibbons, Onur Mutlu. 49th IEEE/ACM International Symposium on Microarchitecture (MICRO’16), October 15-19, 2016, Taipei, Taiwan.
    Abstract / PDF [1.5M]

  • A Model for Application Slowdown Estimation in On-Chip Networks and Its Use for Improving System Fairness and Performance. Xiyue Xiang, Saugata Ghose, Onur Mutlu, Nian-Feng Tzeng. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [399K]

  • Accelerating Pointer Chasing in 3D-Stacked Memory: Challenges, Mechanisms, Evaluation.Kevin Hsieh, Samira Khan, Nandita Vijaykumar, Kevin K. Chang, Amirali Boroumand, Saugata Ghose, Onur Mutlu. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [1.67M]

  • PCFIRE: Towards Provable Preventative Control-Flow Integrity Enforcement for Realistic Embedded Software. Jiaqi Tan, Hui Jun Tay, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. EMSOFT’16, October 01-07, 2016, Pittsburgh, PA, USA.
    Abstract / PDF [722K]

  • Poster Abstract: BUFS: Towards Bottom-Up Foundational Security for Software in the Internet-of-Things. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 1st IEEE/ACM Symposium on Edge Computing (SEC 2016), October 2016.
    Abstract / PDF [682K]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf. IEEE Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS 2016), London, UK, September 2016.
    Abstract / PDF [244K]

  • Soundness Proofs for Iterative Deepening. Ben Blum. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-103, September 6, 2016.
    Abstract / PDF [356K]

  • Parallel Algorithms for Asymmetric Read-Write Costs. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. 28th ACM Symposium on Parallelism in Algorithms and Architectures Jul 11, 2016 - Jul 13, 2016. Asilomar State Beach, California, USA.
    Abstract / PDF [386K]

  • A Case for Hierarchical Rings with Deflection Routing: An energy-efficient on-chip communication substrate. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu, Parallel Computing, Volume 54, May 2016, Pages 29-45, ISSN 0167-8191.
    Abstract / PDF [2M]

  • Achieving both High Energy Efficiency and High Performance in On-Chip Communication using Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu. arXiv:1602.06005v1 [cs.DC], 18 Feb 2016.
    Abstract / PDF [576K]

  • Scheduling Techniques for Hybrid Circuit/Packet Networks. He Liu, Matthew K. Mukerjee, Conglong Li, Nicolas Feltman, George Papen, Stefan Savage, Srinivasan Seshan, Geoffrey M. Voelker, David G. Andersen, Michael Kaminsky, George Porter, Alex C. Snoeren. In 11th International Conference on emerging Networking EXperiments and Technologies (CoNEXT 2015), Heidelberg, Germany, December 2015. Nominated for Best Paper.
    Abstract / PDF [510K]

  • Decoupled Direct Memory Access: Isolating CPU and IO Traffic by Leveraging a Dual-Data-Port DRAM. Donghyuk Lee, Lavanya Subramanian, Rachata Ausavarungnirun, Jongmoo Choi, Onur Mutlu. Proceedings of the 24th International Conference on Parallel Architectures and Compilation Techniques (PACT), San Francisco, CA, USA, October 2015.
    Abstract / PDF [1.8M]

  • Tracking and Reducing Uncertainty in Dataflow Analysis-Based Dynamic Parallel Monitoring. Michelle Goodstein, Phillip Gibbons, Michael Kozuch, Todd Mowry. International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), Oct 18, 2015 - Oct 21, 2015, San Francisco, CA.
    Abstract / PDF [341K]

  • Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance. Rachata Ausavarungnirun, Saugata Ghose, Onur Kayiran, Gabriel H. Loh, Chita R. Das, Mahmut T. Kandemir, Onur Mutlu. Proceedings of the The 24th International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), San Francisco, October 2015.
    Abstract / PDF [556K]

  • Krowd: A Key-Value Store for Crowded Venues. Utsav Drolia, Nathan Mickulicz, Rajeev Gandhi, Priya Narasimhan.10th ACM Workshop on Mobility in the Evolving Internet Architecture (MobiArch), held in Paris, France in September 2015. Best Paper.
    Abstract / PDF [696K]

  • A Low-Overhead, Fully-Distributed, Guaranteed-Delivery Routing Algorithm for Faulty Network-on-Chips. Mohammad Fattah, Antti Airola, Rachata Ausavarungnirun, Nima Mirzaei, Pasi Liljeberg, Juha Plosila, Siamak Mohammadi, Tapio Pahikkala, Onur Mutlu, Hannu Tenhunen. Proceedings of the 9th ACM/IEEE International Symposium on Networks on Chip (NOCS), Vancouver, BC, Canada, September 2015.
    Abstract / PDF [1M]

  • AUSPICE: Automated Safety Property Verification for Unmodified Executables. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, and Priya Narasimhan. In 7th Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), July 2015.
    Abstract / PDF [390K]

  • Reducing Latency via Redundant Requests: Exact Analysis. Kristen Gardner, Sam Zbarsky, Sherwin Doroudi, Mor Harchol-Balter, Esa Hyytia, Alan Scheller-Wolf. Proceedings of ACM Sigmetrics/Performance 2015 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 15), Portland, OR. June 2015.
    Abstract / PDF [725K]

  • A Case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling Efficient Data Compression. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita Das, Mahmut Kandemir, Todd C. Mowry, Onur Mutlu. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [1M]

  • PocketTrend: Timely Identification and Delivery of Trending Search Content to Mobile Users. Gennady Pekhimenko, Dimitrios Lymberopoulos, Oriana Riva, Karin Strauss, Doug Burger. Proceedings of the 24th International World Wide Web Conference (WWW), Florence, Italy, May 2015.
    Abstract / PDF [504K]

  • Raising the Bar for Using GPUs in Software Packet Processing. Anuj Kalia, Dong Zhou, Michael Kaminsky, David G. Andersen. 12th Usenix Symposium on Networked Systems Design (NSDI'15). May 4-6, 2015, Oakland, CA.
    Abstract / PDF [386K]

  • Efficient Hypervisor Based Malware Detection. Peter Friedrich Klemperer. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, May 2015.
    Abstract / PDF [1.3M]

  • Optimal Scheduling for Jobs with Progressive Deadlines. Kristen Gardner, Sem Borst, Mor Harchol-Balter. IEEE INFOCOM 15, Hong Kong, April, 2015.
    Abstract / PDF [558K]

  • Mitigating Prefetcher-Caused Pollution Using Informed Caching Policies for Prefetched Blocks Vivek Seshadri, Samihan Yedkar, Hongyi Xin, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. ACM Transactions on Architecture and Code Optimization (TACO), Volume 11 Issue 4, January 2015, Article No. 51.
    Abstract / PDF [1.1M]

  • Having Your Cake and Eating It Too: Jointly Optimal Erasure Codes for I/O, Storage, and Network-bandwidth. KV Rashmi, Preetum Nakkiran, Jingyan Wang, Nihar B. Shah & Kannan Ramchandran. USENIX FAST, Feb 2015, Santa Clara, CA. Best paper.
    Abstract / PDF [560K]

  • Toggle-Aware Compression for GPUs. Gennady Pekhimenko, Evgeny Bolotin, Mike O'Connor, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. IEEE Computer Architecture Letters (CAL).
    Abstract / PDF [346K]