Recent Publications
- CARP: Range Query-Optimized Indexing for Streaming Data. Ankush Jain, Charles D. Cranor, Qing Zheng, Bradley W. Settlemeyer, George Amvrosiadis, Gary Grider. SC24, November 17-22, 2024, Atlanta, Georgia, USA.
Abstract / PDF [1M]
- Morph: Efficient File-Lifetime Redundancy Management for Cluster File Systems. Timothy Kim, Sanjith Athlur, Saurabh Kadekodi, Francisco Maturana Dax Delvira, Arif Merchant, Gregory R. Ganger, K. V. Rashmi. SOSP ’24, November 4–6, 2024, Austin, TX, USA.
Abstract / PDF [4.6M]
- Reducing Cross-Cloud/Region Costs with the Auto-Configuring MACARON Cache. Hojin Park, Ziyue Qiu, Gregory R. Ganger, George Amvrosiadis. SOSP ’24, November 4–6, 2024, Austin, TX, USA.
Abstract / PDF [2.7M]
- The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions. William Zhang, Wan Shen Lim, Matthew Butrovich, Andrew Pavlo. Proceedings of the VLDB Endowment, 17(11): 3373-3387, 2024. July 2024.
Abstract / PDF [2.6M]
- Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems. Wan Shen Lim, Lin Ma, William Zhang, Matthew Butrovich, Samuel Arch, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 17, No. 11, ISSN 2150-8097. July 2024.
Abstract / PDF [1.25M]
- DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving. Foteini Strati, Sara McAllister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic. Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, July 21-27, 2024.
Abstract / PDF [2.1M]
- Erasure Coded Neural Network Inference via Fisher Averaging. Divyansh Jhunjhunwala, Neharika Jali, Gauri Joshi, Shiqiang Wang. IEEE International Symposium on Information Theory (ISIT), Athens, Greece, July 7-12, 2024.
Abstract / PDF [175K]
- Data Caching for Enterprise-Grade Petabyte-Scale OLAP. Chunxu Tang, Bin Fan, Jing Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian (James) Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen. Proceedings of the 2024 USENIX Annual Technical Conference. July 10–12, 2024 • Santa Clara, CA, USA.
Abstract / PDF [932K]
- A Call for Research on Storage Emissions. Sara McAllister, Fiodar Kazhamiaka, Daniel S. Berger, Rodrigo Fonseca, Kali Frost, Aaron Ogus, Maneesh Sah, Ricardo Bianchini, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger. HotCarbon’24, July 9, 2024, Santa Cruz, CA.
Abstract / PDF [2.9M] / Slides
- FairyWREN: A Sustainable Cache for Emerging Write-Read-Erase Flash Interfaces. Sara McAllister, Yucong “Sherry” Wang, Benjamin Berg*, Daniel S. Berger†, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger. 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI '24), July 10–12, 2024. Santa Clara, CA, USA.
Abstract / PDF [6.7M]
- Designing Cloud Servers for Lower Carbon. Jaylen Wang, Daniel S. Berger, Fiodar Kazhamiaka, Celine Irvene, Chaojie Zhang, Esha Choukse, Kali Frost, Rodrigo Fonseca, Brijesh Warrier, Chetan Bansal, Jonathan Stern, Ricardo Bianchini, Akshitha Sriraman. Proceedings of the 51st Intl. Symposium on Computer Architecture (ISCA 2024), Buenos Aires, Argentina, June 2024.
Abstract / PDF [1.35M]
- Perspective: A Principled Framework for Pliable and Secure Speculation in Operating Systems. Tae Hoon Kim, David Rudo, Kaiyang Zhao, Zirui Neil Zhao, Dimitrios Skarlatos. Proceedings of the 51st Intl. Symposium on Computer Architecture (ISCA 2024), Buenos Aires, Argentina, June 2024.
Abstract / PDF [600K]
- Agents of Autonomy: A Systematic Study of Robotics on Modern Hardware. Mohammad Bakhshalipour and Phillip B. Gibbons. In Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems. (SIGMETRICS/PERFORMANCE Abstracts ’24), June 10–14, 2024, Venice, Italy. ACM, New York, NY, USA. BEST PAPER AT SIGMETRICS '24!
Abstract / PDF [425K] / Code
- Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs. Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak. arXiv:2406.01566v1 [cs.DC] 3 Jun 2024.
Abstract / PDF [775K]
- Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems. Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi. International Conference on Artificial Intelligence and Statistics (AISTATS), May 2nd - May 4th, 2024, Valencia, Spain.
Abstract / PDF [800K]
- Extending the Mochi Methodology to Enable Dynamic HPC Data Services. M. Dorier, P. Carns, R. Ross, S. Snyder, R. Latham, A. Gueroudji, G. Amvrosiadis, C. Cranor, J. Soumagne. In 5th Workshop on Extreme-Scale Storage and Analysis (ESSA 2024), May 2024.
Abstract / PDF [250K]
- BBQ: A Fast and Scalable Integer Priority Queue for Hardware Packet Scheduling. Nirav Atre, Hugo Sadok, Justine Sherry. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI' 24), April 16–18, 2024. Santa Clara, CA.
Abstract / PDF [1M]
- SIEVE is Simpler than LRU: An Efficient Turn-Key Eviction Algorithm for Web Caches. Yazhuo Zhang, Juncheng Yang, Yao Yue, Ymir Vigfusson, K. V. Rashmi. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI'24), April 16–18, 2024. Santa Clara, CA. COMMUNITY AWARD FOR BEST PAPER!
Abstract / PDF [1M]
- Baleen: ML Admission & Prefetching for Flash Caches. Daniel Lin-Kit Wong, Hao Wu, Carson Molder, Sathya Gunasekar, Jimmy Lu, Snehal Khandkar, Abhinav Sharma, Daniel S. Berger, Nathan Beckmann, Gregory R. Ganger. 22nd USENIX Conference on File and Storage Technologies (FAST'24), Feb. 27–29, 2024, Santa Clara, CA.
Abstract / PDF [2.7M] / Code / Traces
- Survey and Evaluation of Database Management System Extensibility. Abigale Kim. Carnegie Mellon University School of Computer Science M.S.Thesis CMU-CS-23-144. January 2024.
Abstract / PDF [1.25M]
- Is Perfect Hashing Practical for OLAP Systems? Kevin P. Gaffney, Jignesh M. Patel. 4th Annual Conference on nnovative Data Systems Research (CIDR ’24) January 14-17, 2024, Chaminade, USA.
Abstract / PDF [765K]
- Dear User-Defined Functions, Inlining isn’t working out so great for us. Let’s try batching to make our relationship work. Sincerely, SQL. Kai Franz, Samuel Arch, Denis Hirn, Torsten Grust, Todd C. Mowry, Andrew Pavlo. Conference on Innovative Data Systems Research (CIDR 2024), Chaminade, CA, USA, January 14-17, 2024.
Abstract / PDF [545K]
- Address Scaling: Architectural Support for Fine-Grained Thread-Safe Metadata Management. Deepanjali Mishra, Konstantinos Kanellopoulos, Ashish Panwar, Akshitha Sriraman, Vivek Seshadri, Onur Mutlu, Todd C. Mowry. IEEE Computer Architecture Letters, Volume: 23, Issue: 1, Jan.-June 2024.
Abstract / PDF [540K]
- UDIR: Towards a Unified Compiler Framework for Reconfigurable Dataflow Architectures. Nikhil Agarwal, Mitchell Fream, Souradip Ghosh, Brian C. Schwedock, Nathan Beckmann. IEEE Computer Architecture Letters ( Volume: 23, Issue: 1, Jan.-June 2024)UDIR: Towards a Unified Compiler Framework for Reconfigurable Dataflow Architectures. Nikhil Agarwal, Mitchell Fream, Souradip Ghosh, Brian C. Schwedock, Nathan Beckmann. IEEE Computer Architecture Letters (Volume: 23, Issue: 1, Jan.-June 2024).
Abstract / PDF [1.28M]
- Rethinking the Encoding of Integers for Scans on Skewed Data. Martin Prammer, Jignesh M. Patel. Proc. ACM Manag. Data, Vol. 1, No. 4 (SIGMOD), Article 257. December 2023.
Abstract / PDF [1.5M]
- Tigger: A Database Proxy That Bounces With User-Bypass. Matthew Butrovich, Karthik Ramanathan, John Rollinson, Wan Shen Lim, William Zhang, Justine Sherry, Andrew Pavlo Proceedings of the VLDB Endowment, Vol. 16, No. 11, 2023.
Abstract / PDF [1.3M]
- Simple Adaptive Query Processing vs. Learned Query Optimizers: Observations and Analysis. Yunjia Zhang, Yannis Chronis, Jignesh M. Patel, Theodoros Rekatsinas. Proceedings of the VLDB Endowment, Vol. 16, No. 11.
Abstract / PDF [950K]
- Agents of Autonomy: A Systematic Study of Robotics on Modern Hardware. Mohammad Bakhshalipour, Phillip B. Gibbons. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Volume 7, Issue 3, Article No.: 43, December 2023. To appear ACM SIGMETRICS / IFIP PERFORMANCE 2024, Venice, Italy, June 10-14, 2024.
Abstract / PDF [2.1M]
- Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments. Ziqi Wang, Kaiyang Zhao, Pei Li, Andrew Jacob, Michael Kozuch, Todd Mowry, Dimitrios Skarlatos. MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture. October 2023. Toronto, Canada.
Abstract / PDF [935K]
- FIFO Queues Are All You Need for Cache Eviction. Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, Rashmi Vinayak SOSP '23: Proceedings of the 29th Symposium on Operating Systems Principles, October 2023. Koblenz, Germany.
Abstract / PDF [1.6M]
- Sia: Heterogeneity-aware, Goodput-optimized ML-cluster Scheduling. Suhas Jayaram Subramanya, Daiyaan Arfeen, Shouxu Lin, Aurick Qiao, Zhihao Jia, and Gregory R. Ganger. 2023. ACM SIGOPS 29th Symposium on Operating Systems Principles (SOSP ’23), October 23–26, 2023, Koblenz, Germany.
Abstract / PDF [1.23M]
- Ensō: A Streaming Interface for NIC-Application Communication. Hugo Sadok and Nirav Atre, Carnegie Mellon University; Zhipeng Zhao, Microsoft; Daniel S. Berger, Microsoft Research & University of Washington; James C. Hoe, Carnegie Mellon University; Aurojit Panda, New York University; Justine Sherry, Carnegie Mellon University; Ren Wang, Intel.17th USENIX Symposium on Operating Systems Design and Implementation (OSDI). July 10–12, 2023. Boston, MA.
Abstract / PDF [800K]
- Peeling Back the Carbon Curtain: Carbon Optimization Challenges in Cloud Computing. Jaylen Wang, Udit Gupta, Akshitha Sriraman. HotCarbon 2023.July 9, 2023, Boston, MA, USA.
Abstract / PDF [840K]
- Runahead A*: Speculative Parallelism for A* with Slow Expansions. Mohammad Bakhshalipour, Mohamad Qadri, Dominic Guri, Seyed Borna, Ehsani, Maxim Likhachev, Phillip B. Gibbons. ICAPS 2023, Prague, Czech Republic, July 8-13, 2023.
Abstact / PDF [710K]
- FIFO Can Be Better than LRU: The Power of Lazy Promotion and Quick Demotion. Juncheng Yang, Ziyue Qiu, Yazhuo Zhang*, Yao Yue^, K. V. Rashmi. HotOS ’23, June 22–24, 2023, Providence, RI, USA.
Abstract / PDF [1.2M]
- Contiguitas: The Pursuit of Physical Memory Contiguity in Datacenters. Kaiyang Zhao, Kaiwen Xue, Ziqi Wang, Dan Schatzberg, Leon Yang, Antonis Manousis, Johannes Weiner, Rik van Riel, Bikash Sharma, Chunqiang Tang, Dimitrios Skarlatos. ISCA ’23, June 17–21, 2023, Orlando, FL, USA. BEST PAPER AWARD!
Abstract / PDF [468K]
- PIM-trie: A Skew-resistant Trie for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. SPAA '23: Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, June 2023, Orlando, FL.
Abstract / PDF [1.23M]
- Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud. Hojin Park, Gregory R. Ganger, George Amvrosiadis. SYSTOR '23: Proceedings of the 16th ACM International Conference on Systems and Storage, Haifa, Israel, June 5-7, 2023.
Abstract / PDF [1.4M]
- Validating Large Language Models with ReLM. Michael Kuchnik, Virginia Smith, George Amvrosiadis.6th MLSys Conference, Miami Beach, FL, USA, June 4-8, 2023. OUTSTANDING PAPER AWARD AT MLSYS23!
Abstract / PDF [1.2M]
- MANIC: A 19µW @ 4MHz, 256 MOPS/mW, RISC-V Microcontroller with Embedded MRAM Main Memory and Vector-Dataflow Co-Processor in 22nm Bulk FinFET CMOS. Graham Gobieski, Oguz Atli, Cagri Erbagci, Ken Mai, Nathan Beckmann, Brandon Lucia. IEEE International Symposium on Circuits and Systems (ISCAS), Monterey, CA, May 21-25, 2023.
Abstract / PDF [3.6 M]
- FrozenHot Cache: Rethinking Cache Management for Modern Hardware. Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu. EuroSys 2023, Rome, Italy, May 8th-12th, 2023.
Abstract / PDF [1.14M]
- Federated Learning Under Distributed Concept Drift. Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons. International Conference on Artificial Intelligence and Statistics (AISTATS), Apr 2023. In preprint arXiv:2206.00799v1.
Abstract / PDF [956K]
- Design Principles for Replicated Storage Systems Built on Emerging Storage Technologies. Thomas Kim. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-23-109. March 2023.
Abstract / PDF [47.5M]
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. Huaicheng Li, Daniel S. Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, Mark D. Hill, Marcus Fontoura, Ricardo Bianchini. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. DISTINGUISHED PAPER AWARD!
Abstract / PDF [1.7M]
- GL-Cache: Group-level Learning for Efficient and High-performance Caching. Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi. 21st USENIX Conference on File and Storage Technologies (FAST '23). Feb. 21–23, 2023, Santa Clara, CA.
Abstract / PDF [1.84M]
- RAIZN: Redundant Array of Independent Zoned Namespaces. Thomas Kim, Jekyeom Jeon, Nikhil Arora, Huaicheng Li, Michael Kaminsky, David G. Andersen, Gregory R. Ganger, George Amvrosiadis, Matias Bjørling. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. Supoercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-22-101, January 2022.
Abstract / PDF [1.65M]
- Database Gyms. Wan Shen Lim, Matthew Butrovich, William Zhang, Andrew Crotty, Lin Ma, Peijing Xu, Johannes Gehrke, Andrew Pavlo. CIDR 2023. 13th Annual Conference on Innovative Data Systems Research (CIDR ’23). January 8-11, 2023, Amsterdam, The Netherlands.
Abstract / PDF [800K] / Slides
- Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays. Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi. ACM Transactions on Storage, Vol. 19, No. 1, Article 5. January 2023.
Abstract / PDF [1.6M]
- Realizing Value in Shared Compute Infrastructures. Andrew Chung. Carnegie Mellon University PhD Dissertation CMU-CS-22-151, December 2022.
Abstract / PDF [3M]
- PIM-tree: A Skew-resistant Index for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. Proc. VLDB Endow. 16(4): 946-958 (2022). In preprint. BEST PAPER RUNNER UP!
Abstract / PDF [1.1M]
- Rateless Sum-Recovery Codes For Distributed Non-Linear Computations. Ankur Mallick,Gauri Joshi. Information Theory Workshop (ITW), November 6-9, 2022. Mumbai, India.
Abstract / PDF [728K]
- RipTide: A Programmable, Energy-minimal Dataflow Compiler and Architecture. Graham Gobieski, Souradip Ghosh, Marijn Heule, Todd Mowry, Tony Nowatzki, Nathan Beckmann, Brandon Lucia. MICRO 2022 - 55th IEEE/ACM International Symposium on Microarchitecture, October 1–5, 2022 Chicago, Illinois, USA.
Abstract / PDF [3.7M]
- MATCHA: A Matching-Based Link Scheduling Strategy to Speed up Distributed Optimization. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. IEEE Transactions on Signal Processing, Oct 2022.
Abstract / PDF [1.85M]
- Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S Berger, Nathan Beckmann, Gregory R Ganger. ACM Transactions on Storage, Vol. 18, No. 3, Article 21. August 2022.
Abstract / PDF [1.4M]
- SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling. Nirav Atre, Hugo Sadok, Erica Chiang, Weina Wang, Justine Sherry. SIGCOMM ’22, August 22–26, 2022, Amsterdam, Netherlands.
Abstract / PDF [2M]
- Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines. Michael Kuchnik, Ana Klimovic, Jirı Simsa, Virginia Smith, George Amvrosiadis. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
Abstract / PDF [7M]
- The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
Abstract / PDF [1.3M]
- Matchmaker: Data Drift Mitigation in Machine Learning for Large-scale Systems. Ankur Mallick, Kevin Hsieh, Behnaz Arzani, Gauri Joshi. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
Abstract / PDF [500K]
- Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays. Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi. To appear, ACM Transactions on Storage, 2022, (accepted July 2022).
Abstract / PDF [656K]
- Tiger: Disk-Adaptive Redundancy Without Placement Restrictions. Saurabh Kadekodi, Francisco Maturana, Sanjith Athlur, Arif Merchant, K. V. Rashmi, Gregory R. Ganger. Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI'22), July 11–13, 2022, Carlsbad, CA, USA.
Abstract / PDF [1.25M]
- Bandwidth Cost of Code Conversions in the Split Regime. Francisco Maturana and K. V. Rashmi. 2022 IEEE International Symposium on Information Theory (ISIT22). June 26-July 1, 2022, Espoo, Finland.
Abstract / PDF [1.22M]
- RACOD: Algorithm/Hardware Co-design for Mobile Robot Path Planning. Mohammad Bakhshalipour, Seyed Borna Ehsani, Mohamad Qadri, Dominic Guri, Maxim Likhachev, Phillip B. Gibbons. ISCA ’22, June 18–22, 2022, New York, NY, USA.
Abstract / PDF [2.2M]
- Thermometer: Profile-Guided BTB Replacement for Data Center Applications. Shixin Song, Tanvir Ahmed Khan, Sara Mahdizadeh Shahri, Akshitha Sriraman, Niranjan K Soundararajan, Sreenivas Subramoney, Daniel A. Jiménez, Heiner Litz, Baris Kasikci. ISCA ’22, June 18–22, 2022, New York, NY, USA.
Abstract / PDF [1.45M]
- täkō: A Polymorphic Cache Hierarchy for General-Purpose Optimization of Data Movement. Brian C. Schwedock, Piratach Yoovidhya, Jennifer Seibert, Nathan Beckmann. ISCA ’22, June 18–22, 2022, New York, NY, USA.
Abstract / PDF [1.75M]
- Tastes Great! Less Filling! High Performance and Accurate Training Data Collection for Self-Driving Database Management Systems. Matthew Butrovich, Wan Shen Lim, Lin Ma, John Rollinson, William Zhang, Yu Xia, Andrew Pavlo. SIGMOD ’22, June 12–17, 2022, Philadelphia, PA, USA.
Abstract / PDF [1.1M]
- RTRBench: A Benchmark Suite for Real-Time Robotics. Mohammad Bakhshalipour, Maxim Likhachev, Phillip B. Gibbons 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 22-24 May 2022, Singapore.
Abstract / PDF [1.85M]
- Adapting the RACER Architecture to Integrate Improved In-ReRAM Logic Primitives. Minh S. Q. Truong, Liting Shen, Alexander Glass, Alison Hoffmann, L. Richard Carley, James A. Bain, Saugata Ghose. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Early Access, 12 May 2022.
Abstract / PDF [4.3M]
- Varuna: Scalable, Low-cost Training of Massive Deep Learning Models. Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, Nipun Kwatra. EuroSys ’22, April 5-8, 2022, Rennes, France. BEST PAPER AWARD!
Abstract / PDF [1.5M]
- C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery. Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman. 19th USENIX Symposium on Networked Systems Design and Implementation. April 4–6, 2022 • Renton, WA, USA.
Abstract / PDF [1.9M] / Slides / Talk Video
- MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations. Nandita Vijaykumar, Ataberk Olgun, Konstantinos Kanellopoulos, F. Nisa Bostanci, Hasan Hassan, Mehrshad Lotfi,
Phillip B. Gibbons, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 19, No. 2, Article 26. Publication date: March 2022.
Abstract / PDF [1.75M]
- Client Selection in Federated Learning: Convergence Analysis and Power-of-Choice Selection Strategies. Yae Jee Cho, Jianyu Wang, Gauri Joshi. International Conference on Artificial Intelligence and Statistics (AISTATS), March 2022.
Abstract / PDF [1.85M]
- TMO: Transparent Memory Offloading in Datacenters. Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos. ASPLOS ’22, February 28 – March 4, 2022, Lausanne, Switzerland. BEST PAPER AWARD AT ASPLOS '22!
Abstract / PDF [1.65M]
- FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients. Jianyu Wang, Hang Qi, Ankit Singh Rawat, Sashank Reddi, Sagar Waghmare, Felix X. Yu, Gauri Joshi. arXiv:2201.11865v2 [cs.LG], 16 Feb 2022.
Abstract / PDF [660K]
- Are You Sure You Want to Use MMAP in YourDatabase Management System? Andrew Crotty, Viktor Leis, Andrew Pavlo. 12th Annual Conference on Innovative Data Systems Research (CIDR ’22). January 9-12, 2022, Chaminade, USA.
Abstract / PDF [690K] / Talk Video
- Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation. Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar, Swanand Kadhe, Gauri Joshi. 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Dec. 6-14, 2021. Virtual Event.
Abstract / PDF [650K]
- It’s Time to Talk About HPC Storage: Perspectives on the Past and Future. Bradley Settlemyer, George Amvrosiadis, Philip Carns, Robert Ross. IEEE Computer Society Computing in Science & Engineering November/December 2021.
Abstract / PDF [483K]
- Arithmetic-Intensity-Guided Fault Tolerance for Neural Network Inference on GPUs. Jack Kosaian, K. V. Rashmi. SC’21, November 14–19, 2021, St. Louis, MO, USA.
Abstract / PDF [256K] / Slides / Code
- The Most Common Queueing Theory Questions Asked by Computer Systems Practitioners. Mor Harchol-Balter and Ziv Scully. First International Workshop on Teaching Performance Analysis of ComputerSystems (TeaPACS 2021) In conjunction with the IFIP Performance 2021 Conference. Milan, Italy, Nov 2021.
Abstract / PDF [286K]
- DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. SC ’21, November 14–19, 2021, St. Louis, MO, USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
Abstract / PDF [1M] / Slides / Talk Video
- The Case for Phase-Aware Scheduling of Parallelizable Jobs. Ben Berg, Justin Whitehouse, Ben Moseley, Weina Wang, Mor Harchol-Balter. IFIP Performance 2021. Milan, Italy, November 2021.
Abstract / PDF [978K]
- RACER: Bit-Pipelined Processing Using Resistive Memory. Minh S. Q. Truong, Eric Chen, Deanyone Su, Alexander Glass, Liting Shen, L. Richard Carley, James A. Bain, Saugata Ghose. 54th IEEE/ACM International Symposium on Microarchitecture, ser. MICRO 2021, Oct. 2021.
Abstract / PDF [2.2M]
- The Gittins Policy in the M/G/1 Queue. Ziv Scully, Mor Harchol-Balter. 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt 2021) Philadelphia, PA, Oct 2021.
Abstract / PDF [265K]
- WineFS: A Hugepage-aware File System for Persistent Memory that Ages Gracefully. Rohan Kadekodi, Saurabh Kadekodi, Soujanya Ponnapalli, Harshad Shirwadkar, Gregory R. Ganger, Aasheesh Kolli, Vijay Chidambaram. 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021.
Abstract / PDF [3M]
- IODA: A Host/Device Co-Design for Strong Predictability Contract on Modern Flash Storage. Huaicheng Li, Martin L. Putra, Ronald Shi, Xing Lin, Gregory R. Ganger, Haryadi S. Gunawi. SOSP ’21, October 26-29, 2021, Virtual Event, Germany.
Abstract / PDF [710K] / Talk Video
- Personalized Federated Learning for Heterogeneous Clients with Clustered Knowledge Transfer. Yae Jee Cho, Jianyu Wang, Tarun Chiruvolu, Gauri Joshi. arXiv:2109.08119v1 [cs.LG] 16 Sep 2021.
Abstract / PDF [852K]
- Rateless Codes for Distributed Non-linear Computations. Ankur Mallick, Sophie Smith, Gauri Joshi. International Symposium on Topics in Coding, Montréal, Québec, Canada, from August 30th to September 3rd, 2021.
Abstract / PDF [1.1M]
- A Novel Framework for the Analysis and Design of Heterogeneous Federated Learning. Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, H. Vincent Poor. IEEE Transactions on Signal Processing, Sept 2021.
Abstract / PDF [835K]
- Cooperative SGD: A Unified Framework for the Analysis of Local-Update. Jianyu Wang, Gauri Joshi. SGD Journal of Machine Learning Research (JMLR), 2021. September 2021.
Abstract / PDF [860K]
- Kangaroo: Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel Berger, Nathan Beckmann, Gregory R. Ganger. Proceedings of the 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021. Virtual Event. BEST PAPER AT SOSP'21!
Abstract / PDF [7.8M] / Talk Video-Short / Talk Video-Long / Blog Post
- Irregular Array Codes with Arbitrary Access Sets for Geo-Distributed Storage. Francisco Maturana, K. V. Rashmi Carnegie Mellon University, Pittsburgh, PA, USA Email: fmaturan@cs.cmu.edu, rvinayak@cs.cmu.edu 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
Abstract / PDF [288K]
- Bandwidth Cost of Code Conversions in Distributed Storage: Fundamental Limits and Optimal Constructions. Francisco Maturana, K. V. Rashmi 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
Abstract / PDF [325K]
- The Processing-in-Memory Model. Hongbo Kang, Phillip B. Gibbons, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
Abstract / PDF [1.25M]
- Progressive Compressed Records: Taking a Byte out of Deep Learning Data. Michael Kuchnik, George Amvrosiadis, Virginia Smith. Proceedings of the VLDB Endowment, Vol. 14, No. 11 ISSN 2150-8097, July 2021.
Abstract / PDF [3.86M]
- Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size. Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, K. V. Rashmi. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 18-24 July 2021, Virtual Event.
Abstract / PDF [518K] / Appendix / Code / Slides and Talk Video
- DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
Abstract / PDF [1M]
- ZNS: Avoiding the Block Interface Tax for Flash-based SSDs. Matias Bjørling, Abutalib Aghayev, Hans Holmberg, Aravind Ramesh, Damien Le Moal, Gregory R. Ganger, George Amvrosiadis. USENIX Annual Technical Conference (USENIX 2021), July 14-16, 2021, Virtual Event.
Abstract / PDF [305K] / Slides / Talk Video
- Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing. 15th USENIX Symposium on Operating Systems Design and Implementation, Virtual Event, July 14–16, 2021. BEST PAPER AT OSDI'21!
Abstract / PDF [930K] / Slides / Talk Video
- Block-Granularity-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
Abstract / PDF [880K]
- Spitfire: A Three-Tier Buffer Manager for Volatile and Non-Volatile Memory. Xinjing Zhou, Joy Arulraj, Andrew Pavlo, David Cohen. SIGMOD/PODS '21: Proceedings of the 2021 International Conference on Management of Data. June 2021.
Abstract / PDF [1.28M]
- Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications. Tanvir Ahmed Khan, Dexin Zhang, Akshitha Sriraman, Joseph Devietti, Gilles A Pokam, Heiner Litz, Baris Kasikci.International Symposium on Computer Architecture (ISCA), June 2021.
Abstract / PDF [770K]
- MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems. Lin Ma, William Zhang, Jie Jiao, Wuwen Wang, Matthew Butrovich, Wan Shen Lim, Prashanth Menon, Andrew Pavlo. SIGMOD ’21, June 20–25, 2021, Virtual Event, China.
Abstract / PDF [1.25M]
- Filter Representation in Vectorized Query Execution. Amadou Ngom, Prashanth Menon, Matthew Butrovich, Lin Ma, Wan Shen Lim, Todd C. Mowry, Andrew Pavlo. International Workshop on Data Management on New Hardware, pages. 6:1—6:7, June 2021.
Abstract / PDF [720K]
- HerQules: Securing Programs via Hardware-Enforced Message Queues. Daming D. Chen, Wen Shih Lim, Mohammad Bakhshalipour, Phillip B. Gibbons, James C. Hoe, Bryan Parno. ASPLOS ’21, April 19–23, 2021, Virtual, USA.
Abstract / PDF [1.7M] / Talk Video
- Segcache: A Memory-efficient and Scalable In-memory Key-value Cache for Small Objects. Juncheng Yang, Yao Yue, K. V. Rashmi. 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI). Virtual Event, April 12–14, 2021. NSDI'21 Community Award and NSDI'21 BEST PAPER AWARD!
Abstract / PDF [517K] / Slides / Talk Video
- Learning on Distributed Traces for Data Center Storage Systems. Giulio Zhou, Martin Maas Conference on Machine Learning and Systems '21, April 5-9, 2021. Virtual Event.
Abstract / PDF [1.3M] / Talk Video
- CORTEX: A Compiler for Recursive Deep Learning Models. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 4th MLSys Conference, San Jose, CA, USA, Apr 4-7, 2021.
Abstract / PDF [622K] / Talk Video (starts at 29:41)
- Distributed Metadata and Streaming Data Indexing as Scalable Filesystem Services. Qing Zheng. Carnegie Mellon University School of Computer Science Ph.D. Dissertation, CMU-CS-21-103. February 2021.
Abstract / PDF [2.1M]
- Open Problems in Queueing Theory Inspired by Datacenter Computing. Mor Harchol-Balter. Queueing Systems, vol. 97, no. 1, February 2021, pp. 3-37.
Abstract / PDF [690K]
- Everything is a Transaction: Unifying Logical Concurrency Control and Physical Data Structure Maintenance in Database Management Systems. Ling Zhang, Matthew Butrovich, Tianyu Li, Yash Nannapanei, Andrew Pavlo, John Rollinson, Huanchen Zhang, Ambarish Balakumar, Daniel Biales, Ziqi Dong, Emmanuel Eppinger, Jordi Gonzalez, Wan Shen Lim, Jianqiao Liu, Lin Ma, Prashanth Menon, Soumil Mukherjee, Tanuj Nayak, Amadou Ngom, Jeff Niu, Deepayan Patra, Poojita Raj, Stephanie Wang, Wuwen Wang, Yao Yu, William Zhang. Conference on Innovative Data Systems Research (CIDR) 2021. January 11-15, 2021. Virtual Event.
Abstract / PDF [352K] / Talk Video
- The Read-Only Semi-External Model. Guy E. Blelloch, Laxman Dhulipala, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. APOCS 2021, January 13, 2021 Virtual Conference, Alexandria, Virginia, U.S.
Abstract / PDF [740K]
- Unearthing Inter-job Dependencies for Better Cluster Scheduling. Andrew Chung, Subru Krishnan, Konstantinos Karanasos, Carlo Curino, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
Abstract / PDF [1.0M] / Slides / Talk Video
- PACEMAKER: Avoiding HeART Attacks in Storage Clusters with Disk-adaptive Redundancy. Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
Abstract / PDF [2.1M] / Slides / Talk Video
- The CacheLib Caching Engine: Design and Experiences at Scale. Benjamin Berg, Daniel S. Berger, Sara McAllister, Isaac Grosof, Sathya Gunasekar, Jimmy Lu, Michael Uhlar, Jim Carrig, Nathan Beckmann, Mor Harchol-Balter, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
Abstract / PDF [606K] / Slides / Talk Video
- A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. Juncheng Yang, Yao Yue, K. V. Rashmi. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
Abstract / PDF [1.6M] / Slides / Talk Video
- Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling. Prashanth Menon, Amadou Ngom, Lin Ma, Todd C. Mowry, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 14, iss. 2, pages. 101—113, October 2020.
Abstract / PDF [904K]
- Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats. Tianyu Li, Matthew Butrovich, Amadou Ngom, Wan Shen Lim, Wes McKinney, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 14, No. 4 ISSN 2150-8097, pp. 534-546, Dec. 2020.
Abstract / PDF [633K]
- GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu. MICRO’20. 53rd IEEE/ACM International Symposium on Microarchitecture, Oct 17-21, 2020. Virtual Event.
Abstract / PDF [1.3M] / Slides / Talk Video
- Jumanji: The Case for Dynamic NUCA in the Datacenter. Brian Schwedock, Nathan Beckmann. MICRO '53: Proceedings of the 53nd Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Athens, Greece, October 17-21, 2020.
Abstract / PDF [2.3M] / Slides / Talk Video
- Challenges and Solutions for Fast Remote Persistent Memory Access. Anuj Kalia, David Andersen, Michael Kaminsky. SoCC ’20, October 19–21, 2020, Virtual Event, USA. BEST PAPER AWARD!
Abstract / PDF [710K] / Talk Video
- High Availability in Cheap Distributed Key Value Storage. Thomas Kim, Daniel Lin-Kit Wong, Gregory R. Ganger, Michael Kaminsky, David G. Andersen. SoCC ’20, October 19–21, 2020, Virtual Event, USA.
Abstract / PDF [2.6M] / Talk Video
- Streaming Data Reorganization at Scale with DeltaFS Indexed Massive Directories. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. ACM Transactions on Storage, Vol. 16, No. 4, Article 23. September 2020.
Abstract / PDF [2.1M]
- Unleashing In-network Computing on Scientific Workloads. Daehyeok Kim, Ankush Jain, Zaoxing Liu, George Amvrosiadis, Damian Hazen, Bradley Settlemyer, Vyas Sekar. arXiv:2009.02457v1 [cs.NI], 5 Sep 2020.
Abstract / PDF [1.25M]
- Accelerating Genome Analysis: A Primer on an Ongoing Journey. Mohammed Alser, Zülal Bingöl, Damla Senol Cali, Jeremie Kim, Saugata Ghose, Can Alkan, Onur Mutlu. This is an extended and updated version of a paper published in IEEE Micro, vol. 40, no. 5, pp. 65-75, 1 Sept.-Oct. 2020.
Abstract / PDF [320K]
- Caching with Delayed Hits. Nirav Atre, Justine Sherry, Weina Wang, Daniel S. Berger. SIGCOMM ’20, August 10–14, 2020, Virtual Event, NY, USA.
Abstract / PDF [2.7M] / Talk Video
- Lightweight Preemptible Functions. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
Abstract / PDF [1M] / Talk Video / Slides
- Fast Software Cache Design for Network Appliances. Dong Zhou, Huacheng Yu, Michael Kaminsky, David Andersen. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
Abstract / PDF [11M] / Talk Video / Slides
- More IOPS for Less: Exploiting Burstable Storage in Public Clouds. Hojin Park, Gregory R. Ganger, George Amvrosiadis. 12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud ’20). Virtual Boston, MA, July 13-14, 2020.
Abstract / PDF [600K] / Talk Video / Slides
- DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift.
Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phillip B. Gibbons.
arXiv:2003.06508 [cs.LG], August, 2020.
Abstract / PDF [1.2M]
- Machine Learning on Volatile Instances.
Xiaoxi Zhang, Jianyu Wang, Gauri Joshi, Carlee Joe-Wong.
IEEE Intl. Conf. on Computer Communications (INFOCOM). Virtual Toronto, Canada, July 6-9, 2020.
Abstract / PDF [516K]
- Simple Near-Optimal Scheduling for the M/G/1.
Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf.
Proceedings of the ACM Measurement and Analysis of Computer Systems - SIGMETRICS, June 2020, Boston, MA.
Abstract / PDF [885K] / Talk Video
- Access-optimal Linear MDS Convertible Codes for All Parameters. Francisco Maturana, V. S. Chaitanya Mukka, K. V. Rashmi. 2020 IEEE International Symposium on Information Theory 21-26 June 2020 • Virtual Los Angeles, California, USA.
Abstract / PDF [287K] / Talk Video
- Order-Preserving Key Compression for In-Memory Search Trees.
Huanchen Zhang, Xiaoxuan Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo
SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
Abstract / PDF [2.15M]
- Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination. Conglong Li, Minjia Zhang, David G. Andersen, Yuxiong He. SIGMOD ’20, June 14–19, 2020, Virtual Portland, OR, USA.
Abstract / PDF [800K]
- Active Learning for ML Enhanced Database Systems. Lin Ma, Bailu Ding, Sudipto Das, Adith Swaminathan. SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
Abstract / PDF [2.4M]
- TVARAK: Software-Managed Hardware Offload for Redundancy in Direct-Access NVM Storage. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. 47th International Symposium on Computer Architecture, May 30 – June 3, 2020, Virtual Valencia, Spain.
Abstract / PDF [1.6M]
- Sage: Parallel SemiAsymmetric Graph Algorithms for NVRAMs. Laxman Dhulipala, Charles McGuffey, Hongbo Kang, Yan Gu, Guy E. Blelloch, Phillip B. Gibbons, Julian Shun, Proceedings of the VLDB Endowment, Vol. 13, No. 9. May 2020.
Abstract / PDF [630K]
- Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD.
Jianyu Wang, Hao Liang, Gauri Joshi.
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
Abstract / PDF [442K]
- Correlated Multi-armed Bandits with a Latent Random Source.
Samarth Gupta, Gauri Joshi, Osman Yağan.
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
Abstract / PDF [1.1M]
- Lookahead Converges to Stationary Points of Smooth Non-Convex Functions.
Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat.
ICASSP 2020: 45th International Conference on Acoustics, Speech, and Signal Processing. Virtual Barcelona, Spain, May 4-8, 2020.
Abstract / PDF [242K]
- SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum.
Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat.
ICLR 2020: International Conference on Learning Representations, Apr 26-May 1, 2020, Virtual Addis Ababa, Ethiopia.
Abstract / PDF [640K] / Talk Video & Slides
- Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM.
Rajat Kateja, Andy Pavlo, Gregory R. Ganger.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-20-101, April 2020.
Abstract / PDF [665K]
- Learning-Based Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. IEEE Journal on Selected Areas in Information Theory, March 2020.
Abstract / PDF [654K]
- The Case for Custom Storage Backends in Distributed Storage Systems.
Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis.
To appear in ACM Transactions on Storage, Volume 16, Issue 1, March 2020.
Abstract / PDF [2.6M]
- Livia: Data-Centric Computing Throughout the Memory Hierarchy.
Elliot Lockerman, Axel Feldmann, Mohammad Bakhshalipour, Alexandru Stanescu, Shashwat Gupta, Daniel Sanchez, Nathan Beckmann.
ASPLOS '20: Proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Lausanne, Switzerland, March 16-20, March 2020.
Abstract / PDF [1.6M] / Talk Video
- Learning Relaxed Belady for Content Distribution Network Caching.
Zhenyu Song, Daniel S. Berger, Kai Li, Wyatt Lloyd.
17th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’20). February 25–27, 2020. Santa Clara, CA.
Abstract / PDF [2.25M]
- Scalable Pointer Analysis of Data Structures using Semantic Models. Pratik Fegade, Christian Wimmer. 29th Conference on Compiler Construction (CC ’20), February 22–23, 2020, San Diego, CA, USA.
Abstract / PDF [700K]
- Convertible Codes: New Class of Codes for Efficient Conversion of Coded Data in Distributed
Storage.
Francisco Maturana, K. V. Rashmi.
11th Innovations in Theoretical Computer Science Conference (ITCS 2020). Seattle, WA, January 12-14, 2020.
Abstract / PDF [687K]
- Mochi: Composing Data Services for High-Performance Computing Environments. Robert B. Ross, George Amvrosiadis, Philip Carns, Charles D. Cranor, Matthieu Dorier, Kevin Harms, Gregory R. Ganger, Garth A. Gibson, Samuel K. Gutierrez, Robert Latham, Bob Robey, Dana Robinson, Bradley Settlemyer, Galen Shipman, Shane Snyder, Jerome Soumagne, Qing Zheng. Journal of Computer Science and Technology 35(1): 121–144 Jan. 2020.
Abstract / PDF [1.3M]
- Writeback-Aware Caching.
Nathan Beckmann, Phillip B. Gibbons, Bernhard Haeupler, Charles McGuffey.
Society for Industrial and Applied Mathematics. 2020.
Abstract / PDF [847K]
- MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling.
Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar.
NeurIPS workshop of Federated Learning for Data Privacy and Confidentiality, Dec 13, 2019. Vancouver, BC, Canada. Distinguished Student Paper Award.
Abstract / PDF [1.1M]
- Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-vector Multiplication.
Ankur Mallick, Malhar Chaudhari, Ganesh Palanikumar, Utsav Sheth, Gauri Joshi.
Proc. ACM Meas. Anal. Comput. Syst., Vol. 3, No. 3, Article 58. December 2019. BEST PAPER SIGMETRICS'20.
Abstract / PDF [1.9M]
- Demystifying Complex Workload–DRAM Interactions: An Experimental Study.
Saugata. Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu.
Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019.
To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 3, No. 3, December 2019.
Abstract / PDF [4M]
- Processing-in-Memory: A Workload-Driven Perspective.
Saugata Ghose, Amarali Boroumand, Jeremie. S. Kim, Juan. Gómez-Luna, Onur Mutlu.
IBM Journal of Research and Development (JRD), Vol. 63, No. 6, November/December 2019.
Abstract / PDF [2.1M]
- MANIC: A Vector-Dataflow Architecture for Ultra-Low-Power Embedded Systems.
Graham, Amolak Nagi, Nathan Serafin, Mehmet Meric Isgenc, Nathan Beckmann, Brandon Lucia.
MICRO '52: Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Columbus, OH, October 2019.
Abstract / PDF [1.2M]
- File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
Abstract / PDF [870K]
- Parity Models: Erasure-Coded Resilience for Prediction Serving Systems. Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
Abstract / PDF [1M]
- PipeDream: Generalized Pipeline Parallelism for DNN Training. Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil R. Devanur, Gregory R. Ganger, Phillip B. Gibbons, Matei Zaharia. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
Abstract / PDF [1M]
- Efficient Remote Procedure Calls for Datacenters. Anuj Kalia. Carnegie Mellon University PhD Dissertation CMU-CS-19-126, September 2019.
Abstract / PDF [1.7M]
- Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. Ziqi Wang, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri. 28th Parallel Architecture and Compiler Technologies 2019 (PACT'19), Sept 21-25, 2019, Seattle, WA.
Abstract / PDF [475K]
- Compact Filters for Fast Online Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. IEEE CLUSTER 2019. September 23 - 26, 2019, Albuquerque, New Mexico, USA.
Abstract / PDF [1M]
- Vantage: Optimizing Video Upload for Time-shifted Viewing of Social Livestreams.
Devdeep Ray, Jack Kosaian, K. V. Rashmi, Srinivasan Seshan.
ACM SIGCOMM, August 19-24, 2019, Beijing, China.
Abstract / PDF [6.75M]
- TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
Abstract / PDF [975K]
- STRADS-AP: Simplifying Distributed Machine Learning Programming without Introducing a New Programming Model. Jin Kyu Kim, Abutalib Aghayev, Garth A. Gibson, Eric P. Xing. Proceedings of the 2019 USENIX Annual Technical Conference, July 10–12, 2019 • Renton, WA.
Abstract / PDF [490K]
- Rateless Codes for Distributed Computations with Sparse Compressed Matrices. Ankur Mallick, Gauri Joshi. IEEE International Symposium on Information Theory (ISIT), July 7-12, 2019, Paris, France.
Abstract / PDF [672K]
- Peering through the Dark: An Owl’s View of Inter-job Dependencies and Jobs’ Impact in Shared Clusters. Andrew Chung, Carlo Curino, Subru Krishnan, Konstantinos Karanasos, Panagiotis Garefalakis, Gregory R. Ganger. SIGMOD ’19, June 30–July 5, 2019, Amsterdam, Netherlands.
Abstract / PDF [1.6M]
- Distribution-based Cluster Scheduling. Jun Woo Park. Carnegie Mellon University School of Computer Science PhD Dissertation, June 2019.
Abstract / PDF [1.47M]
- Enabling Practical Processing in and Near Memory for Data-Intensive Computing. Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun. Proc. of the Design Automation Conference (DAC), Las Vegas, NV, June 2019.
Abstract / PDF [477K]
- CROW: A Low-Cost Substrate for Improving DRAM Performance, Energy Efficiency, and Reliability. Hasan Hassan, Minesh Patel, Jeremie. S. Kim, A. Giray Yaglikçi, Nandita Vijaykumar, Nika Mansouri Ghiasi, Saugata Ghose, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
Abstract / PDF [1.45M]
- CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators. Amarali Boroumand, Saugata Ghose, Minesh Patel, Hasan Hassan, Brandon Lucia, Rachata Ausavarungnirun, Kevin Hsieh, Nastaran Hajinazar, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
Abstract / PDF [1.1M]
- Understanding the Interactions ofWorkloads and DRAM Types: A Comprehensive Experimental Study. Saugata Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019; To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2019.
Abstract / PDF [2M]
- Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
Abstract / PDF[574K]
- Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
Abstract / PDF [1.4M]
- This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
Abstract / PDF [715K]
- Fast and Efficient Distributed Matrix-Vector Multiplication Using Rateless Fountain Codes. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 12 - 17 May, 2019 · Brighton, UK.
Abstract / PDF [485K]
- Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Gregory R. Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
Abstract / PDF [735K]
- Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems. Graham Gobieski, Brandon Lucia, Nathan Beckmann Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’19), April 13th – April 17th, Providence, RI.
Abstract / PDF [3.35M]
- Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Gregory R. Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
Abstract / PDF [800K]
- Scaling Video Analytics on Constrained Edge Nodes. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. 2nd SysML Conference (SysML ’19). March 31-April 2, 2019, Palo Alto, CA.
Abstract / PDF [8.5M]
- Automating Dependence-Aware Parallelization of Machine Learning Training on Distributed Shared Memory.
Jinliang Wei, Garth A. Gibson, Phillip B. Gibbons, Eric P. Xing.
EuroSys '19: Proceedings of the Fourteenth EuroSys Conference, March 2019, Dresden, Germany.
Abstract / PDF [1.1M]
- Non-Volatile Memory Database Management Systems.
Joy Arulraj, Andrew Pavlo.
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, February 2019.
Abstract / PDF currently unavailable
- Datacenter RPCs can be General and Fast. Anuj Kalia, Michael Kaminsky, David G. Andersen. 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Feb. 26–28, 2019, Boston, MA. Best Paper award!
Abstract / PDF [555K]
- Cluster Storage Systems Gotta Have HeART: Improving Storage Efficiency by Exploiting Disk-reliability Heterogeneity. Saurabh Kadekodi, K. V. Rashmi, Gregory R. Ganger. 17th USENIX Conference on File and Storage Technologies (FAST '19) Feb. 25–28, 2019 Boston, MA.
Abstract / PDF [1.1M]
- A Scalable Priority-Aware Approach to Managing Data Center Server Power. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu. HPCA 2019: The 25th International Symposium on High-Performance Computer Architecture, February 16 - 20, 2019, Washington D.C.
Abstract / PDF [610K]
- External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems.
Andrew Pavlo, Matthew Butrovich, Ananya Joshi, Lin Ma, Prashanth Menon, Dana Van Aken, Lisa Lee, Ruslan Salakhutdinov.
Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 42(2): 32-46 (2019).
Abstract / PDF [555K]