PARALLEL DATA LAB 

PDL Publications by Date

2024

  • Baleen: ML Admission & Prefetching for Flash Caches. Daniel Lin-Kit Wong, Hao Wu, Carson Molder, Sathya Gunasekar, Jimmy Lu, Snehal Khandkar, Abhinav Sharma, Daniel S. Berger, Nathan Beckmann, Gregory R. Ganger. 22nd USENIX Conference on File and Storage Technologies (FAST'24), Feb. 27–29, 2024, Santa Clara, CA.
    Abstract / PDF [2.7M] / Code / Traces

  • SIEVE is Simpler than LRU: An Efficient Turn-Key Eviction Algorithm for Web Caches. Yazhuo Zhang, Juncheng Yang, Yao Yue, Ymir Vigfusson, K. V. Rashmi. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI'24), April 16–18, 2024. Santa Clara, CA.
    Abstract / PDF [1M]

  • Survey and Evaluation of Database Management System Extensibility. Abigale Kim. Carnegie Mellon University School of Computer Science M.S.Thesis CMU-CS-23-144. January 2024.
    Abstract / PDF [1.25M]

  • Dear User-Defined Functions, Inlining isn’t working out so great for us. Let’s try batching to make our relationship work. Sincerely, SQL. Kai Franz, Samuel Arch, Denis Hirn, Torsten Grust, Todd C. Mowry, Andrew Pavlo. Conference on Innovative Data Systems Research (CIDR 2024), Chaminade, CA, USA, January 14-17, 2024.
    Abstract / PDF [545K]

2023

  • Agents of Autonomy: A Systematic Study of Robotics on Modern Hardware. Mohammad Bakhshalipour, Phillip B. Gibbons. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Volume 7, Issue 3, Article No.: 43, December 2023. To appear ACM SIGMETRICS / IFIP PERFORMANCE 2024, Venice, Italy, June 10-14, 2024.
    Abstract / PDF [2.1M]

  • Memento: Architectural Support for Ephemeral Memory Management in Serverless Environments. Ziqi Wang, Kaiyang Zhao, Pei Li, Andrew Jacob, Michael Kozuch, Todd Mowry, Dimitrios Skarlatos. MICRO '23: Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture. October 2023. Toronto, Canada.
    Abstract / PDF [935K]

  • FIFO Queues Are All You Need for Cache Eviction. Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, Rashmi Vinayak SOSP '23: Proceedings of the 29th Symposium on Operating Systems Principles, October 2023. Koblenz, Germany.
    Abstract / PDF [1.6M]

  • Sia: Heterogeneity-aware, Goodput-optimized ML-cluster Scheduling. Suhas Jayaram Subramanya, Daiyaan Arfeen, Shouxu Lin, Aurick Qiao, Zhihao Jia, and Gregory R. Ganger. 2023. ACM SIGOPS 29th Symposium on Operating Systems Principles (SOSP ’23), October 23–26, 2023, Koblenz, Germany.
    Abstract / PDF [1.23M]

  • Runahead A*: Speculative Parallelism for A* with Slow Expansions. Mohammad Bakhshalipour, Mohamad Qadri, Dominic Guri, Seyed Borna, Ehsani, Maxim Likhachev, Phillip B. Gibbons. ICAPS 2023, Prague, Czech Republic, July 8-13, 2023.
    Abstact / PDF [710K]

  • FIFO Can Be Better than LRU: The Power of Lazy Promotion and Quick Demotion. Juncheng Yang, Ziyue Qiu, Yazhuo Zhang*, Yao Yue^, K. V. Rashmi. HotOS ’23, June 22–24, 2023, Providence, RI, USA.
    Abstract / PDF [1.2M]

  • Contiguitas: The Pursuit of Physical Memory Contiguity in Datacenters. Kaiyang Zhao, Kaiwen Xue, Ziqi Wang, Dan Schatzberg, Leon Yang, Antonis Manousis, Johannes Weiner, Rik van Riel, Bikash Sharma, Chunqiang Tang, Dimitrios Skarlatoss. ISCA ’23, June 17–21, 2023, Orlando, FL, USA. BEST PAPER AWARD!
    Abstract / PDF [468K]

  • PIM-trie: A Skew-resistant Trie for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. SPAA '23: Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, June 2023, Orlando, FL.
    Abstract / PDF [1.23M]

  • Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud. Hojin Park, Gregory R. Ganger, George Amvrosiadis. SYSTOR '23: Proceedings of the 16th ACM International Conference on Systems and Storage, Haifa, Israel, June 5-7, 2023.
    Abstract / PDF [1.4M]

  • Validating Large Language Models with ReLM. Michael Kuchnik, Virginia Smith, George Amvrosiadis. 6th MLSys Conference, Miami Beach, FL, USA, June 4-8, 2023. OUTSTANDING PAPER AWARD AT MLSYS23!
    Abstract / PDF [1.2M]

  • FrozenHot Cache: Rethinking Cache Management for Modern Hardware. Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu. EuroSys 2023, Rome, Italy, May 8th-12th, 2023.
    Abstract / PDF [1.14M]

  • Federated Learning Under Distributed Concept Drift. Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons. International Conference on Artificial Intelligence and Statistics (AISTATS), Apr 2023. In preprint arXiv:2206.00799v1.
    Abstract / PDF [956K]

  • Design Principles for Replicated Storage Systems Built on Emerging Storage Technologies. Thomas Kim. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-23-109. March 2023.
    Abstract / PDF [47.5M]

  • Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. Huaicheng Li, Daniel S. Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, Mark D. Hill, Marcus Fontoura, Ricardo Bianchini. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. DISTINGUISHED PAPER AWARD!
    Abstract / PDF [1.7M]

  • GL-Cache: Group-level Learning for Efficient and High-performance Caching. Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi. 21st USENIX Conference on File and Storage Technologies (FAST '23). Feb. 21–23, 2023, Santa Clara, CA.
    Abstract / PDF [1.84M]

  • RAIZN: Redundant Array of Independent Zoned Namespaces. Thomas Kim, Jekyeom Jeon, Nikhil Arora, Huaicheng Li, Michael Kaminsky, David G. Andersen, Gregory R. Ganger, George Amvrosiadis, Matias Bjørling. ASPLOS ’23, March 25–29, 2023, Vancouver, BC, Canada. Supoercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-22-101, January 2022.
    Abstract / PDF [1.65M]

  • Database Gyms. Wan Shen Lim, Matthew Butrovich, William Zhang, Andrew Crotty, Lin Ma, Peijing Xu, Johannes Gehrke, Andrew Pavlo. CIDR 2023. 13th Annual Conference on Innovative Data Systems Research (CIDR ’23). January 8-11, 2023, Amsterdam, The Netherlands.
    Abstract / PDF [800K] / Slides

  • Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays. Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi. ACM Transactions on Storage, Vol. 19, No. 1, Article 5. January 2023.
    Abstract / PDF [1.6M]

2022

  • Realizing Value in Shared Compute Infrastructures. Andrew Chung. Carnegie Mellon University PhD Dissertation CMU-CS-22-151, December 2022.
    Abstract / PDF [3M]

  • PIM-tree: A Skew-resistant Index for Processing-in-Memory. Hongbo Kang, Yiwei Zhao, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey, Phillip B. Gibbons. Proc. VLDB Endow. 16(4): 946-958 (2022). In preprint. BEST PAPER RUNNER UP!
    Abstract / PDF [1.1M]

  • Rateless Sum-Recovery Codes For Distributed Non-Linear Computations. Ankur Mallick,Gauri Joshi. Information Theory Workshop (ITW), November 6-9, 2022. Mumbai, India.
    Abstract / PDF [728K]

  • RipTide: A Programmable, Energy-minimal Dataflow Compiler and Architecture. Graham Gobieski, Souradip Ghosh, Marijn Heule, Todd Mowry, Tony Nowatzki, Nathan Beckmann, Brandon Lucia. MICRO 2022 - 55th IEEE/ACM International Symposium on Microarchitecture, October 1–5, 2022 Chicago, Illinois, USA.
    Abstract / PDF [3.7M]

  • MATCHA: A Matching-Based Link Scheduling Strategy to Speed up Distributed Optimization. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. IEEE Transactions on Signal Processing, Oct 2022.
    Abstract / PDF [1.85M]

  • Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S Berger, Nathan Beckmann, Gregory R Ganger. ACM Transactions on Storage, Vol. 18, No. 3, Article 21. August 2022.
    Abstract / PDF [1.4M]

  • SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling. Nirav Atre, Hugo Sadok, Erica Chiang, Weina Wang, Justine Sherry. SIGCOMM ’22, August 22–26, 2022, Amsterdam, Netherlands.
    Abstract / PDF [2M]

  • Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines. Michael Kuchnik, Ana Klimovic, Jirı Simsa, Virginia Smith, George Amvrosiadis. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [7M]

  • The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [1.3M]

  • Matchmaker: Data Drift Mitigation in Machine Learning for Large-scale Systems. Ankur Mallick, Kevin Hsieh, Behnaz Arzani, Gauri Joshi. Proceedings of the 5th MLSys Conference, Santa Clara, CA, USA, August, 2022.
    Abstract / PDF [500K]

  • Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays. Huaicheng Li, Martin L Putra, Ronald Shi, Fadhil I Kurnia, Xing Lin, Jaeyoung Do, Achmad Imam Kistijantoro, Gregory R Ganger, Haryadi S Gunawi. To appear, ACM Transactions on Storage, 2022, (accepted July 2022).
    Abstract / PDF [656K]

  • Tiger: Disk-Adaptive Redundancy Without Placement Restrictions. Saurabh Kadekodi, Francisco Maturana, Sanjith Athlur, Arif Merchant, K. V. Rashmi, Gregory R. Ganger. Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI'22), July 11–13, 2022, Carlsbad, CA, USA.
    Abstract / PDF [1.25M]

  • Bandwidth Cost of Code Conversions in the Split Regime. Francisco Maturana and K. V. Rashmi. 2022 IEEE International Symposium on Information Theory (ISIT22). June 26-July 1, 2022, Espoo, Finland.
    Abstract / PDF [1.22M]

  • RACOD: Algorithm/Hardware Co-design for Mobile Robot Path Planning. Mohammad Bakhshalipour, Seyed Borna Ehsani, Mohamad Qadri, Dominic Guri, Maxim Likhachev, Phillip B. Gibbons. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [2.2M]

  • Thermometer: Profile-Guided BTB Replacement for Data Center Applications. Shixin Song, Tanvir Ahmed Khan, Sara Mahdizadeh Shahri, Akshitha Sriraman, Niranjan K Soundararajan, Sreenivas Subramoney, Daniel A. Jiménez, Heiner Litz, Baris Kasikci. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [1.45M]

  • täkō: A Polymorphic Cache Hierarchy for General-Purpose Optimization of Data Movement. Brian C. Schwedock, Piratach Yoovidhya, Jennifer Seibert, Nathan Beckmann. ISCA ’22, June 18–22, 2022, New York, NY, USA.
    Abstract / PDF [1.75M]

  • Tastes Great! Less Filling! High Performance and Accurate Training Data Collection for Self-Driving Database Management Systems. Matthew Butrovich, Wan Shen Lim, Lin Ma, John Rollinson, William Zhang, Yu Xia, Andrew Pavlo SIGMOD ’22, June 12–17, 2022, Philadelphia, PA, USA.
    Abstract / PDF [1.1M]

  • Adapting the RACER Architecture to Integrate Improved In-ReRAM Logic Primitives. Minh S. Q. Truong, Liting Shen, Alexander Glass, Alison Hoffmann, L. Richard Carley, James A. Bain, Saugata Ghose. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Early Access, 12 May 2022.
    Abstract / PDF [4.3M]

  • RTRBench: A Benchmark Suite for Real-Time Robotics. Mohammad Bakhshalipour, Maxim Likhachev, Phillip B. Gibbons 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 22-24 May 2022, Singapore.
    Abstract / PDF [1.85M]

  • Adapting the RACER Architecture to Integrate Improved In-ReRAM Logic Primitives. Minh S. Q. Truong, Liting Shen, Alexander Glass, Alison Hoffmann, L. Richard Carley, James A. Bain, Saugata Ghose. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Early Access, 12 May 2022.
    Abstract / PDF [4.3M]

  • Varuna: Scalable, Low-cost Training of Massive Deep Learning Models. Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, Nipun Kwatra. EuroSys ’22, April 5-8, 2022, Rennes, France. BEST PAPER AWARD!
    Abstract / PDF [1.5M]

  • C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery. Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman. 19th USENIX Symposium on Networked Systems Design and Implementation. April 4–6, 2022 • Renton, WA, USA.
    Abstract / PDF [1.9M] / Slides / Talk Video

  • MetaSys: A Practical Open-source Metadata Management System to Implement and Evaluate Cross-layer Optimizations. Nandita Vijaykumar, Ataberk Olgun, Konstantinos Kanellopoulos, F. Nisa Bostanci, Hasan Hassan, Mehrshad Lotfi, Phillip B. Gibbons, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 19, No. 2, Article 26. Publication date: March 2022.
    Abstract / PDF [1.75M]

  • Client Selection in Federated Learning: Convergence Analysis and Power-of-Choice Selection Strategies. Yae Jee Cho, Jianyu Wang, Gauri Joshi. International Conference on Artificial Intelligence and Statistics (AISTATS), March 2022.
    Abstract / PDF [1.85M]

  • TMO: Transparent Memory Offloading in Datacenters. Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos. ASPLOS ’22, February 28 – March 4, 2022, Lausanne, Switzerland. BEST PAPER AWARD AT ASPLOS '22!
    Abstract / PDF [1.65M]

  • FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients. Jianyu Wang, Hang Qi, Ankit Singh Rawat, Sashank Reddi, Sagar Waghmare, Felix X. Yu, Gauri Joshi. arXiv:2201.11865v2 [cs.LG], 16 Feb 2022.
    Abstract / PDF [660K]

  • Are You Sure You Want to Use MMAP in YourDatabase Management System? Andrew Crotty, Viktor Leis, Andrew Pavlo. 12th Annual Conference on Innovative Data Systems Research (CIDR ’22). January 9-12, 2022, Chaminade, USA.
    Abstract / PDF [690K] / Talk Video

2021

  • Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation. Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar, Swanand Kadhe, Gauri Joshi. 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Dec. 6-14, 2021. Virtual Event.
    Abstract / PDF [650K]

  • It’s Time to Talk About HPC Storage: Perspectives on the Past and Future. Bradley Settlemyer, George Amvrosiadis, Philip Carns, Robert Ross. IEEE Computer Society Computing in Science & Engineering November/December 2021.
    Abstract / PDF [483K]

  • Arithmetic-Intensity-Guided Fault Tolerance for Neural Network Inference on GPUs. Jack Kosaian, K. V. Rashmi. SC’21, November 14–19, 2021, St. Louis, MO, USA.
    Abstract / PDF [256K] / Slides / Code

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. SC ’21, November 14–19, 2021, St. Louis, MO, USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M] / Slides / Talk Video

  • The Most Common Queueing Theory Questions Asked by Computer Systems Practitioners. Mor Harchol-Balter and Ziv Scully. First International Workshop on Teaching Performance Analysis of ComputerSystems (TeaPACS 2021) In conjunction with the IFIP Performance 2021 Conference. Milan, Italy, Nov 2021.
    Abstract / PDF [286K]

  • The Case for Phase-Aware Scheduling of Parallelizable Jobs. Ben Berg, Justin Whitehouse, Ben Moseley, Weina Wang, Mor Harchol-Balter. IFIP Performance 2021. Milan, Italy, November 2021.
    Abstract / PDF [978K]

  • WineFS: A Hugepage-aware File System for Persistent Memory that Ages Gracefully. Rohan Kadekodi, Saurabh Kadekodi, Soujanya Ponnapalli, Harshad Shirwadkar, Gregory R. Ganger, Aasheesh Kolli, Vijay Chidambaram. 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021.
    Abstract / PDF [3M]

  • RACER: Bit-Pipelined Processing Using Resistive Memory. Minh S. Q. Truong, Eric Chen, Deanyone Su, Alexander Glass, Liting Shen, L. Richard Carley, James A. Bain, Saugata Ghose. 54th IEEE/ACM International Symposium on Microarchitecture, ser. MICRO 2021, Oct. 2021.
    Abstract / PDF [2.2M]

  • The Gittins Policy in the M/G/1 Queue. Ziv Scully, Mor Harchol-Balter. 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt 2021) Philadelphia, PA, Oct 2021.
    Abstract / PDF [265K]

  • IODA: A Host/Device Co-Design for Strong Predictability Contract on Modern Flash Storage. Huaicheng Li, Martin L. Putra, Ronald Shi, Xing Lin, Gregory R. Ganger, Haryadi S. Gunawi. SOSP ’21, October 26-29, 2021, Virtual Event, Germany.
    Abstract / PDF [710K] / Talk Video

  • Personalized Federated Learning for Heterogeneous Clients with Clustered Knowledge Transfer. Yae Jee Cho, Jianyu Wang, Tarun Chiruvolu, Gauri Joshi. arXiv:2109.08119v1 [cs.LG] 16 Sep 2021.
    Abstract / PDF [852K]

  • Rateless Codes for Distributed Non-linear Computations. Ankur Mallick, Sophie Smith, Gauri Joshi. International Symposium on Topics in Coding, Montréal, Québec, Canada, from August 30th to September 3rd, 2021.
    Abstract / PDF [1.1M]

  • A Novel Framework for the Analysis and Design of Heterogeneous Federated Learning. Jianyu Wang, Qinghua Liu, Hao Liang, Gauri Joshi, H. Vincent Poor. IEEE Transactions on Signal Processing, Sept 2021.
    Abstract / PDF [835K]

  • Cooperative SGD: A Unified Framework for the Analysis of Local-Update. Jianyu Wang, Gauri Joshi. SGD Journal of Machine Learning Research (JMLR), 2021. September 2021.
    Abstract / PDF [860K]

  • Kangaroo: Caching Billions of Tiny Objects on Flash. Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel Berger, Nathan Beckmann, Gregory R. Ganger. Proceedings of the 28th ACM Symposium on Operating Systems Principles (SOSP '21) October 25-28, 2021. Virtual Event. BEST PAPER AT SOSP'21!
    Abstract / PDF [7.8M] / Talk Video-Short / Talk Video-Long / Blog Post

  • Irregular Array Codes with Arbitrary Access Sets for Geo-Distributed Storage. Francisco Maturana, K. V. Rashmi Carnegie Mellon University, Pittsburgh, PA, USA Email: fmaturan@cs.cmu.edu, rvinayak@cs.cmu.edu 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
    Abstract / PDF [288K]

  • Bandwidth Cost of Code Conversions in Distributed Storage: Fundamental Limits and Optimal Constructions. Francisco Maturana, K. V. Rashmi 2021 IEEE International Symposium on Information Theory (ISIT 2021) 12-20 July 2021 • Melbourne, Victoria, Australia.
    Abstract / PDF [325K]

  • Progressive Compressed Records: Taking a Byte out of Deep Learning Data. Michael Kuchnik, George Amvrosiadis, Virginia Smith. Proceedings of the VLDB Endowment, Vol. 14, No. 11 ISSN 2150-8097, July 2021.
    Abstract / PDF [3.86M]

  • Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size. Jack Kosaian, Amar Phanishayee, Matthai Philipose, Debadeepta Dey, K. V. Rashmi. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 18-24 July 2021, Virtual Event.
    Abstract / PDF [518K] / Appendix / Code / Slides and Talk Video

  • The Processing-in-Memory Model. Hongbo Kang, Phillip B. Gibbons, Guy E. Blelloch, Laxman Dhulipala, Yan Gu, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
    Abstract / PDF [1.25M]

  • DeltaFS: A Scalable No-Ground-Truth Filesystem For Massively-Parallel Computing. Qing Zheng, Chuck Cranor, Greg Ganger, Garth Gibson, George Amvrosiadis, Brad Settlemyer, Gary Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-21-101, July 2021.
    Abstract / PDF [1M]

  • ZNS: Avoiding the Block Interface Tax for Flash-based SSDs. Matias Bjørling, Abutalib Aghayev, Hans Holmberg, Aravind Ramesh, Damien Le Moal, Gregory R. Ganger, George Amvrosiadis. USENIX Annual Technical Conference (USENIX 2021), July 14-16, 2021, Virtual Event.
    Abstract / PDF [305K] / Slides / Talk Video

  • Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning. Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing. 15th USENIX Symposium on Operating Systems Design and Implementation, Virtual Event, July 14–16, 2021. BEST PAPER AT OSDI'21!
    Abstract / PDF [930K] / Slides / Talk Video

  • Block-Granularity-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Charles McGuffey. SPAA '21: Proceedings of the 33rd ACM Symposium on Parallelism in Algorithms and Architectures. July 2021.
    Abstract / PDF [880K]

  • Spitfire: A Three-Tier Buffer Manager for Volatile and Non-Volatile Memory. Xinjing Zhou, Joy Arulraj, Andrew Pavlo, David Cohen. SIGMOD/PODS '21: Proceedings of the 2021 International Conference on Management of Data. June 2021.
    Abstract / PDF [1.28M]

  • Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications. Tanvir Ahmed Khan, Dexin Zhang, Akshitha Sriraman, Joseph Devietti, Gilles A Pokam, Heiner Litz, Baris Kasikci.International Symposium on Computer Architecture (ISCA), June 2021.
    Abstract / PDF [770K]

  • MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems. Lin Ma, William Zhang, Jie Jiao, Wuwen Wang, Matthew Butrovich, Wan Shen Lim, Prashanth Menon, Andrew Pavlo. SIGMOD ’21, June 20–25, 2021, Virtual Event, China.
    Abstract / PDF [1.25M]

  • Filter Representation in Vectorized Query Execution. Amadou Ngom, Prashanth Menon, Matthew Butrovich, Lin Ma, Wan Shen Lim, Todd C. Mowry, Andrew Pavlo. International Workshop on Data Management on New Hardware, pages. 6:1—6:7, June 2021.
    Abstract / PDF [720K]

  • HerQules: Securing Programs via Hardware-Enforced Message Queues. Daming D. Chen, Wen Shih Lim, Mohammad Bakhshalipour, Phillip B. Gibbons, James C. Hoe, Bryan Parno. ASPLOS ’21, April 19–23, 2021, Virtual, USA.
    Abstract / PDF [1.7M] / Talk Video

  • Segcache: A Memory-efficient and Scalable In-memory Key-value Cache for Small Objects. Juncheng Yang, Yao Yue, K. V. Rashmi. 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI). Virtual Event, April 12–14, 2021. NSDI'21 Community Award and NSDI'21 BEST PAPER AWARD!
    Abstract / PDF [517K] / Slides / Talk Video

  • Learning on Distributed Traces for Data Center Storage Systems. Giulio Zhou, Martin Maas Conference on Machine Learning and Systems '21, April 5-9, 2021. Virtual Event.
    Abstract / PDF [1.3M] / Talk Video

  • CORTEX: A Compiler for Recursive Deep Learning Models. Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry. Proceedings of the 4th MLSys Conference, San Jose, CA, USA, Apr 4-7, 2021.
    Abstract / PDF [622K] / Talk Video (starts at 29:41)

  • Distributed Metadata and Streaming Data Indexing as Scalable Filesystem Services. Qing Zheng. Carnegie Mellon University School of Computer Science Ph.D. Dissertation, CMU-CS-21-103. February 2021.
    Abstract / PDF [2.1M]

  • Open Problems in Queueing Theory Inspired by Datacenter Computing. Mor Harchol-Balter. Queueing Systems, vol. 97, no. 1, February 2021, pp. 3-37.
    Abstract / PDF [690K]

  • Everything is a Transaction: Unifying Logical Concurrency Control and Physical Data Structure Maintenance in Database Management Systems. Ling Zhang, Matthew Butrovich, Tianyu Li, Yash Nannapanei, Andrew Pavlo, John Rollinson, Huanchen Zhang, Ambarish Balakumar, Daniel Biales, Ziqi Dong, Emmanuel Eppinger, Jordi Gonzalez, Wan Shen Lim, Jianqiao Liu, Lin Ma, Prashanth Menon, Soumil Mukherjee, Tanuj Nayak, Amadou Ngom, Jeff Niu, Deepayan Patra, Poojita Raj, Stephanie Wang, Wuwen Wang, Yao Yu, William Zhang. Conference on Innovative Data Systems Research (CIDR) 2021. January 11-15, 2021. Virtual Event.
    Abstract / PDF [352K] / Talk Video

  • The Read-Only Semi-External Model. Guy E. Blelloch, Laxman Dhulipala, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. APOCS 2021, January 13, 2021 Virtual Conference, Alexandria, Virginia, U.S.
    Abstract / PDF [740K]

2020

  • Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats. Tianyu Li, Matthew Butrovich, Amadou Ngom, Wan Shen Lim, Wes McKinney, Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 14, No. 4 ISSN 2150-8097, pp. 534-546, Dec. 2020.
    Abstract / PDF [633K]

  • Unearthing Inter-job Dependencies for Better Cluster Scheduling. Andrew Chung, Subru Krishnan, Konstantinos Karanasos, Carlo Curino, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.0M] / Slides / Talk Video

  • PACEMAKER: Avoiding HeART Attacks in Storage Clusters with Disk-adaptive Redundancy. Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [2.1M] / Slides / Talk Video

  • The CacheLib Caching Engine: Design and Experiences at Scale. Benjamin Berg, Daniel S. Berger, Sara McAllister, Isaac Grosof, Sathya Gunasekar, Jimmy Lu, Michael Uhlar, Jim Carrig, Nathan Beckmann, Mor Harchol-Balter, Gregory R. Ganger. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [606K] / Slides / Talk Video

  • A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. Juncheng Yang, Yao Yue, K. V. Rashmi. 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.
    Abstract / PDF [1.6M] / Slides / Talk Video

  • Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling. Prashanth Menon, Amadou Ngom, Lin Ma, Todd C. Mowry, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 14, iss. 2, pages. 101—113, October 2020.
    Abstract / PDF [904K]

  • GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu. MICRO’20. 53rd IEEE/ACM International Symposium on Microarchitecture, Oct 17-21, 2020. Virtual Event.
    Abstract / PDF [1.3M] / Slides / Talk Video

  • Jumanji: The Case for Dynamic NUCA in the Datacenter. Brian Schwedock, Nathan Beckmann. MICRO '53: Proceedings of the 53nd Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Athens, Greece, October 17-21, 2020.
    Abstract / PDF [2.3M] / Slides / Talk Video

  • Challenges and Solutions for Fast Remote Persistent Memory Access. Anuj Kalia, David Andersen, Michael Kaminsky. SoCC ’20, October 19–21, 2020, Virtual Event, USA. BEST PAPER AWARD!
    Abstract / PDF [710K] / Talk Video

  • High Availability in Cheap Distributed Key Value Storage. Thomas Kim, Daniel Lin-Kit Wong, Gregory R. Ganger, Michael Kaminsky, David G. Andersen. SoCC ’20, October 19–21, 2020, Virtual Event, USA.
    Abstract / PDF [2.6M] / Talk Video

  • Streaming Data Reorganization at Scale with DeltaFS Indexed Massive Directories. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. ACM Transactions on Storage, Vol. 16, No. 4, Article 23. September 2020.
    Abstract / PDF [2.1M]

  • Unleashing In-network Computing on Scientific Workloads. Daehyeok Kim, Ankush Jain, Zaoxing Liu, George Amvrosiadis, Damian Hazen, Bradley Settlemyer, Vyas Sekar. arXiv:2009.02457v1 [cs.NI], 5 Sep 2020.
    Abstract / PDF [1.25M]

  • Accelerating Genome Analysis: A Primer on an Ongoing Journey. Mohammed Alser, Zülal Bingöl, Damla Senol Cali, Jeremie Kim, Saugata Ghose, Can Alkan, Onur Mutlu. This is an extended and updated version of a paper published in IEEE Micro, vol. 40, no. 5, pp. 65-75, 1 Sept.-Oct. 2020.
    Abstract / PDF [320K]

  • Caching with Delayed Hits. Nirav Atre, Justine Sherry, Weina Wang, Daniel S. Berger. SIGCOMM ’20, August 10–14, 2020, Virtual Event, NY, USA.
    Abstract / PDF [2.7M] / Talk Video

  • Lightweight Preemptible Functions. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [1M] / Talk Video / Slides

  • Fast Software Cache Design for Network Appliances. Dong Zhou, Huacheng Yu, Michael Kaminsky, David Andersen. 2020 USENIX Annual Technical Conference (USENIX ATC '20). Virtual Boston, MA, July 15–17, 2020.
    Abstract / PDF [11M / Talk Video / Slides

  • More IOPS for Less: Exploiting Burstable Storage in Public Clouds. Hojin Park, Gregory R. Ganger, George Amvrosiadis. 12th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud ’20). Virtual Boston, MA, July 13-14, 2020.
    Abstract / PDF [600K] / Talk Video / Slides

  • DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift. Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phillip B. Gibbons. arXiv:2003.06508 [cs.LG], August, 2020.
    Abstract / PDF [1.2M]

  • Machine Learning on Volatile Instances. Xiaoxi Zhang, Jianyu Wang, Gauri Joshi, Carlee Joe-Wong. IEEE Intl. Conf. on Computer Communications (INFOCOM). Virtual Toronto, Canada, July 6-9, 2020.
    Abstract / PDF [516K]

  • Simple Near-Optimal Scheduling for the M/G/1. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proceedings of the ACM Measurement and Analysis of Computer Systems - SIGMETRICS, June 2020, Boston, MA.
    Abstract / PDF [885K] / Talk Video

  • Access-optimal Linear MDS Convertible Codes for All Parameters. Francisco Maturana, V. S. Chaitanya Mukka, K. V. Rashmi. 2020 IEEE International Symposium on Information Theory 21-26 June 2020 • Virtual Los Angeles, California, USA.
    Abstract / PDF[287K] / Talk Video

  • Order-Preserving Key Compression for In-Memory Search Trees. Huanchen Zhang, Xiaoxuan Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo
    SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.15M]

  • Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination. Conglong Li, Minjia Zhang, David G. Andersen, Yuxiong He. SIGMOD ’20, June 14–19, 2020, Virtual Portland, OR, USA.
    Abstract / PDF [800K]

  • Active Learning for ML Enhanced Database Systems. Lin Ma, Bailu Ding, Sudipto Das, Adith Swaminathan. SIGMOD’20, June 14–19, 2020. Virtual Portland, OR.
    Abstract / PDF [2.4M]

  • TVARAK: Software-Managed Hardware Offload for Redundancy in Direct-Access NVM Storage. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. 47th International Symposium on Computer Architecture, May 30 – June 3, 2020, Virtual Valencia, Spain.
    Abstract / PDF [1.6M]

  • Sage: Parallel SemiAsymmetric Graph Algorithms for NVRAMs. Laxman Dhulipala, Charles McGuffey, Hongbo Kang, Yan Gu, Guy E. Blelloch, Phillip B. Gibbons, Julian Shun, Proceedings of the VLDB Endowment, Vol. 13, No. 9. May 2020.
    Abstract / PDF [630K]

  • Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD. Jianyu Wang, Hao Liang, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [442K]

  • Correlated Multi-armed Bandits with a Latent Random Source. Samarth Gupta, Gauri Joshi, Osman Yağan. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [1.1M]

  • Lookahead Converges to Stationary Points of Smooth Non-Convex Functions. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICASSP 2020: 45th International Conference on Acoustics, Speech, and Signal Processing. Virtual Barcelona, Spain, May 4-8, 2020.
    Abstract / PDF [242K]

  • SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum. Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael Rabbat. ICLR 2020: International Conference on Learning Representations, Apr 26-May 1, 2020, Virtual Addis Ababa, Ethiopia.
    Abstract / PDF [640K] Talk Video & Slides

  • Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM. Rajat Kateja, Andy Pavlo, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-20-101, April 2020.
    Abstract / PDF [665K]

  • Learning-Based Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. IEEE Journal on Selected Areas in Information Theory, March 2020.
    Abstract / PDF [654K]

  • The Case for Custom Storage Backends in Distributed Storage Systems. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. To appear in ACM Transactions on Storage, Volume 16, Issue 1, March 2020.
    Abstract / PDF [2.6M]

  • Livia: Data-Centric Computing Throughout the Memory Hierarchy. Elliot Lockerman, Axel Feldmann, Mohammad Bakhshalipour, Alexandru Stanescu, Shashwat Gupta, Daniel Sanchez, Nathan Beckmann. ASPLOS '20: Proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems, Virtual Lausanne, Switzerland, March 16-20, March 2020.
    Abstract / PDF [1.6M] / Talk Video

  • Learning Relaxed Belady for Content Distribution Network Caching. Zhenyu Song, Daniel S. Berger, Kai Li, Wyatt Lloyd. 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’20). February 25–27, 2020. Santa Clara, CA.
    Abstract / PDF [2.25M]

  • Scalable Pointer Analysis of Data Structures using Semantic Models. Pratik Fegade, Christian Wimmer. 29th Conference on Compiler Construction (CC ’20), February 22–23, 2020, San Diego, CA, USA.
    Abstract / PDF [700K]

  • Convertible Codes: New Class of Codes for Efficient Conversion of Coded Data in Distributed Storage. Francisco Maturana, K. V. Rashmi. 11th Innovations in Theoretical Computer Science Conference (ITCS 2020). Seattle, WA, January 12-14, 2020.
    Abstract / PDF [687K]

  • Mochi: Composing Data Services for High-Performance Computing Environments. Robert B. Ross, George Amvrosiadis, Philip Carns, Charles D. Cranor, Matthieu Dorier, Kevin Harms, Gregory R. Ganger, Garth A. Gibson, Samuel K. Gutierrez, Robert Latham, Bob Robey, Dana Robinson, Bradley Settlemyer, Galen Shipman, Shane Snyder, Jerome Soumagne, Qing Zheng. Journal of Computer Science and Technology 35(1): 121–144 Jan. 2020.
    Abstract / PDF [1.3M]

  • Writeback-Aware Caching. Nathan Beckmann, Phillip B. Gibbons, Bernhard Haeupler, Charles McGuffey. Society for Industrial and Applied Mathematics. 2020.
    Abstract / PDF [847K]

2019

  • MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling. Jianyu Wang, Anit Sahu, Gauri Joshi, Soummya Kar. NeurIPS workshop of Federated Learning for Data Privacy and Confidentiality, Dec 13, 2019. Vancouver, BC, Canada. Distinguished Student Paper Award.
    Abstract / PDF [1.1M]

  • Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-vector Multiplication. Ankur Mallick, Malhar Chaudhari, Ganesh Palanikumar, Utsav Sheth, Gauri Joshi. Proc. ACM Meas. Anal. Comput. Syst., Vol. 3, No. 3, Article 58. December 2019.
    Abstract / PDF [1.9M]

  • Demystifying Complex Workload–DRAM Interactions: An Experimental Study. Saugata. Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019. To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 3, No. 3, December 2019.
    Abstract / PDF [4M]

  • Processing-in-Memory: A Workload-Driven Perspective. Saugata Ghose, Amarali Boroumand, Jeremie. S. Kim, Juan. Gómez-Luna, Onur Mutlu. IBM Journal of Research and Development (JRD), Vol. 63, No. 6, November/December 2019.
    Abstract / PDF [2.1M]

  • MANIC: A Vector-Dataflow Architecture for Ultra-Low-Power Embedded Systems. Graham, Amolak Nagi, Nathan Serafin, Mehmet Meric Isgenc, Nathan Beckmann, Brandon Lucia. MICRO '52: Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, Columbus, OH, October 2019.
    Abstract / PDF [1.2M]

  • Efficient Remote Procedure Calls for Datacenters. Anuj Kalia. Carnegie Mellon University PhD Dissertation CMU-CS-19-126, September 2019.
    Abstract / PDF [1.7M]

  • Multiversioned Page Overlays: Enabling Faster Serializable Hardware Transactional Memory. Ziqi Wang, Michael A. Kozuch, Todd C. Mowry, Vivek Seshadri. 28th Parallel Architecture and Compiler Technologies 2019 (PACT'19), Sept 21-25, 2019, Seattle, WA.
    Abstract / PDF [475K]

  • Compact Filters for Fast Online Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. IEEE CLUSTER 2019. September 23 - 26, 2019, Albuquerque, New Mexico, USA.
    Abstract / PDF [1M]

  • File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution. Abutalib Aghayev, Sage Weil, Michael Kuchnik, Mark Nelson, Gregory R. Ganger, George Amvrosiadis. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [870K]

  • Parity Models: Erasure-Coded Resilience for Prediction Serving Systems. Jack Kosaian, K. V. Rashmi, Shivaram Venkataraman. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • PipeDream: Generalized Pipeline Parallelism for DNN Training. Deepak Narayanan, Aaron Harlap, Amar Phanishayee, Vivek Seshadri, Nikhil R. Devanur, Gregory R. Ganger, Phillip B. Gibbons, Matei Zaharia. SOSP ’19, October 27–30, 2019, Huntsville, ON, Canada.
    Abstract / PDF [1M]

  • Vantage: Optimizing Video Upload for Time-shifted Viewing of Social Livestreams. Devdeep Ray, Jack Kosaian, K. V. Rashmi, Srinivasan Seshan. ACM SIGCOMM, August 19-24, 2019, Beijing, China.
    Abstract / PDF [6.75M]

  • TVARAK: Software-Managed Hardware Offload for DAX NVM Storage Redundancy. Rajat Kateja, Nathan Beckmann, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-105, Aug 2019.
    Abstract / PDF [975K]

  • STRADS-AP: Simplifying Distributed Machine Learning Programming without Introducing a New Programming Model. Jin Kyu Kim, Abutalib Aghayev, Garth A. Gibson, Eric P. Xing. Proceedings of the 2019 USENIX Annual Technical Conference, July 10–12, 2019 • Renton, WA.
    Abstract / PDF [490K]

  • Rateless Codes for Distributed Computations with Sparse Compressed Matrices. Ankur Mallick, Gauri Joshi. IEEE International Symposium on Information Theory (ISIT), July 7-12, 2019, Paris, France.
    Abstract / PDF [672K]

  • Peering through the Dark: An Owl’s View of Inter-job Dependencies and Jobs’ Impact in Shared Clusters. Andrew Chung, Carlo Curino, Subru Krishnan, Konstantinos Karanasos, Panagiotis Garefalakis, Gregory R. Ganger. SIGMOD ’19, June 30–July 5, 2019, Amsterdam, Netherlands.
    Abstract / PDF [1.6M]

  • Distribution-based Cluster Scheduling. Jun Woo Park. Carnegie Mellon University School of Computer Science PhD Dissertation, June 2019.
    Abstract / PDF [1.47M]

  • Enabling Practical Processing in and Near Memory for Data-Intensive Computing. Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun. Proc. of the Design Automation Conference (DAC), Las Vegas, NV, June 2019.
    Abstract / PDF [477K]

  • CROW: A Low-Cost Substrate for Improving DRAM Performance, Energy Efficiency, and Reliability. Hasan Hassan, Minesh Patel, Jeremie. S. Kim, A. Giray Yaglikçi, Nandita Vijaykumar, Nika Mansouri Ghiasi, Saugata Ghose, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.45M]

  • CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators. Amarali Boroumand, Saugata Ghose, Minesh Patel, Hasan Hassan, Brandon Lucia, Rachata Ausavarungnirun, Kevin Hsieh, Nastaran Hajinazar, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Phoenix, AZ, June 2019.
    Abstract / PDF [1.1M]

  • Understanding the Interactions ofWorkloads and DRAM Types: A Comprehensive Experimental Study. Saugata Ghose, Tianshi Li, Nastaran Hajinazar, Damla Senol Cali, Onur Mutlu. Proc. of the Joint ACM SIGMETRICS/IFIP Performance Conference, Phoenix, AZ, June 2019; To appear in Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2019.
    Abstract / PDF [2M]

  • Compact Filter Structures for Fast Data Partitioning. Qing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary A. Grider. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-104, June 2019.
    Abstract / PDF[574K]

  • Improving ML Applications in Shared Computing Environments. Aaron Harlap. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, May 2019.
    Abstract / PDF [1.4M]

  • This is Why ML-driven Cluster Scheduling Remains Widely Impractical. Michael Kuchnik, Jun Woo Park, Chuck Cranor, Elisabeth Moore, Nathan DeBardeleben, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-103, May 2019.
    Abstract / PDF [715K]

  • Fast and Efficient Distributed Matrix-Vector Multiplication Using Rateless Fountain Codes. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 12 - 17 May, 2019 · Brighton, UK.
    Abstract / PDF [485K]

  • Reconciling LSM-Trees with Modern Hard Drives using BlueFS. Abutalib Aghayev, Sage Weil, Gregory R. Ganger, George Amvrosiadis. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-102, April 2019.
    Abstract / PDF [735K]

  • Intelligence Beyond the Edge: Inference on Intermittent Embedded Systems. Graham Gobieski, Brandon Lucia, Nathan Beckmann Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’19), April 13th – April 17th, 2019, Providence, RI.
    Abstract / PDF [3.35M]

  • Lazy Redundancy for NVM Storage: Handing the Performance-Reliability Tradeoff to Applications. Rajat Kateja, Andy Pavlo, Gregory R. Ganger Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-19-101, April 2019.
    Abstract / PDF [800K]

  • Scaling Video Analytics on Constrained Edge Nodes. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. 2nd SysML Conference (SysML ’19). March 31-April 2, 2019, Palo Alto, CA.
    Abstract / PDF [8.5M]

  • Automating Dependence-Aware Parallelization of Machine Learning Training on Distributed Shared Memory. Jinliang Wei, Garth A. Gibson, Phillip B. Gibbons, Eric P. Xing. EuroSys '19: Proceedings of the Fourteenth EuroSys Conference, March 2019, Dresden, Germany.
    Abstract / PDF [1.1M]

  • Datacenter RPCs can be General and Fast. Anuj Kalia, Michael Kaminsky, David G. Andersen. 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Feb. 26–28, 2019, Boston, MA. Best Paper award!
    Abstract / PDF [555K]

  • Non-Volatile Memory Database Management Systems. Joy Arulraj, Andrew Pavlo. Synthesis Lectures on Data Management, Morgan & Claypool Publishers, February 2019.
    Abstract / PDF currently unavailable

  • Cluster Storage Systems Gotta Have HeART: Improving Storage Efficiency by Exploiting Disk-reliability Heterogeneity. Saurabh Kadekodi, K. V. Rashmi, Gregory R. Ganger. 17th USENIX Conference on File and Storage Technologies (FAST '19) Feb. 25–28, 2019 Boston, MA.
    Abstract / PDF [1.1M]

  • A Scalable Priority-Aware Approach to Managing Data Center Server Power. Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu. HPCA 2019: The 25th International Symposium on High-Performance Computer Architecture, February 16 - 20, 2019, Washington D.C.
    Abstract / PDF [610K]

  • External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems. Andrew Pavlo, Matthew Butrovich, Ananya Joshi, Lin Ma, Prashanth Menon, Dana Van Aken, Lisa Lee, Ruslan Salakhutdinov. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 42(2): 32-46 (2019).
    Abstract / PDF [555K]

2018

  • What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study. Suagata. Ghose, Abdullah Giray Yaglikçi, Raghav Gupta, Donghyuk Lee, Kais. Kudrolli, William. X. Liu, Hasan Hassan, Kevin K. Chang, Niladrish Chatterjee, Aditya Agrawal, Mike O'Connor, Onur Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [2.6M]

  • Improving 3D NAND Flash Memory Lifetime by Tolerating Early Retention Loss and Process Variation. Y. Luo, S. Ghose, Y. Cai, E. F. Haratsch, O. Mutlu. Proc. of the ACM SIGMETRICS Conference, Irvine, CA, June 2018; Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 2, No. 3, December 2018.
    Abstract / PDF [3.2M]

  • SRPT for Multiserver Systems. Isaac Grosof, Ziv Scully, Mor Harchol-Balter. Performance Evaluation , vol. 127-128, Nov. 2018, pp. 154-175. Also in Proc. 36th International Symposium on Computer Performance, Modeling, Measurements, and Evaluation (Performance 2018) , Toulouse, France, December 2018. Best Student Paper Award.
    Abstract / PDF [780K]

  • Towards Lightweight and Robust Machine Learning for CDN Caching. Daniel S. Berger. HotNets-XVII, November 15–16, 2018, Redmond, WA, USA.
    Abstract / PDF [610K]

  • Scaling Embedded In-Situ Indexing with DeltaFS. Qing Zheng, Charles D. Cranor, Danhao Guo, Gregory R. Ganger, George Amvrosiadis, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider, Fan Guo. SC18, November 11-16, 2018, Dallas, Texas, USA.
    Abstract / PDF [927K]

  • Stratus: Cost-aware Container Scheduling in the Public Cloud. Andrew Chung, Jun Woo Park, Gregory R. Ganger. ACM Symposium on Cloud Computing, 2018 (SoCC’18), Carlsbad, CA October 11-13, 2018.
    Abstract / PDF [1.5M]

  • Focus: Querying Large Video Datasets with Low Latency and Low Cost. Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Oct. 8–10, 2018, Carlsbad, CA.
    Abstract / PDF [1.2M]

  • RobinHood: Tail Latency Aware Caching—Dynamic Reallocation from Cache-Rich to Cache-Poor. Daniel S. Berger, Benjamin Berg, Timothy Zhu, Siddhartha Sen, Mor Harchol-Balter. 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’18). October 8–10, 2018 • Carlsbad, CA, USA.
    Abstract / PDF [2.9M]

  • SOAP Bubbles: Robust Scheduling Under Adversarial Noise. Ziv Scully, Mor Harchol-Balter. 56th Annual Allerton Conference on Communication, Control, and Computing, 2-5 Oct. 2018. Monticello, IL.
    Abstract / PDF [245K]

  • Exploiting Locality in Graph Analytics through Hardware-Accelerated Traversal Scheduling. Anurag Mukkara, Nathan Beckmann, Maleen Abeydeera, Xiaosong Ma, Daniel Sanchez. 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 20-24 Oct. 2018, Fukuoka, Japan.
    Abstract / PDF [660K]

  • The Parallel Persistent Memory Model. Guy E. Blelloch, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. SPAA ’18, July 16–18, 2018, Vienna, Austria.
    Abstract / PDF [760K]

  • Putting the “Micro” Back in Microservice. Sol Boucher, Anuj Kalia, David G. Andersen, Michael Kaminsky. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [740K]

  • Geriatrix: Aging What You See and What You Don’t See -- A File System Aging Approach for Modern Storage Systems. Saurabh Kadekodi, Vaishnavh Nagarajan, Gregory R. Ganger, Garth A. Gibson. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.44M]

  • Cavs: An Efficient Runtime System for Dynamic Neural Networks. Shizhen Xu, Hao Zhang, Graham Neubig, Wei Dai, Jin Kyu Kim, Zhijie Deng, Qirong Ho, Guangwen Yang, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [1.7M]

  • Litz: Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA.
    Abstract / PDF [298K]

  • Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela H. Jiang, Daniel L.K. Wong, Christopher Canel, Lilia Tang, Ishan Misra, Michael Kaminsky*, Michael A. Kozuch*, Padmanabhan Pillai*, David G. Andersen Gregory R. Ganger. 2018 USENIX Annual Technical Conference (USENIX ATC ’18). July 11–13, 2018 • Boston, MA, USA.
    Abstract / PDF [1.5M]

  • Tributary: Spot-dancing for Elastic Services with Latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. 2018 USENIX Annual Technical Conference. July 11–13, 2018 Boston, MA, USA. Supersedes Carnagie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102.
    Abstract / PDF [1.25M]

  • On the Diversity of Cluster Workloads and its Impact on Research Results. George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. 2018 USENIX Annual Technical Conference (ATC '18), Boston, MA, July 11-13, 2018.
    Abstract / PDF [285K]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson, Gregory R. Ganger. 10th USENIX Workshop on Hot Topics in Cloud Computing, July 9, 2018, Boston, MA. Supersedes CMU-PDL-17-105.
    Abstract / PDF [250K]

  • FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. A. Tavakkol, M. Sadrosadati, S. Ghose, J. Kim, Y. Luo, Y. Wang, N. M. Ghiasi, L. Orosa, J. Gómez-Luna, O. Mutlu. Proc. of the International Symposium on Computer Architecture (ISCA), Los Angeles, CA, June 2018.
    Abstract / PDF [888K]

  • Learning a Code: Machine Learning for Approximate Non-Linear Coded Computation. Jack Kosaian, K.V. Rashmi, Shivaram Venkataraman. arXiv:1806.01259v1 [cs.LG], 4 Jun 2018
    Abstract / PDF [575K]

  • Practical Bounds on Offline Caching with Variable Object Sizes. Daniel Berger, Nathan Beckmann, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 2, Article 32. June 2018. POMACS 2018.
    Abstract / PDF [1.2M]

  • Query-based Workload Forecasting for Self-Driving Database Management Systems. Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, Geoffrey J. Gordon. SIGMOD/PODS '18 International Conference on Management of Data, Houston, TX, USA, June 10 - 15, 2018.
    Abstract / PDF [1.25M]

  • Building a Bw-Tree Takes More Than Just Buzz Words. Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis, Huanchen Zhang, Michael Kaminsky, David G. Andersen. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.
    Abstract / PDF [2.2M]

  • SuRF: Practical Range Query Filtering with Fast Succinct Tries. Huanchen Zhang, Hyeontaek Lim, Viktor Leis, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo. SIGMOD’18, June 10–15, 2018, Houston, TX, USA.BEST PAPER AWARD!
    Abstract / PDF [1.9M]

  • The Locality Descriptor: A Holistic Cross-Layer Abstraction to Express Data Locality in GPUs. Nandita Vijaykumar, Eiman Ebrahimi, Kevin Hsieh, Phillip B. Gibbons, Onur Mutlu. The 45th International Symposium on Computer Architecture - June 2-6, ISCA 2018. Los Angeles, California, USA.
    Abstract / PDF [3.1M]

  • A Case for Richer Cross-layer Abstractions: Bridging the Semantic Gap with Expressive Memory. Nandita Vijaykumar, Abhilasha Jain, Diptesh Majumdar, Kevin Hsieh, Gennady Pekhimenko, Eiman Ebrahimi, Nastaran Hajinazaru, Phillip B. Gibbons, Onur Mutlu. 45th International Symposium on Computer Architecture (ISCA), Los Angeles, CA, USA, June 2018.
    Abstract / PDF [2M]

  • Practical Bounds on Optimal Caching with Variable Object Sizes. Daniel S. Berger, Nathan Beckmann, Mor Harchol-Balter. Proceedings of the ACM on Measurement and Analysis of Computing Systems. Vol. 2, No. 2, Article 32, June 2018.
    Abstract / PDF [1.2M]

  • Implicit Decomposition for Write-Efficient Connectivity Algorithms. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, and Julian Shun. 2018 International Parallel and Distributed Processing Symposium (IPDPS '18). May 21-25, 2018, Vancouver, BC, Canada.
    Abstract / PDF [716K]

  • 3Sigma: Distribution-based Cluster Scheduling for Runtime Uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. EuroSys ’18, April 23–26, 2018, Porto, Portugal. Supersedes CMU-PDL-17-107, Nov. 2017.
    Abstract / PDF [1.4M]

  • LHD: Improving Cache Hit Rate by Maximizing Hit Density. Nathan Beckmann, Haoxian Chen, Asaf Cidon. 15th USENIX Symposium on Networked Systems Design and Implementation ({NSDI} 18), April 9-11, 2018, Renton, WA..
    Abstract / PDF [1.1M]

  • Better Caching in Search Advertising Systems with Rapid Refresh Predictions. Conglong Li, David G. Andersen, Qiang Fu, Sameh Elnikety, Yuxiong He. Proceedings of the 2018 World Wide Web Conference, Lyon, France, April 23 - 27, 2018.
    Abstract / PDF [1.1M]

  • Rateless Codes for Near-Perfect Load Balancing in Distributed Matrix-Vector Multiplication. Ankur Mallick, Malhar Chaudhari, Gauri Joshi. arXiv:1804.10331v2 [cs.DC] 30 Apr 2018.
    Abstract / PDF [1.1M]

  • GoogleWorkloads for Consumer Devices: Mitigating Data Movement Bottlenecks. Amirali Boroumand, Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu. ASPLOS’18, March 24–28, 2018, Williamsburg, VA, USA.
    Abstract / PDF [885K]

  • LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching. Mohammad Sadrosadati, Amirhossein Mirhosseini, Seyed Borna Ehsani, Hamid Sarbazi-Azad, Mario Drumond, Babak Falsafi, Rachata Ausavarungnirun, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.M]

  • MASK: Redesigning the GPU Memory Hierarchy to Support Multi-Application Concurrency. Rachata Ausavarungnirun, Vance Miller, Joshua Landgraf, Saugata Ghose, Jayneel Gandhi, Adwait Jog, Christopher J. Rossbach, Onur Mutlu. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.1M]

  • Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability. Maciej Besta, Syed Minhaj Hassan, Sudhakar Yalamanchili, Rachata Ausavarungnirun, Onur Mutlu, Torsten Hoefler. ASPLOS2018. The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, March 24th – March 28th, Williamsburg, VA, USA.
    Abstract / PDF [1.6M]

  • SOAP: One Clean Analysis of All Age-Based Scheduling Policies. Ziv Scully, Mor Harchol-Balter, Alan Scheller-Wolf. Proc. ACM Meas. Anal. Comput. Syst., Vol. 2, No. 1, Article 16, March 2018.
    Abstract / PDF [885K]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. arXiv:1803.07445v1 [cs.LG] 20 Mar 2018.
    Abstract / PDF [1M]

  • Dynamic Stem-Sharing for Multi-Tenant Video Processing. Angela Jiang, Christopher Canel, Daniel Wong, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, David G. Andersen, Gregory R. Ganger. SysML 18, February 15–16, 2018. Stanford, CA.
    Abstract / PDF [450K]

  • MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. A. Tavakkol, J. Gómez-Luna, M. Sadrosadati, S. Ghose, and O. Mutlu. USENIX Conference on File and Storage Technologies (FAST), Oakland, CA, February 2018.
    Abstract / PDF [2.25M]

  • 3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. arXiv:1802.07389v1 [cs.LG] 21 Feb 2018.
    Abstract / PDF [586K]

  • Efficient Multi-Tenant Inference on Video using Microclassifiers. Giulio Zhou, Thomas Kim, Christopher Canel, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [1.5M]

  • PipeDream: Fast and Efficient Pipeline Parallel DNN Training. Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Gregory R. Ganger, Phil Gibbons. SysML '18, Feb. 15-16, 2018 , Stanford, CA.
    Abstract / PDF [615K]

  • Intermittent Deep Neural Network Inference. Graham Gobieski, Nathan Beckmann, Brandon Lucia. SysML 2018, February 15-16, 2018, Stanford, CA.
    Abstract / PDF [450K]

  • Picking Interesting Frames in Streaming Video. Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor. SysML’18, February 15–16, 2018, Stanford, CA.
    Abstract / PDF [913K]

  • Tributary: Spot-dancing for elastic services with latency SLOs. Aaron Harlap, Andrew Chung, Alexey Tumanov, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-102, Jan. 2018.
    Abstract / PDF [990K]

  • Addressing the Long-Lineage Bottleneck in Apache Spark. Haoran Wang, Jinliang Wei, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-18-101, January 2018.
    Abstract / PDF [250K]

2017

  • Towards Optimality in Parallel Job Scheduling. Benjamin Berg, Jan-Pieter Dorsman, Mor Harchol-Balter. Proc. ACM Meas. Anal. Comput. Syst., Vol. 1, No. 2, Article 40. Publication date: December 2017.
    Abstract / PDF [4.3M]

  • SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data. Kai Ren, Qing Zheng, Joy Arulraj, Garth A. Gibson. Proceedings of the VLDB Endowment, Vol. 10, No. 13, 2017.
    Abstract / PDF [2.15M]

  • 3Sigma: Distribution-based cluster scheduling for runtime uncertainty. Jun Woo Park, Alexey Tumanov, Angela Jiang, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-107, November 2017.
    Abstract / PDF [800K]

  • Software-Defined Storage for Fast Trajectory Queries using a DeltaFS Indexed Massive Directory. Qing Zheng, George Amvrosiadis, Saurabh Kadekodi, Garth A. Gibson, Chuck Cranor, Brad Settlemyer, Gary Grider, Fan Guo. PDSW-DISCS 2017: 2nd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems held in conjunction with SC17, Denver, CO, November 2017.
    Abstract / PDF [1.25M]

  • Aging Gracefully with Geriatrix: A File System Aging Tool. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-106, October 2017. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [560K]

  • A Case for Packing and Indexing in Cloud File Systems. Saurabh Kadekodi, Bin Fan, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-105, October 2017.
    Abstract / PDF [280K]

  • Bigger, Longer, Fewer: What do cluster jobs look like outside Google? George Amvrosiadis, Jun Woo Park, Gregory R. Ganger, Garth A. Gibson, Elisabeth Baseman, Nathan DeBardeleben. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-17-104, October 2017.
    Abstract / PDF [360K]

  • Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes. Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach & Onur Mutlu. Proc. of the International Symposium on Microarchitecture (MICRO), Cambridge, MA, October 2017.
    Abstact / PDF [1.32M]

  • Ambit: In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology. Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons & Todd C. Mowry. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [2.5M]

  • Detecting and Mitigating Data-Dependent DRAM Failures by Exploiting Current Memory Content. Samira Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee & Onur Mutlu. Proceedings of the 50th International Symposium on Microarchitecture (MICRO), Boston, MA, USA, October 2017.
    Abstact / PDF [1.5M]

  • WorkloadCompactor: Reducing datacenter cost while providing tail latency SLO guarantees. Timothy Zhu, Michael A. Kozuch & Mor Harchol-Balter. ACM Symposium on Cloud Computing (SoCC'17) , Santa Clara, Oct 2017.
    Abstact / PDF [3.25M]

  • Utility-Based Hybrid Memory Management. Yang Li, Saugata Ghose, Jongmoo Choi, Jin Sun, Hui Wang & Onur Mutlu. In Proc. of the IEEE Cluster Conference (CLUSTER), Honolulu, HI, September 2017.
    Abstact / PDF [588K]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size. Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf & Benny Van Houdt. Transactions on Networking, September 2017.
    Abstact / PDF [544K]

  • Error Characterization, Mitigation, and Recovery in Flash-Memory-Based Solid-State Drives. Yu Cai, Saugata Ghose, Erich F. Haratsch, Yixin Luo & Onur Mutlu. Proceedings of the IEEE Volume: 105, Issue: 9, Sept. 2017.
    Abstact / PDF [5.3M]

  • Workload Analysis and Caching Strategies for Search Advertising Systems. Conglong Li, David G. Andersen, Qiang Fu, Sameh Elnikety, Yuxiong He. SoCC ’17, September 24–27, 2017, Santa Clara, CA, USA.
    Abstract / PDF [650K]

  • Scheduling for Efficiency and Fairness in Systems with Redundancy. Kristen Gardner, Mor Harchol-Balter, Esa Hyyti & Rhonda Righter. Performance Evaluation, July 2017.
    Abstact / PDF [784K]

  • Litz: An Elastic Framework for High-Performance Distributed Machine Learning. Aurick Qiao, Abutalib Aghayev, Weiren Yu, Haoyang Chen, Qirong Ho, Garth A. Gibson, Eric P. Xing. Carnegie Mellon Univedrsity Parallel Data Laboratory Technical Report CMU-PDL-17-103. June 2017.
    Abstract / PDF [424K]

  • Cachier: Edge-caching for Recognition Applications. Utsav Drolia, Katherine Guo (Bell Labs), Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. The 37th IEEE International Conference on Distributed Computing Systems (ICDCS 2017), June 5 – 8, 2017, Atlanta, GA, USA.
    Abstract / PDF [5.4M]

  • Carpool: A Bufferless On-Chip Network Supporting Adaptive Multicast and Hotspot Alleviation. Xiyue Xiang, Wentao Shi, Saugata Ghose, Lu Peng, Onur Mutlu & Nian-Feng Tzeng. In Proc. of the International Conference on Supercomputing (ICS), Chicago, IL, June 2017.
    Abstact / PDF [6.7M]

  • Viyojit: Decoupling Battery and DRAM Capacities for Battery-Backed DRAM. Rajat Kateja, Anirudh Badam, Sriram Govindan, Bikash Sharma, Gregory R. Ganger. ISCA ’17, June 24-28, 2017, Toronto, ON, Canada.
    Abstract / PDF [1M]

  • Understanding Reduced-Voltage Operation in Modern DRAM Devices: Experimental Characterization, Analysis, and Mechanisms. Kevin K. Chang, A. Giray Yaglikçi, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O’Connor, Hasan Hassan & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [4M]

  • Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms. Donghyuk Lee, Samira Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri & Onur Mutlu. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 1, No. 1, June 2017.
    Abstact / PDF [2.5M]

  • Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last. Prashanth Menon, Todd C. Mowry & Andrew Pavlo. Proceedings of the VLDB Endowment, Vol. 11, No. 1, 2017.
    Abstact / PDF [970K]

  • Efficient Redundancy Techniques for Latency Reduction in Cloud Systems. Gauri Joshi, Emina Soljanin & Gregory Wornell. ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS) Volume 2 Issue 2, May 2017.
    Abstact / PDF [1.38M]

  • Automatic Database Management System Tuning Through Large-scale Machine Learning. Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, Bohan Zhang. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017. Chicago, IL, USA.
    Abstract / PDF [760K]

  • Online Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. ACM SIGMOD International Conference on Management of Data, May 14-19, 2017.
    Abstract / PDF [890K]

  • Proteus: Agile ML Elasticity through Tiered Reliability in Dynamic Resource Markets. Aaron Harlap, Alexey Tumanov, Andrew Chung, Gregory R. Ganger, Phil Gibbons. ACM European Conference on Computer Systems, 2017 (EuroSys'17), 23rd-26th April, 2017, Belgrade, Serbia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [743K]

  • An Empirical Evaluation of In-Memory Multi-Version Concurrency Control. Yingjun Wu, Joy Arulraj, Jiexi Lin, Ran Xian, Andrew Pavlo. Proceedings of the VLDB Endowment, vol. 10, iss. 7, pages. 781—792, March 2017.
    Abstract / PDF [660K]

  • AdaptSize: Orchestrating the Hot Object Memory Cache in a Content Delivery Network. Daniel S. Berger, Ramesh K. Sitaraman, Mor Harchol-Balter. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17). March 27–29, 2017, Boston, MA.
    Abstract / PDF [560K]

  • Improving the Reliability of Chip-off Forensic Analysis of NAND Flash Memory Devices. Aya Fukami, Saugata Ghose, Yixin Luo, Yu CaI, Onur Mutlu. DFRWS Digital Forensics Research Conference Europe (DFRWS EU), March 21 - 23, 2017 Lake Constance, Germany.
    Abstract / PDF [1.5M]

  • Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds. Kevin Hsieh, Aaron Harlap, Nandita Vijaykumar, Dimitris Konomis, Gregory R. Ganger, Phillip B. Gibbons, Onur Mutlu. 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI), March 27–29, 2017, Boston, MA.
    Abstract / PDF [1.5M]

  • Towards Edge-caching for Image Recognition. Utsav Drolia, Katherine Guo, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. First Workshop on Smart Edge Computing and Networking (SmartEdge) '17, held in conjunction with PerCom 2017, March 13 - 17, 2017, Hawaii, USA.
    Abstract / PDF [5.1M]

  • Evolving Ext4 for Shingled Disks. Abutalib Aghayev, Theodore Ts’o, Garth A. Gibson, Peter Desnoyers. 15th USENIX Conference on File and Storage Technologies (FAST '17), Feb 27–Mar 2, 2017. Santa Clara, CA.
    Abstract / PDF [1.4M]

  • Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques. Yu Cai, Saugata Ghose, Yixin Luo, Ken Mai, Onur Mutlu, Erich F. Haratsch. 23rd IEEE Symposium on High Performance Computer Architecture, Industrial session, February 2017.
    Abstract / PDF [8.4M]

  • SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies. Hasan Hassan,Nandita Vijaykumar, Samira Khan, Saugata Ghose, Kevin Chang, Gennady Pekhimenko, Donghyuk Lee, Oguz Ergin, Onur Mutlu. International Symposium on High-Performance Computer Architecture (HPCA), February 2017.
    Abstract / PDF [1.6M]

  • An Evaluation of Distributed Concurrency Control. Rachael Harding, Dana Van Aken, Andrew Pavlo, Michael Stonebraker. Proceedings of the VLDB Endowment, vol. 10, iss. 5, pages. 553—564, January 2017.
    Abstract / PDF [421K]

  • Self-Driving Database Management Systems. A. Pavlo, G. Angulo, J. Arulraj, H. Lin, J. Lin, L. Ma, P. Menon, T. Mowry, M. Perron, I. Quah, S. Santurkar, A. Tomasic, S. Toor, D. V. Aken, Z. Wang, Y. Wu, R. Xian, and T. Zhang. In CIDR 2017, Conference on Innovative Data Systems Research. January 8-11, 2017, Chaminade, CA.
    Abstract / PDF [680K]

 

2016

  • Write-Behind Logging. J. Arulraj, M. Perron, A. Pavlo. Proc. VLDB Endow., vol. 10, pp. 337-348, December, 2016.
    Abstract / PDF [931K]

  • Prescriptive Safety-Checks through Automated Proofs for Control-Flow Integrity. Jiaqi Tan. Carnegie Mellon University Electrical and Computer Engineering PhD Dissertation, November 2016.
    Abstract / PDF [5.75M]

  • A Survey of Security Vulnerabilities in Bluetooth Low Energy Beacons. Hui Jun Tay, Jiaqi Tan, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-109. November 2016.
    Abstract / PDF [110K]

  • AUSPICE-R: Automatic Safety-Property Proofs for Realistic Features in Machine Code. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, Priya Narasimhan.14th Asian Symposium on Programming Languages and Systems (APLAS), November 2016.
    Abstract / PDF [325K]

  • FaSST: Fast, Scalable and Simple Distributed Transactions with Two-sided (RDMA) Datagram RPCs. Anuj Kalia, Michael Kaminsky, David G. Andersen.12th USENIX Symposium on Operating Systems Design and Implementation November 2–4, 2016, Savannah, GA, USA.
    Abstract / PDF [608K]

  • EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding. K. V. Rashmi, Mosharaf Chowdhury, Jack Kosaian, Ion Stoica & Kannan Ramchandran. 12th USENIX Symposium on Operating Systems Design and Implementation, Nov. 2–4, 2016, Savannah, GA.
    Abstract / PDF [830K]

  • Stateless Model Checking with Data-Race Preemption Points. Ben Blum, Garth A. Gibson. SPLASH 2016 OOPSLA, Oct 30 - Nov 4, 2016, Amsterdam, Netherlands.
    Abstract / PDF [704K]

  • MLtuner: System Support for Automatic Machine Learning Tuning. Henggang Cui, Gregory R. Ganger, and Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-108, October 2016.
    Abstract / PDF [900K]

  • Aging Gracefully with Geriatrix: A File System Aging Suite. Saurabh Kadekodi, Vaishnavh Nagarajan, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-105. October, 2016.
    Abstract / PDF [503K]

  • Benchmarking Apache Spark with Machine Learning Applications. Jinliang Wei, Jin Kyu Kim, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-107 October 2016.
    Abstract / PDF [360K]

  • Zorua: A Holistic Approach to Resource Virtualization in GPUs. Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha,Saugata Ghose, Adwait Jogu, Phillip B. Gibbons, Onur Mutlu. 49th IEEE/ACM International Symposium on Microarchitecture (MICRO’16), October 15-19, 2016, Taipei, Taiwan.
    Abstract / PDF [1.5M]

  • Principled Workflow-centric Tracing of Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Jonathan Mace, Benjamin H. Sigelman, Rodrigo Fonseca, Gregory R. Ganger. ACM Symposium on Cloud Computing 2016 (SoCC ’16) October 5-7, 2016, Santa Clara, CA, USA.
    Abstract / PDF [590K]

  • SNC-Meister: Admitting More Tenants with Tail Latency SLOs. Timothy Zhu, Daniel S. Berger, Mor Harchol-Balter. SoCC ’16, October 05-07, 2016, Santa Clara, CA, USA.
    Abstract / PDF [500K]

  • A Model for Application Slowdown Estimation in On-Chip Networks and Its Use for Improving System Fairness and Performance. Xiyue Xiang, Saugata Ghose, Onur Mutlu, Nian-Feng Tzeng. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [399K]

  • Accelerating Pointer Chasing in 3D-Stacked Memory: Challenges, Mechanisms, Evaluation.Kevin Hsieh, Samira Khan, Nandita Vijaykumar, Kevin K. Chang, Amirali Boroumand, Saugata Ghose, Onur Mutlu. International Conference on Computer Design (ICCD), October 3-5, 2016, Phoenix, USA.
    Abstract / PDF [1.67M]

  • PCFIRE: Towards Provable Preventative Control-Flow Integrity Enforcement for Realistic Embedded Software. Jiaqi Tan, Hui Jun Tay, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. EMSOFT’16, October 01-07, 2016, Pittsburgh, PA, USA.
    Abstract / PDF [722K]

  • Poster Abstract: BUFS: Towards Bottom-Up Foundational Security for Software in the Internet-of-Things. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 1st IEEE/ACM Symposium on Edge Computing (SEC 2016), October 2016.
    Abstract / PDF [682K]

  • Addressing the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2016. Oct 5-7, Santa Clara, CA. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • μC-States: Fine-grained GPU Datapath Power Management. Onur Kayıran, Adwait Jog, Ashutosh Pattnaik, Rachata Ausavarungnirun, Xulong Tang, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das. Proceedings of the The 25th International Conference on Parallel Architectures and Compilation Techniques (PACT 2016), Haifa, Israel, September 2016.
    Abstract / PDF [823K]

  • Online Deduplication for Distributed Databases. Lianghong Xu. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, September 2016.
    Abstract / PDF [1.8M]

  • JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes. Alexey Tumanov, Angela Jiang, Jun Woo Park, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-104. September 2016.
    Abstract / PDF [1.6M]

  • A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size Kristen Gardner, Mor Harchol-Balter, Alan Scheller-Wolf. IEEE Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS 2016), London, UK, September 2016.
    Abstract / PDF [244K]

  • Soundness Proofs for Iterative Deepening. Ben Blum. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-103, September 6, 2016.
    Abstract / PDF [356K]

  • Efficient Algorithms with Asymmetric Read and Write Costs. Guy E Blelloch, Jeremy T Fineman, Phillip B Gibbons, Yan Gu, Julian Shun. 24th European Symposium on Algorithms (ESA’16). August, 2016.
    Abstract / PDF [623K]

  • Parallel Algorithms for Asymmetric Read-Write Costs. Naama Ben-David, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, Yan Gu, Charles McGuffey, Julian Shun. 28th ACM Symposium on Parallelism in Algorithms and Architectures Jul 11, 2016 - Jul 13, 2016. Asilomar State Beach, California, USA.
    Abstract / PDF [386K]

  • Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. Joy Arulraj, Andrew Pavlo, Prashanth Menon. SIGMOD’16, June 26-July 01, 2016, San Francisco, CA, USA.
    Abstract / PDF [575K]

  • PARBOR: An Efficient System-Level Technique to Detect Data-Dependent Failures in DRAM. Samira Khan, Donghyuk Lee, Onur Mutlu. Proceedings of the 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Toulouse, France, June 28 - July 1 2016.
    Abstract / PDF [630K]

  • Larger-than-Memory Data Management on Modern Storage Hardware for In-Memory OLTP Database Systems. Lin Ma, Joy Arulraj, Sam Zhao, Andrew Pavlo, Subramanya R. Dulloor, Michael J. Giardino, Jeff Parkhurst, Jason L. Gardner, Kshitij Dosh*, Col. Stanley Zdonik. DaMoN’16, June 26-July 01 2016, San Francisco, CA, USA.
    Abstract / PDF [1.25M]

  • Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems. Kevin Hsieh, Eiman Ebrahimi, Gwangsun Kim, Niladrish Chatterjee, Mike O'Connor, Nandita Vijaykumar, Onur Mutlu§, Stephen W. Keckler. Proceedings of the 43rd International Symposium on Computer Architecture (ISCA), Seoul, South Korea, June 18 - 22, 2016.
    Abstract / PDF [1M]

  • Understanding Latency Variation in Modern DRAM Chips: Experimental Characterization, Analysis, and Optimization. Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Khan, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), Antibes Juan-Les-Pins, France, June 14 - 18, 2016.
    Abstract / PDF [3M]

  • Design Guidelines for High Performance RDMA Systems. Anuj Kalia, Michael Kaminsky, David G. Andersen. 2016 USENIX Annual Technical Conference (USENIX ATC'16), June 2016.
    Abstract / PDF [553K]

  • Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes. Huanchen Zhang, Andy Pavlo, David G. Andersen, Michael Kaminsky, Lin Ma, Rui Shen. ACM SIGMOD International Conference on Management of Data 2016 (SIGMOD'16), June 2016.
    Abstract / PDF [715K]

  • Achieving One Billion Key-Value Requests Per Second on a Single Server. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. IEEE Micro's Top Picks from the Computer Architecture Conferences 2016, May/June 2016. Top Picks 2016 Award!
    Abstract / PDF [176K]

  • A Case for Hierarchical Rings with Deflection Routing: An energy-efficient on-chip communication substrate. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu, Parallel Computing, Volume 54, May 2016, Pages 29-45, ISSN 0167-8191.
    Abstract / PDF [2M]

  • TierML: Using Tiers of Reliability for Agile Elasticity in Machine Learning. Aaron Harlap, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016.
    Abstract / PDF [590K]

  • Similarity-based Deduplication for Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-101, April 2016.
    Abstract / PDF [1M]

  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [617K]

  • TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters. Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [8M]

  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth A. Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016 (EuroSys'16), 18th-21st April, 2016, London, UK.
    Abstract / PDF [1.6M]

  • Full-Stack Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. ACM Transactions on Computer Systems (TOCS), Vol. 34, No. 2, April 2016.
    Abstract / PDF [1.14M]

  • Be Fast, Cheap and in Control with SwitchKV. Xiaozhou Li, Raghav Sethi, Michael Kaminsky, David G. Andersen, Michael J. Freedman. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI'16), Santa Clara, CA, March 2016.
    Abstract / PDF [594K]

  • ChargeCache: Reducing DRAM Latency by Exploiting Row Access Locality. Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [2M]

  • Low-Cost Inter-Linked Subarrays (LISA): Enabling Fast Inter-Subarray Data Movement in DRAM. Kevin K. Chang, Prashant J. Nair, Donghyuk Lee, Saugata Ghose, Moinuddin K. Qureshi, and Onur Mutlu. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [768K]

  • A Case for Toggle-Aware Compression for GPU Systems. Gennady Pekhimenko, Evgeny Bolotin, Nandita Vijaykumar, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA), Barcelona, Spain, March 2016.
    Abstract / PDF [713K]

  • SizeCap: Efficiently Handling Power Surges in Fuel Cell Powered Data Centers. Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu. 22nd International Symposium on High Performance Computer Architecture (HPCA), March 12-16, Barcelona, Spain, 2016.
    Abstract / PDF [1.32M]

  • Achieving both High Energy Efficiency and High Performance in On-Chip Communication using Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Kai-Wei Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu. arXiv:1602.06005v1 [cs.DC], 18 Feb 2016.
    Abstract / PDF [576K]

  • A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Saugata Ghose, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu. arXiv:1602.01348v1 [cs.AR]. 3 Feb 2016.
    Abstract / PDF [1.87M]

  • Towards Accurate and Fast Evaluation of Multi-Stage Log-Structured Designs. Hyeontaek Lim, David G. Andersen, Michael Kaminsky. In 14th USENIX Conference on File and Storage Technologies (FAST'16), Santa Clara, CA, February 2016.
    Abstract / PDF [2M]

  • Simultaneous Multi-Layer Access: Improving 3D-Stacked Memory Bandwidth at Low Cost. Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Khan, Onur Mutlu. ACM Transactions on Architecture and Code Optimization (TACO), Vol. 12, January 2016. Presented at the 11th HiPEAC Conference, Prague, Czech Republic, January 2016.
    Abstract / PDF [2M]

  • Enabling Accurate and Practical Online Flash Channel Modeling for Modern MLC NAND Flash Memory. Yixin Luo, Saugata Ghose, Yu Cai, Erich F. Haratsch, Onur Mutlu JSAC Special Issue, 2016.
    Abstract / PDF [4.2M]

2015

  • ThyNVM: Enabling Software-Transparent Crash Consistency in Persistent Memory Systems. Jinglei Ren, Jishen Zhao, Samira Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [460K]

  • Scheduling Techniques for Hybrid Circuit/Packet Networks. He Liu, Matthew K. Mukerjee, Conglong Li, Nicolas Feltman, George Papen, Stefan Savage, Srinivasan Seshan, Geoffrey M. Voelker, David G. Andersen, Michael Kaminsky, George Porter, Alex C. Snoeren. In 11th International Conference on emerging Networking EXperiments and Technologies (CoNEXT 2015), Heidelberg, Germany, December 2015. Nominated for Best Paper.
    Abstract / PDF [510K]

  • The Application Slowdown Model: Quantifying and Controlling the Impact of Inter-Application Interference at Shared Caches and Main Memory. Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Khan, Onur Mutlu. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [604K]

  • Gather-Scatter DRAM: In-DRAM Address Translation to Improve the Spatial Locality of Non-unit Strided Accesses. Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 48th International Symposium on Microarchitecture (MICRO), Waikiki, Hawaii, USA, December 2015.
    Abstract / PDF [874K]

  • DeltaFS: Exascale File Systems Scale Better Without Dedicated Servers. Qing Zheng, Kai Ren, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider. PDSW2015: 10th Parallel Data Storage Workshop, held in conjunction with SC15, Austin, TX, November 16, 2015.
    Abstract / PDF [930K]

  • High-Performance and Lightweight Transaction Support in Flash-Based SSDs. Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, Onur Mutlu. IEEE Transactions on Computers (TC), October 2015.
    Abstract / PDF [1.4M]

  • Decoupled Direct Memory Access: Isolating CPU and IO Traffic by Leveraging a Dual-Data-Port DRAM. Donghyuk Lee, Lavanya Subramanian, Rachata Ausavarungnirun, Jongmoo Choi, Onur Mutlu. Proceedings of the 24th International Conference on Parallel Architectures and Compilation Techniques (PACT), San Francisco, CA, USA, October 2015.
    Abstract / PDF [1.8M]

  • Tracking and Reducing Uncertainty in Dataflow Analysis-Based Dynamic Parallel Monitoring. Michelle Goodstein, Phillip Gibbons, Michael Kozuch, Todd Mowry. International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), Oct 18, 2015 - Oct 21, 2015, San Francisco, CA.
    Abstract / PDF [341K]

  • Scalable Deep Learning on Distributed GPUs with a GPU-specialized Parameter Server. Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-107, October 2015.
    Abstract / PDF [537K]

  • Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance. Rachata Ausavarungnirun, Saugata Ghose, Onur Kayiran, Gabriel H. Loh, Chita R. Das, Mahmut T. Kandemir, Onur Mutlu. Proceedings of the The 24th International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), San Francisco, October 2015.
    Abstract / PDF [556K]

  • Krowd: A Key-Value Store for Crowded Venues. Utsav Drolia, Nathan Mickulicz, Rajeev Gandhi, Priya Narasimhan.10th ACM Workshop on Mobility in the Evolving Internet Architecture (MobiArch), held in Paris, France in September 2015. Best Paper.
    Abstract / PDF [696K]

  • A Low-Overhead, Fully-Distributed, Guaranteed-Delivery Routing Algorithm for Faulty Network-on-Chips. Mohammad Fattah, Antti Airola, Rachata Ausavarungnirun, Nima Mirzaei, Pasi Liljeberg, Juha Plosila, Siamak Mohammadi, Tapio Pahikkala, Onur Mutlu, Hannu Tenhunen. Proceedings of the 9th ACM/IEEE International Symposium on Networks on Chip (NOCS), Vancouver, BC, Canada, September 2015.
    Abstract / PDF [1M]

  • Resource-Efficient Data-Intensive System Designs for High Performance and Capacity. Hyeontaek Lim. Carnegie Mellon University PhD Dissertation CMU-CS-15-132, September 2015.
    Abstract / PDF [3.1M]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth A. Gibson. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [275K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy, Krishnamurthy Viswanathan, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI. See the extended Technical Report for more information.
    Abstract / PDF [1.3M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [369K]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta, Jin Li, Gregory R. Ganger. ACM Symposium on Cloud Computing 2015. Aug. 27 - 29, 2015, Kohala Coast, HI.
    Abstract / PDF [501K]

  • Scaling Up Clustered Network Appliances with ScaleBricks. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Michael Mitzenmacher, Ren Wang, Ajaypal Singh. Proc. ACM SIGCOMM 2015, August 17-21, 2015, London, United Kingdom.
    Abstract / PDF [626K]

  • Cuckoo Linear Algebra. Li Zhou, David G. Andersen, Mu Li, Alexander J. Smola. KDD’15, August 10-13, 2015, Sydney, NSW, Australia.
    Abstract / PDF [611K]

  • AUSPICE: Automated Safety Property Verification for Unmodified Executables. Jiaqi Tan, Hui Jun Tay, Rajeev Gandhi, and Priya Narasimhan. In 7th Working Conference on Verified Software: Theories, Tools, and Experiments (VSTTE), July 2015.
    Abstract / PDF [390K]

  • WARM: Improving NAND Flash Memory Lifetime with Write-hotness Aware Retention Management. Yixin Luo, Yu Cai, Saugata Ghose, Jongmoo Choi, Onur Mutlu.MSST 2015: 31st International Conference on Massive Storage Systems and Technologies, Jun 1, 2015 - Jun 5, 2015, Santa Clara, CA.
    Abstract / PDF [1.5M]

  • Architecting to Achieve a Billion Requests Per Second Throughput on a Single Key-Value Store Server Platform. Sheng Li, Hyeontaek Lim, Victor Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey. In Proceedings of the 42nd International Symposium on Computer Architecture (ISCA 2015), Portland, OR, June 2015. Fast-tracked to Transactions on Computer Systems (TOCS).
    Abstract / PDF [350K]

  • Reducing Latency via Redundant Requests: Exact Analysis. Kristen Gardner, Sam Zbarsky, Sherwin Doroudi, Mor Harchol-Balter, Esa Hyytia, Alan Scheller-Wolf. Proceedings of ACM Sigmetrics/Performance 2015 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 15), Portland, OR. June 2015.
    Abstract / PDF [725K]

  • A Case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling Efficient Data Compression. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita Das, Mahmut Kandemir, Todd C. Mowry, Onur Mutlu. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [1M]

  • Page Overlays: An Enhanced Virtual Memory Framework to Enable Fine-grained Memory Management. Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul Chilimbi. Proceedings of the 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, June 2015.
    Abstract / PDF [2.1M]

  • SMPFRAME: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Hoy, Seunghak Lee Xun Zheng, Wei Dai, Garth A. Gibson, Eric Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-103, May 2015.
    Abstract / PDF [1.57M]

  • PocketTrend: Timely Identification and Delivery of Trending Search Content to Mobile Users. Gennady Pekhimenko, Dimitrios Lymberopoulos, Oriana Riva, Karin Strauss, Doug Burger. Proceedings of the 24th International World Wide Web Conference (WWW), Florence, Italy, May 2015.
    Abstract / PDF [504K]

  • Optimal Scheduling for Jobs with Progressive Deadlines. Kristen Gardner, Sem Borst, Mor Harchol-Balter. IEEE INFOCOM 15, Hong Kong, April, 2015.
    Abstract / PDF [558K]

  • Using Data Transformations for Low-latency Time Series Analysis. Henggang Cui, Kimberly Keeton, Indrajit Roy Krishnamurthy Viswanathan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-106. April 2015. Extended version of the 2015 SoCC paper.
    Abstract / PDF [925K]

  • Caveat-Scriptor: Write Anywhere Shingled Disks. Saurabh Kadekodi, Swapnil Pimpale, Garth A. Gibson. Proc. Of the Seventh USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’15), Santa Clara, CA, July 2015. Expanded paper available: Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-101.
    Abstract / PDF [3.4M]

  • BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed. D. Van Aken, D. E. Difallah, A. Pavlo, C. Curino, and P. Cudré-Mauroux. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, 2015, pp. 1069-1073.
    Abstract / PDF [1.2M]

  • Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems. Joy Arulraj, Andrew Pavlo, Subramanya R. Dulloor. Proceedings ACM SIGMOD, Melbourne, Victoria, Australia, May 31-June 4, 2015.
    Abstract / PDF [1M]

  • Raising the Bar for Using GPUs in Software Packet Processing. Anuj Kalia, Dong Zhou, Michael Kaminsky, David G. Andersen. 12th Usenix Symposium on Networked Systems Design (NSDI'15). May 4-6, 2015, Oakland, CA.
    Abstract / PDF [386K]

  • Efficient Hypervisor Based Malware Detection. Peter Friedrich Klemperer. Ph.D. Dissertation, Carnegie Mellon University, Electrical and Computer Engineering, May 2015.
    Abstract / PDF [1.3M]

  • Managed Communication and Consistency for Fast Data-Parallel Iterative Analytics. Jinliang Wei, Wei Dai, Aurick Qiao, Qirong Ho*, Henggang Cui, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-105. April 2015.
    Abstract / PDF [2.62M]

  • ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems. Lin Xiao, Kai Ren, Qing Zheng, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-15-104, April 2015.
    Abstract / PDF [696K]

  • Solving the Straggler Problem for Iterative Convergent Parallel ML. Aaron Harlap, Henggang Cui, Wei Dai, Jinliang Wei Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-15-102, April 2015.
    Abstract / PDF [519K]

  • A Cloud Computing Course: From Systems To Services. M. Suhail Rehman, Jason Boles, Mohammad Hammoud, Majd F. Sakr. Proceedings of the 46th ACM Special Interest Group on Computer Science Education Conference (SIGCSE 2015), Kansas City, USA, March 2015.
    Abstract / PDF [356K]

  • Exploiting Compressed Block Size as an Indicator of Future Reuse. Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip P. Gibbons, Michael A. Kozuch, and Todd C. Mowry. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [2.4M]

  • Having Your Cake and Eating It Too: Jointly Optimal Erasure Codes for I/O, Storage, and Network-bandwidth. KV Rashmi, Preetum Nakkiran, Jingyan Wang, Nihar B. Shah & Kannan Ramchandran. USENIX FAST, Feb 2015, Santa Clara, CA. Best paper.
    Abstract / PDF [560K]

  • Data Retention in MLC NAND Flash Memory: Characterization, Optimization and Recovery. Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu. HPCA-21, February 7-11, 2015 — Best Paper Runner Up.
    Abstract / PDF [1.6M]

  • Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case. Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Khan, Vivek Seshadri, Kevin Chang, Onur Mutlu. Proceedings of the 21st International Symposium on High-Performance Computer Architecture (HPCA), Bay Area, CA, February 2015.
    Abstract / PDF [1.67M]

  • High-Performance Distributed ML at Scale through Parameter Server Consistency Models. Wei Dai, Abhimanu Kumar, Jinliang Wei, Qirong Ho, Garth A. Gibson, Eric P. Xing. 29th AAAI Conf. on Artificial Intelligence (AAAI-15), Jan 25-29, 2015, Austin, Texas.
    Abstract / PDF [733K]

  • Mitigating Prefetcher-Caused Pollution Using Informed Caching Policies for Prefetched Blocks Vivek Seshadri, Samihan Yedkar, Hongyi Xin, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. ACM Transactions on Architecture and Code Optimization (TACO), Volume 11 Issue 4, January 2015, Article No. 51.
    Abstract / PDF [1.1M]

  • Research Problems and Opportunities in Memory Systems. Onur Mutlu, Lavanya Subramanian. Invited Article in Supercomputing Frontiers and Innovations (SUPERFRI), 2015.
    Abstract / PDF [1.72M]

  • The Main Memory System: Challenges and Opportunities. Onur Mutlu, Justin Meza, Lavanya Subramanian. Invited Article in Communications of the Korean Institute of Information Scientists and Engineers (KIISE), 2015.
    Abstract / PDF [813K]

  • Main Memory Scaling: Challenges and Solution Directions. Onur Mutlu. Invited Book Chapter in More than Moore Technologies for Next Generation Computer Design, pp. 127-153, Springer, 2015.
    Abstract / PDF [1.02M]

  • Toggle-Aware Compression for GPUs. Gennady Pekhimenko, Evgeny Bolotin, Mike O'Connor, Onur Mutlu, Todd C. Mowry, Stephen W. Keckler. IEEE Computer Architecture Letters (CAL).
    Abstract / PDF [346K]

2014

  • Trading Freshness for Performance in Distributed Systems. James Cipar. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-14-144. December 2014.
    Abstract / PDF [1.82M]

  • Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories. Hanbin Yoon, Justin Meza, Naveen Mural Imanohar, Norman P. Jouppi, Onur Mutlu. ACM Transactions on Architecture and Code Optimization, Vol. 11, No. 4, Article 40, December 2014.
    Abstract / PDF [1.06M]

  • A Comparative Study of Baremetal Provisioning Frameworks. Ashok Chandrasekar, Garth A. Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-14-109, December 2014.
    Abstract / PDF [447K]

  • Reducing Replication Bandwidth for Distributed Document Databases. Lianghong Xu, Andrew Pavlo, Sudipta Sengupta Jin Li, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-108. December 2014.
    Abstract / PDF [646K]

  • STOVE: Strict, Observable, Verifiable Data and Execution Models for Untrusted Applications. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014 (Doctoral Symposium), pp.644,649, 15-18 Dec. 2014.
    Abstract / PDF [541K]

  • STOVEPipe: Observable Access Control of User Data for Untrusted Applications on Mobile Devices. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. Poster at the IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), 2014, 15-18 Dec. 2014.
    Abstract / PDF [149K]

  • Cuckoo Filter: Practically Better Than Bloom. Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher. Proceedings of CoNEXT (CoNEXT’14), December 2014.
    Abstract / PDF [343K]

  • On Model Parallelization and Scheduling Strategies for Distributed Machine Learning. S. Lee, J. K. Kim, X. Zheng, Q. Ho, G. A. Gibson, E. P. Xing. Proceedings of 2014 Neural Information Processing Systems (NIPS’14), December 2014.
    Abstract / PDF [336K]

  • Managing GPU Concurrency in Heterogeneous Architectures. Onur Kayiran, Nachiappan Chidambaram Nachiappan, Adwait Jog, Rachata Ausavarungnirun, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, and Chita R. Das. Proceedings of 47th International Symposium on Microarchitecture (MICRO’14), December 2014.
    Abstract / PDF [2.38M]

  • FIRM: Fair and High-Performance Memory Control for Persistent Memory Systems. Jishen Zhao, Onur Mutlu, Yuan Xie. Proceedings of the 47th International Symposium on Microarchitecture (MICRO), Cambridge, UK, December 2014.
    Abstract / PDF [626K]

  • Exploiting Iterative-ness for Parallel ML Computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Greg R. Ganger, Phil B. Gibbons, Garth A. Gibson, Eric P. Xing. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. Supersedes Carnegie Mellon University Parallel Data Technical Report CMU-PDL-14-107.
    Abstract / PDF [609K]

  • PriorityMeister: Tail Latency QoS for Shared Networked Storage. Timothy Zhu, Alexey Tumanov, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014.
    Abstract / PDF [940K]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. BEST PAPER AWARD!
    Abstract / PDF [287K]

  • IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson. ACM/IEEE Int'l Conf. for High Performance Computing, Networking, Storage and Analysis (SC'14), November 16-21, 2014, New Orleans, LA. BEST PAPER AWARD!
    Abstract / PDF [939K] / Slides [1M]

  • BatchFS: Scaling the File System Control Plane with Client-Funded Metadata Servers. Qing Zheng, Kai Ren, Garth A. Gibson. Proceedings of the 9th international Petascale Data Storage Workshop (PDSW '14) held in conjunction with Supercomputing '14. November 16, 2014, New Orleans, LA.
    Abstract / PDF [651K]

  • Cloudlets: at the Leading Edge of Mobile-Cloud Convergence. M. Satyanarayanan, Z. Chen, K. Ha, W. Hu, W. Richter, P. Pillai. Proceedings of MobiCASE 2014: Sixth International Conference on Mobile Computing, Applications and Services, Austin, TX, November 2014.
    Abstract / PDF [859K]

  • A Brief History of Cloud Offload. M. Satyanarayanan. GetMobile, Volume 18, Issue 4, October 2014.
    Abstract / PDF [360K]

  • The Heterogeneous Block Architecture. Chris Fallin, Chris Wilkerson, Onur Mutlu. Proceedings of 32nd IEEE International Conference on Computer Design (ICCD’14), October 2014.
    Abstract / PDF [308K]

  • Fast and Accurate Mapping of Complete Genomics Reads. Donghyuk Lee, Farhad Hormozdiari, Hongyi Xin, Faraz Hach, Onur Mutlu, Can Alkan. Methods, Elsevier, October 2014.
    Abstract / PDF [1.25M]

  • Design and Evaluation of Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Chang, Greg Nazario, Reetuparna Das, Gabriel Loh, Onur Mutlu. Proceedings of the 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD’14), October 2014.
    Abstract / PDF [325K]

  • Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, And Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. ACM Transactions on Storage, Vol. 10, No. 4, Article 16, Publication date: October 2014.
    Abstact / PDF [1.34M]

  • Loose-Ordering Consistency for Persistent Memory. Youyou Lu, Jiwu Shu, Long Sun, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [389K]

  • The Blacklisting Memory Scheduler: Achieving High Performance and Fairness at Low Cost. Lavanya Subramanian, Donghyuk Lee, Vivek Seshadri, Harsha Rastogi, Onur Mutlu. Proceedings of the 32nd IEEE International Conference on Computer Design (ICCD), Seoul, South Korea, October 2014.
    Abstract / PDF [240K]

  • Value Driven Load Balancing. Sherwin Doroudi, Esa Hyytia, Mor Harchol-Balter. Performance Evaluation, vol. 79, September 2014.
    Abstract / PDF [258K]

  • Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

  • Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David G. Andersen. ACM SIGCOMM 2014. Chicago, Illinois, August 17-22, 2014. Supersedes CMU-PDL-14-106, June 2014.
    Abstract / PDF [462K]

  • Towards Secure Execution of Untrusted Code for Mobile Edge-Clouds. Jiaqi Tan, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. Poster at 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [116K]

  • CHIPS: Content-based Heuristics for Improving Photo Privacy for Smartphones. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [1.4M]

  • Will They Blend?: Exploring Big Data Computation atop Traditional HPC NAS Storage. Ellis H. Wilson III, Mahmut T. Kandemir, Garth A. Gibson. The 34th International Conference on Distributed Computing Systems, ICDCS 2014, June 30 - July 3, 2014, Madrid, Spain.
    Abstract / PDF [332K]

  • Towards Wearable Cognitive Assistance. Kiryong Ha, Zhuo Chen, Wenlu Hu, Wolfgang Richter, Padmanabhan Pillai, Mahadev Satyanarayanan. Proceedings of the 12th ACM International Conference on Mobile Computing, Systems and Services (MobiSys’14), June 2014.
    Abstract / PDF [1.54M]

  • The Efficacy of Error Mitigation Techniques for DRAM Retention Failures: A Comparative Experimental Study. Samira Khan, Donghyuk Lee, Yoongu Kim, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu. Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’14), June 2014.
    Abstract / PDF [8M]

  • Exact Analysis of the M/M/k/setup Class of Markov Chains via Recursive Renewal Reward. Anshul Gandhi, Sherwin Doroudi, Mor Harchol-Balter, Alan Scheller-Wolf. Queueing Systems: Theory and Applications vol. 77, no. 2, 2014, pp. 177-209. June 2014.
    Abstract / PDF [4K]

  • Characterizing Application Memory Error Vulnerability to Optimize Datacenter Cost via Heterogeneous- Reliability Memory. Yixin Luo, Sriram Govindan, Bikash Sharma, Mark Santaniello, Justin Meza, Aman Kansal, Jie Liu, Badriddine Khessib, Kushagra Vaid, Onur Mutlu Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Atlanta, GA, June 2014.
    Abstract / PDF [1.58]

  • Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 USENIX Annual Technical Conference (ATC'14). June 19-20, 2014. Philadelphia, PA. Supersedes CMU-PDL-14-101.
    Abstract / PDF [731K]

  • The Dirty-Block Index. Vivek Seshadri, Abhishek Bhowmick, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. 41st International Symposium on Computer Architecture, June, 2014.
    Abstract / PDF [2.32M]

  • A Tag-Based, Logical Access-Control Framework for Personal File Sharing. Michelle L. Mazurek. Carnegie Mellon University ECE Ph.D. Dissertation, May 2014.
    Abstract / PDF [1.6M]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-105. May 2014.
    Abstract / PDF [444K]

  • Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-103. May 2014.
    Abstract / PDF [763K]

  • Comparing Performance of Different Cleaning Algorithms for SMR Disks. Mukul Kumar Singh. M.S. Thesis: Master of Science in Information Networking, April 2014.
    Abstract / PDF [623K]

  • Algorithmic Improvements for Fast Concurrent Cuckoo Hashing. Xiaozhou Li, David G. Andersen, Michael Kaminsky, Michael J. Freedman. Proceedings of the European Conference on Computer Systems (EuroSys '14), April 2014.
    Abstract / PDF [4.3M]

  • MICA: A Holistic Approach to Fast In-Memory Key-Value Storage. Hyeontaek Lim, Dongsu Han, David G. Andersen, Michael Kaminsky. 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI'14), April 2014.
    Abstract / PDF [1.36M]

  • Bounding Memory Interference Delay in COTS-based Multi-Core Systems. Hyoseung Kim, Dionisio de Niz, Björn Andersson, Mark Klein, Onur Mutlu, Ragunathan (Raj) Rajkumar. Proceedings of the 20th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), Berlin, Germany, April 2014.
    Abstract / PDF [2.5M]

  • Memory Systems. Yoongu Kim, Onur Mutlu. Invited Book Chapter in Computing Handbook, Third Edition: Computer Science and Software Engineering, CRC Press, April 2014.
    Abstract / PDF [453K]

  • So, You Want To Trace Your Distributed System? Key Design Insights from Years of Practical Experience. Raja R. Sambasivan, Rodrigo Fonseca, Ilari Shafer, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-102, April 2014.
    Abstract / PDF [870K]

  • Agentless Cloud-wide Streaming of Guest File System Updates. Wolfgang Richter, Canturk Isci, Jan Harkes, Benjamin Gilbert, Vasanth Bala, Mahadev Satyanarayanan. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. The Second IEEE Conference on Cloud Engineering (IC2E'14), March 2014. Best Paper.
    Abstract / PDF [978K]

  • Improving Cache Performance by Exploiting Read-Write Disparity. Samira Khan, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu, Daniel Jimenez. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA), Orlando, FL, February 2014. Best paper session.
    Abstract / PDF [355K]

  • Improving DRAM Performance by Parallelizing Refreshes with Accesses. Kevin Chang, Donghyuk Lee, Zeshan Chishti, Chris Wilkerson, Alaa Alameldeen, Yoongu Kim, Onur Mutlu. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA'14), February 2014.
    Abstract / PDF [2.86M]

  • QuiltView: Glass-Sourced Video for Google Maps Queries. Zhuo Chen, Wenlu Hu, Kiryong Ha, Jan Harkes, Benjamin Gilbert, Jason Hong, Asim Smailagic, Dan Siewiorek, Mahadev Satyanarayanan. The 15th International Workshop on Mobile Computing Systems and Applications (HotMobile'14), February 2014.
    Abstract / PDF [4.51M]
  • SpringFS: Bridging Agility and Performance in Elastic Distributed Storage. Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. 12th USENIX Conference on File and Storage Technologies (FAST '14), Santa Clara, CA, February 17–20, 2014.
    Abstract / PDF [319K]

  • Toward Strong, Usable Access Control for Shared Distributed Data. Michelle L. Mazurek, Yuan Liang, William Melicher, Manya Sleeper, Lujo Bauer, Gregory R. Ganger, Nitin Gupta, and Michael K. Reiter. In FAST 2014: USENIX Conference on File and Storage Technologies, February 2014.
    Abstract / PDF [395K]

2013

  • Scalable, High Performance Ethernet Forwarding with CuckooSwitch. Dong Zhou, Bin Fan, Hyeontaek Lim, David G. Andersen, Michael Kaminsky. Proc. 9th International Conference on emerging Networking EXperiments and Technologies (CoNEXT), Dec. 2013.
    Abstract / PDF [479K]

  • RowClone: Fast and Energy-Efficient In-DRAM Bulk Data Copy and Initialization. Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, and Todd C. Mowry, 46th IEEE/ACM International Symposium on Microarchitecture (MICRO-46), December 2013.
    Abstract / PDF [2.42M]

  • Tetrisched: Space-Time Scheduling for Heterogeneous Datacenters. Alexey Tumanov, Timothy Zhu, Michael A. Kozuch†, Mor Harchol-Balter, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-112, December, 2013.
    Abstract / PDF [716K]

  • Linearly Compressed Pages: A Low-Complexity, Low-Latency Main Memory Compression Framework. Gennady Pekhimenko, Vivek Seshadri, Yoongu Kim, Hongyi Xin, Onur Mutlu, Philip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 46th International Symposium on Microarchitecture (MICRO), Davis, CA, December 2013.
    Abstract / PDF [525K]

  • More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server. Qirong Ho, James Cipar, Henggang Cui, Jin Kyu Kim, Seunghak Lee, Phillip B. Gibbons, Garth A. Gibson, Gregory R. Ganger, Eric P. Xing. Conference on Neural Information Processing Systems (NIPS '13). Dec 5-8, 2013, Lake Tahoe, NV.
    Abstract / PDF [2.64M] / Appendix

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. IEEE Transactions on Visualization and Computer Graphics (Proceedings Information Visualization 2013), vol. 19, no. 12, Dec. 2013.
    Abstract / PDF [1.9M] / TRAILER VIDEO [5.6M] / VIDEO [17.9M]

  • Measuring Password Guessability for an Entire University. Michelle L. Mazurek, Saranga Komanduri, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Patrick Gage Kelley, Richard Shay, Blase Ur. In CCS 2013: ACM Conference on Computer and Communications Security, November 2013.
    Abstract / PDF [2.19M]

  • There Is More Consensus in Egalitarian Parliaments. Iulian Moraru, David G. Andersen, Michael Kaminsky. Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP'13), November 3-6, 2013, Nemacolin Woodlands Resort, Farmington, PA.
    Abstract / PDF [713K]

  • Making Problem Diagnosis Work for Large-Scale, Production Storage Systems. Michael P. Kasick, Priya Narasimhan, Kevin Harms. Proceedings of the 27th Large Installation System Administration Conference (LISA '13), Washington, DC, November 2013.
    Abstract / PDF [2.23M]

  • Consistent, Durable, and Safe Memory Management for Byte-addressable Non Volatile Main Memory. Iulian Moraru, David G. Andersen, Michael Kaminsky, Niraj Tolia, Nathan Binkert, Parthasarathy Ranganathan. TRIOS: Conference on Timely Results in Operating Systems. Held in conjunction with SOSP '13. Farmington, PA, November 3, 2013.
    Abstract / PDF [967K]

  • LightTx: A Lightweight Transactional Design in Flash-based SSDs to Support Flexible Transactions. Youyou Lu, Jiwu Shuy, Jia Guo, Shuai Li, Onur Mutlu. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [262K]

  • Program Interference in MLC NAND Flash Memory: Characterization, Modeling, and Mitigation. Yu Cai, Onur Mutlu, Erich F. Haratsch, Ken Mai. The 32nd IEEE International Conference on Computer Design (ICCD13). October 6-9, 2013, Ashville, NC, USA.
    Abstract / PDF [1.18M]

  • Memory-Efficient GroupBy-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. 2013 ACM Symposium on Cloud Computing (SoCC'13), Oct. 01-03 2013, Santa Clara, CA, USA.
    Abstract / PDF [944K]

  • Challenges in Security and Privacy for Mobile Edge-Clouds. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-113. October, 2013.
    Abstract / PDF [212K]

  • Memory Scaling: A Systems Architecture Perspective. Onur Mutlu. MemCon 2013 (MEMCON), Santa Clara, CA, August 2013.
    Abstract / PDF [114K]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-111. August 2013.
    Abstract / PDF [2.3M]

  • Hadoop's Adolescence: An Analysis of Hadoop Usage in Scientific Workloads. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Very Large Data Bases (VLDB), August 2013.
    Abstract / PDF [986K]

  • Toward Strong, Usable Access Control for Shared Distributed Data. Michelle L. Mazurek, Yuan Liang, Manya Sleeper, Lujo Bauer, Gregory R. Ganger, Nitin Gupta, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-110. June 2013.
    Abstract / PDF [1M]

  • Just-in-Time Provisioning for Cyber Foraging. Kiryong Ha, Padmanabhan Pillai, Wolfgang Richter, Yoshihisa Abe, Mahadev Satyanarayanan The 11th International Conference on Mobile Systems, Applications, and Services (MobiSys'13), June 25–28, 2013, Taipei, Taiwan.
    Abstract / PDF [2.29M]

  • An Experimental Study of Data Retention Behavior in Modern DRAM Devices: Implications for Retention Time Profiling Mechanisms. Jamie Liu, Ben Jaiyen, Yoongu Kim, Chris Wilkerson, Onur Mutlu. ACM/IEEE International Symposium on Computer Architecture (ISCA '13), June 23-27, 2013, Tel-Aviv, Israel.
    Abstract / PDF [704K]

  • A Case for Efficient Hardware/Software Cooperative Management of Storage and Memory. Justin Meza, Yixin Luo, Samira Khan, Jishen Zhao, Yuan Xie, Onur Mutlu. Fifth Workshop on Energy-Efficient Design (WEED 2013). Held in conjunction with the 2013 International Symposium on Computer Architecture (ISCA-40). June 24, 2013, Tel-Aviv, Israel.
    Abstract / PDF [667K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth A. Gibson. 2013 USENIX Annual Technical Conference, June 26-28, 2013, San Jose, CA.
    Abstract / PDF [867K]

  • PRObE: A Thousand-Node Experimental Cluster for Computer Systems Research. Garth A. Gibson, Gary Grider, Andree Jacobson, Wyatt Lloyd. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.5M]

  • Shingled Magnetic Recording: Areal Density Increase Requires New Data Management. Tim Feldman, Garth A. Gibson. USENIX ;login:, v 38, n 3, June 2013.
    Abstract / PDF [1.17M]

  • I/O Acceleration with Pattern Detection. Jun He, John Bent, Aaron Torres, Gary Grider, Garth A. Gibson, Carlos Maltzahn, Xian-He Sun. The 22nd Int. ACM Symposium on High Performance Parallel and Distributed Computing (HPDC'13), New York City, June 17-21, 2013.
    Abstract / PDF [458K]

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park, Hyunok Oh, Sungchan Kim, Youngmin, Gregory R. Ganger. Proceedings of the ACM Int'l Conference on Supercomputing (ICS), Eugene, OR, June 2013.
    Abstract / PDF [677K]

  • Specialized Storage for Big Numeric Time Series. Ilari Shafer, Raja R. Sambasivan, Anthony Rowe, Gregory R. Ganger. Proceedings of the 5th Workshop on Hot Topics in Storage and File Systems, June 2013.
    Abstract / PDF [161K]

  • Automated Diagnosis of Chronic Performance Problems in Production Systems. Soila P. Kavulya. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-109, May 2013.
    Abstract / PDF [12.6M]

  • Diagnosing Performance Changes in Distributed Systems by Comparing Request Flows. Raja R. Sambasivan. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-105, May 2013.
    Abstract / PDF [3.9M]

  • Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth A. Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
    Abstract / PDF [174K]

  • Building a High-Performance Metadata Service by Reusing Scalable I/O Bandwidth. Kai Ren, Swapnil Patil, Kartik Kulkarni, Adit Madan, Garth A. Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-107, May 2013.
    Abstract / PDF [690K]

  • Evaluating STT-RAM as an Energy-Efficient Main Memory Alternative. Emre Kultursay, Mahmut Kandemir, Anand Sivasubramaniam, and Onur Mutlu. 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2013), April 21-23, 2013, Austin, TX.
    Abstract / PDF [1.83M]

  • PETAL: Preset Encoding Table Information Leakage. Jiaqi Tan, Jayvardhan Nahata. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-106, April 2013.
    Abstract / PDF [294K]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. USENIX ;login, 38(2), April 2013.
    Abstract / PDF [961K]

  • Asymmetry-aware Execution Placement on Manycore Chips. Alexey Tumanov, Joshua Wise, Onur Mutlu, Gregory R. Ganger. In Proc. of the 3rd Workshop on Systems for Future Multicore Architectures (SFMA'13), EuroSys'13, Apr. 14-17, 2013, Prague, Czech Republic.
    Abstract / PDF [703K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-104 (supersedes CMU-PDL-12-102), April 2013.
    Abstract / PDF [1.93M]

  • The Impact of Length and Mathematical Operators on the Usability and Security of System-assigned One-time PINs. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Richard Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In 2013 Workshop on Usable Security (USEC), April 2013.
    Abstract / PDF [802K]

  • MemC3: Compact and Concurrent Memcache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen and Michael Kaminsky. In Proc. 10th USENIX NSDI, Apr 2013. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012. Source code: https://github.com/efficient/libcuckoo
    Abstract / PDF [280K]

  • Threshold Voltage Distribution in MLC NAND Flash Memory: Characterization, Analysis, and Modeling. Yu Cai, Erich F. Haratsch, Onur Mutlu and Ken Mai. Design Automation and Test in Europe (DATE 2013), Mar 19-22, 2013, Grenoble, France.
    Abstract / PDF [1.44M]

  • Application-to-Core Mapping Policies to Reduce Memory System Interference in Multi-Core Systems. Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [623K]

  • MISE: Providing Performance Predictability and Improving Fairness in Shared Main Memory Systems. Lavanya Subramanian, Vivek Seshadri, Yoongu Kim, Ben Jaiyen, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013.
    Abstract / PDF [607K]

  • Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture. Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu. Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA), Shenzhen China, February 2013.
    Abstract / PDF [3.17M]

  • Practical Batch-Updatable External Hashing with Sorting. Hyeontaek Lim and David G. Andersen and Michael Kaminsky. In Proc. Meeting on Algorithm Engineering and Experiments (ALENEX), Jan 2013.
    Abstract / PDF [536K]

  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-102, January 2013. Revised version of CMU-PDL-12-110.
    Abstract / PDF [798K]

  • Giga+TableFS on PanFS: Scaling Metadata Performance on Cluster File Systems. Kartik Kulkarni, Kai Ren, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-101, January 2013.
    Abstract / PDF [679K]

  • Error Analysis and Retention-Aware Error Management for NAND Flash Memory. Yu Cai, Gulay Yalcin, Onur Mutlu, Erich F. Haratsch, Adrian Cristal, Osman Unsal, Ken Mai. Intel Technology Journal (ITJ) Special. Issue on Memory Resiliency, 2013.
    Abstract / PDF [270K]

2012

  • vQuery: A Platform for Connecting Configuration and Performance. Ilari Shafer, Snorri Gylfason, Gregory R. Ganger. vwWare Labs Technical Report, Palo Alto, CA. December 2012.
    Abstract / PDF [288K]

  • Helping Users Create Better Passwords. Blase Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman, and Julio López. USENIX ;login:, 37(6), December 2012.
    Abstract / PDF [970K]

  • Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. Elmer Garduno, Soila P. Kavulya, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. 26th Usenix Large Installation System Administration Conference (LISA'12), Dec. 9-14, San Diego, CA.
    Abstract / PDF [913K]

  • Failure Diagnosis of Complex Systems. Soila P. Kavulya, Kaustubh Joshi (AT&T), Felicita Di Giandomenico (ISTI-CNR, Pisa, Italy), Priya Narasimhan. Chapter in "Resilience Assessment and Evaluation". Editors. Katinka Wolter, Alberto Avritzer, Marco Vieira, Aad van Moorsel. Springer Verlag, December 2012.
    Abstract / PDF [288K]

  • Runtime Estimation and Resource Allocation for Concurrency Testing. Jiri Simsa, Randy Bryant, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-113. December 2012.
    Abstract / PDF [490K]

  • AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael Kozuch. Transactions on Computer Systems, Volume 30, Issue 4, Article 14. November 2012.
    Abstract / PDF [1.77M]

  • TABLEFS: Embedding a NoSQL Database inside the Local File System. Ren, Kai, Garth A. Gibson. 1st Storage System, Hard Disk and Solid State Technologies Summit, IEEE Asia-Pacific Magnetic Recording Conference (APMRC), November 2012, Singapore.
    Abstract / PDF [399K]

  • MemC3: Compact and Concurrent MemCache with Dumber Caching and Smarter Hashing. Bin Fan, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-116. November 2012.
    Abstract / PDF [824K]

  • HPC Computation on Hadoop Storage with PLFS. Chuck Cranor, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-115. Nov. 2012.
    Abstract / PDF [170K]

  • A Case for Scaling HPC Metadata Performance through De-specialization. Swapnil Patil, Kai Ren, Garth A. Gibson. 7th Petascale Data Storage Workshop held in conjunction with Supercomputing '12, November 12, 2012. Salt Lake City, UT. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-111, November 2012.
    Abstract / PDF [512K]

  • JackRabbit: Improved Agility In Elastic Distributed Storage. James Cipar, Lianghong Xu, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-112, October 2012.
    Abstract / PDF [395K]

  • HAT: Heterogeneous Adaptive Throttling for On-Chip Networks. Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu. SBAC-PAD 2012, New York, NY, October 24-26, 2012.
    Abstract / PDF [259K]

  • alsched: Algebraic Scheduling of Mixed Workloads in Heterogeneous Clouds. Alexey Tumanov, James Cipar, Michael A. Kozuch, Gregory R. Ganger. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA.
    Abstract / PDF [379K]

  • Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis. Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. 3rd ACM Symposium on Cloud Computing. October 14th-17th, 2012 - San Jose, CA. 2021 SoCC Test of Time Award!
    Abstract / PDF [3.1M]

  • Using Vector Interfaces to Deliver Millions of IOPS from a Networked Key-value Storage Server. Vijay Vasudevan, Michael Kaminsky, David G. Andersen. SOCC'12, October 14-17, 2012, San Jose, CA USA.
    Abstract / PDF [648K]

  • Scalable Dynamic Partial Order Reduction. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey. Third Int. Conf. on Runtime Verification (RV2012), 25-28 September 2012, Istanbul, Turkey.
    Abstract / PDF [331K]

  • The Evicted-Address Filter: A Unified Mechanism to Address Both Cache Pollution and Thrashing. Vivek Seshadri, Onur Mutlu, Michael A Kozuch, Todd C Mowry. PACT'12, September 19–23, 2012, Minneapolis, Minnesota, USA.
    Abstract / PDF [2M]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-110, September 2012.
    Abstract / PDF [1.43M]

  • Row Buffer Locality Aware Caching Policies for Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael A. Harding, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Montreal, Quebec, Canada, September 2012. Best paper award in Computer Systems and Applications track.
    Abstract / PDF [577K]

  • A Case for Small Row Buffers in Non-Volatile Main Memories. Justin Meza, Jing Li, Onur Mutlu. Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Poster Session, Montreal, Quebec, Canada, September 2012.
    Abstract / PDF [172K]

  • A Proof of Correctness for Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-109. September 2012. Superseded by CMU-PDL-13-111, August 2013.
    Abstract / PDF [2.3M]

  • Indexing and Fast Near-Matching of Billions of Astronomical Objects. Bin Fu, Eugene Fink, Garth A. Gibson and Jaime Carbonell. In Proceedings of the Fourth Workshop on Interfaces and Architecture for Scientific Data Storage, 2012 (IASDS12). September 24, 2012, Beijing, China.
    Abstract / PDF [303K]

  • SOFTScale: Stealing Opportunistically For Transient Scaling. Anshul Gandhi, Timothy Zhu, Mor Harchol-Balter, Michael Kozuchy. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-111R, August 2012.
    Abstract / PDF [477K]

  • RainMon: An Integrated Approach to Mining Bursty Timeseries Monitoring Data. Ilari Shafer, Kai Ren, Vishnu Boddeti, Yashihisa Abe, Gregory R. Ganger, Christos Faloutsos. KDD'12, August 12–16, 2012, Beijing, China.
    Abstract / PDF [1.5M]

  • How Does Your Password Measure Up? The effect of strength meters on password creation. Blaser Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In the 2012 USENIX Security Symposium, August 2012.
    Abstract / PDF [1.2M]

  • On-Chip Networks from a Networking Perspective: Congestion and Scalability in Many-core Interconnects. George Nychis, Chris Fallin, Thomas Moscibroda, Onur Mutlu, Srinivasan Seshan.
    In SIGCOMM 2012, Helsinki, Finland, Aug 2012.
    Abstract / PDF [628K]

  • Light-weight Black-box Failure Detection for Distributed Systems. Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-107. July 2012
    Abstract / PDF [300K]

  • Correct Horse Battery Staple: Exploring the usability of system-assigned passphrases. Richard Shay, Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Blase Ur, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In SOUPS 2012: Symposium on Usable Privacy and Security, July 2012.
    Abstract / PDF [549K]

  • Egalitarian Paxos. Iulian Moraru, David G. Andersen, Michael Kaminsky. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-108. July 2012.
    Abstract / PDF [363K]

  • Saving Cash by Using Less Cache. Timothy Zhu, Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. 4th USENIX Workshop of Hot Topics in Cloud Computing (Hotcloud 2012). June 12-13, 2012, Boston, MA.
    Abstract / PDF [177K]

  • Hadoop's Adolescence: A Comparative Workload Analysis from Three Research Clusters. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-106. June 2012.
    Abstract / PDF [1.76M]

  • A Case for Exploiting Subarray-level Parallelism (SALP) in DRAM. Yoongu Kim, Vivek Seshadri, Donghyuk Lee, Jamie Liu, Onur Mutlu. Proceedings of the 39th International Symposium on Computer Architecture, June 2012.
    Abstract / PDF [927K]

  • RAIDR: Retention-Aware Intelligent DRAM Refresh. Jamie Liu, Ben Jaiyen, Richard Veras, Onur Mutlu. In Proceedings of the 39th International Symposium on Computer Architecture, Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [480K]

  • Staged Memory Scheduling: Achieving High Performance and Scalability in Heterogeneous Systems. Rachata Ausavarungnirun, Kevin Kai-Wei Chang, Lavanya Subramanian, Gabriel H. Loh, Onur Mutlu. The 39th International Symposium on Computer Architecture (ISCA), Portland, Oregon, June 9-13th, 2012.
    Abstract / PDF [700K]

  • Exact and Approximate Computation of a Histogram of Pairwise Distances between Astronomical Objects. Bin Fu, Eugene Fink, Garth A. Gibson and Jaime Carbonell. First Workshop on High Performance Computing in Astronomy (AstroHPC 2012), held in conjunction with the 21st International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC 2012), June 18-19, 2012, Delft, the Netherlands.
    Abstract / PDF [309K]

  • Automated Diagnosis without Predictability is a Recipe for Failure. Raja R. Sambasivan & Gregory R. Ganger. Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '12), June 12-13, 2012, Boston, MA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-101.
    Abstract / PDF [368K]

  • Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. Soila P. Kavulya, Scott Daniels (AT&T), Kautubh Joshi (AT&T), Matti Hiltunen (AT&T), Rajeev Gandhi, Priya Narasimhan.IEEE/IFIP Conference on Dependable Systems and Networks (DSN), June 2012.
    Abstract / PDF [859K]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek , Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. ACM Transactions on Storage, Vol. 8, No. 3, Article 39, May 2012.
    Abstract / PDF [518K]

  • MinBD: Minimally-Buffered Deflection Routing for Energy-Efficient Interconnect. Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, Onur Mutlu. In NOCS 2012, Lyngby, Denmark, May 2012. (One of five papers nominated for the Best Paper Award by the Program Committee.)
    Abstract / PDF [369K]

  • Shingled Magnetic Recording for Big Data Applications. Anand Suresh, Garth A. Gibson, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-105. May 2012.
    Abstract / PDF [561K]

  • SkyeFS: Distributed Directories using Giga+ and PVFS. Anthony Chivetta, Swapnil Patil & Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-104, May 2012.
    Abstract / PDF [398K]

  • Tag, You Can See It! Using tags for access control in photo sharing. Peter F. Klemperer, Yuan Liang, Michelle L. Mazurek, Manya Sleeper, Blase Ur, Lujo Bauer, Lorrie Faith Cranor, Nitin Gupta, and Michael K. Reiter. In CHI 2012: Conference on Human Factors in Computing Systems, May 2012.
    Abstract / PDF [560K]

  • Guess Again (and Again and Again): Measuring password strength by simulating password-cracking algorithms. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Rich Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Julio López. In the 2012 IEEE Symposium on Security and Privacy, May 2012.
    Abstract / PDF [2.8M]

  • Concurrent Systematic Testing at Scale. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-101. May 2012.
    Abstract / PDF [397K]

  • Landslide: Systematic Dynamic Race Detection in Kernel Space. Ben Blum. Carnegie Mellon University School of Computer Science MS Thesis CMU-CS-12-118. May 2012.
    Abstract / PDF [1.7M]

  • Towards Understanding Heterogeneous Clouds at Scale: Google Trace Analysis Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch. Intel Science and Technology Center for Cloud Computing Technical Report ISTC-CC-TR-12-101, April 27, 2012.
    Abstract / PDF [876K]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report. CMU-PDL-12-103 May 2012.
    Abstract / PDF [339K]

  • A Statistical Study for File System Meta Data On High Performance Computing Sites. Yifan Wang. M.S. Thesis, Information Networking Institute, Carnegie Mellon University. May 2012.
    Abstract / PDF [5.3M]

  • Enabling Efficient and Scalable Hybrid Memories Using Fine-Granularity DRAM Cache Management. Justin Meza, Jichuan Chang, HanBin Yoon, Onur Mutlu, Parthasarathy Ranganathan. IEEE Computer Architecture Letters (CAL), May 2012.
    Abstract / PDF [184K]

  • LazyBase: Trading Freshness for Performance in a Scalable Database. James Cipar, Gregory R. Ganger, Kimberly Keeton, Charles B. Morrey III, Craig A. N. Soules, Alistair Veitch. EuroSys 2012 April 10-13, 2012, Bern, Switzerland.
    Abstract / PDF [236K]

  • Bottleneck Identification and Scheduling in Multithreaded Applications. José A. Joao, M. Aater Suleman, Onur Mutlu, Yale N. Patt. Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), London, UK, March 2012.
    Abstract / PDF [828K]

  • Near-Real-Time Inference of File-Level Mutations from Virtual Disk Writes. Wolfgang Richter, Mahadev Satyanarayanan, Jan Harkes, Benjamin Gilbert. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-103. February 2012.
    Abstract / PDF [343K]

  • ZZFS: A Hybrid Device and Cloud File System for Spontaneous Users. Michelle L. Mazurek, Eno Thereska, Dinan Gundawardena, Richard Harper, James Scott. FAST 2012: USENIX Conference on File and Storage Technologies, February 2012.
    Abstract / PDF [567K]

  • Memory-Efficient Group-By-Aggregate using Compressed Buffer Trees. Hrishikesh Amur, Wolfgang Richter, David G. Andersen, Michael Kaminsky, Karsten Schwan, Athula Balachandran, Erik Zawadzki. Georgia Tech Center for Experimental Research in Computer Systems Technical Report GIT-CERCS-12-08.
    Abstract / PDF [450K]

2011

  • Active Disk Meets Flash: A Case for Intelligent SSDs. Sangyeun Cho, Chanik Park , Hyunok Oh, Sungchan Kim, Youngmin Yi and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-115. Dec. 2011.
    Abstract / PDF [989K]

  • Persistent, Protected and Cached: Building Blocks for Main Memory Data Stores. Iulian Moraru, David G. Andersen, Michael Kaminsky, Nathan Binkert, Niraj Tolia, Reinhard Munz,Parthasarathy Ranganathan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-114v2, Nov. 2012. Supersedes CMU-PDL-11-114. Dec. 2011.
    Abstract / PDF [1.0M]

  • A Cyber-Physical-System Approach to Data Center Modeling and Control for Energy Efficiency. Luca Parolini, Bruno Sinopoli, Bruce H. Krogh, Zhikui Wang. Proceedings of the IEEE, Special Issue on Cyber-Physical Systems, December 2011.
    Abstract / PDF [1.76M]

  • Reducing Memory Interference in Multicore Systems via Application-Aware Memory Channel Partitioning. Sai Prashanth Muralidhara, Lavanya Subramanian, Onur Mutlu, Mahmut Kandemir, Thomas Moscibroda. Proceedings of the 44th International Symposium on Microarchitecture
    (MICRO), Porto Alegre, Brazil, December 2011.
    Abstract / PDF [232K]

  • Understanding and Improving the Diagnostic Workflow of MapReduce Users. Jason D. Campbell (Intel Labs Pittsburgh), Arun B. Ganesan, Ben Gotow, Soila P. Kavulya, James Mulholland, Priya Narasimhan, Sriram Ramasubramanian, Mark Shuster, Jiaqi Tan (DSO National Laboratories, Singapore), ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, December 2011.
    Abstract / PDF [775K]

  • Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113, November 2011.
    Abstract / PDF [786K]

  • On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS. Wittawat Tantisiriroj, Swapnil Patil, Garth A. Gibson, Seung Woo Son, Samuel J. Lang, Robert B. Ross. SC11, November 12-18, 2011, Seattle, Washington USA. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-108. April 2011.
    Abstract / PDF [459K]

  • The Case for Sleep States in Servers. Anshul Gandhi, Mor Harchol-Balter, Michael A. Kozuch. HotPower'11, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [621K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. SLAML 2011, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [500K]

  • DiskReduce: Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth A. Gibson. Carnegie Mellon Univsersity Parallel Data Laboratory Technical Report CMU-PDL-11-112, October, 2011.
    Abstract / PDF [897K]

  • SILT: A Memory-Efficient, High-Performance Key-Value Store. Hyeontaek Lim, Bin Fan, David Andersen and Michael Kaminsky. ACM Symposium on Operating Systems Principles (SOSP'11), Cascais, Portugal, October 2011.
    Abstract / PDF [1.15M]

  • Small Cache, Big Effect: Provable Load Balancing for Randomly Partitioned Cluster Services. Bin Fan, Hyeontaek Lim, David Andersen and Michael Kaminsky. ACM Symposium on Cloud Computing (SOCC'11), Cascais, Portugal, October, 2011.
    Abstract / PDF [336K]

  • Switching the Optical Divide: Fundamental Challenges for Hybrid Electrical/Optical Datacenter Networks. Hamid Hajabdolali Bazzaz, Malveeka Tewari, Guohui Wang, George Porter, T. S. Eugene Ng, David G. Andersen, Michael Kaminsky, Michael A. Kozuch, Amin Vahdat. Proc. 2nd ACM Symposium on Cloud Computing (SOCC), Oct 2011.
    Abstract / PDF [190K]

  • Don't Settle for Eventual: Scalable Causal Consistency for Wide-Area Storage with COPS.
    Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, David G. Andersen. Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP), Oct 2011.
    Abstract / PDF [689K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya (CMU), Kaustubh Joshi, Matti Hiltunen , Scott Daniels (AT&T Labs, Research), Rajeev Gandhi and Priya Narasimhan (CMU). Workshop on System Logs and the Application of Machine Learning Techniques (SLAML), Cascais, Portugal, October 2011.
    Abstract / PDF [524K]

  • Performance Insulation: More Predictable Shared Storage. Matthew Wachs. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-11-134. September 2011.
    Abstract / PDF [2.65M]

  • Row Buffer Locality-Aware Data Placement in Hybrid Memories. HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Rachael Harding, Onur Mutlu. SAFARI Technical Report, TR-SAFARI-2011-005, Carnegie Mellon University, September 2011.
    Abstract / PDF [272K]

  • Improving Cache Performance Using Victim Tag Stores. Vivek Seshadri, Onur Mutlu, Todd Mowry, Michael A. Kozuch. SAFARI Technical Report, TR-SAFARI-2011-009, Carnegie Mellon University, September 2011.
    Abstract / PDF [242K]

  • ThermoCast: A Cyber-Physical Forecasting Model for Data Centers. Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, Christos Faloutsos. In KDD '11: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 21-24, San Diego, CA.
    Abstract / PDF [1.32M]

  • YCSB++: Benchmarking and Performance Debugging Advanced Features in Scalable Table Stores. Swapnil Patil, Milo Polte, Kai Ren, Wittawat Tantisiriroj, Lin Xiao, Julio Lopez, Garth A. Gibson, Adam Fuchs, Billie Rinaldi. Proc. of the 2nd ACM Symposium on Cloud Computing (SOCC '11), October 27–28, 2011, Cascais, Portugal. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-111, August 2011.
    Abstract / PDF [1.2M]

  • Minimizing Data Center SLA Violations and Power Consumption via Hybrid Resource Provisioning. Anshul Gandhi, Yuan Chen, Daniel Gmach, Martin Arlitt, Manish Marwah. 2nd IGCC 2011 (IEEE International Green Computing Conference 2011) July 25-28, 2011 Orlando, Florida, USA. -- BEST PAPER AWARD
    Abstract / PDF [503K]

  • End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
    Abstract / PDF [489K]

  • dBug: Systematic Testing of Distributed and Multi-threaded Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson.18th International Workshop on Model Checking of Software (SPIN'11), Snowbird UT, July 2011.
    Abstract / PDF [149K]

  • Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, and Garth A. Gibson. Proceedings of the Twenty-Third Scientific and Statistical Database Management Conference (SSDBM 2011), 20-22 July 2011.
    Abstract / PDF [5.5M]

  • Distributed, Robust Auto-Scaling Policies for Power Management in Compute Intensive Server Farms. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael A. Kozuch. 5th International Open Cirrus Summit. June 01 – 03, 2011, Moscow, Russia.
    Abstract / PDF [317K]

  • Applying Idealized Lower-bound Runtime Models to Understand Inefficiencies in Data-intensive Computing (Extended Abstract). Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger: SIGMETRICS 2011: 125-126, San Jose, CA, June 7-11, 2011.
    Abstract / PDF [297K]

  • Privacy-Sensitive VM Retrospection. Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode, Nilton Bila, Eyal De Lara, Vas Bala, Mahadev Satyanarayanan. HotCloud 2011 3rd USENIX Workshop on Hot Topics in Cloud Computing. Portland, OR, June 14-17, 2011.
    Abstract / PDF [1.97M]

  • Six Degrees of Scientific Data: Reading Patterns for Extreme Scale Science IO. Lofstead, Jay, Milo Polte, Garth A. Gibson, Scott A. Klasky, Karsten Schwan, Ron Oldfield, Matthew Wolf, Qing Liu. 20th ACM Int. Symp. On High-Performance Parallel and Distributed Computing (HPDC'11), June 2011.
    Abstract / PDF [595K]

  • Memory Power Management via Dynamic Voltage/Frequency Scaling. Howard David, Chris Fallin, Eugene Gorbatov, Ulf R. Hanebutte, Onur Mutlu. Proceedings of the 8th International Conference on Autonomic Computing (ICAC), Karlsruhe, Germany, June 2011.
    Abstract / PDF [463K]

  • Time Series Clustering: Complex is Simpler! Lei Li, B. Aditya Prakash. In Proceedings of the 28th International Conference on Machine learning, June 28 - July 2, 2011, Bellevue, WA.
    Abstract / PDF [631K]

  • Diagnosis in Automotive Systems: A Survey. Patrick E. Lanigan, Soila Kavulya, Priya Narasimhan, Thomas E. Fuhrman, Mutasim A. Salman. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-110. June 2011.
    Abstract / PDF [369K]

  • Exertion-based Billing for Cloud Storage Access. Matthew Wachs, Lianghong Xu, Arkady Kanevsky, Gregory R. Ganger. Proceedings of the 3rd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '11). June 14-15, 2011, Portland, OR. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-105. March 2011.
    Abstract / PDF [65K]

  • Otus: Resource Attribution in Data-Intensive Clusters. Kai Ren, Julio López, Garth A. Gibson. MapReduce'11, June 8, 2011, San Jose, California, USA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-106, April 2011.
    Abstract / PDF [2.5M]

  • Exploring Reactive Access Control. Michelle L. Mazurek, Peter F. Klemperer, Richard Shay, Hassan Takabi, Lujo Bauer, Lorrie Faith Cranor. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
    Abstract / PDF [293k]

  • Of Passwords and People: Measuring the Effect of Password-Composition Policies. Saranga Komanduri, Richard Shay, Patrick Gage Kelley, Michelle L. Mazurek, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
    Abstract / PDF [405K]

  • Disks Are Like Snowflakes: No Two Are Alike. Elie Krevat, Joseph Tucek, Gregory R. Ganger. 13th Workshop on Hot Topics in Operating Systems (HotOS 2011), Napa Valley, CA. May 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-102, February 2011.
    Abstract / PDF [1.8M]

  • The Case for VOS: The Vector Operating System. Vijay Vasudevan, David Andersen, Michael Kaminsky. In 13th Workshop on Hot Topics in Operating Systems (HotOS 2011). May 2011.
    Abstract / PDF [430K]

  • Principles of Operation for Shingled Disk Devices. Garth A. Gibson, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-107. April 2011.
    Abstract / PDF [500K]

  • WindMine: Fast and Effective Mining of Web-click Sequences. Yasushi Sakurai, Lei Li, Yasuko Matsubara, Christos Faloutsos. 2011 SIAM International Conference on Data Mining, April 28-30, 2011, Mesa, AZ.
    Abstract / PDF [968K]

  • Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-109, April 2011.
    Abstract / PDF [787K]

  • Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-104. April 2011.
    Abstract / PDF [6.5M]

  • Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI'11). March 30 - April 1, 2011. Boston, MA.
    Abstract / PDF [388K]

  • Scale and Concurrency of GIGA+: File System Directories with Millions of Files. Swapnil Patil, Garth A. Gibson. Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose CA, February 2011. Supersedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-110, Sept. 2010.
    Abstract / PDF [508K]

  • Applying Simple Performance Models to Understand Inefficiencies in Data-Intensive Computing. Elie Krevat, Tomer Shiran, Eric Anderson, Joseph Tucek, Jay J. Wylie, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-103. February 2011.
    Abstract / PDF [476K]

  • Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
    Abstract / PDF [336K]

2010

  • Storage-Based Intrusion Detection. Adam G. Pennington, John Linwood Griffin, John S. Bucy, John D. Strunk, Gregory R. Ganger. ACM Transactions on Information and System Security, Vol. 13, No. 4, Article 30, Pub. date: December 2010.
    Abstract / PDF [333K]

  • Thread Cluster Memory Scheduling: Exploiting Differences in Memory Access Behavior. Yoongu Kim, Michael Papamichael, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 43rd International Symposium on Microarchitecture (MICRO), Atlanta, GA, December 2010.
    Abstract / PDF [478K]

  • Improving Storage Bandwidth Guarantees with Performance Insulation. Matthew Wachs, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-113, October 2010.
    Abstract / PDF [285K]

  • SmartScan: Efficient Metadata Crawl for Storage Management Metadata Querying in Large File Systems. Likun Liu, Lianghong Xu, Yongwei Wu, Guangwen Yang, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-112, Oct. 2010.
    Abstract / PDF [366K]

  • Speeding Up Finite Element Wave Propagation for Large-Scale Earthquake Simulations. Ricardo Taborda, Julio López, Haydar Karaoglu, John Urbanic, Jacobo Bielak. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-109, October 2010.
    Abstract / PDF [4.4M]

  • Behavior-Based Problem Localization for Parallel File Systems. Michael P. Kasick, Rajeev Gandhi, Priya Narasimhan. HotDep '10. October 3, 2010, Vancouver, BC, Canada.
    Abstract / PDF [149K]

  • To Upgrade or Not to Upgrade: Impact of Online Upgrades across Multiple Administrative Domains. T. Dumitras, E. Tilevich, P.Narasimhan. ACM Onward! Conference, Oct. 2010.
    Abstract / PDF [425K]

  • dBug: Systematic Evaluation of Distributed Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson. 5th Int. Workshop on Systems Software Verification (SSV’10), co-located with 9th USENIX Symp. On Operating Systems Design and Implementation (OSDI’10), Vancouver BC, October 2010.
    Abstract / PDF [168K]

  • Token Attempt: The Misrepresentation of Website Privacy Policies through the Misuse of P3P Compact Policy Tokens. Pedro Giovanni Leon, Lorrie Faith Cranor, Aleecia M. McDonald, Robert McGuire. Cylab Technical Report CMU-CyLab-10-014, September 10, 2010.
    Abstract / PDF [305K]

  • Parsimonious Linear Fingerprinting for Time Series. Lei Li, B. Aditya Prakash, Christos Faloutsos. Proceedings of the VLDB Endowment, Vol. 3, No. 1, September 2010.
    Abstract / PDF [684K]

  • pWalrus: Towards Better Integration of Parallel File Systems into Cloud Storage. Yoshihisa Abe, Garth A. Gibson. Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), co-located with IEEE Int. Conference on Cluster Computing 2010 (Cluster10), Heraklion, Greece, September 2010.
    Abstract / PDF [321K]

  • The Case for Content Search of VM Clouds. Mahadev Satyanarayanan, Wolfgang Richter, Glenn Ammons, Jan Harkes, Adam Goode. 34th Annual IEEE Computer Software and Applications Conference Workshops (COMPSACW), July 19-23, 2010, Seoul, Korea.
    Abstract / PDF [831K]

  • FAWNSort: Energy-efficient Sorting of 10GB. Vijay Vasudevan Lawrence Tan, David Andersen, Michael Kaminsky, Michael A. Kozuch, Padmanabhan Pillai, Winner of 2010 10GB Joulesort, Daytona and Indy categories. http://sortbenchmark.org/. July 2010
    Abstract / PDF [90K]

  • Phase Change Memory Architecture and the Quest for Scalability. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Communications of the ACM (CACM), Research Highlight, Vol. 53, No. 7, pages 99-106, July 2010.
    Abstract / PDF [1.34M]

  • Diagnosing Performance Changes by Comparing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-107. July 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-103.
    Abstract / PDF [503K]

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • OddBall: Spotting Anomalies in Weighted Graphs. Leman Akoglu, Mary McGlohon, Christos Faloutsos. PAKDD 2010, Hyderabad, India, 21-24 June 2010. Best Paper Award!
    Abstract / PDF [3.0M]

  • A Transparently-Scalable Metadata Service for the Ursa Minor Storage System. Shafeeq Sinnamohideen, Raja R. Sambasivan, James Hendricks, Likun Liu, Gregory R. Ganger. Usenix Annual Technical Conference, Boston, MA, June 23-25, 2010. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-102. March 2010.
    Abstract / PDF [230K]

  • Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. Jiaqi Tan*, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. 30th IEEE International Conference on Distributed Computing Systems (ICDCS) 2010, Genoa, Italy, Jun 2010.
    Abstract / PDF [2.1M]

  • Zzyzx: Scalable Fault Tolerance Through Byzantine Locking. James Hendricks, Shafeeq Sinnamohideen, Gregory R. Ganger, Michael K. Reiter. Proceedings of the 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. Chicago, Illinois, June 2010.
    Abstract / PDF [231K]

  • DiscFinder: A data-intensive scalable cluster finder for astrophysics. Bin Fu, Kai Ren, Julio López, Eugene Fink, and Garth A. Gibson. In Proceedings of the ACM International Symposium on High Performance Distributed Computing (HPDC), Chicago, IL. June, 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-104..
    Abstract / PDF [372K]

  • Robust and Flexible Power-proportional Storage. Hrishikesh Amur, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch, Karsten Schwan. ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.
    Abstract / PDF [944K]

  • Reusing Migration to Simply and Efficiently Implement Multi-server Operations in Transparently Scalable Storage Systems. Shafeeq Sinnamohideen. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-10-141. May 2010.
    Abstract / PDF [926K]

  • An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010). May 17-20, 2010, Melbourne, Victoria, Australia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-107, December, 2009.
    Abstract / PDF [832K]

  • Applying Performance Models to Understand Data-intensive Computing Efficiency. Elie Krevat, Tomer Shiran, Eric Anderson†, Joseph Tucek†, Jay J. Wylie†, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-108. May 2010.
    Abstract / PDF [304K]

  • Energy-efficient Cluster Computing with FAWN: Workloads and Implications. Vijay Vasudevan David Andersen, Michael Kaminsky, Lawrence Tan, Jason Franklin, Iulian Moraru . Proceedings of 1st Int'l Conf. on Energy-Efficient Computing & Networking (e-Energy 2010), Univ. of Passau, Germany. April 13-15, 2010.
    Abstract / PDF [645K]

  • Open Cirrus: A Global Cloud Computing Testbed. Arutyun I. Avetisyan, Roy Campbell, Indranil Gupta, Michael T. Heath, Steven Y. Ko, Gregory R. Ganger, Michael A. Kozuch, David O’Hallaron, Marcel Kunze, Thomas T. Kwan, Kevin Lai, Martha Lyons, Dejan S. Milojicic, Hing Yan Lee, Ng Kwang Ming, Jing-Yuan Luke, Han Namgong, Yeng Chai Soh. IEEE Computer, April 2010.
    Abstract / PDF [1.1M]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-105, April 2010.
    Abstract / PDF [513K]

  • Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments. Jiaqi Tan, Xinghao Pan, Eugene Marinelli, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 12th IEEE/IFIP Network Operations and Management Symposium (NOMS) 2010, Osaka, Japan, Apr 2010.
    Abstract / PDF [2.8M]

  • Access Control for Home Data Sharing: Attitudes, Needs and Practices. Michelle L. Mazurek, J.P. Arsenault, Joanna Bresee, Nitin Gupta, Iulia Ion, Christina Johns, Daniel Lee, Yuan Liang, Jenny Olsen, Brandon Salmon, Richard Shay, Kami Vaniea, Lujo Bauer, Lorrie Faith Cranor, Gregory R. Ganger, Michael K. Reiter. CHI 2010, April 10 – 15, 2010, Atlanta, Georgia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-110, October 2009.
    Abstract / PDF [250K]

  • Black-Box Problem Diagnosis in Parallel File Systems. Michael P. Kasick, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST '10), San Jose, CA, February 2010.
    Abstract / PDF [533K]

  • Phase Change Technology and the Future of Main Memory. Benjamin C. Lee, Ping Zhou, Jun Yang, Youtao Zhang, Bo Zhao, Engin Ipek, Onur Mutlu, Doug Burger. IEEE Micro, Special Issue: Micro's Top Picks from 2009 Computer Architecture Conferences (MICRO TOP PICKS), Vol. 30, No. 1, pages 60-70, January/February 2010.
    Abstract / PDF [600K]

  • ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers. Yoongu Kim, Dongsu Han, Onur Mutlu, Mor Harchol-Balter. Proceedings of the 16th International Symposium on High-Performance Computer Architecture (HPCA), Bangalore, India, January 2010.
    Abstract / PDF [333K]

2009

  • Why Do Upgrades Fail And What Can We Do About It? Toward Dependable, Online Upgrades in Enterprise Systems. T. Dumitras, P. Narasimhan. ACM/IFIP/USENIX Middleware Conference, Nov-Dec. 2009.
    Abstract / PDF [835K]

  • Toward Upgrades-as-a-Service in Distributed Systems. T. Dumitras, P. Narasimhan. Poster Session at Middleware 2009. 10th International Middleware Conference Urbana Champaign, Illinois, USA.
    Abstract / PDF [602K]

  • ...And eat it too: High read performance in write-optimized HPC I/O middleware file formats. Milo Polte, Jay Lofstead, John Bent, Garth A. Gibson, Scott A. Klasky, Qing Liu, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Meghan Wingate, Matthew Wolf. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-111, November 2009.
    Abstract / PDF [388K]

  • PLFS: A Checkpoint Filesystem for Parallel Applications. John Bent, Garth A. Gibson, Gary Grider, Ben McClelland, Paul Nowoczynski, James Nunez, Milo Polte, Meghan Wingate. Supercomputing '09, November 15, 2009. Portland, Oregon.
    Abstract / PDF [388K]

  • DiskReduce: RAID for Data-Intensive Scalable Computing. Bin Fan, Wittawat Tantisiriroj, Lin Xiao, Garth A. Gibson. 4th Petascale Data Storage Workshop held in conjunction with Supercomputing '09, November 15, 2009. Portland, Oregon. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-112, November 2009.
    Abstract / PDF [304K]

  • Blind Men and the Elephant: Piecing Together Hadoop for Diagnosis. Xinghao Pan, Jiaqi Tan, Soila Kalvulya, Rajeev Gandhi, Priya Narasimhan. 20th IEEE International Symposium on Software Reliability Engineering (ISSRE), Industrial Track, Mysuru, India, Nov 2009.
    Abstract / PDF [160K]

  • Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. ;LOGIN Vol. 34, No. 5
    Abstract / PDF [403K]

  • Access Control for Home Data Sharing: Attitudes, Needs and Practices. Michelle L. Mazurek, J.P. Arsenault, Joanna Bresee, Nitin Gupta, Iulia Ion, Christina Johns, Daniel Lee, Yuan Liang, Jenny Olsen, Brandon Salmon, Richard Shay, Kami Vaniea, Lujo Bauer, Lorrie Faith Cranor, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-110. October 2009.
    Abstract / PDF [250K]

  • Understanding and Maturing the Data-Intensive Scalable Computing Storage Substrate. Garth A. Gibson, Bin Fan, Swapnil Patil, Milo Polte, Wittawat Tantisiriroj, Lin Xiao. Microsoft Research eScience Workshop 2009, Pittsburgh, PA, October 16-17, 2009.
    Abstract / PDF [520K]

  • FAWN: A Fast Array of Wimpy Nodes. David Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Proc. 22nd ACM Symposium on Operating Systems Principles (SOSP 2009), Big Sky, MT. October 2009. BEST PAPER AWARD!
    Abstract / PDF [332K]

  • Co-scheduling of Disk Head Time in Cluster-based Storage. Matthew Wachs, Gregory R. Ganger. 28th International Symposium On Reliable Distributed Systems September 27-30, 2009. Niagara Falls, New York, U.S.A. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-113. October 2008.
    Abstract / PDF [245K]

  • Putting Home Data Management into Perspective. Brandon Watts Salmon. Carnegie Mellon University ECE Ph.D. Dissertation, CMU-PDL-09-113, August 17, 2009.
    Abstract / PDF [3.6M]

  • File System Virtual Appliances. Michael Abd-El-Malek. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-109, August 2009.
    Abstract / PDF [1.15M]

  • Delayed Instantiation Bulk Operations for Management of Distributed, Object-based Storage Systems. Andrew J. Klosterman. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-108, August 2009.
    Abstract / PDF [2M]

  • Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Brian Mueller. SIGCOMM’09, August 17–21, 2009, Barcelona, Spain.
    Abstract / PDF [755K]

  • Efficient Byzantine Fault Tolerance for Scalable Storage and Services. James Hendricks. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-09-146. July 2009.
    Abstract / PDF [1.1M]

  • No Downtime for Data Conversions: Rethinking Hot Upgrades. Tudor Dumitraş, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-106. July 2009.
    Abstract / PDF [855K]

  • Architecting Phase Change Memory as a Scalable DRAM Alternative. Benjamin C. Lee, Engin Ipek, Onur Mutlu, Doug Burger. Proceedings of the 36th International Symposium on Computer Architecture (ISCA), pages 2-13, Austin, TX, June 2009.
    Abstract / PDF [2.6M]

  • Ganesha: Black-Box Fault Diagnosis for MapReduce Systems. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Workshop on Hot Topics in Measurement and Modeling of Computer Systems (HotMetrics 2009), Seattle, WA (June 2009). Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-112. September 2008.
    Abstract / PDF [180K]

  • In Search of an API for Scalable File Systems: Under the table or above it? Swapnil Patil, Garth A. Gibson, Gregory R. Ganger, Julio Lopez, Milo Polte, Wittawat Tantisiroj, and Lin Xiao. USENIX HotCloud Workshop 2009. June 2009, San Diego CA.
    Abstract / PDF [260K]

  • Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Workshop on Hot Topics in Cloud Computing (HotCloud '09), San Diego, CA, on June 15, 2009. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-103, May 2009.
    Abstract / PDF [373K]

  • System-Call Based Problem Diagnosis for PVFS. Michael P. Kasick, Keith A. Bare, Eugene E. Marinelli III, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep '09). Lisbon, Portugal. June 2009.
    Abstract / PDF [117K]

  • Tashi: Location-aware Cluster Management. Michael A. Kozuch, Michael P. Ryan, Richard Gass, Steven W. Schlosser, David O’Hallaron, James Cipar, Elie Krevat, Julio López, Michael Stroucken, Gregory R. Ganger. First Workshop on Automated Control for Datacenters and Clouds (ACDC'09), Barcelona, Spain, June 2009.
    Abstract / PDF [160K]

  • Directions for Shingled-Write and Two-Dimensional Magnetic Recording System Architectures: Synergies with Solid-State Disks. Garth A. Gibson, Milo Polte. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-104. May 2009.
    Abstract / PDF [70K]

  • File System Virtual Appliances: Portable File System Implementations. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Karan Sanghi, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-102. May 2009.
    Abstract / PDF [486K]

  • FAWNdamentally Power-efficient Clusters. Vijay Vasudevan, Jason Franklin, David Andersen, Amar Phanishayee, Lawrence Tan, Michael Kaminsky, Iulian Moraru. 12th Workshop on Hot Topics in Operating Systems (HotOS XII). May 2009.
    Abstract / PDF [236K]

  • Enabling Enterprise Solid State Disks Performance. Milo Polte, Jiri Simsa, Garth A. Gibson. 1st Workshop on Integrating Solid-state Memory into the Storage Hierarchy, March 7, 2009, Washington DC.
    Abstract / PDF [302K]

  • Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. 7th USENIX Conference on File and Storage Technologies (FAST '09). Feb. 24-27, 2009. San Francisco, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-105, May 2008.
    Abstract / PDF [275KM]

  • Relative Fitness Modeling. Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, and Gregory R. Ganger. Communications of the ACM, Vol. 52 No. 4, April 2009.
    Abstract / PDF [775K]

  • Solving TCP Incast in Cluster Storage Systems. Vijay Vasudevan, Hiral Shah, Amar Phanishayee, Elie Krevat, David Andersen, Gregory R. Ganger, Garth A. Gibson. FAST 2009 Work in Progress Report. 7th USENIX Conference on File and Storage Technologies. Feb 24-27, 2009, San Francisco, CA.
    PDF [70K]

  • A (In)Cast of Thousands: Scaling Datacenter TCP to Kiloservers and Gigabits. Vijay Vasudevan, Amar Phanishayee, Hiral Shah, Elie Krevat, David G. Andersen, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-101, Feb. 2009.
    Abstact / PDF [317K]

2008

  • A Fault Model for Upgrades in Distributed Systems. Tudor Dumitraş, Soila Kavulya, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-115, December 2008.
    Abstract / PDF [275K]

  • SALSA: Analyzing Logs as StAte Machines.SALSA: Analyzing Logs as StAte Machines. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Analysis of System Logs (WASL), San Diego, CA (December 2008). Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-111. September 2008.
    Abstract / PDF [630K]

  • Fast Log-based Concurrent Writing of Checkpoints. Milo Polte, Jiri Simsa, Wittawat Tantisiriroj, Garth A. Gibson, Shobhit Dayal, Mikhail Chainani, Dilip Kumar Uppugandla. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [262K]

  • Comparing Performance of Solid State Devices and Mechanical Disks. Milo Polte, Jiri Simsa, Garth A. Gibson. Proceedings of the 3rd Petascale Data Storage Workshop held in conjunction with Supercomputing '08, November 17, 2008, Austin, TX.
    Abstract / PDF [99K]

    Data-intensive file systems for Internet services: A rose by any other name ... Wittawat Tantisiriroj, Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-114. October 2008
    Abstract / PDF [350K]

  • Ganesha: Black-Box Fault Diagnosis for MapReduce Systems. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-112. September 2008.
    Abstract / PDF [180K]

  • GIGA+ : Scalable Directories for Shared File Systems. Swapnil Patil, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-110. October 2008.
    Abstract / PDF [400K]

  • Characterizing HEC Storage Systems at Rest. Shobhit Dayal. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-109, July 2008.
    Abstract / PDF [603K]

  • IRONModel: Robust Performance Models in the Wild. Eno Thereska, Gregory R. Ganger. SIGMETRICS’08, June 2–6, 2008, Annapolis, Maryland, USA.
    Abstract / PDF [813K]

  • FAWN: A Fast Array of Wimpy Nodes. David G. Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-108, May 2008.
    Abstract / PDF [875K]

  • User Level Implementation of Scalable Directories (GIGA+). Sanket Hase, Aditya Jayaraman, Vinay K. Perneti, Sundararaman Sridharan, Swapnil V. Patil, Milo Polte, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-107, May 2008.
    Abstract / PDF [1.67M]

  • File System Virtual Appliances: Third-party File System Implementations without the Pain. Michael Abd-El-Malek, Matthew Wachs, James Cipar, Gregory R. Ganger, Garth A. Gibson, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-106, May 2008.
    Abstract / PDF [508K]

  • Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-105, May 2008.
    Abstract / PDF [1.65M]

  • ASDF: Automated, Online Fingerpointing for Hadoop. Keith Bare, Michael P. Kasick, Soila Kavulya, Eugene Marinelli, Xinghao Pan, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-104. May 2008.
    Abstract / PDF [650K]

  • The DiskSim Simulation Environment Version 4.0 Reference Manual.John S. Bucy, Jiri Schindler, Steven W. Schlosser, Gregory R. Ganger, and Contributors. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-101, May 2008.
    Abstract / PDF [704K] / Code Release

  • Using Utility Functions to Control a Distributed Storage System. John D. Strunk. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-08-102, May 2008.
    Abstract / PDF [940K]

  • RAMS and BlackSheep: Inferring White-box Application Behavior Using Black-box Techniques. Jiaqi Tan, Priya Narasimhan. School of Computer Science Senior Honors Thesis and Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-103, May, 2008.
    Abstract / PDF [1.7M]

  • Using Utility to Provision Storage Systems. John D. Strunk, Eno Thereska, Christos Faloutsos, Gregory R. Ganger. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-106, September 2007.
    Abstract / PDF [310K]

  • Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems. Amar Phanishayee, Elie Krevat, Vijay Vasudevan, David G. Andersen, Gregory R. Ganger, Garth A. Gibson, Srinivasan Seshan. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-105, September 2007.
    Abstract / PDF [374K]

2007

  • On Modeling the Relative Fitness of Storage. Michael P. Mesnier. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-108, December 19, 2007.
    Abstract / PDF [1.16M]

  • On Application-level Approaches to Avoiding TCP Throughput Collapse in Cluster-Based Storage Systems. E. Krevat, V. Vasudevan, A. Phanishayee, D. Andersen, G. Ganger, G. Gibson, S. Seshan. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [124K]

  • GIGA+: Scalable Directories for Shared File Systems. Swapnil V. Patil, Garth A. Gibson, Sam Lang, Milo Polte. Proceedings of the 2nd international Petascale Data Storage Workshop (PDSW '07) held in conjunction with Supercomputing '07. November 11, 2007, Reno, NV.
    Abstract / PDF [114K]

  • Failure Tolerance in Petascale Computers. Garth A. Gibson, Bianca Schroeder, Joan Digney. CTWatch Quarterly, vol. 3 no. 4. Volume on Software Enabling Technologies for Petascale Science. November 2007. www.ctwatch.org
    PDF [686K]
  • Learning to Share: A Study of Sharing Among Home Storage Devices. Brandon Salmon, Frank Hady, Jay Melican. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-107, October, 2007.
    Abstract / PDF [726K]

  • Low-overhead Byzantine Fault-tolerant Storage. James Hendricks, Gregory R. Ganger, Michael K. Reiter. Proceedings of the Twenty-First ACM Symposium on Operating Systems Principles (SOSP 2007), Stevenson, WA, October 2007.
    Abstract / PDF [280K]

  • Understanding Failures in Petascale Computers. Bianca Schroeder, Garth A. Gibson. SciDAC 2007. Journal of Physics: Conf. Ser. 78.
    Abstract / PDF [712K]

  • To Share Or Not To Share? Ryan Johnson, Stavros Harizopoulos, Nikos Hardavellas, Kivanc Sabirli, Ippokratis Pandis, Anastasia Ailamaki, Naju G. Mancheril, Babak Falsafi. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
    Abstract / PDF [366K]

  • Efficient Use of the Query Optimizer for Automated Physical Design. Stratos Papadomanolakis, Debabrata Dash, Anastasia Ailamaki. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
    Abstract / PDF [2.4M]

  • Enabling What-if Explorations in Systems. Eno Thereska. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-103, August 2007.
    Abstract / PDF [2.35M]

  • Verifying Distributed Erasure-coded Data. James Hendricks, Gregory R. Ganger, Michael K. Reiter. To appear in Proceedings of the Twenty-Sixth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2007), Portland, August 2007.
    Abstract / PDF [193K]

  • An Analysis of Database System Performance on Chip Multiprocessors. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Stavros Harizopoulos, Anastasia Ailamaki and Babak Falsafi. Proceedings of the 6th Hellenic Data Management Symposium (HDMS2007), Athens, Greece, July 2007.
    Abstract / PDF [308K]

  • Lessons Learned From the Deployment of a Smartphone-Based Access-Control System. Lujo Bauer, Lorrie Faith Cranor, Michael K. Reiter, Kami Vaniea. Symposium On Usable Privacy and Security (SOUPS) 2007, July 18-20, 2007, Pittsburgh, PA, USA.
    Abstract / PDF [984K]

  • Categorizing and Differencing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Eno Thereska, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL.
    Abstract / PDF [120K]

  • Observer: Keeping System Models from Becoming Obsolete. Eno Thereska, Dushyanth Narayanan, Anastassia Ailamaki, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-101, January 2007.
    Abstract / PDF[ 75K]

  • Using Provenance to Aid in Personal File Search. Sam Shah, Craig A. N. Soules, Gregory R. Ganger, Brian D. Noble. USENIX '07 Annual Technical Conference, Santa Clara, CA, June 17–22, 2007.
    Abstract / PDF [225K]

  • Improving Mobile Database Access Over Wide-Area Networks Without Degrading Consistency. Niraj Tolia, M. Satyanarayanan, Adam Wolbach. MobiSys’07, June 11–13, 2007, San Juan, Puerto Rico, USA.
    Abstract / PDF [1.1M]

  • VMM-Independent Graphics Acceleration. H. Andres Lagar-Cavilla, Niraj Tolia, M. Satyanarayanan, Eyal de Lara. VEE’07, June 13–15, 2007, San Diego, California, USA.
    Abstract / PDF [1.4M]

  • Modeling the Relative Fitness of Storage. Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, Gregory R. Ganger. SIGMETRICS’07, June 12–16, 2007, San Diego, California, USA.
    Abstract / PDF [235K]

  • Scheduling Threads for Constructive Cache Sharing on CMPs. Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Vasileios Liaskovitis, Anastassia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Todd C. Mowry, Chris Wilkerson. SPAA'07, June 9-11, 2007, San Diego, California, USA.
    Abstract / PDF [293K]

  • Consistency-preserving Caching of Dynamic Database Content. Niraj Tolia and M. Satyanarayanan. International World Wide Web Conference (WWW 2007), May 8-12, 2007, Banff, Alberta, Canada.
    Abstract / PDF [888K]

  • Fingerpointing Correlated Failures in Replicated Systems. Soila Pertet, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), Cambridge, MA (April 2007).
    Abstract / PDF [100K]

  • Exploiting Similarity for Multi-Source Downloads Using File Handprints. Himabindu Pucha, David G. Andersen, Michael Kaminsky. Proceedings of the 4th Symposium on Networked Systems Design and Implementation (NSDI ’07), Cambridge, Massachusetts, April 2007.
    Abstract / PDF [579K]

  • MultiMap: Preserving Disk Locality for Multidimensional Datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. IEEE 23rd International Conference on Data Engineering (ICDE 2007) Istanbul, Turkey, April 2007. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005.
    Abstract / PDF [203K]

  • The Computer Failure Data Repository. Bianca Schroeder, Garth A. Gibson. Invited contribution to the Workshop on Reliability Analysis of System Failure Data (RAF'07) MSR Cambridge, UK, March 2007.
    Abstract / PDF [42K]

  • //TRACE: Parallel Trace Replay with Approximate Causal Events. Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio Lopez, James Hendricks, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-108, September 2006.
    Abstract / PDF[ 187K]

  • Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You? Bianca Schroeder, Garth A. Gibson. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-111, September 2006.
    Abstract / PDF[ 272K]

  • Argon: Performance Insulation for Shared Storage Servers. Matthew Wachs, Michael Abd-El-Malek, Eno Thereska, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-106, May 2006.
    Abstract / PDF [ 167K]

  • Database Servers on Chip Multiprocessors: Limitations and Opportunities. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Anastassia Ailamaki and Babak Falsafi. 3rd Biennial Conference on Innovative Data Systems Research (CIDR), January 7-10, 2007, Asilomar, California, USA.
    Abstract / PDF [111K]

2006

  • Static Analysis Meets Distributed Fault-Tolerance: Enabling State-Machine Replication with Nondeterminism. Joseph Slember, Priya Narasimhan. Proceedings of the 2nd Workshop on Hot Topics in System Dependability (HotDep '06), Seattle, WA. Nov. 8, 2006.
    Abstract / PDF [98K]

  • Living with Nondeterminism in Replicated Middleware Applications. Joseph Slember, Priya Narasimhan. Middleware 2006, ACM/IFIP/USENIX, 6th International Middleware Conference, Melbourne, Australia, November 27 - December 1, 2006, Proceedings. Lecture Notes in Computer Science 4290 Springer 2006.
    Abstract / PDF [387K]

  • Towards Fingerpointing in the Emulab Dynamic Distributed System. Michael P. Kasick, Priya Narasimhan, Kevin Atkinson, Jay Lepreau. Proceedings of the 3rd USENIX Workshop on Real, Large Distributed Systems (WORLDS '06), Seattle, WA. Nov. 5, 2006.
    Abstract / PDF [311K]

  • Putting Home Storage Management into Perspective. Brandon Salmon, Steven W. Schlosser, Lily B. Mummert, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-110, September 2006.
    Abstract / PDF [382K]

  • Early Experiences on the Journey Towards Self-* Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad, Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, September 2006.
    Abstract / PDF [113K] / Postscript [745K]

  • Brief Announcement: Parallel Depth First vs. Work Stealing Schedulers on CMP Architectures. Vasileios Liaskovitis, Shimin Chen, Phillip B. Gibbons, Anastassia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Michael Kozuch, Todd C. Mowry, Chris Wilkerson. SPAA’06, July 30-August 2, 2006, Cambridge, Massachusetts, USA.
    Abstract / PDF [52K]

  • Routing, Disjoint Paths, and Classification. Shuheng Zhou. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-06-109, August 2006.
    Abstract / PDF [971K] / Postscript [1.1M]

  • Using Context to Assist in Personal File Retrieval. Craig A. N. Soules. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-06-147, August 25, 2006.
    Abstract / PDF [ 681K]

  • InteMon: Continuous Mining of Sensor Data in Large-scale Self-* Infrastructures. Evan Hoke, Jimeng Sun, John D. Strunk, Gregory R. Ganger, and Christos Faloutsos. ACM SIGOPS Operating Systems Review. Vol 40 Issue 3. July, 2006. ACM Press.
    Abstract / PDF [573K]

  • Group Communication: Helping or Obscuring Failure Diagnosis? Soila Pertet, Rajeev Gandhi and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-107, June, 2006.
    Abstract / PDF [ 591K]

  • Stardust: Tracking Activity in a Distributed Storage System. Eno Thereska, Brandon Salmon, John Strunk, Matthew Wachs, Michael Abd-El-Malek, Julio Lopez, Gregory R. Ganger. Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems, (SIGMETRICS'06). June 26th-30th 2006, Saint-Malo, France.
    Abstract / PDF [578K]

  • A Large-scale Study of Failures in High-performance-computing Systems. Bianca Schroeder, Garth A. Gibson. Proceedings of the International Conference on Dependable Systems and Networks (DSN2006), Philadelphia, PA, USA, June 25-28, 2006. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-112, December, 2005.
    Abstract / PDF [570K]

  • Informed Data Distribution Selection in a Self-predicting Storage System. Eno Thereska, Michael Abd-El-Malek, Jay J. Wylie, Dushyanth Narayanan, Gregory R. Ganger. Proceedings of the International Conference on Autonomic Computing (ICAC-06), Dublin, Ireland. June 12th-16th 2006. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-101, January 2006.
    Abstract / PDF [196K]

  • Design Tradeoffs in Applying Content Addressable Storage to Enterprise-scale Systems Based on Virtual Machines. Partho Nath, Michael Kozuch, David O'Hallaron, Jan Harkes, M. Satyanarayanan, Niraj Tolia, and Matt Toups. Proceedings of the 2006 USENIX Annual Technical Conference (USENIX '06), Boston, Massachusetts, May-June 2006.
    Abstract / PDF [912K]
  • Eliminating Cross-server Operations in Scalable File Systems. James Hendricks, Shafeeq Sinnamohideen, Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-105, May 2006.
    Abstract / PDF [ 254K]

  • Improving Small File Performance in Object-based Storage. James Hendricks, Raja R. Sambasivan, Shafeeq Sinnamohideen, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-104, May 2006.
    Abstract / PDF [ 1.45M]

  • Towards Efficient Semantic Object Storage for the Home. Brandon Salmon, Steven W. Schlosser, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-103, May 2006.
    Abstract / PDF [ 297K]

  • An Architecture for Internet Data Transfer. Niraj Tolia, Michael Kaminsky, David G. Andersen, Swapnil Patil. Proceedings of the 3rd Symposium on Networked Systems Design and Implementation (NSDI'06), San Jose, California, May 2006.
    Abstract / PDF [211K]

  • Dimorphic Computing. H. Andres Lagar-Cavilla, Niraj Tolia, Rajesh Balan, Eyal de Lara, M. Satyanarananan, David O'Hallaron. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-06-123, April 2006.
    Abstract / PDF [1.7M]

  • Quantifying Interactive User Experience on Thin Clients. Niraj Tolia, David G. Andersen, and M. Satyanarayanan. IEEE Computer. March, 2006.
    Abstract / PDF [690K]

  • Challenges and Opportunities in Internet Data Mining. David G. Andersen, Nick Feamster. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-06-102, Jan 2006.
    Abstract / PDF [104K]

2005

  • Ursa Minor: Versatile Cluster-based Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad, Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-104, April, 2005.
    Abstract / PDF [490K]

  • Design and Implementation of Self-Securing Network Interface Applications. Stanley M. Bielski. M.S. Thesis. Electrical and Computer Engineering, Carnegie Mellon University. December 2005.
    Abstract / PDF [211K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • Causes of Failure in Web Applications. Soila Pertet and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-109. December 2005.
    Abstract / PDF [367K]

  • On Multidimensional Data and Modern Disks. Steven W. Schlosser, Jiri Schindler, Stratos Papadomanolakis, Minglong Shao Anastassia Ailamaki, Christos Faloutsos, Gregory R. Ganger. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005.
    Abstract / PDF [220K]

  • Scheduling Speculative Tasks in a Compute Farm. David Petrou, Garth A. Gibson, Gregory R. Ganger. Proceedings of the ACM/IEEE Supercomputing 2005 Conference, Seattle, Washington, November, 2005.
    Abstract / PDF [ 569K]

  • A read/write protocol family for versatile storage infrastructures. Jay J. Wylie. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-05-108, October 2005.
    Abstract / Postscript [1.95M] / PDF [1.5M]

  • Connections: Using Context to Enhance File Search. Craig A. N. Soules, Gregory R. Ganger. SOSP'05, October 23–26, 2005, Brighton, United Kingdom. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-105, April 2005.
    Abstract / PDF [300K]

  • Fault-Scalable Byzantine Fault-Tolerant Services. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. SOSP’05, October 23-26, 2005, Brighton, United Kingdom.
    Abstract / PDF [299K]

  • Lazy Verification in Fault-Tolerant Distributed Storage Systems. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. 24th IEEE Symposium on Reliable Distributed Systems (SRDS 2005), October 26-28, 2005, Orlando, Florida.
    Abstract / PDF [ 228K]

  • Correctness of the Read/Conditional-Write and Query/Update Protocols. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-107, September, 2005.
    Abstract / PDF [392K]

  • Replication Policies for Layered Clustering of NFS Servers. Raja R. Sambasivan, Andrew J. Klosterman, Gregory R. Ganger. 13th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). September 26 - 29, Atlanta, GA.
    Abstract / PDF [199K]

  • Continuous Resource Monitoring for Self-predicting DBMS. Dushyanth Narayanan, Eno Thereska, Anastassia Ailamaki. Proceedings of the 13th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2005), Atlanta, GA, September 25-27, 2005.
    Abstract / PDF [333K]

  • Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations. Jurij Leskovec, Jon Kleinberg*, Christos Faloutsos. KDD’05, August 21–24, 2005, Chicago, Illinois, USA.
    Abstract / PDF [ 318K]

  • Modeling the Relative Fitness of Storage Devices. Michael Mesnier, Matthew Wachs, Gregory Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-106, August, 2005.
    Abstract / PDF [190K]

  • Explicit Compiler-based Memory Management for Out-of-core Applications. Angela Demke Brown. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-05-140, May 2005.
    Abstract / PDF [1.3M]

  • The Seductive Appeal of Thin Clients. Niraj Tolia, David G. Andersen, M. Satyanarayanan. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-05-151. February 2005.
    Abstract / Postscript [250K] / PDF [300K]

  • Towards self-predicting systems: What if you could ask “what-if”? Eno Thereska, Dushyanth Narayanan, Gregory R. Ganger. 3rd International Workshop on Self-adaptive and Autonomic Computing Systems. Copenhagen, Denmark, August 2005. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-05-10, February 2005.
    Abstract / PDF [110K]

  • QPipe: A Simultaneously Pipelined Relational Query Engine. Stavros Harizopoulos, Vladislav Shkapenyuk, Anastassia Ailamaki. SIGMOD 2005, June 14-16, 2005, Baltimore, Maryland, USA.
    Abstract / PDF [288K]

  • Empirical Analysis of Rate Limiting Mechanisms. Cynthia Wong, Stan Bielski, Ahren Studer, Chenxi Wang. 8th International Symposium on Recent Advances in Intrusion Detection (RAID 2005), September 7-9, 2005, Seattle, Washington. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-103, March 2005.
    Abstract / PDF [207K]

  • MultiMap: Preserving disk locality for multidimensional datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Christos Faloutsos, and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005.
    Abstract / PDF [318K]

  • Comparison-based File Server Verification. Yuen-Lin Tan, Terrence Wong, John D. Strunk, Gregory R. Ganger. USENIX '05 Annual Technical Conference, April 10-15, 2005. Anaheim, CA.
    Abstract / Postscript [900K] / PDF [130K]

2004

  • A Study of Mass-mailing Worms. Cynthia Wong, Stan Bielski, Jonathan M. McCune, Chenxi Wang. WORM’04, October 29, 2004, Washington, DC, USA.
    Abstract / PDF [192K]

  • Cluster Scheduling for Explicitly-speculative Tasks. David Petrou. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-112, September 2004.
    Abstract / PDF [4.2M] / Simulator Code

  • Efficient, scalable consistency for highly fault-tolerant storage. Garth Goodson. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-111, August 2004.
    Abstract / PDF [1.5M]

  • Challenges in Building a Two-Tiered Learning Architecture for Disk Layout. Brandon Salmon, Eno Thereska, Craig A.N. Soules, John D. Strunk, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-109. August, 2004.
    Abstract / Postscript [6.8M] / PDF [150K]

  • On Hierarchical Routing in Doubling Metrics. Anupam Gupta, Bruce M. Maggs, Shuheng Zhou. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-106. December, 2004.
    Abstract / Postscript [914K] / PDF [344K]

  • A Computational Database System for Generating Unstructured Hexahedral Meshes with Billions of Elements. Tiankai Tu, David R. O’Hallaron. SC2004, November 6-12, 2004, Pittsburgh, PA USA.
    Abstract / PDF [ 171K]

  • Timing-accurate Storage Emulation: Evaluating hypothetical storage components in real computer systems. John Linwood Griffin. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-108, September 2004.
    Abstract / PDF [16M]

  • Storage Device Performance Prediction with CART Models. Mengzhi Wang, Kinman Au, Anastassia Ailamaki, Anthony Brockwell, Christos Faloutsos, and Gregory R. Ganger. Proc. 12th Annual Meeting of the IEEE/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). Volendam, The Netherlands. October 5-7, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-103, March 2004.
    Abstract / Postscript [908K] / PDF [122K]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Secure Bootstrap is Not Enough: Shoring up the Trusted Computing Base. James Hendricks and Leendert van Doorn. Proceedings of the Eleventh SIGOPS European Workshop, ACM SIGOPS, Leuven, Belgium, September 2004.
    Abstract / PDF [175K]

  • Cluster Scheduling for Explicitly Speculative Tasks. David Petrou Gregory R. Ganger Garth A. Gibson. Proceedings 18th Annual ACM International Conference on Supercomputing (ICS’04), June 26-July 1, 2004, Malo, France.
    Abstract / PDF [443K]

  • STEPS Towards Cache-Resident Transaction Processing. Stavros Harizopoulos, Anastassia Ailamaki. Proceedings of the 30th VLDB Conference, Toronto, Canada, 29 August - 3 September 2004.
    Abstract / PDF [332K]

  • Toward Automatic Context-based Attribute Assignment for Semantic File Systems. Craig A. N. Soules, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-105. June 2004.
    Abstract / PDF [290K]

  • SIMFLEX: A Fast, Accurate, Flexible Full-System Simulation Framework for Performance Evaluation of Server Architecture. Nikolaos Hardavellas, Stephen Somogyi, Thomas F. Wenisch, Roland E. Wunderlich, Shelley Chen, Jangwoo Kim, Babak Falsafi, James C. Hoe, and Andreas G. Nowatzyk. ACM SIGMETRICS Performance Evaluation Review (PER) Special Issue on Tools for Computer Architecture Research, Volume 31, Number 4, pages 31-35, March 2004.
    Abstract / PDF [96K]

  • Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • AutoPart: Automating Schema Design for Large Scientific Databases Using Data Partitioning. Stratos Papadomanolakis, Anastassia Ailamaki. 16th International Conference on Scientific and Statistical Database Management (SSDBM). Santorini Island, Greece. June 21-23, 2004.
    Abstract / Postscript [1M] / PDF [150K]

  • Efficient Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-104, December 2003.
    Abstract / Postscript [435K] / PDF [253K]

  • Dynamic Quarantine of Internet Worms. Cynthia Wong, Chenxi Wang, Dawn Song, Stan Bielski, Gregory R. Ganger. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-108, December 2003.
    Abstract / Postscript [1.4M] / PDF [224K]

  • A Protocol Family Approach to Survivable Storage Infrastructures. Jay J. Wylie, Garth R. Goodson, Gregory R. Ganger, Michael K. Reiter. FuDiCo II: S.O.S. (Survivability: Obstacles and Solutions), 2nd Bertinoro Workshop on Future Directions in Distributed Computing, 23-25 June 2004, University of Bologna Residential Center, Bertinoro (Forlì), Italy.
    Abstract / PDF [56K]

  • Storage Device Performance Prediction with CART Models [Extended Abstract]. Mengzhi Wang, Kinman Au, Anastassia Ailamaki, Anthony Brockwell, Christos Faloutsos, and Gregory R. Ganger. Proceedings: Poster Session. Joint International Conference on Measurement and Modeling of Computer Systems. ACM SIGMETRICS/Performance 2004. June 12th-16th 2004, Columbia University, New York.
    Abstract / Postscript [400K] / PDF [64K]

  • Decentralized Recovery for Survivable Storage Systems. Theodore Ming-Tao Wong. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-04-119. May 2004.
    Abstract / Postscript [1.6M] / PDF [715K]

  • Using MEMS-based Storage Devices in Computer Systems. Steven W. Schlosser. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-104, May 2004.
    Abstract / PDF [1.1M]

  • Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • The Safety and Liveness Properties of a Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-03-105. March 2004.
    Abstract / Postscript [922K] / PDF [227K]

  • Improving Hash Join Performance through Prefetching. Shimin Chen, Anastassia Ailamaki, Philip B. Gibbons, Todd C. Mowry. Proceedings of the 20th International Conference on Data Engineering (ICDE 2004). Boston, MA. March 30 to April 2, 2004.
    Abstract / Postscript [ 2.2M] / PDF [330K]

  • Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003.
    Abstract / PDF [281K]

  • MEMS-based storage devices and standard disk interfaces: A square peg in a round hole? Steven W. Schlosser, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-102, December, 2003.
    Abstract / Postscript [2.8M] / PDF [156K]

  • A Framework for Building Unobtrusive Disk Maintenance Applications. Eno Thereska, Jiri Schindler, John Bucy, Brandon Salmon, Christopher R. Lumb, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-192, October 2003.
    Abstract / Postscript [5.1M] / PDF [148K]

  • Integrating Portable and Distributed Storage. Niraj Tolia, Jan Harkes, Michael Kozuch, and M. Satyanarayanan. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [881K] / PDF [211K]

  • Diamond: A Storage Architecture for Early Discard in Interactive Search. Larry Huston, Rahul Sukthankar, Rajiv Wickremesinghe, M. Satyanarayanan, Gregory R. Ganger, Erik Riedel, Anastassia Ailamaki. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [2.2M] / PDF [266K]

  • File Classification in Self-* Storage Systems. Michael Mesnier, Eno Thereska, Daniel Ellard, Gregory R. Ganger, Margo Seltzer. Proceedings of the First International Conference on Autonomic Computing (ICAC-04). New York, NY. May 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-101, January 2004.
    Abstract / Postscript [1.6M] / PDF [80K]

2003

  • Design and Implementation of a Freeblock Subsystem. Eno Thereska, Jiri Schindler, Christopher R. Lumb, John Bucy, Brandon Salmon, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-107, December, 2003.
    Abstract / Postscript [6.5M] / PDF [165K]

  • On the Feasibility of Intrusion Detection Inside Workstation Disks. John Linwood Griffin, Adam Pennington, John S. Bucy, Deepa Choundappan, Nithya Muralidharan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-106. December, 2003.
    Abstract / Postscript [1.12M] / PDF [215K]

  • A Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-103, December 2003.
    Abstract / Postscript [925K] / PDF [321K]

  • Scheduling Explicitly-speculative Tasks. David Petrou, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Technical Report CMU-CS-03-204, November 2003.
    Abstract / Postscript [2.0M] / PDF [400K]

  • Balancing Locality and Randomness in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-203, November 2003.
    Abstract / Postscript [489K] / PDF [511K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • A Prototype User Interface for Coarse-Grained Desktop Access Control. A. Chris Long, Courtney Moskowitz, and Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-200, November 2003.
    Abstract / Postscript [900K] / PDF [535K]

  • Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-187, September 2003.
    Abstract / Postscript [272K] / PDF [479K]

  • Attribute-Based Prediction of File Properties. Daniel Ellard, Michael Mesnier, Eno Thereska, Gregory R. Ganger, Margo Seltzer. Harvard Computer Science Group Technical Report TR-14-03, December 2003.
    Abstract / Postscript [850K] / PDF [127K]

  • GEM: Graph EMbedding for Routing and Data-Centric Storage in Sensor Networks Without Geographic Information. James Newsome and Dawn Song. Proceedings of the First ACM Conference on Embedded Networked Sensor Systems (SenSys 2003). November 5-7, 2003, Redwood, CA.
    Abstract / Postscript [1.5M] / PDF [266K]

  • Location-based Node IDs: Enabling Explicit Locality in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-171, September 2003.
    Abstract / Postscript [282K] / PDF [160K]

  • Self-* Storage: Brick-based Storage with Automated Administration. Gregory R. Ganger, John D. Strunk, Andrew J. Klosterman. Published as Carnegie Mellon University Technical Report, CMU-CS-03-178, August 2003.
    Abstract / Postscript [2.2M] / PDF [558K]

  • Storage-based Intrusion Detection: Watching Storage Activity For Suspicious Behavior. Adam Pennington, John Strunk, John Griffin, Craig Soules, Garth Goodson & Gregory R. Ganger. 12th USENIX Security Symposium, Washington, D.C., Aug 4-8, 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-02-179, September 2002.
    Abstract / Postscript [727K] / PDF [138K]

  • Object-Based Storage. Mike Mesnier, Gregory R. Ganger, Erik Riedel. IEEE Communications Magazine, v.41 n.8 pp 84-90, August 2003.
    Abstract / PDF [85K]

  • Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. VLDB 03, Berlin, Germany, Sept 9-12, 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003.
    Abstract / Postscript [510K] / PDF [152K]

  • Adaptive, Hands-Off Stream Mining. Spiros Papadimitriou, Anthony Brockwell, Christos Faloutsos. Carnegie Mellon University SCS Technical Report CMU-CS-02-205. Also published in Proceedings VLDB 03, Berlin, Germany, Sept 9-12, 2003.
    Abstract / Postscript [3.6M] / PDF [767K]

  • Time Management for New Faculty. Anastassia Ailamaki, Johannes Gehrke. SIGMOD Record, 32(2): 102-106, June 2003.
    Abstract / PDF [110K]

  • Opportunistic Use of Content Addressable Storage for Distributed File Systems. Niraj Tolia, Michael Kozuch, Mahadev Satyanarayanan, Brad Karp, Thomas Bressoud, and Adrian Perrig. Proceedinge USENIX Annual Technical Conference, General Track 2003: 127-140, June 9-14, San Antonio, TX.
    Abstract / Postscript [1M] / PDF [284K]

  • Metadata Efficiency in a Comprehensive Versioning File System. Craig A. N. Soules, Garth R. Goodson, John D. Strunk, Gregory R. Ganger. 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, Mar 31 - Apr 2, 2003. Supercedes CMU SCS Technical Report CMU-CS-02-145, May 2002.
    Abstract / Postscript [817K] / PDF [178K]

  • Data Staging on Untrusted Surrogates. Jason Flinn, Shafeeq Sinnamohideen, Niraj Tolia, M. Satyanarayanan. Proceedings 2nd USENIX Conference on File and Storage Technologies (FAST03), Mar31-Apr2, 2003, San Francisco, CA.
    Abstract / Postscript [1.5M] / PDF [325K]

  • A Human Organization Analogy for Self-* Systems. John D. Strunk, Gregory R. Ganger. First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003. Supercedes Carnegie Mellon University SCS Technical Report CMU-CS-03-129.
    Abstract / Postscript [273K] / PDF [68K]

  • A Two-Tiered Software Architecture for Automated Tuning of Disk Layouts. Brandon Salmon, Eno Thereska, Craig A.N. Soules, Gregory R. Ganger. First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003. Supercedes Carnegie Mellon University SCS Technical Report CMU-CS-03-130.
    Abstract / Postscript [718K] / PDF [405K]
  • Efficient Consistency for Erasure-coded Data via Versioning Servers. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-127, April 2003.
    Abstract / Postscript [290K] / PDF [160K]

  • Exposing and Exploiting Internal Parallelism in MEMS-based Storage. Steven W. Schlosser, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-125, March 2003.
    Abstract / Postscript [1.67M] / PDF [136K]

  • Finding and Containing Enemies Within the Walls with Self-securing Network Interfaces. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. Carnegie Mellon University Technical Report CMU-CS-03-109, January 2003.
    Abstract / Postscript [963K] / PDF [118K]

  • Why Can’t I Find My Files? New methods for automating attribute assignment. Craig A.N. Soules, Gregory R. Ganger. Proceedings of the Ninth Workshop on Hot Topics in Operating systems, USENIX Association, May 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-116, February 2003.
    Abstract / PDF [75K]

  • A Case for Staged Database Systems. S. Harizopoulos and A. Ailamaki. In proceedings of the First International Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, January 2003.
    Abstract / Postscript [820K] / PDF [153K]

2002

  • Data Page Layouts for Relational Databases on Deep Memory Hierarchies. A. Ailamaki, D.J. DeWitt, and M.D. Hill. The VLDB Journal 11(3), 2002.
    Abstract / Postscript [977K] / PDF [177K]

  • Verifiable Secret Redistribution for Archive Systems. Theodore M. Wong, Chenxi Wang, Jeannette M. Wing. Proceedings of the First International IEEE Security in Storage Workshop, December 2002.
    Abstract / PDF [215K]

  • Cuckoo: Layered clustering for NFS. Andrew J. Klosterman, Gregory Ganger. Carnegie Mellon University Technical Report CMU-CS-02-183, October 2002.
    Abstract / Postscript [370K] / PDF [86K]

  • Decentralized Storage Consistency via Versioning Servers. Garth Goodson, Jay Wylie, Gregory R. Ganger & Mike Reiter. Carnegie Mellon University Technical Report CMU-CS-02-180, September 2002.
    Abstract / PDF [210K]

  • My Cache or Yours? Making Storage More Exclusive. Theodore M. Wong, John Wilkes. USENIX Annual Technical Conference (USENIX 2002), pp. 161-175, 10-15 June 2002, Monterey, CA. Supercedes CMU SCS Tech. Report CMU-CS-02-186, which supercedes CMU-CS-00-157, originally published in November 2000.
    Abstract / Postscript [759K] / PDF [253K]

  • Robustness Hinting for Improving End-to-End Dependability. Michael W. Bigrigg. Second Workshop on Evaluating and Architecting System Dependability (EASY). In conjunction with ASPLOS-X. Sunday, 6 October 2002, San Jose, California, U.S.A.
    Abstract / PDF [25K]

  • Capturing the Spatio-Temporal Behavior of Real Traffic Data. Mengzhi Wang, Anastassia Ailamaki & Christos Faloutsos. Performance 2002 (IFIP Int. Symp. on Computer Performance Modeling, Measurement and Evaluation), Rome, Italy, Sept. 2002.
    Abstract / PDF [1.9M]

  • Analysis of Methods for Scheduling Low Priority Disk Drive Tasks. Jiri Schindler, Eitan Bachmat. Proceedings of SIGMETRICS 2002 Conference, June 15-19, 2002, Marina Del Rey, California.
    Abstract / Postscript [237K] / PDF [132K]

  • Fractal Prefetching B+trees: Optimizing Both Cache and Disk Performance. Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, and Gary Valentin. SIGMOD 2002, June 2002, Madison, Wisc. Supercedes CMU SCS Technical Report CMU-CS-02-115.
    Abstract / Postscript [2.2M] / PDF [335K]

  • The Set-Check-Use Methodology for Detecting Error Propagation Failures in I/O Routines. Michael W. Bigrigg, Jacob J. Vos. Workshop on Dependability Benchmarking, in conjunction with the International Conference on Dependable Systems and Networks, DSN-2002; June 23rd-26th, 2002, Washington, D.C., USA.
    Abstract / Postscript [111K] / PDF [77K]

  • Intrusion Detection, Diagnosis, and Recovery with Self-Securing Storage. John D. Strunk, Garth R. Goodson, Adam G. Pennington, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-140, May 2002.
    Abstract / Postscript [1.1M] / PDF [119K]

  • Self-Securing Network Interfaces: What, Why and How. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. CMU SCS Technical Report CMU-CS-02-144, May 2002.
    Abstract / Postscript [952K] / PDF [472K]

  • On Correlated Failures in Survivable Storage Systems. Mehmet Bakkaloglu, Jay J. Wylie, Chenxi Wang, Gregory R. Ganger CMU SCS Technical Report CMU-CS-02-129. May 2002
    Abstract / Postscript [1.8M] / PDF [475K]

  • Affinity Scheduling in Staged Server Architectures. Stavros Harizopoulos and Anastassia Ailamaki. Carnegie Mellon University Technical Report CMU-CS-02-113, March 2002.
    Abstract / Postscript [982K] / PDF [186K]

  • Verifiable Secret Redistribution for Threshold Sharing Schemes. Theodore M. Wong, Chenxi Wang, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-02-114, February 2002 (revised).
    Abstract / Postscript [156K] / PDF [210K]

  • Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic. M. Wang, T. Madhyastha, N.H. Chan, S. Papadimitriou, C. Faloutsos. 18th International Conference on Data Engineering, February 26-March 1, 2002 San Jose, California. Supercedes CMU SCS technical report CMU-CS-01-101.
    Abstract / Postscript [2.25M] / PDF [358K]

  • Examining Semantics In Multi-Protocol Network File Systems. Edward P. A. Hogan, Garth A. Gibson, and Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-103, January 2002.
    Abstract / Postscript [981K] / PDF [408K]

  • Timing-accurate Storage Emulation. John Linwood Griffin, Jiri Schindler, Steven W. Schlosser, John S. Bucy, Gregory R. Ganger. Conference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-146.
    Abstract / Postscript [464K] / PDF [102K]

  • Freeblock Scheduling Outside of Disk Firmware. Christopher R. Lumb, Jiri Schindler, Gregory R. Ganger. Conference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-149.
    Abstract / Postscript [643K] / PDF [150K]

  • Track-aligned Extents: Matching Access Patterns to Disk Drive Characteristics. Jiri Schindler, John Linwood Griffin, Christopher R. Lumb, Gregory R. Ganger. Conference on File and Storage Technologies (FAST) January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-119.
    Abstract / Postscript [682K] / PDF [159K]

2001

  • Using Speculative Execution to Automatically Hide I/O Latency. Fay W. Chang. Carnegie Mellon Ph.D Dissertation CMU-CS-01-172, December 2001.
    Abstract / Postscript [4.9M] / PDF [945K]

  • Blurring the Line Between Oses and Storage Devices. Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-166, December 2001.
    Abstract / Postscript [2.3M] / PDF [974K]

  • MEMS Memory Elements. L. Richard Carley, Rany Tawfik El-Sayed, David F. Guillou, Fernando Alfaro, Gary K. Fedder, Stephen Schlosser, David Nagle, Gregory R. Ganger, James Bain. Non-Volatile Memory Technology Symposium, November 2001, San Diego, CA.
    Abstract / PDF [908K]

  • Testing the Portability of Desktop Applications to a Networked Embedded System. Michael W. Bigrigg, and Joseph G. Slember. Workshop on Reliability in Embedded Systems at the 20th IEEE Symposium on Reliable Distributed Systems, New Orleans, October 2001.
    Abstract / Postscript [198K] / PDF [40K]

  • Verifiable Secret Redistribution Theodore M. Wong, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-01-155, October 2001.
    Abstract / Postscript [198K] / PDF [168K]

  • Active Disks for Large-Scale Data Processing. Riedel, E., Faloutsos, C., Gibson, G.A. and Nagle, D.F. IEEE Computer, June 2001.
    Abstract / PDF [722K]

  • Survivable Storage Systems. Gregory R. Ganger, Pradeep K. Khosla, Mehmet Bakkaloglu, Michael W. Bigrigg, Garth R. Goodson, Semih Oguz, Vijay Pandurangan, Craig A. N. Soules, John D. Strunk, Jay J. Wylie. DARPA Information Survivability Conference and Exposition (Anaheim, CA, 12-14 June 2001), pages 184-195 vol 2. IEEE, 2001.
    Abstract / PDF [128K]

  • Selecting the Right Data Distribution Scheme for a Survivable Storage System. Jay J. Wylie, Mehmet Bakkaloglu, Vijay Pandurangan, Michael W. Bigrigg, Semih Oguz, Ken Tew, Cory Williams, Gregory R. Ganger, Pradeep K. Khosla. CMU SCS Technical Report CMU-CS-01-120, May 2001.
    Abstract / Postscript [10.9M] / PDF [1.4M]

  • Better Security via Smarter Devices. Gregory R. Ganger and David F. Nagle. Appears in HotOS-VIII (IEEE Workshop on Hot Topics in Operating Systems), May 2001.
    Abstract / Postscript [1.1M] / PDF [245K]

  • Position Summary: Authentication Confidences. Gregory R. Ganger. Appears in HotOS-VIII (IEEE Workshop on Hot Topics in Operating Systems), May 2001.
    Abstract / Postscript [66K] / PDF [16K]

  • Authentication Confidences Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-123, May 2001.
    Abstract / Postscript [335K] / PDF [42K]

  • Building Firewalls with Intelligent Network Interface Cards. David Friedman and David Nagle. CMU SCS Technical Report CMU-CS-00-173, May 2001.
    Abstract / Postscript [540K] / PDF [229K]

  • Improving Index Performance through Prefetching. Shimin Chen, Phillip B. Gibbons, and Todd C. Mowry. In proceedings of SIGMOD 2001, Santa Barbara, CA, May 2001. Supercedes a technical report CMU-CS-00-177.
    Abstract / Postscript [1.8M] / PDF [323K]

  • Compiler-Based I/O Prefetching for Out-of-Core Applications. Angela Demke Brown, Todd C. Mowry and Orran Krieger. In ACM Transactions on Computer Systems,19(2):111-170, May 2001.
    Abstract / Postscript [1.2M] / PDF [487K]

  • PASTENSE: a Fast Start-up Algorithm for Scalable Video Libraries. Stavros Harizopoulos and Garth A. Gibson. CMU SCS Technical Report CMU-CS-01-105, March 2001.
    Abstract / Postscript [784K] / PDF [184K]

2000

  • Scalable and Manageable Storage Systems. Khalil S. Amiri. Carnegie Mellon Ph.D. Dissertation, CMU-CS-00-178, December, 2000.
    Abstract / Postscript [M] / PDF [1.7M]

  • Enabling Dynamic Security Management of via Device-Embedded Security. Gregory R. Ganger and David F. Nagle. CMU SCS Technical Report CMU-CS-00-174, December 2000.
    Abstract / PDF [607K]

  • Network Attached Storage Architecture. Garth A. Gibson and Rodney Van Meter. Communications of the ACM, November 2000, Vol.43, No.11
    Abstract / PDF [219K]

  • MEMS-Based Integrated-Circuit Mass-Storage Systems. L. Richard Carley, Gregory R. Ganger and David F. Nagle. COMMUNICATIONS OF THE ACM November 2000, Vol.43, No.11.
    Abstract / PDF [564K]

  • Designing Computer Systems with MEMS-based Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. Proceedings 9th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-IX), Cambridge, MA, November 13-15, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-137, May 2000.
    Abstract / Postscript [2.7M] / PDF [485K]

  • Self-Securing Storage: Protecting Data in Compromised Systems. Strunk, J.D., Goodson, G.R., Scheinholtz, M.L., Soules, C.A.N. and Ganger, G.R. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, October, 2000.
    Abstract / Postscript [345K] / PDF [294K]

  • Journaling versus Soft Updates: Asynchronous Meta-data Protection in File Systems. Margo I. Seltzer, Gregory R. Ganger, M. Kirk McKusick, Keith A. Smith, Craig A. N. Soules, Christopher A. Stein. Proceedings of the USENIX Technical Conference, June, 2000.
    Abstract / PDF [120K]

  • Operating System Management of MEMS-based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. Proceedings 4th Symposium on Operating Systems Design & Implementation (OSDI 2000), San Diego, CA, October 23-25, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-136, May 2000.
    Abstract / Postscript [2.2M] / PDF [336K]

  • Information Technology - SCSI Object Based Storage Device Commands (OSD). Gene Milligan, T10 Technical Editor. T10 - Working draft NCITS TBD-200X Project 1355D, Revision 3. October 1, 2000.
    Abstract / PDF [367K]

  • Towards Higher Disk Head Utilization: Extracting "Free" Bandwidth From Busy Disk Drives. Lumb, C., Schindler, J., Ganger, G.R., Nagle, D.F. and Riedel, E. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-130, May 2000.
    Abstract / Postscript [2.3M] / PDF [422K]

  • Taming the Memory Hogs: Using Compiler Inserted Releases to Manage Physical Memory Intelligently. Angela Demke Brown and Todd C. Mowry. Proceedings of the Fourth Symposium on Operating Systems Design and Implementation (OSDI 2000), pages 31-44, October 2000.
    Abstract / Postscript [670K] / PDF [129K]

  • Easing the Management of Data-parallel Systems via Adaptation Petrou, D., Amiri, K., Ganger, G.R. and Gibson, G.A. Appears in the Proceedings of the 9th ACM SIGOPS European Workshop, Kolding, Denmark, September 17-20, 2000.
    Abstract / Postscript [622K] / PDF [122K]

  • Survivable Information Storage Systems. Jay Wylie, Michael Bigrigg, John Strunk, Gregory Ganger, Han Kiliccote and Pradeep Khosla. IEEE Computer, August 2000.
    Abstract / PDF [192K]

  • Dynamic Function Placement for Data-intensive Cluster Computing. Amiri, K., Petrou, D., Ganger, G.R. and Gibson, G.A. Proceedings of the USENIX Annual Technical Conference, San Diego, CA, June 2000.
    Abstract / Postscript [386K] / PDF [182k]

  • Highly Concurrent Shared Storage. Amiri, K., Gibson, G.A. and Golding, R. Proceedings of the International Conference on Distributed Computing Systems, Taipei, April 2000.
    Abstract / Postscript [908K] / PDF [104K]

  • Modeling and Performance of MEMS-Based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. Proceedings of ACM SIGMETRICS 2000, Santa Clara, California, June 17-21, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-100.
    Abstract / Postscript [3.4M] / PDF [311K]

  • Data Mining on an OLTP System (Nearly) for Free. Riedel, E., Faloutsos, C., Ganger, G.R. and Nagle, D.F. Proc. of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, May 14-19, 2000. Supercedes CMU SCS Technical Report CMU-CS-99-151.
    Abstract / Postscript [1.0M] / PDF [171K]

  • Active Disk Architecture for Databases. Riedel, E., Faloutsos, C., Nagle, D.F. Carnegie Mellon Technical Report CMU-CS-00-145, May 2000.
    Abstract / Postscript [948K] / PDF [201K]

  • Secure Continuous Biometric-Enhanced Authentication Andrew J. Klosterman and Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-134, May 2000.
    Abstract / Postscript [1.1M] / PDF [245K]

  • Design and Implementation of a Self-Securing Storage Device. John D. Strunk, Garth R. Goodson, Michael L. Scheinholtz, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-129, May 2000.
    Abstract / Postscript [401K] / PDF [330K]

  • Fast and Flexible Application-level Networking on Exokernel Systems. Gregory R. Ganger, Dawson R. Engler, M.Frans Kaashoek, Héctor M. Briceño, Russell Hunt, Thomas Pinckney. CMU SCS Technical Report CMU-CS-00-117, March 2000.
    Abstract / PDF [363K]

1999

  • Filling the Memory Access Gap: A Case for On-Chip Magnetic Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-174, December 1999.
    Abstract / Postscript [3.1M] / PDF [398K]

  • Automated Disk Drive Characterization. Schindler, J. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-176, December 1999.
    Abstract / Postscript [341K] / PDF [282K]

  • Active Disks - Remote Execution for Network-Attached Storage. Erik Riedel. Carnegie Mellon Ph.D. Dissertation CMU-CS-99-177, November 1999.
    Abstract / Postscript.Z [5.7M] / PDF [2.7M]

  • Network Support for Network-Attached Storage. Nagle, D.F., Ganger, G.R., Butler, J., Goodson, G. and Sabol, C. Hot Interconnects 1999, August 18 - 20, 1999, Stanford University, Stanford, California.
    Abstract / Postscript [861K] / PDF [205K]

  • Security for a High Performance Commodity Storage Subsystem. Howard Gobioff. Carnegie Mellon Ph.D. Dissertation, CMU-CS-99-160, July 1999.
    Abstract / Postscript [8.4M] / PDF [1.8M]

  • Dynamic Function Placement in Active Storage Clusters. Amiri, K., Petrou, D., Ganger, G.R. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-140, June 1999.
    Abstract / Postscript [730K] / PDF [220K]

  • NASD Scalable Storage Systems. Gibson, G.A., Nagle, D.F., Courtright II, W., Lanza, N., Mazaitis, P., Unangst, M. and Zelenka, J. USENIX99, Extreme Linux Workshop, Monterey, CA, June 1999.
    Abstract / Postscript [1.6M] / PDF [270K]

  • Embedded Security for Network-Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-154, June 1999.
    Abstract / Postscript [5.1M] / PDF [761K]

  • Implementing Lottery Scheduling: Matching the Specializations in Traditional Schedulers. Petrou, D., Milford, J.W. and Gibson, G.A. Proceedings, USENIX 1999, Monterey, CA, June 9-11, 1999.
    Abstract / Postscript [532K] / PDF [235K]

  • Scalable Concurrency Control and Recovery for Shared Storage Arrays. Amiri, K., Gibson, G.A. and Golding, R. CMU SCS Technical Report CMU-CS-99-111, February 1999.
    Abstract / Postscript [480K] / PDF [108K]

  • Automatic I/O Hint Generation through Speculative Execution. Chang, F.W. and Gibson, G.A. Proceedings of the 3rd Symposium on Operating Systems Design and Implementation, February 1999.
    Abstract / Postscript [197K] / PDF [121K]

1998

  • A Cost-Effective, High-Bandwidth Storage Architecture. Gibson, G.A., Nagle, D.F., Amiri, K., Butler, J., Chang, F.W., Gobioff, H., Hardin, C., Riedel, E., Rochberg, D. and Zelenka, J. Proceedings of the 8th Conference on Architectural Support for Programming Languages and Operating Systems.
    Abstract / Postscript [5.3M] / PDF [501K]

  • Integrity and Performance in Network Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-98-182, December 1998.
    Abstract / Postscript [534K] / PDF [142K]

  • Active Storage For Large-Scale Data Mining and Multimedia. Riedel, E., Gibson, G.A and Faloutsos, C. Proc. of the 24th International Conference on Very large Databases (VLDB '98), New York, New York, August 24-27, 1998.
    Abstract / Postscript [3.5M] / PDF [231K]

  • A Performance Study of Sequential I/O on Windows NT. Riedel, E., van Ingen, C. and Gray, J. Proceedings of the Second Usenix Windows/NT Symposium, Seattle, WA. August 1998.
    Abstract / Postscript [720K] / PDF [114K]

  • Selected Reports: Fall 1997 Software Systems Course. CMU SCS Technical Report CMU-CS-98-103, April 4, 1998.
    Abstract / Postscript [5.5M] / PDF [3.3M]

1997

  • Active Disks: Remote Execution for Network-Attached Storage. Riedel, E. and Gibson, G.A. CMU SCS Technical Report CMU-CS-97-198, December 1997.
    Abstract / Postscript [2.1M] / PDF [123K]

  • Informed Prefetching and Caching. Hugo Patterson. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-204: December 1997.
    Abstract / Postscript [4.8M] / PDF [2.3M]

  • Prefetching Over a Network: Early Experience with CTIP. Rochberg, D. and Gibson, G.A. SIGMETRICS Performance Evaluation Review, volume 25, number 3, (pp. 29-36), December, 1997.
    Abstract / Postscript [314K] / PDF [75K]

  • Security for Network Attached Storage Devices. Gobioff, H., Gibson, G.A. and Tygar, D. CMU SCS Technical Report CMU-CS-97-185, October, 1997.
    Abstract / Postscript [143K] / PDF [135K]

  • Practical and Theoretical Issues in Prefetching and Caching. Andrew Tomkins. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-181: October 1997.
    Abstract / GZ Postscript [827K] / PDF [918K]
  • Filesystems for Network-Attached Secure Disks. Gibson, G., Nagle, D.F., Amiri, K., Chang, F.W., Gobioff, H., Riedel, E., Rochberg, D. and Zelenka, J. CMU SCS Technical Report CMU-CS-97-118, July, 1997.
    Abstract / Postscript [220K] / PDF [94K]

  • RAIDframe: A Rapid Prototyping Tool for RAID Systems. William V. Courtright II, Garth A. Gibson, Mark Holland, LeAnn Neal Reilly, Jim Zelenka. Carnegie Mellon University Technical Report CMU-CS-97-142. June 4, 1997.
    Abstract / Postscript [980K] / PDF [620K]

  • File Server Scaling with Network-Attached Secure Disks. Gibson, G.A., Nagle, D.F., Amiri, K., Chang, F.W., Feinberg, E.M., Gobioff, H., Lee, C., Ozceri, B., Riedel, E., Rochberg, D. and Zelenka, J. Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems (Sigmetrics '97), Seattle, Washington, June 15-18, 1997.
    Abstract / Postscript [573K] / PDF [258K]

  • Informed Multi-Process Prefetching and Caching. Tomkins, A., Patterson, R.H. and Gibson, G.A Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems, (Sigmetrics '97), Seattle, Washington, June 15-18, 1997.
    Abstract / Postscript [1.2M] / PDF [319K]

  • A Transactional Approach to Redundant Disk Array Implementation. William Courtright, II. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-141, April 1997.
    Abstract / Postscript [1.7M] / PDF [1.1M]

1996

  • A Trace-Driven Comparison of Algorithms for Parallel Prefetching and Caching. Kimbrel, T., Tomkins, A., Patterson, R.H., Bershad, B., Cao, P., Felten, E.W., Gibson, G.A., Karlin, A.R. and Li, K. Proc. of the 2nd USENIX Symposium on Operating Systems Design and Implementation (OSDI), Seattle, WA, October 28-31, 1996, pp. 19-34. Supercedes CMU SCS Technical Report CMU-CS-96-174.
    Abstract / Postscript [670K] / PDF [344K]

  • Automatic Compiler-Inserted I/O Prefetching for Out-of-Core Applications. Todd C. Mowry, Angela K. Demke and Orran Krieger. Proceedings of the Second Symposium on Operating Systems Design and Implementation (OSDI '96), pages 3-17, October 1996.
    Abstract / Postscript [1.15M] / PDF [120K]

  • Understanding Customer Dissatisfaction with Underutilized Distributed File Servers. Riedel, E. and Gibson, G.A. Proc. of the 5th NASA Conference on Mass Storage Systems and Technologies, College Park, MD. September 1996. Supercedes CMU SCS Technical Report CMU-CS-96-158.
    Abstract / Postscript [898K] / PDF [258K]

  • A Case for Network-Attached Secure Disks. Gibson, G.A., Nagle, D.F., Amiri, K., Chang, F.W., Feinberg, E.M., Gobioff, H., Lee, C., Ozceri, B., Riedel, E. and Rochberg, D. CMU SCS Technical Report CMU-CS-96-142, September 1996.
    Abstract / Postcript [380K] / PDF [151K]

  • A Structured Approach to Redundant Disk Array Implementation. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. Proc. of the International Computer Performance and Dependability Symposium (IPDS), Sept. 4-6, 1996. Supercedes CMU SCS Technical Report CMU-CS-96-137.
    Abstract / Postscript [212K] / PDF [85K]

  • RAIDframe: Rapid Prototyping for Disk Arrays. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. Proc. of the 1996 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), May 1996, Vol. 24 No. 1, pp. 268-269. Supercedes CMU SCS Technical Report CMU-CS-95-200.
    Abstract / Postscript [74K] / PDF [25K]

1995

  • Informed Prefetching and Caching. Patterson, R.H. Gibson, G.A., Ginting, E., Stodolsky, D. and Zelenka, J. Proc. of the 15th Symposium of Operating Systems Principles, Copper Mountain Resort, CO, December 3-6, 1995, pp. 79-95.
    Abstract / Postscript [440K] / PDF [232K]

  • RAID for Mobile Computers. Rachad Youssef. Carnegie Mellon M.S. Thesis INI-TR 1995-3, 1995, 35 pages. August, 1995.
    Abstract / Postscript [245K] / PDF [106K]

  • The Scotch Parallel Storage Systems. Gibson, G.A., Stodolsky, D., Chang, F.W., Courtright II, W.V., Demetriou, C.G., Ginting, E., Holland, M., Ma, Q., Neal, L., Patterson, R.H., Su, J., Youssef, R. and Zelenka, J. Proc. of the IEEE CompCon Conference, March 5-8, 1995. Supercedes CMU SCS Technical Report CMU-CS-95-107.
    Abstract / Postscript [112K] / PDF [64K]

1994

  • Backward Error Recovery in Redundant Disk Arrays. Courtright II, W.V. and Gibson, G.A. Proc. of the 1994 Computer Measurement Group (CMG) Conference, Dec. 1994, Vol. 1, pp. 63-74. Supercedes CMU SCS Technical Report CMU-CS-94-193.
    Abstract / HTML / Postscript [134K] / PDF [69K]

  • Exposing I/O Concurrency with Informed Prefetching. Patterson, R.H. and Gibson, G.A Proc. of Third Int'l. Conf. on Parallel and Distributed Information Systems, Austin, TX, September 28-30, 1994, pp.7-16.
    Abstract / HTML / Postscript [103K] / PDF [51K]

  • Parity-Logging Disk Arrays. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. ACM Transactions on Computer Systems, 12(3):206-235, August 1994. Supercedes A Redundant Disk Array Architecture for Efficient Small Writes. CMU SCS Technical Report CMU-CS-94-170, 1994.
    Abstract / Postscript [313K] / PDF [164K]

  • Architectures and Algorithms for On-Line Failure Recovery in Redundant Disk Arrays. Holland, M., Gibson, G.A. and Siewiorek, D.P. Journal of Distributed and Parallel Databases, Vol. 2, No. 3, pp. 295-335, July 1994.
    Abstract / Postscript [374K] / PDF [206K]

  • On-Line Data Reconstruction in Redundant Disk Arrays. Mark Holland. Carnegie Mellon Ph.D. Dissertation CMU-CS-94-164, April 1994, 232 pages.
    Abstract / Table of Contents / Postscript [1.3M] / PDF [769K]

1993

  • A Redundant Disk Array Architecture for Efficient Small Writes. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. CMU SCS Technical Report CMU-CS-93-200, October, 1993.
    Abstract / Postscript [356K] / PDF [196K]

  • Parity Logging: Overcoming the Small Write Problem in Redundant Disk Arrays. Stodolsky, D., Holland, M. and Gibson, G.A. Proc. of the 21th Annual International Symposium on Computer Architectures, May, 1993, pp. 64-75.
    Abstract / Postscript [190K] / PDF [89K]

  • A Status Report on Research in Transparent Informed Prefetching. Patterson, R.H., Gibson, G.A and Satyanarayanan, M. ACM Operating Systems Review, V 27(2), April, 1993, pp.21-34. Supercedes CMU SCS Technical Report CMU-CS-93-113.
    Abstract / Postscript [94K] / PDF [50K]

  • Fast, On-Line Failure Recovery in Redundant Disk Arrays. Holland, M., Gibson, G.A. and Siewiorek, D.P. Proc. of the 23rd Annual International Symposium on Fault-Tolerant Computing, pp. 421-433, 1993.
    Abstract / Postscript [137K] / PDF [68K]

1992

  • Using Transparent Informed Prefetching to Reduce File Read Latency. Patterson, R.H., Gibson, G.A and Satyanarayanan, M. Annotated slides for the conference talk appearing in the proceedings of the Goddard Conference on Mass Storage Systems and Technologies, September, 1992.
    Postscript [71K] / PDF [40K]

  • Disk Reads with DRAM Latency. Gibson, G.A, Patterson, R.H. and Satyanarayanan, M. Third Workshop on Workstation Operating Systems, April, 1992, pp. 126-131.
    Abstract / Postscript [43K] / PDF [18K]

  • Parity Declustering for Continuous Operation in Redundant Disk Arrays. Holland, M. and Gibson, G.A. Proc. of the 5th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS V), 1992.
    Abstract / Postscript [239K] / PDF [101K]

  • This directory contains the block design reffered to in Parity Declustering for Continuous Operation in Redundant Disk Arrays, Proceedings, ASPLOS V, 1992.
    Database [2.7M]