PARALLEL DATA LAB 

Past Project Publications

Abacus

  • Diamond: A Storage Architecture for Early Discard in Interactive Search. Larry Huston, Rahul Sukthankar, Rajiv Wickremesinghe, M. Satyanarayanan, Gregory R. Ganger, Erik Riedel, Anastassia Ailamaki. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [2.2M] / PDF [266K]

  • Easing the Management of Data-parallel Systems via Adaptation Petrou, D., Amiri, K., Ganger, G.R. and Gibson, G.A. Appears in the Proceedings of the 9th ACM SIGOPS European Workshop, Kolding, Denmark, September 17-20, 2000.
    Abstract / Postscript [622K] / PDF [122k]

  • Dynamic Function Placement for Data-intensive Cluster Computing. Amiri, K., Petrou, D., Ganger, G.R. and Gibson, G.A. Proceedings of the USENIX Annual Technical Conference, San Diego, CA, June 2000.
    Abstract / Postscript [386K] / PDF [182k]

  • Dynamic Function Placement in Active Storage Clusters. Amiri, K., Petrou, D., Ganger, G.R. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-140, June 1999.
    Abstract / Postscript [730K] / PDF [220K]

AstroDISC

  • Indexing and Fast Near-Matching of Billions of Astronomical Objects. Bin Fu, Eugene Fink, Garth Gibson and Jaime Carbonell. In Proceedings of the Fourth Workshop on Interfaces and Architecture for Scientific Data Storage, 2012 (IASDS12). September 24, 2012, Beijing, China.
    Abstract / PDF [303K]

  • Exact and Approximate Computation of a Histogram of Pairwise Distances between Astronomical Objects. Bin Fu, Eugene Fink, Garth Gibson and Jaime Carbonell. First Workshop on High Performance Computing in Astronomy (AstroHPC 2012), held in conjunction with the 21st International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC 2012), June 18 or 19, 2012, Delft, the Netherlands.
    Abstract / PDF [309K]

  • Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, and Garth Gibson. Proceedings of the Twenty-Third Scientific and Statistical Database Management Conference (SSDBM 2011), 20-22 July 2011.
    Abstract / PDF [5.5M]

  • Recipes for Baking Black Forest Databases: Building and Querying Black Hole Merger Trees from Cosmological Simulations. Julio Lopez, Colin Degraf, Tiziana DiMatteo, Bin Fu, Eugene Fink, Garth Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-11-104. April 2011.
    Abstract / PDF [6.5M]

  • Speeding Up Finite Element Wave Propagation for Large-Scale Earthquake Simulations. Ricardo Taborda, Julio López, Haydar Karaoglu, John Urbanic, Jacobo Bielak. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-109, October 2010.
    Abstract / PDF [4.4M]

  • DiscFinder: A Data-intensive Scalable Cluster Finder for Astrophysics. Bin Fu, Kai Ren, Julio López, Eugene Fink, and Garth Gibson. In Proceedings of the ACM International Symposium on High Performance Distributed Computing (HPDC), Chicago, IL. June, 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-104..
    Abstract / PDF [372K]

Attribute-Based Learning Environments (ABLE)

  • File Classification in Self-* Storage Systems. Michael Mesnier, Eno Thereska, Daniel Ellard, Gregory R. Ganger, Margo Seltzer. Proceedings of the First International Conference on Autonomic Computing (ICAC-04). New York, NY. May 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-101, January 2004.
    Abstract / Postscript [1.6M] / PDF [80K]

  • Attribute-Based Prediction of File Properties. Daniel Ellard, Michael Mesnier, Eno Thereska, Gregory R. Ganger, Margo Seltzer. Harvard Computer Science Group Technical Report TR-14-03, December 2003.
    Abstract / Postscript [850K] / PDF [127K]

Active Disks

  • Diamond: A Storage Architecture for Early Discard in Interactive Search. Larry Huston, Rahul Sukthankar, Rajiv Wickremesinghe, M. Satyanarayanan, Gregory R. Ganger, Erik Riedel, Anastassia Ailamaki. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [2.2M] / PDF [266K]

  • Active Disks for Large-Scale Data Processing. Riedel, E., Faloutsos, C., Gibson, G.A. and Nagle, D.F. IEEE Computer, June 2001.
    Abstract / PDF [722K]

  • Data Mining on an OLTP System (Nearly) for Free. Riedel, E., Faloutsos, C., Ganger, G.R. and Nagle, D.F. Proc. of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, May 14-19, 2000. Supercedes CMU SCS Technical Report CMU-CS-99-151.
    Abstract / Postscript [1.0M] / PDF [171K]

  • Active Disk Architecture for Databases. Riedel, E., Faloutsos, C., Nagle, D.F. Carnegie Mellon Technical Report CMU-CS-00-145, May 2000.
    Abstract / Postscript [948K] / PDF [201K]

  • Active Disks - Remote Execution for Network-Attached Storage. Erik Riedel. Carnegie Mellon Ph.D. Dissertation CMU-CS-99-177, November 1999.
    Abstract / Postscript.Z [5.7M] / PDF [2.7M]

  • Active Storage For Large-Scale Data Mining and Multimedia. Riedel, E., Gibson, G.A and Faloutsos, C. Proc. of the 24th International Conference on Very large Databases (VLDB '98), New York, New York, August 24-27, 1998.
    Abstract / Postscript [3.5M] / PDF [231K]

  • Active Disks: Remote Execution for Network-Attached Storage. Riedel, E. and Gibson, G.A. CMU SCS Technical Report CMU-CS-97-198, December 1997.
    Abstract / Postscript [2.1M] / PDF [123K]

Attribute-Based Naming

  • Using Provenance to Aid in Personal File Search. Sam Shah, Craig A. N. Soules, Gregory R. Ganger, Brian D. Noble. USENIX '07 Annual Technical Conference, Santa Clara, CA, June 17–22, 2007.
    Abstract / PDF [225K]

  • Using Context to Assist in Personal File Retrieval. Craig A. N. Soules. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-06-147, August 25, 2006.
    Abstract / PDF [ 681K]

  • Connections: Using Context to Enhance File Search. Craig A. N. Soules, Gregory R. Ganger. SOSP'05, October 23–26, 2005, Brighton, United Kingdom. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-105, April 2005.
    Abstract / PDF [300K]

  • Toward Automatic Context-based Attribute Assignment for Semantic File Systems. Craig A. N. Soules, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-105. June 2004.
    Abstract / PDF [290K]

  • Why Can’t I Find My Files? New methods for automating attribute assignment. Craig A.N. Soules, Greg Ganger. Proceedings of the Ninth Workshop on Hot Topics in Operating systems, USENIX Association, May 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-116, February 2003.
    Abstract / PDF [75K]

Continuous Reorganization

  • Challenges in Building a Two-Tiered Learning Architecture for Disk Layout. Brandon Salmon, Eno Thereska, Craig A.N. Soules, John D. Strunk, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-109. August, 2004.
    Abstract / Postscript [6.8M] / PDF [150K]

  • A Two-Tiered Software Architecture for Automated Tuning of Disk Layouts. Brandon Salmon, Eno Thereska, Craig A.N. Soules, Gregory R. Ganger. First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003. Supercedes Carnegie Mellon University SCS Technical Report CMU-CS-03-130.
    Abstract / Postscript [718K] / PDF [405K]

Database I/O

  • BEMC: A Searchable, Compressed Representation for Large Seismic Wavefields. Julio López, Leonardo Ramírez-Guzmán, Jacobo Bielak, David O’Hallaron. 22nd Int. Conf on Scientific and Statistical Database Management (SSDBM'10), Heidelberg, Germany, June 30 - July 2, 2010.
    Abstract / PDF [311K]

  • To Share Or Not To Share? Ryan Johnson, Stavros Harizopoulos, Nikos Hardavellas, Kivanc Sabirli, Ippokratis Pandis, Anastasia Ailamaki, Naju G. Mancheril, Babak Falsafi. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
    Abstract / PDF [366K]

  • Efficient Use of the Query Optimizer for Automated Physical Design. Stratos Papadomanolakis, Debabrata Dash, Anastasia Ailamaki. Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07), Vienna, Austria, September 2007.
    Abstract / PDF [2.4M]

  • An Analysis of Database System Performance on Chip Multiprocessors. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Stavros Harizopoulos, Anastasia Ailamaki and Babak Falsafi. Proceedings of the 6th Hellenic Data Management Symposium (HDMS2007), Athens, Greece, July 2007.
    Abstract / PDF [308K]

  • Scheduling Threads for Constructive Cache Sharing on CMPs. Shimin Chen, Phillip B. Gibbons, Michael Kozuch, Vasileios Liaskovitis, Anastassia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Todd C. Mowry, Chris Wilkerson. SPAA'07, June 9-11, 2007, San Diego, California, USA.
    Abstract / PDF [293K]

  • MultiMap: Preserving Disk Locality for Multidimensional Datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. IEEE 23rd International Conference on Data Engineering (ICDE 2007) Istanbul, Turkey, April 2007. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005.
    Abstract / PDF [203K]

  • Database Servers on Chip Multiprocessors: Limitations and Opportunities. Nikos Hardavellas, Ippokratis Pandis, Ryan Johnson, Naju G. Mancheril, Anastassia Ailamaki and Babak Falsafi. 3rd Biennial Conference on Innovative Data Systems Research (CIDR), January 7-10, 2007, Asilomar, California, USA.
    Abstract / PDF [111K]

  • Brief Announcement: Parallel Depth First vs. Work Stealing Schedulers on CMP Architectures. Vasileios Liaskovitis, Shimin Chen, Phillip B. Gibbons, Anastassia Ailamaki, Guy E. Blelloch, Babak Falsafi, Limor Fix, Nikos Hardavellas, Michael Kozuch, Todd C. Mowry, Chris Wilkerson. SPAA’06, July 30-August 2, 2006, Cambridge, Massachusetts, USA.
    Abstract / PDF [52K]

  • On Multidimensional Data and Modern Disks. Steven W. Schlosser, Jiri Schindler, Stratos Papadomanolakis , Minglong Shao Anastassia Ailamaki, Christos Faloutsos, Gregory R. Ganger. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005.
    Abstract / PDF [220K]

  • Continuous Resource Monitoring for Self-predicting DBMS. Dushyanth Narayanan, Eno Thereska, Anastassia Ailamaki. Proceedings of the 13th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2005), Atlanta, GA, September 25-27, 2005.
    Abstract / PDF [333K]

  • QPipe: A Simultaneously Pipelined Relational Query Engine. Stavros Harizopoulos, Vladislav Shkapenyuk, Anastassia Ailamaki. SIGMOD 2005, June 14-16, 2005, Baltimore, Maryland, USA.
    Abstract / PDF [288K]

  • MultiMap: Preserving disk locality for multidimensional datasets. Minglong Shao, Steven W. Schlosser, Stratos Papadomanolakis, Jiri Schindler, Anastassia Ailamaki, Christos Faloutsos, and Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-102. March 2005.
    Abstract / PDF [318K]

  • A Computational Database System for Generating Unstructured Hexahedral Meshes with Billions of Elements. Tiankai Tu, David R. O’Hallaron. SC2004, November 6-12, 2004, Pittsburgh, PA USA.
    Abstract / PDF [ 171K]

  • STEPS Towards Cache-Resident Transaction Processing. Stavros Harizopoulos, Anastassia Ailamaki. Proceedings of the 30th VLDB Conference, Toronto, Canada, 29 August - 3 September 2004.
    Abstract / PDF [332K]

  • Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • AutoPart: Automating Schema Design for Large Scientific Databases Using Data Partitioning. Stratos Papadomanolakis, Anastassia Ailamaki. 16th International Conference on Scientific and Statistical Database Management (SSDBM). Santorini Island, Greece. June 21-23, 2004.
    Abstract / Postscript [1M] / PDF [150K]

  • Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003.
    Abstract / PDF [281K]

  • Improving Hash Join Performance through Prefetching. Shimin Chen, Anastassia Ailamaki, Philip B. Gibbons, Todd C. Mowry. Proceedings of the 20th International Conference on Data Engineering (ICDE 2004). Boston, MA. March 30 to April 2, 2004.
    Abstract / Postscript [ 2.2M] / PDF [330K]

  • Diamond: A Storage Architecture for Early Discard in Interactive Search. Larry Huston, Rahul Sukthankar, Rajiv Wickremesinghe, M. Satyanarayanan, Gregory R. Ganger, Erik Riedel, Anastassia Ailamaki. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004.
    Abstract / Postscript [2.2M] / PDF [266K]

  • Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. VLDB 03, Berlin, Germany, Sept 9-12, 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003.
    Abstract / Postscript [510K] / PDF [152K]

  • Adaptive, Hands-Off Stream Mining. Spiros Papadimitriou, Anthony Brockwell, Christos Faloutsos. Carnegie Mellon University SCS Technical Report CMU-CS-02-205. Also published in Proceedings VLDB 03, Berlin, Germany, Sept 9-12, 2003.
    Abstract / Postscript [3.6M] / PDF [767K]

  • A Case for Staged Database Systems. S. Harizopoulos and A. Ailamaki. In proceedings of the First International Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, January 2003.
    Abstract / Postscript [820K] / PDF [153K]

  • Data Page Layouts for Relational Databases on Deep Memory Hierarchies. A. Ailamaki, D.J. DeWitt, and M.D. Hill. The VLDB Journal 11(3), 2002.
    Abstract / Postscript [977K] / PDF [177K]

  • Fractal Prefetching B+trees: Optimizing Both Cache and Disk Performance. Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, and Gary Valentin. SIGMOD 2002, June 2002, Madison, Wisc. Supercedes CMU SCS Technical Report CMU-CS-02-115.
    Abstract / Postscript [2.2M] / PDF [335K]

  • Affinity Scheduling in Staged Server Architectures. Stavros Harizopoulos and Anastassia Ailamaki. Carnegie Mellon University Technical Report CMU-CS-02-113, March 2002.
    Abstract / Postscript [982K] / PDF [186K]

  • Improving Index Performance through Prefetching. Shimin Chen, Phillip B. Gibbons, and Todd C. Mowry. In proceedings of SIGMOD 2001, Santa Barbara, CA, May 2001. Supercedes CMU SCS technical report CMU-CS-00-177.
    Abstract / Postscript [1.8M] / PDF [323K]

  • Active Disks for Large-Scale Data Processing. Riedel, E., Faloutsos, C., Gibson, G.A. and Nagle, D.F. IEEE Computer, June 2001.
    Abstract / PDF [722K]

  • Data Mining on an OLTP System (Nearly) for Free. Riedel, E., Faloutsos, C., Ganger, G.R. and Nagle, D.F. Proc. of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, May 14-19, 2000. Supercedes CMU SCS Technical Report CMU-CS-99-151.
    Abstract / Postscript [1.0M] / PDF [171K]

  • Active Storage For Large-Scale Data Mining and Multimedia. Riedel, E., Gibson, G.A and Faloutsos, C. Proc. of the 24th International Conference on Very large Databases (VLDB '98), New York, New York, August 24-27, 1998.
    Abstract / Postscript [3.5M] / PDF [231K]

DiskSim

  • The DiskSim Simulation Environment Version 4.0 Reference Manual. John S. Bucy, Jiri Schindler, Steven W. Schlosser, Gregory R. Ganger, and Contributors. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-101, May 2008.
    Abstract / PDF [704K] / Code Release

DIXtrac

  • Analysis of Methods for Scheduling Low Priority Disk Drive Tasks. Jiri Schindler, Eitan Bachmat. Proceedings of SIGMETRICS 2002 Conference, June 15-19, 2002, Marina Del Rey, California.
    Abstract / Postscript [237K] / PDF [132K]

  • Track-aligned Extents: Matching Access Patterns to Disk Drive Characteristics. Jiri Schindler, John Linwood Griffin, Christopher R. Lumb, Gregory R. Ganger. Conference on File and Storage Technologies (FAST), January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-119.
    Abstract / Postscript [682K] / PDF [159K]

  • Automated Disk Drive Characterization. Schindler, J. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-176, December 1999.
    Abstract / Postscript [341K] / PDF [282K]

Failure Data Analysis

  • Failure Tolerance in Petascale Computers. Garth Gibson, Bianca Schroeder, Joan Digney. CTWatch Quarterly, vol. 3 no. 4. Volume on Software Enabling Technologies for Petascale Science. November 2007. www.ctwatch.org
    PDF [686K]

  • Understanding Failures in Petascale Computers. Bianca Schroeder, Garth A. Gibson. SciDAC 2007. Journal of Physics: Conf. Ser. 78.
    Abstract / PDF [712K]

  • The Computer Failure Data Repository. Bianca Schroeder, Garth Gibson. Invited contribution to the Workshop on Reliability Analysis of System Failure Data (RAF'07) MSR Cambridge, UK, March 2007.
    Abstract / PDF [42K]

  • Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You? Bianca Schroeder, Garth A. Gibson. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-111, September 2006.
    Abstract / PDF[ 272K]

  • A Large-scale Study of Failures in High-performance-computing Systems. Bianca Schroeder, Garth Gibson. Proceedings of the International Conference on Dependable Systems and Networks (DSN2006), Philadelphia, PA, USA, June 25-28, 2006. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-112, December, 2005.
    Abstract / PDF [570K]

Fates

  • Clotho: Decoupling Page Layout from Storage Organization. Minglong Shao, Jiri Schindler, Steven W. Schlosser, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 30th VLDB Conference. Toronto, Canada, 29 August - 3 September 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-102, March 2004.
    Abstract / PDF [203K]

  • Matching Application Access Patterns to Storage Device Characteristics. Jiri Schindler. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-03-109, May 2004.
    Abstract / PDF [1.14M]

  • Atropos: A Disk Array Volume Manager for Orchestrated Use of Disks. Jiri Schindler, Steven W. Schlosser, Minglong Shao, Anastassia Ailamaki, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-101, December, 2003.
    Abstract / PDF [281K]

  • Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics. Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. VLDB 03, Berlin, Germany, Sept 9-12, 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-124, April 2003.
    Abstract / Postscript [510K] / PDF [152K]

Freeblock Scheduling

  • Design and Implementation of a Freeblock Subsystem. Eno Thereska, Jiri Schindler, Christopher R. Lumb, John Bucy, Brandon Salmon, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-107, December, 2003.
    Abstract / Postscript [6.5M] / PDF [165K]

  • A Framework for Building Unobtrusive Disk Maintenance Applications. Eno Thereska, Jiri Schindler, John Bucy, Brandon Salmon, Christopher R. Lumb, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Technical Report CMU-CS-03-192, October 2003.
    Abstract / Postscript [5.1M] / PDF [148K]

  • Freeblock Scheduling Outside of Disk Firmware. Christopher R. Lumb, Jiri Schindler, Gregory R. Ganger. Conference on File and Storage Technologies (FAST), January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-149.
    Abstract / Postscript [643K] / PDF [150K]

  • Towards Higher Disk Head Utilization: Extracting "Free" Bandwidth From Busy Disk Drives. Lumb, C., Schindler, J., Ganger, G.R., Nagle, D.F. and Riedel, E. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-130, May 2000.
    Abstract / Postscript [2.3M] / PDF [422K]

Home Storage

  • A Tag-Based, Logical Access-Control Framework for Personal File Sharing. Michelle L. Mazurek. Carnegie Mellon University ECE Ph.D. Dissertation, May 2014.
    Abstract / PDF [1.6M]

  • Toward Strong, Usable Access Control for Shared Distributed Data. Michelle L. Mazurek, Yuan Liang, William Melicher, Manya Sleeper, Lujo Bauer, Gregory R. Ganger, Nitin Gupta, and Michael K. Reiter. In FAST 2014: USENIX Conference on File and Storage Technologies, February 2014.
    Abstract / PDF [395K]

  • Tag, You Can See It! Using tags for access control in photo sharing. Peter F. Klemperer, Yuan Liang, Michelle L. Mazurek, Manya Sleeper, Blase Ur, Lujo Bauer, Lorrie Faith Cranor, Nitin Gupta, and Michael K. Reiter. In CHI 2012: Conference on Human Factors in Computing Systems, May 2012.
    Abstract / PDF [560K]

  • Exploring Reactive Access Control. Michelle L. Mazurek, Peter F. Klemperer, Richard Shay, Hassan Takabi, Lujo Bauer, Lorrie Faith Cranor. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
    Abstract / PDF [293k]

  • Of Passwords and People: Measuring the Effect of Password-Composition Policies. Saranga Komanduri, Richard Shay, Patrick Gage Kelley, Michelle L. Mazurek, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman. CHI 2011, May 7–12, 2011, Vancouver, BC, Canada.
    Abstract / PDF [405K]

  • Access Control for Home Data Sharing: Attitudes, Needs and Practices. Michelle L. Mazurek, J.P. Arsenault, Joanna Bresee, Nitin Gupta, Iulia Ion, Christina Johns, Daniel Lee, Yuan Liang, Jenny Olsen, Brandon Salmon, Richard Shay, Kami Vaniea, Lujo Bauer, Lorrie Faith Cranor, Gregory R. Ganger, Michael K. Reiter. CHI 2010, April 10 – 15, 2010, Atlanta, Georgia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-110, October 2009.
    Abstract / PDF [250K]

  • Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. ;LOGIN Vol. 34, No. 5
    Abstract / PDF [403K]

  • Putting Home Data Management into Perspective. Brandon Watts Salmon. Carnegie Mellon University ECE Ph.D. Dissertation, CMU-PDL-09-113, August 17, 2009.
    Abstract / PDF [3.6M]

  • Perspective: Semantic Data Management for the Home. Brandon Salmon, Steven W. Schlosser, Lorrie Faith Cranor, Gregory R. Ganger. 7th USENIX Conference on File and Storage Technologies (FAST '09). Feb. 24-27, 2009. San Francisco, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-105, May 2008.
    Abstract / PDF [275KM]

  • Learning to Share: A Study of Sharing Among Home Storage Devices. Brandon Salmon, Frank Hady, Jay Melican. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-107, October, 2007.
    Abstract / PDF [726K]

  • Putting Home Storage Management into Perspective. Brandon Salmon, Steven W. Schlosser, Lily B. Mummert, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-110, September 2006.
    Abstract / PDF [382K]

  • Towards Efficient Semantic Object Storage for the Home. Brandon Salmon, Steven W. Schlosser, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-103, May 2006.
    Abstract / PDF [ 297K]

MEMS-Based Storage

  • Timing-accurate Storage Emulation: Evaluating hypothetical storage components in real computer systems. John Linwood Griffin. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-108, September 2004.
    Abstract / PDF [16M]

  • Using MEMS-based Storage Devices in Computer Systems. Steven W. Schlosser. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-104, May 2004.
    Abstract / PDF [1.1M]

  • MEMS-based storage devices and standard disk interfaces: A square peg in a round hole? Steven W. Schlosser, Gregory R. Ganger. Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST '04). San Francisco, CA. March 31, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-102, December, 2003.
    Abstract / Postscript [2.8M] / PDF [156K]

  • Exposing and Exploiting Internal Parallelism in MEMS-based Storage. Steven W. Schlosser, Jiri Schindler, Anastassia Ailamaki, Gregory R. Ganger. Carnegie Mellon University Technical Report CMU-CS-03-125, March 2003.
    Abstract / Postscript [1.67M] / PDF [136K]

  • Timing-accurate Storage Emulation. John Linwood Griffin, Jiri Schindler, Steven W. Schlosser, John S. Bucy, Gregory R. Ganger. Conference on File and Storage Technologies (FAST), January 28-30, 2002. Monterey, CA. Supercedes CMU SCS Technical Report CMU-CS-01-146.
    Abstract / Postscript [464K] / PDF [102K]

  • MEMS Memory Elements. L. Richard Carley, Rany Tawfik El-Sayed, David F. Guillou, Fernando Alfaro, Gary K. Fedder, Stephen Schlosser, David Nagle, Greg Ganger, James Bain. Non-Volatile Memory Technology Symposium, November 2001, San Diego, CA.
    Abstract / PDF [908K]

  • MEMS-Based Integrated-Circuit Mass-Storage Systems. L. Richard Carley, Gregory R. Ganger and David F. Nagle. COMMUNICATIONS OF THE ACM November 2000, Vol.43, No.11.
    Abstract / PDF [564K]

  • Operating System Management of MEMS-based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. Proceedings 4th Symposium on Operating Systems Design & Implementation (OSDI 2000), San Diego, CA, October 23-25, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-136, May 2000.
    Abstract / Postscript [2.2M] / PDF [336K]

  • Designing Computer Systems with MEMS-based Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. Proceedings 9th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-IX), Cambridge, MA, November 13-15, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-137, May 2000.
    Abstract / Postscript [2.7M] / PDF [485K]

  • Modeling and Performance of MEMS-Based Storage Devices. Griffin, J.L., Schlosser, S.W., Ganger, G.R. and Nagle, D.F. Proceedings of ACM SIGMETRICS 2000, Santa Clara, California, June 17-21, 2000. Supercedes CMU SCS Technical Report CMU-CS-00-100.
    Abstract / Postscript [3.4M] / PDF [311K]

  • Filling the Memory Access Gap: A Case for On-Chip Magnetic Storage. Schlosser, S.W., Griffin, J.L., Nagle, D.F. and Ganger, G.R. CMU SCS Technical Report CMU-CS-99-174, December 1999.
    Abstract / Postscript [3.1M] / PDF [398K]

Network-Attached Secure Disks (NASD)

  • PASTENSE: a Fast Start-up Algorithm for Scalable Video Libraries. Stavros Harizopoulos and Garth A. Gibson. CMU SCS Technical Report CMU-CS-01-105, March 2001.
    Abstract / Postscript [784K] / PDF [184K]

  • Scalable and Manageable Storage Systems. Khalil S. Amiri. Carnegie Mellon Ph.D. Dissertation, CMU-CS-00-178, December, 2000.
    Abstract / Postscript [M] / PDF [1.7M]

  • Network Attached Storage Architecture. Garth A. Gibson and Rodney Van Meter. COMMUNICATIONS OF THE ACM November 2000, Vol.43, No.11
    Abstract / PDF [219K]

  • Information Technology - SCSI Object Based Storage Device Commands (OSD). Gene Milligan, T10 Technical Editor. T10 - Working draft NCITS TBD-200X Project 1355D, Revision 3. October 1, 2000.
    Abstract / PDF [367K]

  • Security for a High Performance Commodity Storage Subsystem. Howard Gobioff. Carnegie Mellon Ph.D. Dissertation, CMU-CS-99-160, July 1999.
    Abstract / Postscript [8.4M] / PDF [1.8M]

  • Highly Concurrent Shared Storage. Amiri, K., Gibson, G.A. and Golding, R. Proceedings of the International Conference on Distributed Computing Systems, Taipei, April 2000.
    Abstract / Postscript [908K] / PDF [104K]

  • Network Support for Network-Attached Storage. Nagle, D.F., Ganger, G.R., Butler, J., Goodson, G. and Sabol, C. Hot Interconnects 1999, August 18 - 20, 1999, Stanford University, Stanford, California.
    Abstract / Postscript [861K] / PDF [205K]

  • NASD Scalable Storage Systems. Gibson, G.A., Nagle, D.F., Courtright II, W., Lanza, N., Mazaitis, P., Unangst, M. and Zelenka, J. USENIX99, Extreme Linux Workshop, Monterey, CA, June 1999.
    Abstract / Postscript [1.6M] / PDF [270K]

  • Embedded Security for Network-Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-99-154, June 1999.
    Abstract / Postscript [5.1M] / PDF [761K]

  • Integrity and Performance in Network Attached Storage. Gobioff, H., Nagle, D.F. and Gibson, G.A. CMU SCS Technical Report CMU-CS-98-182, December 1998.
    Abstract / Postscript [534K] / PDF [142K]

  • Scalable Concurrency Control and Recovery for Shared Storage Arrays. Amiri, K., Gibson, G.A. and Golding, R. CMU SCS Technical Report CMU-CS-99-111, February 1999.
    Abstract / Postscript [480K] / PDF [108K]

  • A Cost-Effective, High-Bandwidth Storage Architecture. Gibson, G.A., Nagle, D.F., Amiri, K., Butler, J., Chang, F.W., Gobioff, H., Hardin, C., Riedel, E., Rochberg, D. and Zelenka, J. Proceedings of the 8th Conference on Architectural Support for Programming Languages and Operating Systems.
    Abstract / Postscript [5.3M] / PDF [501K]

  • Filesystems for Network-Attached Secure Disks. Gibson, G., Nagle, D.F., Amiri, K., Chang, F.W., Gobioff, H., Riedel, E., Rochberg, D. and Zelenka, J. CMU SCS Technical Report CMU-CS-97-118, 1997.
    Abstract / Postscript [220K] / PDF [94K]

  • Security for Network Attached Storage Devices. Gobioff, H., Gibson, G.A. and Tygar, D. CMU SCS Technical Report CMU-CS-97-185, 1997.
    Abstract / Postscript [143K] / PDF [135K]

  • File Server Scaling with Network-Attached Secure Disks. Gibson, G.A., Nagle, D.F., Amiri, K., Chang, F.W., Feinberg, E.M., Gobioff, H., Lee, C., Ozceri, B., Riedel, E., Rochberg, D. and Zelenka, J. Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems (Sigmetrics '97), Seattle, Washington, June 15-18, 1997.
    Abstract / Postscript [573K] / PDF [258K]

  • Understanding Customer Dissatisfaction with Underutilized Distributed File Servers. Riedel, E. and Gibson, G.A. Proc. of the 5th NASA Conference on Mass Storage Systems and Technologies, College Park, MD. September 1996. Supercedes CMU SCS Technical Report CMU-CS-96-158.
    Abstract / Postscript [898K] / PDF [258K]

  • A Case for Network-Attached Secure Disks. Gibson, G.A., Nagle, D.F., Amiri, K., Chang, F.W., Feinberg, E.M., Gobioff, H., Lee, C., Ozceri, B., Riedel, E. and Rochberg, D. CMU SCS Technical Report CMU-CS-96-142, September 1996.
    Abstract / Postcript [380K] / PDF [151K]

Parity Declustering: On-line Failure Recovery for RAID

  • Architectures and Algorithms for On-Line Failure Recovery in Redundant Disk Arrays. Holland, M., Gibson, G.A. and Siewiorek, D.P. Journal of Distributed and Parallel Databases, Vol. 2, No. 3, pp. 295-335, July 1994.
    Abstract / Postscript [374K] / PDF [206K]

  • On-Line Data Reconstruction in Redundant Disk Arrays. Mark Holland. Carnegie Mellon Ph.D. Dissertation CMU-CS-94-164, 1994, 232 pages.
    Abstract / Table of Contents / Postscript [1.3M] / PDF [769K]

  • Fast, On-Line Failure Recovery in Redundant Disk Arrays. Holland, M., Gibson, G.A. and Siewiorek, D.P. Proc. of the 23rd Annual International Symposium on Fault-Tolerant Computing, pp. 421-433, 1993.
    Abstract / Postscript [137K] / PDF [68K]

  • Parity Declustering for Continuous Operation in Redundant Disk Arrays. Holland, M. and Gibson, G.A. Proc. of the 5th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS V), 1992.
    Abstract / Postscript [239K] / PDF [101K]

  • This directory contains the block design reffered to in Parity Declustering for Continuous Operation in Redundant Disk Arrays, Proceedings, ASPLOS V, 1992.
    Database [2.7M]

Parity Logging: Overcoming the Small-Write Bottleneck

  • Parity-Logging Disk Arrays. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. ACM Transactions on Computer Systems, 12(3):206-235, August 1994. Supercedes A Redundant Disk Array Architecture for Efficient Small Writes. CMU SCS Technical Report CMU-CS-94-170, 1994.
    Abstract / Postscript [313K] / PDF [164K]

  • Parity Logging: Overcoming the Small Write Problem in Redundant Disk Arrays. Stodolsky, D., Holland, M. and Gibson, G.A. Proc. of the 21th Annual International Symposium on Computer Architectures, 1993, pp. 64-75.
    Abstract / Postscript [190K] / PDF [89K]

  • A Redundant Disk Array Architecture for Efficient Small Writes. Stodolsky, D., Holland, M., Courtright II, W.V. and Gibson, G.A. CMU SCS Technical Report CMU-CS-93-200, 1993.
    Abstract / Postscript [356K] / PDF [196K]

PASIS

  • Zzyzx: Scalable Fault Tolerance Through Byzantine Locking. James Hendricks, Shafeeq Sinnamohideen, Gregory R. Ganger, Michael K. Reiter. Proceedings of the 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. Chicago, Illinois, June 2010.
    Abstract / PDF [231K]

  • Efficient Byzantine Fault Tolerance for Scalable Storage and Services. James Hendricks. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-09-146. July 2009.
    Abstract / PDF [1.1M]

  • Low-overhead Byzantine Fault-tolerant Storage. James Hendricks, Gregory R. Ganger, Michael K. Reiter. Proceedings of the Twenty-First ACM Symposium on Operating Systems Principles (SOSP 2007), Stevenson, WA, October 2007.
    Abstract / PDF [280K]

  • Verifying Distributed Erasure-coded Data. James Hendricks, Gregory R. Ganger, Michael K. Reiter. To appear in Proceedings of the Twenty-Sixth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2007), Portland, August 2007.
    Abstract / PDF [193K]

  • A read/write protocol family for versatile storage infrastructures. Jay J. Wylie. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-05-108, October 2005.
    Abstract / Postscript [1.95M] / PDF [1.5M]

  • Correctness of the Read/Conditional-Write and Query/Update Protocols. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-107, September, 2005.
    Abstract / PDF [392K]

  • Fault-Scalable Byzantine Fault-Tolerant Services. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. SOSP’05, October 23-26, 2005, Brighton, United Kingdom.
    Abstract / PDF [299K]

  • Lazy Verification in Fault-Tolerant Distributed Storage Systems. Michael Abd-El-Malek, Gregory R. Ganger, Garth R. Goodson, Michael K. Reiter, Jay J. Wylie. 24th IEEE Symposium on Reliable Distributed Systems (SRDS 2005), October 26-28, 2005, Orlando, Florida.
    Abstract / PDF [ 228K]

  • Efficient, scalable consistency for highly fault-tolerant storage. Garth Goodson. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-111, August 2004.
    Abstract / PDF [1.5M]

  • Efficient Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-104, December 2003.
    Abstract / Postscript [435K] / PDF [253K]

  • A Protocol Family Approach to Survivable Storage Infrastructures. Jay J. Wylie, Garth R. Goodson, Gregory R. Ganger, Michael K. Reiter. FuDiCo II: S.O.S. (Survivability: Obstacles and Solutions), 2nd Bertinoro Workshop on Future Directions in Distributed Computing, 23-25 June 2004, University of Bologna Residential Center, Bertinoro (Forlì), Italy.
    Abstract / PDF [56K]

  • The Safety and Liveness Properties of a Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-03-105. March 2004.
    Abstract / Postscript [922K] / PDF [227K]

  • Decentralized Recovery for Survivable Storage Systems. Theodore Ming-Tao Wong. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-04-119. May 2004.
    Abstract / Postscript [1.6M] / PDF [715K]

  • A Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-103, December 2003.
    Abstract / Postscript [925K] / PDF [321K]

  • Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-187, September 2003.
    Abstract / Postscript [272K] / PDF [479K]

  • Efficient Consistency for Erasure-coded Data via Versioning Servers. Garth Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-127, April 2003.
    Abstract / Postscript [290K] / PDF [160K]

  • Verifiable Secret Redistribution for Archive Systems. Theodore M. Wong, Chenxi Wang, Jeannette M. Wing. Proceedings of the First International IEEE Security in Storage Workshop, December 2002.
    Abstract / PDF [215K]

  • Decentralized Storage Consistency via Versioning Servers. Garth Goodson, Jay Wylie, Greg Ganger, Mike Reiter. Carnegie Mellon University Technical Report CMU-CS-02-180, September 2002.
    Abstract / PDF [210K]

  • Robustness Hinting for Improving End-to-End Dependability. Michael W. Bigrigg. Second Workshop on Evaluating and Architecting System Dependability (EASY). In conjunction with ASPLOS-X. Sunday, 6 October 2002, San Jose, California, U.S.A.
    Abstract / PDF [25K]

  • On Correlated Failures in Survivable Storage Systems. Mehmet Bakkaloglu, Jay J. Wylie, Chenxi Wang, Gregory R. Ganger CMU SCS Technical Report CMU-CS-02-129. May 2002
    Abstract / Postscript [1.8M] / PDF [475K]

  • The Set-Check-Use Methodology for Detecting Error Propagation Failures in I/O Routines. Michael W. Bigrigg, Jacob J. Vos. Workshop on Dependability Benchmarking, in conjunction with the International Conference on Dependable Systems and Networks, DSN-2002; June 23rd-26th, 2002, Washington, D.C., USA.
    Abstract / Postscript [111K] / PDF [77K]

  • Verifiable Secret Redistribution for Threshold Sharing Schemes. Theodore M. Wong, Chenxi Wang, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-02-114, February 2002 (revised).
    Abstract / Postscript [156K] / PDF [210K]

  • Testing the Portability of Desktop Applications to a Networked Embedded System. Michael W. Bigrigg, and Joseph G. Slember. Workshop on Reliability in Embedded Systems at the 20th IEEE Symposium on Reliable Distributed Systems, New Orleans, October 2001.
    Abstract / Postscript [198K] / PDF [40K]

  • Verifiable Secret Redistribution Theodore M. Wong, Jeannette M. Wing. CMU SCS Technical Report CMU-CS-01-155, October 2001.
    Abstract / Postscript [198K] / PDF [168K]

  • Survivable Storage Systems. Gregory R. Ganger, Pradeep K. Khosla, Mehmet Bakkaloglu, Michael W. Bigrigg, Garth R. Goodson, Semih Oguz, Vijay Pandurangan, Craig A. N. Soules, John D. Strunk, Jay J. Wylie. DARPA Information Survivability Conference and Exposition (Anaheim, CA, 12-14 June 2001), pages 184-195 vol 2. IEEE, 2001.
    Abstract / PDF [128K]

  • Selecting the Right Data Distribution Scheme for a Survivable Storage System. Jay J. Wylie, Mehmet Bakkaloglu, Vijay Pandurangan, Michael W. Bigrigg, Semih Oguz, Ken Tew, Cory Williams, Gregory R. Ganger, Pradeep K. Khosla. CMU SCS Technical Report CMU-CS-01-120, May 2001.
    Abstract / Postscript [10.9M] / PDF [1.4M]

  • Survivable Information Storage Systems. Jay Wylie, Michael Bigrigg, John Strunk, Gregory Ganger, Han Kiliccote and Pradeep Khosla. IEEE Computer, August 2000.
    Abstract / PDF [192K]

Process Scheduling

  • Scheduling Speculative Tasks in a Compute Farm. David Petrou, Garth A. Gibson, Gregory R. Ganger. Proceedings of the ACM/IEEE Supercomputing 2005 Conference, Seattle, Washington, November, 2005.
    Abstract / PDF [ 569K]

  • Cluster Scheduling for Explicitly-speculative Tasks. David Petrou. Carnegie Mellon University Ph.D Dissertation. CMU-PDL-04-112, September 2004.
    Abstract / PDF [4.2M] / Simulator Code

  • Cluster Scheduling for Explicitly Speculative Tasks. David Petrou Gregory R. Ganger Garth A. Gibson. Proceedings 18th Annual ACM International Conference on Supercomputing (ICS’04), June 26-July 1, 2004, Malo, France.
    Abstract / PDF [443K]

  • Scheduling Explicitly-speculative Tasks. David Petrou, Gregory R. Ganger, Garth A. Gibson. Carnegie Mellon University Technical Report CMU-CS-03-204, November 2003.
    Abstract / Postscript [2.0M] / PDF [400K]

  • Implementing Lottery Scheduling: Matching the Specializations in Traditional Schedulers. Petrou, D., Milford, J.W. and Gibson, G.A. Proceedings, USENIX 1999, Monterey, CA, June 9-11, 1999.
    Abstract / Postscript [532K] / PDF [235K]

RAID: Implementing and Prototyping Redundant Disk

  • A Transactional Approach to Redundant Disk Array Implementation. William Courtright, II. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-141, April 1997.
    Abstract / Postscript [1.7M] / PDF [1.1M]

  • RAIDframe: A Rapid Prototyping Tool for RAID Systems. William V. Courtright II, Garth Gibson, Mark Holland, LeAnn Neal Reilly, Jim Zelenka. Carnegie Mellon University Technical Report CMU-CS-97-142. June 4, 1997.
    Abstract / Postscript [980K] / PDF [620K]

  • A Structured Approach to Redundant Disk Array Implementation. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. Proc. of the International Computer Performance and Dependability Symposium (IPDS), Sept. 4-6, 1996. Supercedes CMU SCS Technical Report CMU-CS-96-137.
    Abstract / Postscript [212K] / PDF [85K]

  • RAIDframe: Rapid Prototyping for Disk Arrays. Courtright II, W.V., Gibson, G.A., Holland, M. and Zelenka, J. Proc. of the 1996 Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), May 1996, Vol. 24 No. 1, pp. 268-269. Supercedes CMU SCS Technical Report CMU-CS-95-200.
    Abstract / Postscript [74K] / PDF [25K]

  • Backward Error Recovery in Redundant Disk Arrays. Courtright II, W.V. and Gibson, G.A. Proc. of the 1994 Computer Measurement Group (CMG) Conference, Dec. 1994, Vol. 1, pp. 63-74. Supercedes CMU SCS Technical Report CMU-CS-94-193.
    Abstract / HTML / Postscript [134K] / PDF [69K]

RAID for Mobile Computers

  • RAID for Mobile Computers. Rachad Youssef. Carnegie Mellon M.S. Thesis INI-TR 1995-3, 1995, 35 pages.
    Abstract / Postscript [245K] / PDF [106K]

Scotch Parallel Storage Systems

  • The Scotch Parallel Storage Systems. Gibson, G.A., Stodolsky, D., Chang, F.W., Courtright II, W.V., Demetriou, C.G., Ginting, E., Holland, M., Ma, Q., Neal, L., Patterson, R.H., Su, J., Youssef, R. and Zelenka, J. Proc. of the IEEE CompCon Conference, March 5-8, 1995. Supercedes CMU SCS Technical Report CMU-CS-95-107.
    Abstract / Postscript [112K] / PDF [64K]

Self-* Storage

  • So, You Want To Trace Your Distributed System? Key Design Insights from Years of Practical Experience. Raja R. Sambasivan, Rodrigo Fonseca, Ilari Shafer, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-14-102, April 2014.
    Abstract / PDF [870K]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. IEEE Transactions on Visualization and Computer Graphics (Proceedings Information Visualization 2013), vol. 19, no. 12, Dec. 2013.
    Abstract / PDF [1.9M] / TRAILER VIDEO [5.6M] / VIDEO [17.9M]

  • Diagnosing Performance Changes in Distributed Systems by Comparing Request Flows. Raja R. Sambasivan. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation. CMU-PDL-13-105, May 2013.
    Abstract / PDF [3.9M]

  • Visualizing Request-flow Comparison to Aid Performance Diagnosis in Distributed Systems. Raja R. Sambasivan, Ilari Shafer, Michelle L. Mazurek, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-104 (supersedes CMU-PDL-12-102), April 2013.
    Abstract / PDF [1.93M]

  • Performance Insulation: More Predictable Shared Storage. Matthew Wachs. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-11-134. September 2011.
    Abstract / PDF [2.65M]

  • End-to-end Tracing in HDFS. William Wang Carnegie Mellon University School of Computer Science Technical Report (Masters Thesis) CMU-CS-11-120, July 2011.
    Abstract / PDF [489K]

  • Automation Without Predictability is a Recipe for Failure. Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-11-101, January 2011.
    Abstract / PDF [336K]

  • Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI'11). March 30 - April 1, 2011. Boston, MA.
    Abstract / PDF [388K]

  • Improving Storage Bandwidth Guarantees with Performance Insulation. Matthew Wachs, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-113, October 2010.
    Abstract / PDF [285K]

  • Diagnosing Performance Changes by Comparing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-107. July 2010. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-103.
    Abstract / PDF [503K]

  • A Transparently-Scalable Metadata Service for the Ursa Minor Storage System. Shafeeq Sinnamohideen, Raja R. Sambasivan, James Hendricks, Likun Liu, Gregory R. Ganger. Usenix Annual Technical Conference, Boston, MA, June 23-25, 2010. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-10-102. March 2010.
    Abstract / PDF [230K]

  • Zzyzx: Scalable Fault Tolerance Through Byzantine Locking. James Hendricks, Shafeeq Sinnamohideen, Gregory R. Ganger, Michael K. Reiter. Proceedings of the 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks. Chicago, Illinois, June 2010.
    Abstract / PDF [231K]

  • Reusing Migration to Simply and Efficiently Implement Multi-server Operations in Transparently Scalable Storage Systems. Shafeeq Sinnamohideen. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-10-141. May 2010.
    Abstract / PDF [926K]

  • Delayed Instantiation Bulk Operations for Management of Distributed, Object-based Storage Systems. Andrew J. Klosterman. Ph.D. Dissertation. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-108, August 2009.
    Abstract / PDF [2M]

  • Efficient Byzantine Fault Tolerance for Scalable Storage and Services. James Hendricks. Carnegie Mellon School of Computer Science Ph.D. Dissertation CMU-CS-09-146. July 2009.
    Abstract / PDF [1.1M]

  • Co-scheduling of Disk Head Time in Cluster-based Storage. Matthew Wachs, Gregory R. Ganger. 28th International Symposium On Reliable Distributed Systems September 27-30, 2009. Niagara Falls, New York, U.S.A. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-113. October 2008.
    Abstract / PDF [245K]

  • Relative Fitness Modeling. Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, and Gregory R. Ganger. Communications of the ACM, Vol. 52 No. 4, April 2009.
    Abstract / PDF [775K]

  • IRONModel: Robust Performance Models in the Wild. Eno Thereska, Gregory R. Ganger. SIGMETRICS’08, June 2–6, 2008, Annapolis, Maryland, USA.
    Abstract / PDF [813K]

  • Using Utility Functions to Control a Distributed Storage System. John D. Strunk. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-08-102, May 2008.
    Abstract / PDF [940K]

  • Using Utility to Provision Storage Systems. John D. Strunk, Eno Thereska, Christos Faloutsos, Gregory R. Ganger. 6th USENIX Conference on File and Storage Technologies (FAST '08). Feb. 26-29, 2008. San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-106, September 2007.
    Abstract / PDF [310K]

  • On Modeling the Relative Fitness of Storage. Michael P. Mesnier. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-108, December 19, 2007.
    Abstract / PDF [1.16M]

  • Low-overhead Byzantine Fault-tolerant Storage. James Hendricks, Gregory R. Ganger, Michael K. Reiter. Proceedings of the Twenty-First ACM Symposium on Operating Systems Principles (SOSP 2007), Stevenson, WA, October 2007.
    Abstract / PDF [280K]

  • Enabling What-if Explorations in Systems. Eno Thereska. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-103, August 2007.
    Abstract / PDF [2.35M]

  • Verifying Distributed Erasure-coded Data. James Hendricks, Gregory R. Ganger, Michael K. Reiter. To appear in Proceedings of the Twenty-Sixth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2007), Portland, August 2007.
    Abstract / PDF [193K]

  • Categorizing and Differencing System Behaviours. Raja R. Sambasivan, Alice X. Zheng, Eno Thereska, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL.
    Abstract / PDF [120K]

  • Observer: Keeping System Models from Becoming Obsolete. Eno Thereska, Dushyanth Narayanan, Anastassia Ailamaki, Gregory R. Ganger. Second Workshop on Hot Topics in Autonomic Computing. June 15, 2007. Jacksonville, FL. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-07-101, January 2007.
    Abstract / PDF[ 75K]

  • Modeling the Relative Fitness of Storage. Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, Gregory R. Ganger. SIGMETRICS’07, June 12–16, 2007, San Diego, California, USA.
    Abstract / PDF [235K]

  • Eliminating Cross-server Operations in Scalable File Systems. James Hendricks, Shafeeq Sinnamohideen, Raja R. Sambasivan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-105, May 2006.
    Abstract / PDF [ 254K]

  • Improving Small File Performance in Object-based Storage. James Hendricks, Raja R. Sambasivan, Shafeeq Sinnamohideen, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-104, May 2006.
    Abstract / PDF [ 1.45M]

  • Early Experiences on the Journey Towards Self-* Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad, Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, September 2006.
    Abstract / PDF [113K] / Postscript [745K]

  • InteMon: Continuous Mining of Sensor Data in Large-scale Self-* Infrastructures. Evan Hoke, Jimeng Sun, John D. Strunk, Gregory R. Ganger, and Christos Faloutsos. ACM SIGOPS Operating Systems Review. Vol 40 Issue 3. July, 2006. ACM Press.
    Abstract / PDF [573K]

  • Stardust: Tracking Activity in a Distributed Storage System. Eno Thereska, Brandon Salmon, John Strunk, Matthew Wachs, Michael Abd-El-Malek, Julio Lopez, Gregory R. Ganger. Proceedings of the Joint International Conference on Measurement and Modeling of Computer Systems, (SIGMETRICS'06). June 26th-30th 2006, Saint-Malo, France.
    Abstract / PDF [578K]

  • Informed Data Distribution Selection in a Self-predicting Storage System. Eno Thereska, Michael Abd-El-Malek, Jay J. Wylie, Dushyanth Narayanan, Gregory R. Ganger. Proceedings of the International Conference on Autonomic Computing (ICAC-06), Dublin, Ireland. June 12th-16th 2006. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-101, January 2006.
    Abstract / PDF [196K]

  • Ursa Minor: Versatile Cluster-based Storage. Michael Abd-El-Malek, William V. Courtright II, Chuck Cranor, Gregory R. Ganger, James Hendricks, Andrew J. Klosterman, Michael Mesnier, Manish Prasad, Brandon Salmon, Raja R. Sambasivan, Shafeeq Sinnamohideen, John D. Strunk, Eno Thereska, Matthew Wachs, Jay J. Wylie. Proceedings of the 4th USENIX Conference on File and Storage Technology (FAST '05). San Francisco, CA. December 13-16, 2005. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-104, April, 2005.
    Abstract / PDF [490K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage Systems. Christopher R. Lumb. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-05-111, December, 2005.
    Abstract / PDF [1.2M]

  • Modeling the Relative Fitness of Storage Devices. Michael Mesnier, Matthew Wachs, Gregory Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-106, August, 2005.
    Abstract / PDF [190K]

  • Towards self-predicting systems: What if you could ask “what-if”? Eno Thereska, Dushyanth Narayanan, Gregory R. Ganger. 3rd International Workshop on Self-adaptive and Autonomic Computing Systems. Copenhagen, Denmark, August 2005. Supercedes Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-05-10, February 2005.
    Abstract / PDF [110K]

  • Comparison-based File Server Verification. Yuen-Lin Tan, Terrence Wong, John D. Strunk, Gregory R. Ganger. USENIX '05 Annual Technical Conference, April 10-15, 2005. Anaheim, CA.
    Abstract / Postscript [900K] / PDF [130K]

  • Challenges in Building a Two-Tiered Learning Architecture for Disk Layout. Brandon Salmon, Eno Thereska, Craig A.N. Soules, John D. Strunk, Gregory R. Ganger. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-109. August, 2004.
    Abstract / Postscript [6.8M] / PDF [150K]

  • DSPTF: Decentralized Request Distribution in Brickbased Storage Systems. Christopher R. Lumb, Richard Golding, Gregory R. Ganger. Proceedings of ASPLOS’04, October 7–13 ,2004, Boston, Massachusetts, USA.
    Abstract / PDF [281K]

  • Efficient Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-104, December 2003.
    Abstract / Postscript [435K] / PDF [253K]

  • The Safety and Liveness Properties of a Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-03-105. March 2004.
    Abstract / Postscript [922K] / PDF [227K]

  • A Protocol Family for Versatile Survivable Storage Infrastructures. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-103, December 2003.
    Abstract / Postscript [925K] / PDF [321K]

  • D-SPTF: Decentralized Request Distribution in Brick-based Storage. Christopher R. Lumb, Gregory R. Ganger, Richard Golding. Carnegie Mellon University School of Computer Science Tecnical Report CMU-CS-03-202, November, 2003.
    Abstract / PDF [475K]

  • File Classification in Self-* Storage Systems. Michael Mesnier, Eno Thereska, Daniel Ellard, Gregory R. Ganger, Margo Seltzer. Proceedings of the First International Conference on Autonomic Computing (ICAC-04). New York, NY. May 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-101, January 2004.
    Abstract / Postscript [1.6M] / PDF [80K]

  • Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-187, September 2003.
    Abstract / Postscript [272K] / PDF [479K]

  • Attribute-Based Prediction of File Properties. Daniel Ellard, Michael Mesnier, Eno Thereska, Gregory R. Ganger, Margo Seltzer. Harvard Computer Science Group Technical Report TR-14-03, December 2003.
    Abstract / Postscript [850K] / PDF [127K]

  • Self-* Storage: Brick-based Storage with Automated Administration. Gregory R. Ganger, John D. Strunk, Andrew J. Klosterman. Published as Carnegie Mellon University Technical Report, CMU-CS-03-178, August 2003.
    Abstract / Postscript [2.2M] / PDF [558K]

  • A Human Organization Analogy for Self-* Systems. John D. Strunk, Gregory R. Ganger. First Workshop on Algorithms and Architectures for Self-Managing Systems. In conjunction with Federated Computing Research Conference (FCRC). San Diego, CA. June 11, 2003. Supercedes Carnegie Mellon University SCS Technical Report CMU-CS-03-129.
    Abstract / Postscript [273K] / PDF [68K]

Self-Securing Devices

  • Storage-Based Intrusion Detection. Adam G. Pennington, John Linwood Griffin, John S. Bucy, John D. Strunk, Gregory R. Ganger. ACM Transactions on Information and System Security, Vol. 13, No. 4, Article 30, Pub. date: December 2010.
    Abstract / PDF [333K]

  • Design and Implementation of Self-Securing Network Interface Applications. Stanley M. Bielski. M.S. Thesis. Electrical and Computer Engineering, Carnegie Mellon University. December 2005.
    Abstract / PDF [211K]

  • Empirical Analysis of Rate Limiting Mechanisms. Cynthia Wong, Stan Bielski, Ahren Studer, Chenxi Wang. 8th International Symposium on Recent Advances in Intrusion Detection (RAID 2005), September 7-9, 2005, Seattle, Washington. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-103, March 2005.
    Abstract / PDF [207K]

  • A Study of Mass-mailing Worms. Cynthia Wong, Stan Bielski, Jonathan M. McCune, Chenxi Wang. WORM’04, October 29, 2004, Washington, DC, USA.
    Abstract / PDF [192K]

  • Dynamic Quarantine of Internet Worms. Cynthia Wong, Chenxi Wang, Dawn Song, Stan Bielski, Gregory R. Ganger. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-108, December 2003.
    Abstract / Postscript [1.4M] / PDF [224K]

  • Finding and Containing Enemies Within the Walls with Self-securing Network Interfaces. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. Carnegie Mellon University Technical Report CMU-CS-03-109, January 2003.
    Abstract / Postscript [963K] / PDF [118K]

  • Self-Securing Network Interfaces: What, Why and How. Gregory R. Ganger, Gregg Economou, Stanley M. Bielski. CMU SCS Technical Report CMU-CS-02-144, May 2002.
    Abstract / Postscript [952K] / PDF [472K]

  • Better Security via Smarter Devices. Gregory R. Ganger and David F. Nagle. Appears in HotOS-VIII (IEEE Workshop on Hot Topics in Operating Systems), May 2001.
    Abstract / Postscript [1.1M] / PDF [245K]

  • Position Summary: Authentication Confidences. Gregory R. Ganger. Appears in HotOS-VIII (IEEE Workshop on Hot Topics in Operating Systems), May 2001.
    Abstract / Postscript [66K] / PDF [16K]

  • Authentication Confidences Gregory R. Ganger. CMU SCS Technical Report CMU-CS-01-123, May 2001.
    Abstract / Postscript [335K] / PDF [42K]

  • Building Firewalls with Intelligent Network Interface Cards. David Friedman and David Nagle. CMU SCS Technical Report CMU-CS-00-173, May 2001.
    Abstract / Postscript [540K] / PDF [229K]

  • Enabling Dynamic Security Management of via Device-Embedded Security. Gregory R. Ganger and David F. Nagle. CMU SCS Technical Report CMU-CS-00-174, December 2000.
    Abstract / PDF [607K]

  • Secure Continuous Biometric-Enhanced Authentication Andrew J. Klosterman and Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-134, May 2000.
    Abstract / Postscript [1.1M] / PDF [245K]

  • Design and Implementation of a Self-Securing Storage Device. John D. Strunk, Garth R. Goodson, Michael L. Scheinholtz, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-00-129, May 2000.
    Abstract / Postscript [401K] / PDF [330K]

Self-Securing Storage

  • Storage-Based Intrusion Detection. Adam G. Pennington, John Linwood Griffin, John S. Bucy, John D. Strunk, Gregory R. Ganger. ACM Transactions on Information and System Security, Vol. 13, No. 4, Article 30, Pub. date: December 2010.
    Abstract / PDF [333K]

  • On the Feasibility of Intrusion Detection Inside Workstation Disks. John Linwood Griffin, Adam Pennington, John S. Bucy, Deepa Choundappan, Nithya Muralidharan, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-106. December, 2003.
    Abstract / Postscript [1.12M] / PDF [215K]

  • Byzantine-tolerant Erasure-coded Storage. Garth R. Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-187, September 2003.
    Abstract / Postscript [272K] / PDF [479K]

  • Storage-based Intrusion Detection: Watching Storage Activity For Suspicious Behavior. Adam Pennington, John Strunk, John Griffin, Craig Soules, Garth Goodson, Greg Ganger. 12th USENIX Security Symposium, Washington, D.C., Aug 4-8, 2003. Supercedes Carnegie Mellon University Technical Report CMU-CS-02-179, September 2002.
    Abstract / Postscript [727K] / PDF [138K]

  • Efficient Consistency for Erasure-coded Data via Versioning Servers. Garth Goodson, Jay J. Wylie, Gregory R. Ganger, Michael K. Reiter. Carnegie Mellon University Technical Report CMU-CS-03-127, April 2003.
    Abstract / Postscript [290K] / PDF [160K]

  • Intrusion Detection, Diagnosis, and Recovery with Self-Securing Storage. John D. Strunk, Garth R. Goodson, Adam G. Pennington, Craig A.N. Soules, Gregory R. Ganger. CMU SCS Technical Report CMU-CS-02-140, May 2002.
    Abstract / Postscript [1.1M] / PDF [119K]

  • Metadata Efficiency in a Comprehensive Versioning File System. Craig A. N. Soules, Garth R. Goodson, John D. Strunk, Gregory R. Ganger. 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, Mar 31 - Apr 2, 2003. Supercedes CMU SCS Technical Report CMU-CS-02-145, May 2002.
    Abstract / Postscript [817K] / PDF [178K]

  • Self-Securing Storage: Protecting Data in Compromised Systems. Strunk, J.D., Goodson, G.R., Scheinholtz, M.L., Soules, C.A.N. and Ganger, G.R. Appears in Proc. of the 4th Symposium on Operating Systems Design and Implementation, 2000.
    Abstract / Postscript [345K] / PDF [294K]

//Trace

  • Relative Fitness Modeling. Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Alice X. Zheng, and Gregory R. Ganger. Communications of the ACM, Vol. 52 No. 4, April 2009.
    Abstract / PDF [775K]

  • On Modeling the Relative Fitness of Storage. Michael P. Mesnier. Carnegie Mellon University, Dept. ECE Ph.D Dissertation CMU-PDL-07-108, December 19, 2007.
    Abstract / PDF [1.16M]

  • //TRACE: Parallel Trace Replay with Approximate Causal Events. Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio Lopez, James Hendricks, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-108, September 2006.
    Abstract / PDF[ 187K]

Transparent Informed Prefetching (TIP) and Caching

  • Explicit Compiler-based Memory Management for Out-of-core Applications. Angela Demke Brown. Carnegie Mellon University School of Computer Science Ph.D. Dissertation CMU-CS-05-140, May 2005.
    Abstract / PDF [1.3M]

  • Using Speculative Execution to Automatically Hide I/O Latency. Fay W. Chang. Carnegie Mellon Ph.D Dissertation CMU-CS-01-172, December 2001.
    Abstract / Postscript [4.9M] / PDF [945K]

  • Compiler-Based I/O Prefetching for Out-of-Core Applications. Angela Demke Brown, Todd C. Mowry and Orran Krieger. In ACM Transactions on Computer Systems,19(2):111-170, May 2001.
    Abstract / Postscript [1.2M] / PDF [487K]

  • Taming the Memory Hogs: Using Compiler Inserted Releases to Manage Physical Memory Intelligently. Angela Demke Brown and Todd C. Mowry. Proceedings of the 4th Symposium on Operating Systems Design and Implementation (OSDI 2000), pp. 31-44, Oct 2000.
    Abstract / Postscript [670K] / PDF [129K]

  • Automatic Compiler-Inserted I/O Prefetching for Out-of-Core Applications. Todd C. Mowry, Angela K. Demke and Orran Krieger. Proceedings of the Second Symposium on Operating Systems Design and Implementation (OSDI '96), pages 3-17, October 1996.
    Abstract / Postscript [1.15M] / PDF [120K]

  • Automatic I/O Hint Generation through Speculative Execution. Chang, F.W. and Gibson, G.A. Proceedings of the 3rd Symposium on Operating Systems Design and Implementation, February 1999.
    Abstract / Postscript [197K] / PDF [121K]

  • Informed Prefetching and Caching. Hugo Patterson. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-204: December 1997.
    Abstract / Postscript [4.8M] / PDF [2.3M]

  • Practical and Theoretical Issues in Prefetching and Caching. Andrew Tomkins. Carnegie Mellon Ph.D. Dissertation CMU-CS-97-181: October 1997.
    Abstract / GZ Postscript [827K] / PDF [918K]

  • Prefetching Over a Network: Early Experience with CTIP. Rochberg, D. and Gibson, G.A. SIGMETRICS Performance Evaluation Review, volume 25, number 3, (pp. 29-36), December, 1997.
    Abstract / Postscript [314K] / PDF [75K]

  • Informed Multi-Process Prefetching and Caching. Tomkins, A., Patterson, R.H. and Gibson, G.A Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems, (Sigmetrics '97), Seattle, Washington, June 15-18, 1997.
    Abstract / Postscript [1.2M] / PDF [319K]

  • A Trace-Driven Comparison of Algorithms for Parallel Prefetching and Caching. Kimbrel, T., Tomkins, A., Patterson, R.H., Bershad, B., Cao, P., Felten, E.W., Gibson, G.A., Karlin, A.R. and Li, K. Proc. of the 2nd USENIX Symposium on Operating Systems Design and Implementation (OSDI), Seattle, WA, October 28-31, 1996, pp. 19-34. Supercedes CMU SCS Technical Report CMU-CS-96-174.
    Abstract / Postscript [670K] / PDF [344K]

  • Informed Prefetching and Caching. Patterson, R.H. Gibson, G.A., Ginting, E., Stodolsky, D. and Zelenka, J. Proc. of the 15th Symposium of Operating Systems Principles, Copper Mountain Resort, CO, December 3-6, 1995, pp. 79-95.
    Abstract / Postscript [440K] / PDF [232K]

  • Exposing I/O Concurrency with Informed Prefetching. Patterson, R.H. and Gibson, G.A Proc. of Third Int'l. Conf. on Parallel and Distributed Information Systems, Austin, TX, September 28-30, 1994, pp.7-16.
    Abstract / HTML / Postscript [103K] / PDF [51K]

  • A Status Report on Research in Transparent Informed Prefetching. Patterson, R.H., Gibson, G.A and Satyanarayanan, M. ACM Operating Systems Review, 1993. Supercedes CMU SCS Technical Report CMU-CS-93-113.
    Abstract / Postscript [94K] / PDF [50K]

  • Using Transparent Informed Prefetching to Reduce File Read Latency. Patterson, R.H., Gibson, G.A and Satyanarayanan, M. Annotated slides for the conference talk appearing in the proceedings of the Goddard Conference on Mass Storage Systems and Technologies, 1992.
    Postscript [71K] / PDF [40K]

  • Disk Reads with DRAM Latency. Gibson, G.A, Patterson, R.H. and Satyanarayanan, M. Third Workshop on Workstation Operating Systems, April, 1992, pp. 126-131.
    Abstract / Postscript [43K] / PDF [18K]

Workload Characterization

  • //TRACE: Parallel Trace Replay with Approximate Causal Events. Michael Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio Lopez, James Hendricks, Gregory R. Ganger. Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07), February 13–16, 2007, San Jose, CA. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-108, September 2006.
    Abstract / PDF[ 187K]

  • SIMFLEX: A Fast, Accurate, Flexible Full-System Simulation Framework for Performance Evaluation of Server Architecture. Nikolaos Hardavellas, Stephen Somogyi, Thomas F. Wenisch, Roland E. Wunderlich, Shelley Chen, Jangwoo Kim, Babak Falsafi, James C. Hoe, and Andreas G. Nowatzyk. ACM SIGMETRICS Performance Evaluation Review (PER) Special Issue on Tools for Computer Architecture Research, Volume 31, Number 4, pages 31-35, March 2004.
    Abstract / PDF [96K]

  • Storage Device Performance Prediction with CART Models. Mengzhi Wang, Kinman Au, Anastassia Ailamaki, Anthony Brockwell, Christos Faloutsos, and Gregory R. Ganger. Proc. 12th Annual Meeting of the IEEE/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). Volendam, The Netherlands. October 5-7, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-04-103, March 2004.
    Abstract / Postscript [908K] / PDF [122K]

  • Storage Device Performance Prediction with CART Models [Extended Abstract]. Mengzhi Wang, Kinman Au, Anastassia Ailamaki, Anthony Brockwell, Christos Faloutsos, and Gregory R. Ganger. Proceedings: Poster Session. Joint International Conference on Measurement and Modeling of Computer Systems. ACM SIGMETRICS/Performance 2004. June 12th-16th 2004, Columbia University, New York.
    Abstract / Postscript [400K] / PDF [64K]

  • Capturing the Spatio-Temporal Behavior of Real Traffic Data. Mengzhi Wang, Anastassia Ailamaki, Christos Faloutsos. Performance 2002 (IFIP Int. Symp. on Computer Performance Modeling, Measurement and Evaluation), Rome, Italy, Sept. 2002.
    Abstract / PDF [1.9M]

  • Data Mining Meets Performance Evaluation: Fast Algorithms for Modeling Bursty Traffic. M. Wang, T. Madhyastha, N.H. Chan, S. Papadimitriou, C. Faloutsos. 18th International Conference on Data Engineering, February 26-March 1, 2002 San Jose, California. Supercedes a technical report CMU-CS-01-101.
    Abstract / Postscript [2.25M] / PDF [358K]

Past Associated Publications


  • A Comparative Study of Baremetal Provisioning Frameworks. Ashok Chandrasekar, Garth A. Gibson. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-14-109, December 2014.
    Abstract / PDF [447K]

  • Managing GPU Concurrency in Heterogeneous Architectures. Onur Kayiran, Nachiappan Chidambaram Nachiappan, Adwait Jog, Rachata Ausavarungnirun, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, and Chita R. Das. Proceedings of 47th International Symposium on Microarchitecture (MICRO’14), December 2014.
    Abstract / PDF [2.38M]

  • Paxos Quorum Leases: Fast Reads Without Sacrificing Writes. Iulian Moraru, David G. Andersen, Michael Kaminsky. ACM Symposium on Cloud Computing 2014 (SoCC'14), Seattle, WA, Nov 2014. BEST PAPER AWARD!
    Abstract / PDF [287K]

  • The Heterogeneous Block Architecture. Chris Fallin, Chris Wilkerson, Onur Mutlu. Proceedings of 32nd IEEE International Conference on Computer Design (ICCD’14), October 2014.
    Abstract / PDF [308K]

  • Design and Evaluation of Hierarchical Rings with Deflection Routing. Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Chang, Greg Nazario, Reetuparna Das, Gabriel Loh, Onur Mutlu. Proceedings of the 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD’14), October 2014.
    Abstract / PDF [325K]

  • Fast and Accurate Mapping of Complete Genomics Reads. Donghyuk Lee, Farhad Hormozdiari, Hongyi Xin, Faraz Hach, Onur Mutlu, Can Alkan. Methods, Elsevier, October 2014.
    Abstract / PDF [1.25M]

  • Value Driven Load Balancing. Sherwin Doroudi, Esa Hyytia, Mor Harchol-Balter. Performance Evaluation, vol. 79, September 2014.
    Abstract / PDF [258K]

  • Egalitarian Distributed Consensus. Iulian Moraru. Carnegie Mellon University Ph.D. Dissertation CMU-CS-14-133. August 2014.
    Abstract / PDF [1.95M]

  • Towards Secure Execution of Untrusted Code for Mobile Edge-Clouds. Jiaqi Tan, Utsav Drolia, Rajeev Gandhi, Priya Narasimhan. Poster at 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [116K]

  • CHIPS: Content-based Heuristics for Improving Photo Privacy for Smartphones. Jiaqi Tan, Utsav Drolia, Rolando Martins, Rajeev Gandhi, Priya Narasimhan. 7th ACM Conference on Security and Privacy in Wireless and Mobile Networks (WiSec), July 2014.
    Abstract / PDF [1.4M]

  • Exact Analysis of the M/M/k/setup Class of Markov Chains via Recursive Renewal Reward. Anshul Gandhi, Sherwin Doroudi, Mor Harchol-Balter, Alan Scheller-Wolf. Queueing Systems: Theory and Applications vol. 77, no. 2, 2014, pp. 177-209. June 2014.
    Abstract / PDF [4K]

  • The Dirty-Block Index. Vivek Seshadri, Abhishek Bhowmick, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. 41st International Symposium on Computer Architecture, June, 2014.
    Abstract / PDF [2.32M]

  • Improving Cache Performance by Exploiting Read-Write Disparity. Samira Khan, Alaa Alameldeen, Chris Wilkerson, Onur Mutlu, Daniel Jimenez. Proceedings of the 20th International Symposium on High-Performance Computer Architecture (HPCA), Orlando, FL, February 2014. Best paper session.
    Abstract / PDF [355K]

  • Linearly Compressed Pages: A Low-Complexity, Low-Latency Main Memory Compression Framework. Gennady Pekhimenko, Vivek Seshadri, Yoongu Kim, Hongyi Xin, Onur Mutlu, Philip B. Gibbons, Michael A. Kozuch, Todd C. Mowry. Proceedings of the 46th International Symposium on Microarchitecture (MICRO), Davis, CA, December 2013.
    Abstract / PDF [525K]

  • Measuring Password Guessability for an Entire University. Michelle L. Mazurek, Saranga Komanduri, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Patrick Gage Kelley, Richard Shay, Blase Ur. In CCS 2013: ACM Conference on Computer and Communications Security, November 2013.
    Abstract / PDF [2.19M]

  • Challenges in Security and Privacy for Mobile Edge-Clouds. Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-13-113. October, 2013.
    Abstract / PDF [212K]

  • Hadoop's Adolescence: An Analysis of Hadoop Usage in Scientific Workloads. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Very Large Data Bases (VLDB), August, 2013.
    Abstract / PDF [986K]

  • Solving the Straggler Problem with Bounded Staleness. James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Gregory R. Ganger, Garth A. Gibson, Kimberly Keeton, Eric Xing. 14th USENIX HotOS Workshop, Santa Ana Pueblo, NM, May 13-15, 2013.
    Abstract / PDF [174K]

  • The Impact of Length and Mathematical Operators on the Usability and Security of System-assigned One-time PINs. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Richard Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In 2013 Workshop on Usable Security (USEC), April 2013.
    Abstract / PDF [802K]

  • PETAL: Preset Encoding Table Information Leakage. Jiaqi Tan, Jayvardhan Nahata. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-13-106, April 2013.
    Abstract / PDF [291K]

  • Helping Users Create Better Passwords. Blase Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Serge Egelman, and Julio López. USENIX ;login:, 37(6), December 2012.
    Abstract / PDF [970K]

  • AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers. Anshul Gandhi, Mor Harchol-Balter, Ram Raghunathan, Michael Kozuch. Transactions on Computer Systems, Volume 30, Issue 4, Article 14. November 2012.
    Abstract / PDF [1.77M]

  • TABLEFS: Embedding a NoSQL Database inside the Local File System. Ren, Kai, Garth A. Gibson. 1st Storage System, Hard Disk and Solid State Technologies Summit, IEEE Asia-Pacific Magnetic Recording Conference (APMRC), November 2012, Singapore.
    Abstract / PDF [399K]

  • HAT: Heterogeneous Adaptive Throttling for On-Chip Networks. Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu. SBAC-PAD 2012, New York, NY, October 24-26, 2012.
    Abstract / PDF [259K]

  • Scalable Dynamic Partial Order Reduction. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey. Third Int. Conf. on Runtime Verification (RV2012), 25-28 September 2012, Istanbul, Turkey.
    Abstract / PDF [331K]

  • The Evicted-Address Filter: A Unified Mechanism to Address Both Cache Pollution and Thrashing. Vivek Seshadri, Onur Mutlu, Michael A Kozuch, Todd C Mowry. PACT'12, September 19–23, 2012, Minneapolis, Minnesota, USA.
    Abstract / PDF [2M]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-110, September 2012.
    Abstract / PDF [1.43M]

  • SOFTScale: Stealing Opportunistically For Transient Scaling. Anshul Gandhi, Timothy Zhu, Mor Harchol-Balter, Michael Kozuchy. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-12-111R, August 2012.
    Abstract / PDF [477K]

  • Hadoop's Adolescence: A Comparative Workload Analysis from Three Research Clusters. Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-106. June 2012.
    Abstract / PDF [1.76M]

  • MinBD: Minimally-Buffered Deflection Routing for Energy-Efficient Interconnect. Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, Onur Mutlu. In NOCS 2012, Lyngby, Denmark, May 2012. (One of five papers nominated for the Best Paper Award by the Program Committee.)
    Abstract / PDF [369K]

  • Guess Again (and Again and Again): Measuring password strength by simulating password-cracking algorithms. Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Rich Shay, Tim Vidas, Lujo Bauer, Nicolas Christin, Lorrie Faith Cranor, Julio López. In the 2012 IEEE Symposium on Security and Privacy, May 2012.
    Abstract / PDF [2.8M]

  • Concurrent Systematic Testing at Scale. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-12-101. May 2012.
    Abstract / PDF [397K]

  • Landslide: Systematic Dynamic Race Detection in Kernel Space. Ben Blum. Carnegie Mellon University School of Computer Science MS Thesis CMU-CS-12-118. May 2012.
    Abstract / PDF [1.7M]

  • TABLEFS: Embedding a NoSQL Database Inside the Local File System. Kai Ren, Garth A. Gibson. Carnegie Mellon University Parallel Data Lab Technical Report. CMU-PDL-12-103 May 2012.
    Abstract / PDF [339K]

  • Efficient Exploratory Testing of Concurrent Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson, Jason Hickey (Google). Carnegie Mellon University Parallel Data Laboratory Techical Report CMU-PDL-11-113, November 2011.
    Abstract / PDF [786K]

  • A Cyber-Physical-System Approach to Data Center Modeling and Control for Energy Efficiency. Luca Parolini, Bruno Sinopoli, Bruce H. Krogh, Zhikui Wang. Proceedings of the IEEE, Special Issue on Cyber-Physical Systems, December 2011.
    Abstract / PDF [1.76M]

  • Reducing Memory Interference in Multicore Systems via Application-Aware Memory Channel Partitioning. Sai Prashanth Muralidhara, Lavanya Subramanian, Onur Mutlu, Mahmut Kandemir, Thomas Moscibroda. Proceedings of the 44th International Symposium on Microarchitecture
    (MICRO), Porto Alegre, Brazil, December 2011.
    Abstract / PDF [232K]

  • Understanding and Improving the Diagnostic Workflow of MapReduce Users. Jason D. Campbell (Intel Labs Pittsburgh), Arun B. Ganesan, Ben Gotow, Soila P. Kavulya, James Mulholland, Priya Narasimhan, Sriram Ramasubramanian, Mark Shuster, Jiaqi Tan (DSO National Laboratories, Singapore), ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, December 2011.
    Abstract / PDF [775K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi, Priya Narasimhan. SLAML 2011, October 23, 2011, Cascais, Portugal.
    Abstract / PDF [500K]

  • Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Soila P. Kavulya (CMU), Kaustubh Joshi, Matti Hiltunen , Scott Daniels (AT&T Labs, Research), Rajeev Gandhi and Priya Narasimhan (CMU). Workshop on System Logs and the Application of Machine Learning Techniques (SLAML), Cascais, Portugal, October 2011.
    Abstract / PDF [524K]

  • Improving Cache Performance Using Victim Tag Stores. Vivek Seshadri, Onur Mutlu, Todd Mowry, Michael A. Kozuch. SAFARI Technical Report, TR-SAFARI-2011-009, Carnegie Mellon University, September 2011.
    Abstract / PDF [242K]

  • How Does Your Password Measure Up? The effect of strength meters on password creation. Blaser Ur, Patrick Gage Kelley, Saranga Komanduri, Joel Lee, Michael Maass, Michelle L. Mazurek, Timothy Passaro, Richard Shay, Timothy Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In the 2012 USENIX Security Symposium, August 2012.
    Abstract / PDF [1.2M]

  • On-Chip Networks from a Networking Perspective: Congestion and Scalability in Many-core Interconnects. George Nychis, Chris Fallin, Thomas Moscibroda, Onur Mutlu, Srinivasan Seshan.
    In SIGCOMM 2012, Helsinki, Finland, Aug 2012.
    Abstract / PDF [628K]

  • ThermoCast: A Cyber-Physical Forecasting Model for Data Centers. Lei Li, Chieh-Jan Mike Liang, Jie Liu, Suman Nath, Andreas Terzis, Christos Faloutsos. In KDD '11: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 21-24, San Diego, CA.
    Abstract / PDF [1.32M]

  • dBug: Systematic Testing of Distributed and Multi-threaded Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson.18th International Workshop on Model Checking of Software (SPIN'11), Snowbird UT, July 2011.
    Abstract / PDF [149K]

  • Time Series Clustering: Complex is Simpler! Lei Li, B. Aditya Prakash. In Proceedings of the 28th International Conference on Machine learning, June 28 - July 2, 2011, Bellevue, WA.
    Abstract / PDF [631K]

  • WindMine: Fast and Effective Mining of Web-click Sequences. Yasushi Sakurai, Lei Li, Yasuko Matsubara, Christos Faloutsos. 2011 SIAM International Conference on Data Mining, April 28-30, 2011, Mesa, AZ.
    Abstract / PDF [968K]

  • SmartScan: Efficient Metadata Crawl for Storage Management Metadata Querying in Large File Systems. Likun Liu, Lianghong Xu, Yongwei Wu, Guangwen Yang, Gregory R. Ganger. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-112, Oct. 2010.
    Abstract / PDF [366K]

  • dBug: Systematic Evaluation of Distributed Systems. Jiri Simsa, Randy Bryant, Garth A. Gibson. 5th Int. Workshop on Systems Software Verification (SSV’10), co-located with 9th USENIX Symp. On Operating Systems Design and Implementation (OSDI’10), Vancouver BC, October 2010.
    Abstract / PDF [168K]

  • Token Attempt: The Misrepresentation of Website Privacy Policies through the Misuse of P3P Compact Policy Tokens. Pedro Giovanni Leon, Lorrie Faith Cranor, Aleecia M. McDonald, Robert McGuire. Cylab Technical Report CMU-CyLab-10-014, September 10, 2010.
    Abstract / PDF [305K]

  • Parsimonious Linear Fingerprinting for Time Series. Lei Li, B. Aditya Prakash, Christos Faloutsos. Proceedings of the VLDB Endowment, Vol. 3, No. 1, September 2010.
    Abstract / PDF [684K]

  • Correct Horse Battery Staple: Exploring the usability of system-assigned passphrases. Richard Shay, Patrick Gage Kelley, Saranga Komanduri, Michelle L. Mazurek, Blase Ur, Tim Vidas, Lujo Bauer, Nicolas Christin, and Lorrie Faith Cranor. In SOUPS 2012: Symposium on Usable Privacy and Security, July 2012.
    Abstract / PDF [549K]

  • OddBall: Spotting Anomalies in Weighted Graphs. Leman Akoglu, Mary McGlohon, Christos Faloutsos. PAKDD 2010, Hyderabad, India, 21-24 June 2010. Best Paper Award!
    Abstract / PDF [3.0M]

  • Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. Jiaqi Tan*, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. 30th IEEE International Conference on Distributed Computing Systems (ICDCS) 2010, Genoa, Italy, Jun 2010.
    Abstract / PDF [2.1M]

  • An Analysis of Traces from a Production MapReduce Cluster. Soila Kavulya, Jiaqi Tan, Rajeev Gandhi and Priya Narasimhan. 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2010). May 17-20, 2010, Melbourne, Victoria, Australia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-107, December, 2009.
    Abstract / PDF [832K]

  • Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments. Jiaqi Tan, Xinghao Pan, Eugene Marinelli, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 12th IEEE/IFIP Network Operations and Management Symposium (NOMS) 2010, Osaka, Japan, Apr 2010.
    Abstract / PDF [2.8M]

  • Black-Box Problem Diagnosis in Parallel File Systems. Michael P. Kasick, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST '10), San Jose, CA, February 2010.
    Abstract / PDF [533K]

  • Journaling versus Soft Updates: Asynchronous Meta-data Protection in File Systems. Margo I. Seltzer, Gregory R. Ganger, M. Kirk McKusick, Keith A. Smith, Craig A. N. Soules, Christopher A. Stein. Proceedings of the USENIX Technical Conference, June, 2000.
    Abstract / PDF [120K]

  • Blind Men and the Elephant: Piecing Together Hadoop for Diagnosis. Xinghao Pan, Jiaqi Tan, Soila Kalvulya, Rajeev Gandhi, Priya Narasimhan. 20th IEEE International Symposium on Software Reliability Engineering (ISSRE), Industrial Track, Mysuru, India, Nov 2009.
    Abstract / PDF [160K]

  • No Downtime for Data Conversions: Rethinking Hot Upgrades. Tudor Dumitraş, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-106. July 2009.
    Abstract / PDF [855K]

  • Ganesha: Black-Box Fault Diagnosis for MapReduce Systems. Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Workshop on Hot Topics in Measurement and Modeling of Computer Systems (HotMetrics 2009), Seattle, WA (June 2009). Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-112. September 2008.
    Abstract / PDF [180K]

  • Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan. Workshop on Hot Topics in Cloud Computing (HotCloud '09), San Diego, CA, on June 15, 2009. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-09-103, May 2009.
    Abstract / PDF [373K]

  • SALSA: Analyzing Logs as StAte Machines.SALSA: Analyzing Logs as StAte Machines. Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Analysis of System Logs (WASL), San Diego, CA (December 2008). Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-111. September 2008.
    Abstract / PDF [630K]

  • A Fault Model for Upgrades in Distributed Systems. Tudor Dumitraş, Soila Kavulya, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-115, December 2008.
    Abstract / PDF [275K]

  • ASDF: Automated, Online Fingerpointing for Hadoop. Keith Bare, Michael P. Kasick, Soila Kavulya, Eugene Marinelli, Xinghao Pan, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-104. May 2008.
    Abstract / PDF [650K]

  • RAMS and BlackSheep: Inferring White-box Application Behavior Using Black-box Techniques. Jiaqi Tan, Priya Narasimhan. School of Computer Science Senior Honors Thesis and Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-08-103, May, 2008.
    Abstract / PDF [1.7M]

  • Lessons Learned From the Deployment of a Smartphone-Based Access-Control System. Lujo Bauer, Lorrie Faith Cranor, Michael K. Reiter, Kami Vaniea. Symposium On Usable Privacy and Security (SOUPS) 2007, July 18-20, 2007, Pittsburgh, PA, USA.
    Abstract / PDF [984K]

  • Improving Mobile Database Access Over Wide-Area Networks Without Degrading Consistency. Niraj Tolia, M. Satyanarayanan, Adam Wolbach. MobiSys’07, June 11–13, 2007, San Juan, Puerto Rico, USA.
    Abstract / PDF [1.1M]

  • VMM-Independent Graphics Acceleration. H. Andres Lagar-Cavilla, Niraj Tolia, M. Satyanarayanan, Eyal de Lara. VEE’07, June 13–15, 2007, San Diego, California, USA.
    Abstract / PDF [1.4M]

  • Consistency-preserving Caching of Dynamic Database Content. Niraj Tolia and M. Satyanarayanan. International World Wide Web Conference (WWW 2007), May 8-12, 2007, Banff, Alberta, Canada.
    Abstract / PDF [888K]

  • Fingerpointing Correlated Failures in Replicated Systems. Soila Pertet, Rajeev Gandhi and Priya Narasimhan. USENIX Workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML), Cambridge, MA (April 2007).
    Abstract / PDF [100K]

  • Exploiting Similarity for Multi-Source Downloads Using File Handprints. Himabindu Pucha, David G. Andersen, Michael Kaminsky. Proceedings of the 4th Symposium on Networked Systems Design and Implementation (NSDI ’07), Cambridge, Massachusetts, April 2007.
    Abstract / PDF [579K]

  • Static Analysis Meets Distributed Fault-Tolerance: Enabling State-Machine Replication with Nondeterminism. Joseph Slember, Priya Narasimhan. Proceedings of the 2nd Workshop on Hot Topics in System Dependability (HotDep '06), Seattle, WA. Nov. 8, 2006.
    Abstract / PDF [98K]

  • Living with Nondeterminism in Replicated Middleware Applications. Joseph Slember, Priya Narasimhan. Middleware 2006, ACM/IFIP/USENIX, 6th International Middleware Conference, Melbourne, Australia, November 27 - December 1, 2006, Proceedings. Lecture Notes in Computer Science 4290 Springer 2006.
    Abstract / PDF [387K]

  • Towards Fingerpointing in the Emulab Dynamic Distributed System. Michael P. Kasick, Priya Narasimhan, Kevin Atkinson, Jay Lepreau. Proceedings of the 3rd USENIX Workshop on Real, Large Distributed Systems (WORLDS '06), Seattle, WA. Nov. 5, 2006.
    Abstract / PDF [311K]

  • Routing, Disjoint Paths, and Classification. Shuheng Zhou. Carnegie Mellon University Parallel Data Lab Ph.D. Dissertation CMU-PDL-06-109, August 2006.
    Abstract / PDF [971K] / Postscript [1.1M]

  • Group Communication: Helping or Obscuring Failure Diagnosis? Soila Pertet, Rajeev Gandhi and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-06-107, June, 2006.
    Abstract / PDF [ 591K]

  • A Large-scale Study of Failures in High-performance-computing Systems. Bianca Schroeder, Garth Gibson. Proceedings of the International Conference on Dependable Systems and Networks (DSN2006), Philadelphia, PA, USA, June 25-28, 2006. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-112, December, 2005.
    Abstract / PDF [570K]

  • Design Tradeoffs in Applying Content Addressable Storage to Enterprise-scale Systems Based on Virtual Machines. Partho Nath, Michael Kozuch, David O'Hallaron, Jan Harkes, M. Satyanarayanan, Niraj Tolia, and Matt Toups. Proceedings of the 2006 USENIX Annual Technical Conference (USENIX '06), Boston, Massachusetts, May-June 2006.
    Abstract / PDF [912K]

  • An Architecture for Internet Data Transfer. Niraj Tolia, Michael Kaminsky, David G. Andersen, Swapnil Patil. Proceedings of the 3rd Symposium on Networked Systems Design and Implementation (NSDI'06), San Jose, California, May 2006.
    Abstract / PDF [211K]

  • Dimorphic Computing. H. Andres Lagar-Cavilla, Niraj Tolia, Rajesh Balan, Eyal de Lara, M. Satyanarananan, David O'Hallaron. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-06-123, April 2006.
    Abstract / PDF [1.7M]

  • Quantifying Interactive User Experience on Thin Clients. Niraj Tolia, David G. Andersen, and M. Satyanarayanan. IEEE Computer. March, 2006.
    Abstract / PDF [690K]

  • Challenges and Opportunities in Internet Data Mining. David G. Andersen, Nick Feamster. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-06-102, Jan 2006.
    Abstract / PDF [104K]

  • Causes of Failure in Web Applications. Soila Pertet and Priya Narasimhan. Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-109. December 2005.
    Abstract / PDF [367K]

  • Empirical Analysis of Rate Limiting Mechanisms. Cynthia Wong, Stan Bielski, Ahren Studer, Chenxi Wang. 8th International Symposium on Recent Advances in Intrusion Detection (RAID 2005), September 7-9, 2005, Seattle, Washington. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-05-103, March 2005.
    Abstract / PDF [207K]

  • Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations. Jurij Leskovec, Jon Kleinberg*, Christos Faloutsos. KDD’05, August 21–24, 2005, Chicago, Illinois, USA.
    Abstract / PDF [ 318K]

  • The Seductive Appeal of Thin Clients. Niraj Tolia, David G. Andersen, M. Satyanarayanan. Carnegie Mellon University School of Computer Science Technical Report CMU-CS-05-151. February 2005.
    Abstract / Postscript [250K] / PDF [300K]

  • On Hierarchical Routing in Doubling Metrics. Anupam Gupta, Bruce M. Maggs, Shuheng Zhou. Carnegie Mellon University Parallel Data Laboratory Technical Report CMU-PDL-04-106. December, 2004.
    Abstract / Postscript [914K] / PDF [344K]

  • A Study of Mass-mailing Worms. Cynthia Wong, Stan Bielski, Jonathan M. McCune, Chenxi Wang. WORM’04, October 29, 2004, Washington, DC, USA.
    Abstract / PDF [192K]

  • Secure Bootstrap is Not Enough: Shoring up the Trusted Computing Base. James Hendricks and Leendert van Doorn. Proceedings of the Eleventh SIGOPS European Workshop, ACM SIGOPS, Leuven, Belgium, September 2004.
    Abstract / PDF [175K]

  • Dynamic Quarantine of Internet Worms. Cynthia Wong, Chenxi Wang, Dawn Song, Stan Bielski, Gregory R. Ganger. Proceedings of the International Conference on Dependable Systems and Networks (DSN-2004). Palazzo dei Congressi, Florence, Italy. June 28th - July 1, 2004. Supercedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-03-108, December 2003.
    Abstract / Postscript [1.4M] / PDF [224K]

  • Balancing Locality and Randomness in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-203, November 2003.
    Abstract / Postscript [489K] / PDF [511K]

  • A Prototype User Interface for Coarse-Grained Desktop Access Control. A. Chris Long, Courtney Moskowitz, and Greg Ganger. Carnegie Mellon University Technical Report CMU-CS-03-200, November 2003.
    Abstract / Postscript [900K] / PDF [535K]

  • GEM: Graph EMbedding for Routing and Data-Centric Storage in Sensor Networks Without Geographic Information. James Newsome and Dawn Song. Proceedings of the First ACM Conference on Embedded Networked Sensor Systems (SenSys 2003). November 5-7, 2003, Redwood, CA.
    Abstract / Postscript [1.5M] / PDF [266K]

  • Location-based Node IDs: Enabling Explicit Locality in DHTs. Shuheng Zhou, Gregory R. Ganger, Peter Steenkiste. Carnegie Mellon University Technical Report CMU-CS-03-171, September 2003.
    Abstract / Postscript [282K] / PDF [160K]

  • Time Management for New Faculty. Anastassia Ailamaki, Johannes Gehrke. SIGMOD Record, 32(2): 102-106, June 2003.
    Abstract / PDF [110K]

  • Fast and Flexible Application-level Networking on Exokernel Systems. Gregory R. Ganger, Dawson R. Engler, M.Frans Kaashoek, Héctor M. Briceño, Russell Hunt, Thomas Pinckney. CMU SCS Technical Report CMU-CS-00-117, March 2000.
    Abstract / PDF [363K]

  • A Performance Study of Sequential I/O on Windows NT. Riedel, E., van Ingen, C. and Gray, J. Proceedings of the Second Usenix Windows/NT Symposium, Seattle, WA. August 1998.
    Abstract / Postscript [720K] / PDF [114K]

  • Selected Reports: Fall 1997 Software Systems Course. CMU SCS Technical Report CMU-CS-98-103, April 4, 1998.
    Abstract / Postscript [5.5M] / PDF [3.3M]