PARALLEL DATA LAB

Continuous Data Reorganization





Automated Tuning of Disk Layouts

The mechanical positioning delays of disk accesses continue to plague system performance. To address this, many heuristics have been developed for adapting on-disk data layouts to expected and observed workload characteristics. We are developing a two-tiered software architecture for cleanly and extensibly combining such heuristics. In this architecture, each heuristic is implemented independently and an adaptive combiner merges their suggestions based on how well they work in the given environment. The result is a simpler and more robust system for automated tuning of disk layouts.

People

FACULTY

Greg Ganger

STUDENTS

Mike Mesnier
Brandon Salmon
Craig Soules
Eno Thereska

Acknowledgements

We thank the members and companies of the PDL Consortium: Amazon, Bloomberg, Datadog, Google, Honda, Intel Corporation, IBM, Jane Street, Meta, Microsoft Research, Oracle Corporation, Pure Storage, Salesforce, Samsung Semiconductor Inc., Two Sigma, and Western Digital for their interest, insights, feedback, and support.