PARALLEL DATA LAB

Astro-DISC: Research Goals

The purpose of the Astro-DISC project is to create computational tools for eScience. We are working on new algorithms, data structures, and software architectures for analysis of massive scientific datasets. The project involves collaboration with domain scientists; development of efficient algorithms that address related scalability challenges; and application of cloud computing and supercomputing.

We follow two complementary approaches to these challenges. First, we are working with domain scientists to identify specific problems that require massive computation and developing specialized tools for these problems. The initial results include astronomical and cosmological applications. We are now also looking at problems in bioinformatics, sustainability, and gathering data on the web.

Second, we aim to develop more general toolkits, which will enable domain scientists to build their own applications for massive data processing, much in the same way as Excel enables users to build numeric applications. The long-term purpose is to create a distributed database for storing and indexing various scientific data, along with tools for querying and integrating these data. The initial results include techniques for distributed indexing of astronomical catalogs and cosmological simulations.

We are continually looking for new challenges and new collaborations with domain scientists. If you have related problems that you want to solve, we would love to hear from you. We can help you develop scalable algorithms. We can even write code for you and run it on a compute cluster.

astro-disc collage