Coupled Storage System for Efficient Management of Self-Describing Data Formats (CoSEMoS)
The project goal is to explore the benefits of a coupled storage system for self-describing data formats. It will introduce a novel hybrid approach leveraging storage technologies from the fields of high-performance computing and database systems, where each technology will be used according to its respective strengths and weaknesses. By coupling the storage system tightly with self-describing data formats, it can make use of structural information for selecting appropriate storage technologies and tiers. As such information is currently not available, storage systems have to employ heuristics, which often lead to suboptimal performance as well as unnecessary and expensive data movements. Moreover, the storage system will support adaptable I/O semantics to tune its performance according to application and data format requirements. Together, these features will enable completely new data management methods and provide significant performance improvements. Existing workflows of scientific users will be supported through a dedicated data analysis interface. All changes will be thoroughly tested to ensure backwards compatibility with existing applications and interfaces. Consequently, no modifications will be necessary to run applications on top of CoSEMoS, which helps preserve past investments in scientific software development.
More information can be found in the Problem Statement.
This project is funded by the German Research Foundation (DFG) under grant KU 3584/1-1.
Prof. Dr. Thomas Ludwig (director of the German Climate Computing Center and professor at Universität Hamburg)
Johann Lombardi (principal architect in the High Performance Data Division at Intel)
Uwe Schulzweida (one of the main developers of the Climate Data Operators at Max Planck Institute for Meteorology)