Title Requirements for a Data Management Infrastructure to support UK High-End Computing
Abstract There is a growing awareness that it is not enough to provide leading edge computational and experimental facilities on their own. Many of them produce large amounts of data, which have to be handled appropriately. Keeping records, archiving and exploring results, granting access to and disseminating new findings, as well as providing back-up capacities are major tasks and without adequate data archival and exploration historical and new data may well be lost. To ensure this does not happen CLRC not only provides numerous data services, but is also involved in a number of internal research projects on different topics within data management. In June 1998 CLRC Daresbury Laboratory started a new project called DAMP - the Data Management in Climate Research Project. The project is concerned with the data management requirements of climate researchers and environmental scientists using high performance computing for their work. During the last year the work has focused on analysing the current situation in the UK in the context of developments in other European countries and the USA. More importantly future strategies have been developed to address the shortcomings identified during the analysis. Though the project was mainly concerned with environmental science and climate research it also considers the needs of other computational science disciplines and the report will point out whenever results are of more general value. However to get an exact picture for any other discipline a more detailed survey of their specific requirements would be necessary.
Keywords Natural environment , data management , environmental science , high performance computing , hpc
