ID: 122

COUTADA, Miguel; SILVA, Hélder; RAMALHO, José Carlos 

Curriculum

Miguel Coutada holds a degree in Informatics Engineering from University of Minho, Portugal. He is currently pursing a Master's Degree in Informatics Engineering at the same university with his thesis "Database Preservation Toolkit".

 

Master degree by University of Minho in Informatics Engineering in 2011, with specializations in Natural Language Processing and Network Protocols and Infrastructures. Bachelor degree by University of Minho in Informatics Engineering in 2009. Joined KEEP SOLUTIONS in 2009, where he plays the role of Project Manager, carrying out several activities related with digital repositories and digital preservation. He was part of the technical team who developed RCAAP (Repositório Científico de Acesso Aberto de Portugal), having an important role in its developement and maintenance. Also participated in the national project RODA (Repositório de Objetos Digitais Autênticos), a digital repository specialized in digital preservation. In the European context, he participated in an EC-FP7 funded project in the digital preservation area called SCAPE (SCAlable Preservation Environments).

 

José Carlos Ramalho graduated in 1991 as a Systems and Informatics Engineer. During his graduation he has worked as a freelancer software developer and as a network technician for Apple. During this time he also taught several courses about computer programming for several public institutions.
He finishes his Phd in 2000 under the subject "Structured Documents Semantics". From 1996 until the present he has been researching in Structured Documents area (currently he is coordinating several projects in Digital Archives and Libraries field). He was responsible for the creation of the conference series called XATA (starting in 2003 until now). He wrote two books and many articles presented in internacional and nacional conferences.

During 2008, together with two former post-graduation students he launches KEEP Solutions, a spin-off software company focusing Information Archiving, Information Management, Information Access and Information Deployment.

Title:

Database Preservation Toolkit

Brief summary:

Database Preservation Toolkit allows the preservation of relational databases holding the structure and content of the the database in a preservation format in order to provide access to the database information in a long term period.

Content:

The preservation of information systems is one of the biggest challenges of digital preservation. Among those systems we can find databases. Databases support the majority of the information management systems, showing themselves as a valuable resource to preserve.

If in one hand there is a need to migrate databases to newer ones that appear with technological evolution, on the other hand there is also the need to preserve the information they hold for a long time period, due to legal duties but also due to archival issues. That being said, that information must be available no matter the database management system where the information came from.

In this area, the existing products for relational database preservation are still scarce - CHRONOS and SIARD are the main ones. The first one is, in most of the cases, unreachable due to the associated costs. The second one only supports basic features.

Therefore there is the urge to explore the main features and limitations of the existing products in order to improve 'db-preservation-toolkit' (http://keeps.github.io/db-preservation-toolkit/), an extracted component from the RODA project (http://www.roda-community.org).

Therefore, 'db-preservation-toolkit' was inproved with respect to performance and also adding new features in order to support more database management systems, address some missing features of the other products and provide an interface where it is possible to see and search the information of the archived database.

Scientific contribution:

The study of relational databases digital preservation, including analysis and definition of the significant properties and related database preservation format, and development of a system to allow format migration to the preservation format and extended features for continuous and adequate access of the information inside the databases to the community and external users.

Keywords:

database preservation, digital preservation, SIARD, DBML