XML representation of datasets Alejandro Engelmann Environmental Data Centre at SLU
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Datasets • • • • • • •
Tables Text documents Spreadsheets Databases Metadata Reports Applications
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Torque • Apache Software Foundation project. • Object-relational mapper for java. • Hides database-specific implementation details. • Independent of a specific database. • … if no exotic features of the database are used.
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Schema
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
XML (dtd, xsd, xsl, xquery…)
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Files • • • • •
schema.dtd schema.xml data.dtd data.xml metadata
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
od Database archiv ing in DSpace Name: Author: Version: Created: Updated:
Database
Database archi ving in DSpace Alejandro Engelmann 1.0 3/16/2005 7:18:31 AM 1/30/2006 2:36:52 PM
User input Table and fields info
Database connection info
Archiv ing metadata
DbDoc
Extract metadata
Complete database description
Extract data
SIP schema.xml
data.dtd
1..* data.xml
Dublin Core metadata
databas.dtd
DSpace
Archiv e
AIP
DTools Extract
Adapt
DIP
File
Schema v iew
SQL script
Spreadsheet
Text
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
od Database archiv ing in DSpace Name: Author: Version: Created: Updated:
Database
Database archiving in DSpace Alejandro Engelmann 1.0 3/16/2005 7:18:31 AM 1/30/2006 2:36:52 PM
User input Table and fields info
Database connection info
Archiv ing metadata
DbDoc
Extract metadata
Complete database description
Extract data
SIP schema.xml
data.dtd
1..* data.xml
Dublin Core metadata
databas.dtd
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
SIP schema.xml
data.dtd
1..* data.xml
Dublin Core metadata
databas.dtd
DSpace
Archiv e
AIP
DTools Extract
Adapt
DIP
File
Schema v iew
SQL script
Spreadsheet
Text
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Example Genetics of domestic fowls
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
od Database archiv ing in DSpace Name: Author: Version: Created: Updated:
Database
Database archiving in DSpace Alejandro Engelmann 1.0 3/16/2005 7:18:31 AM 1/30/2006 2:36:52 PM
User input Table and fields info
Database connection info
Archiv ing metadata
DbDoc
Extract metadata
Complete database description
Extract data
SIP schema.xml
data.dtd
1..* data.xml
Dublin Core metadata
databas.dtd
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro
SIP schema.xml
data.dtd
1..* data.xml
Dublin Core metadata
databas.dtd
DSpace
Archiv e
AIP
DTools Extract
Adapt
DIP
File
Schema v iew
SQL script
Spreadsheet
Text
Workshop “Critical issues for the preservation of datasets“, 06-04-26, Alejandro