data standardization in digital libraries an etd case in turkey Özlem Şenyurt topçu, tolga...

18
Data Standardization in Digital Libraries An ETD Case in Turkey Özlem Şenyurt Topçu, Tolga Çakmak, Güleda Doğan Hacettepe University, Department of Information Management {ozlemsenyurt, tcakmak, gduzyol} @ hacettepe.edu.tr

Upload: randolph-blankenship

Post on 24-Dec-2015

213 views

Category:

Documents


1 download

TRANSCRIPT

Data Standardization in Digital Libraries

An ETD Case in Turkey

Özlem Şenyurt Topçu, Tolga Çakmak, Güleda DoğanHacettepe University,

Department of Information Management{ozlemsenyurt, tcakmak, gduzyol} @ hacettepe.edu.tr

ContentO Data StandardizationO Data CleaningO Data Standardization and MetadataO LIS ETDs Archive in Turkey

Content&Data Structure Standardization work-flow Data Standardization Process

Conclusion

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

O Quality data should have some criteria: accuracy integrity completeness validity consistency uniformity density

Data Standardization

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Data Standardization

Data standardization:O consistency,

between the content and format of data types represented in a system,

for data mapping and data outputs.

Oprocesses to improve information retrieval,Oloss of data,Ounduplicated data.

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Essential processes for;Ouser interactionOmeeting information needsAdvantages;Oincreases the clarity level of data,Oprovides internal consistency of data,Opresents a high quality and consistent platform for users,

Data Standardization

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Data CleaningDetecting and removing errors and inconsistencies from data (Rahm and Do, 2000).

Data cleaning is to weed out unsuitable or incorrectly entered data in the data set (Tekerek, 2011).

Data cleaning is also stated as a process that increases data quality and solves data quality problems.

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Data Standardization and Metadata

O Usage of digital collections effectively is dependent on the metadata quality.

O management of digital resources usefully at a wider level requires some degree of standardization of data and metadata (Gartner, 2008, p. 4).

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

O Standardized data utilize: efficient discovery, access, transfer and use of common terms, common definitions, etc. (Gartner, 2008, p. 5;

Why, 2013).

Data Standardization and Metadata

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Standardized metadataOenables users effective and efficient access to data by using a set of terminology (GeoNetwork, 2008, p. 32).

Oallows users’ access to the information they need (Xie and Shibasaki, 2013).

Data Standardization and Metadata

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

LIS ETD Archive in Turkeyhttp://bbytezarsivi.hacettepe.edu.tr

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

LIS ETD Archive in Turkey

Objectives of the project are:OCreating a union catalogODeveloping a digital archive that presents full texts and bibliographic descriptions OIdentification of ETDs via interoperable and standardized structures.OIncreasing access and visibility OProviding OAI-PMH standards and protocols, and an interoperable environment for search engines.

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

O 436 post-graduate (masters and doctorate)

O by the end of 2012.

Content & data structure

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

O First stage;O Data set was determined and created

via various supplementary resources,O Second stage;

O Rules and norms that will be used for the standardization processes were determined,

O Final stage;O Data integrity were provided, and false

and flawed data were corrected.

Data standardization processes and data standardization work-flow

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Standardization work-flow

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

Data standardization process

O Authorities: Virtual Authority File (VIAF)

O Title: AACR2O Date: Publication mm-dd-yyO Keywords and subject headingsO SummariesO Usage restrictions

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

ConclusionO interoperability with other systems,O mostly supporting open access and

scholarly communication,O effective information retrieval and

support critical thinking processes of users,

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

ReferencesGartner, R. (2008). Metadata for digital libraries: state of the art and future directions. JISC Technology & Standarts Watch. Retrieved June 17, 2013, from http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf

GeoNetwork Opensource  (2008). The complete manual. Retrieved June 23, 2013, from http://apps.who.int/geonetwork/docs/Manual.pdf

Xie, R. and Shibasaki, R. (2013).Standardization framework for CEOP metadata development and application. CEOP/IGWCO Joint Meeting. University of Tokyo, Japan. Retrieved June 20, 2013, from http://jaxa.ceos.org/wtf_ceop/documents/CEOP_Metadata_Report_20th.pdf

Rahm, E. and Do. H. H. (2000). Data Cleaning: problems and current approaches. Retrieved June 10, 2013, from http://dc-pubs.dbs.uni-leipzig.de/files/Rahm2000DataCleaningProblemsand.pdf.

Tekerek, A. (2011). Veri madenciliği süreçleri ve açık kaynak kodlu veri madenciliği araçları. paper presented at Akademik Bilişim 2011 Konferansı. Malatya: İnönü Üniversitesi.

Why standardize metadata? (2013). Retrieved June 21, 2013, from http://gep.frec.vt.edu/pdfFiles/Metadata_PDF's/3.0MD_Presentation-Section3.pdf

3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague

If you can’t explain it simply, you don’t understand it well enough

Albert Einstein

Thank you…