data standardization in digital libraries an etd case in turkey Özlem Şenyurt topçu, tolga...
TRANSCRIPT
Data Standardization in Digital Libraries
An ETD Case in Turkey
Özlem Şenyurt Topçu, Tolga Çakmak, Güleda DoğanHacettepe University,
Department of Information Management{ozlemsenyurt, tcakmak, gduzyol} @ hacettepe.edu.tr
ContentO Data StandardizationO Data CleaningO Data Standardization and MetadataO LIS ETDs Archive in Turkey
Content&Data Structure Standardization work-flow Data Standardization Process
Conclusion
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
O Quality data should have some criteria: accuracy integrity completeness validity consistency uniformity density
Data Standardization
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Data Standardization
Data standardization:O consistency,
between the content and format of data types represented in a system,
for data mapping and data outputs.
Oprocesses to improve information retrieval,Oloss of data,Ounduplicated data.
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Essential processes for;Ouser interactionOmeeting information needsAdvantages;Oincreases the clarity level of data,Oprovides internal consistency of data,Opresents a high quality and consistent platform for users,
Data Standardization
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Data CleaningDetecting and removing errors and inconsistencies from data (Rahm and Do, 2000).
Data cleaning is to weed out unsuitable or incorrectly entered data in the data set (Tekerek, 2011).
Data cleaning is also stated as a process that increases data quality and solves data quality problems.
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Data Standardization and Metadata
O Usage of digital collections effectively is dependent on the metadata quality.
O management of digital resources usefully at a wider level requires some degree of standardization of data and metadata (Gartner, 2008, p. 4).
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
O Standardized data utilize: efficient discovery, access, transfer and use of common terms, common definitions, etc. (Gartner, 2008, p. 5;
Why, 2013).
Data Standardization and Metadata
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Standardized metadataOenables users effective and efficient access to data by using a set of terminology (GeoNetwork, 2008, p. 32).
Oallows users’ access to the information they need (Xie and Shibasaki, 2013).
Data Standardization and Metadata
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
LIS ETD Archive in Turkeyhttp://bbytezarsivi.hacettepe.edu.tr
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
LIS ETD Archive in Turkey
Objectives of the project are:OCreating a union catalogODeveloping a digital archive that presents full texts and bibliographic descriptions OIdentification of ETDs via interoperable and standardized structures.OIncreasing access and visibility OProviding OAI-PMH standards and protocols, and an interoperable environment for search engines.
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
O 436 post-graduate (masters and doctorate)
O by the end of 2012.
Content & data structure
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
O First stage;O Data set was determined and created
via various supplementary resources,O Second stage;
O Rules and norms that will be used for the standardization processes were determined,
O Final stage;O Data integrity were provided, and false
and flawed data were corrected.
Data standardization processes and data standardization work-flow
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Standardization work-flow
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
Data standardization process
O Authorities: Virtual Authority File (VIAF)
O Title: AACR2O Date: Publication mm-dd-yyO Keywords and subject headingsO SummariesO Usage restrictions
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
ConclusionO interoperability with other systems,O mostly supporting open access and
scholarly communication,O effective information retrieval and
support critical thinking processes of users,
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague
ReferencesGartner, R. (2008). Metadata for digital libraries: state of the art and future directions. JISC Technology & Standarts Watch. Retrieved June 17, 2013, from http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf
GeoNetwork Opensource (2008). The complete manual. Retrieved June 23, 2013, from http://apps.who.int/geonetwork/docs/Manual.pdf
Xie, R. and Shibasaki, R. (2013).Standardization framework for CEOP metadata development and application. CEOP/IGWCO Joint Meeting. University of Tokyo, Japan. Retrieved June 20, 2013, from http://jaxa.ceos.org/wtf_ceop/documents/CEOP_Metadata_Report_20th.pdf
Rahm, E. and Do. H. H. (2000). Data Cleaning: problems and current approaches. Retrieved June 10, 2013, from http://dc-pubs.dbs.uni-leipzig.de/files/Rahm2000DataCleaningProblemsand.pdf.
Tekerek, A. (2011). Veri madenciliği süreçleri ve açık kaynak kodlu veri madenciliği araçları. paper presented at Akademik Bilişim 2011 Konferansı. Malatya: İnönü Üniversitesi.
Why standardize metadata? (2013). Retrieved June 21, 2013, from http://gep.frec.vt.edu/pdfFiles/Metadata_PDF's/3.0MD_Presentation-Section3.pdf
3rd International Conference on Integrated Information (IC-ININFO) – September 5-9, 2013 – Prague