etb.eun.org etb ist 1999 - 11781 iuk 2001 metadata + heterogeneity in etb 12.03.2001 kluck (hub/iz)...
TRANSCRIPT
etb.eun.org
12.03.2001Kluck (HUB/IZ)
1
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Metadata and Handling of Heterogeneity as Central Means
for the Development of an European School Portal - The Project
European Schools Treasury Browser – ETBPresentation at the 7th Annual Meeting of the IuK Initiative
Trier 11.-14.03.2001
Michael KluckHumboldt University Berlin, Computer Uses in Education (HUB)
Social Sciences Information Centre Bonn (IZ)
etb.eun.org
12.03.2001Kluck (HUB/IZ)
2
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Introduction (I)Introduction (I)
The ETB project is embedded in the context of the European Schoolnet (EUN) www.eun.org
The European Schoolnet is the new framework for the co-operation between the European Ministries of Education on Information and Communication Technology in Education.
EUN builds a European network of national and regional computer networks of repositories on schools.
etb.eun.org
12.03.2001Kluck (HUB/IZ)
3
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
BUILD THE BUILD THE ““SCHOOLNET INFORMATION SPACESCHOOLNET INFORMATION SPACE””
etb.eun.org
12.03.2001Kluck (HUB/IZ)
4
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Introduction (II)Introduction (II) ETB works out the technological and structural
prerequisites for this network of networks. Building on a preceding project, ETB shall realise the
technical infrastructure and the content-based integration of the different services and of their cultural and linguistic contexts.
The presentation is concentrated on the content integration of the participating networks and repositories.
The main user groups will be teachers and pupils.
etb.eun.org
12.03.2001Kluck (HUB/IZ)
5
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Developing a CommonDeveloping a CommonMetadata SetMetadata Set
Context and General purpose:Get similarly structured informationFacilitate targeted searchAvoid mismatch of the specific search and the
unstructured universe of the Internet: - Topic versus person (i.e. Ohm, Kierkegaard)- Different domain-specific meanings (i.e. Leistung,
Disziplin)- Domain-specific meaning versus general meaning (i.e.
Lehre, services)
etb.eun.org
12.03.2001Kluck (HUB/IZ)
6
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Metadata
Derived from the Dublin Core metadata elements and the EUN Metadata Element Set (developed in the preceding EUN project)
Quite minimalised, but with obligation types M = mandatoryO = optional
Using RDF syntax
etb.eun.org
12.03.2001Kluck (HUB/IZ)
7
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Metadata Elements (I)Title MCreator MSubject O or M?!Description MPublisher OContributor ODate OType O
etb.eun.org
12.03.2001Kluck (HUB/IZ)
8
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Metadata Elements (II)Format O Identifier MSource OLanguage MRelation OCoverage ORights ManagementOAudience OEUN User Level O
etb.eun.org
12.03.2001Kluck (HUB/IZ)
9
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Metadata Elements (III)
Element Subject Besides freely chosen keywordsETB thesaurus termsSound or video clip representing the
content of an audio, audiovisual, visual or multimedia resource
etb.eun.org
12.03.2001Kluck (HUB/IZ)
10
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Metadata Elements (IV)
• Element EUN User Level- School level or age group
- Pre-school (education)- Primary (education)- AdultEducation- Secondary (education)- Vocational (eduction and training)- HigherEducation- Juvenile (material for children and adolescents in
general)- Adult (material for adults in general)
etb.eun.org
12.03.2001Kluck (HUB/IZ)
11
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Producing Metadata
Direct entry by authors (adapting given rules/definitions or using an online template)
Generation by repositories during input Extraction from existing un-coded data by
defining extraction rules
etb.eun.org
12.03.2001Kluck (HUB/IZ)
12
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Metadata Extraction and MappingMetadata Extraction and Mapping
For different repositories which have different metadata structures mapping schemes will be set up into the ETB Metadata Element Set.
For repositories without metadata schemes metadata will be extracted from the entries as far as structured elements of the resources can be detected and an algorithm for converting them into metadata fields can be applied.
etb.eun.org
12.03.2001Kluck (HUB/IZ)
13
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Metadata Exchange via NNTPMetadata Exchange via NNTP
etb.eun.org
12.03.2001Kluck (HUB/IZ)
14
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
etb.eun.org
12.03.2001Kluck (HUB/IZ)
15
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Technical Goals of ETBTechnical Goals of ETB
A new approach for a European Network of repositories
Network based on “Publish” not “Pull” Added value to users from a thesaurus Retain full local editorial policy High quality control tools Wider outreach Support of multilinguality
etb.eun.org
12.03.2001Kluck (HUB/IZ)
16
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Thesaurus (I)ETB Thesaurus (I)
Search problemsNatural language problems:
- Synonymy, homonymy, polysemy, phrases, compounds, spelling variations
Lack of relevance controlMultilinguality
etb.eun.org
12.03.2001Kluck (HUB/IZ)
17
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Thesaurus (II)ETB Thesaurus (II)
Thesaurus benefitsEffective control of indexing language (preferred
terms, inter-language equivalence)Systematic display of descriptors (ease of
navigation through the terminology) Indexing and searching by using post-coordinationFollowing recommendations of Dublin CoreBasics for solving heterogeneity
etb.eun.org
12.03.2001Kluck (HUB/IZ)
18
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ETB Thesaurus (III)ETB Thesaurus (III)
The content of the repositories in the EUN context (= multimedia material, teaching material, school projects) and schools as target area and teachers and pupils as main target groups need specific terminology.
Only few repositories have developed an own terminology.
etb.eun.org
12.03.2001Kluck (HUB/IZ)
19
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Handling Heterogeneity (I)Handling Heterogeneity (I) Making use of existing content descriptions Dealing with heterogeneity on the content level means:
Same words or phrases may indicate different meanings in different environments (i.e. education, or class):
- Occurring anywhere in the full text of an Internet resource
- Being the code of an classification scheme assigned to an document
- Being an indexing term taken from a specific thesaurus
etb.eun.org
12.03.2001Kluck (HUB/IZ)
20
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Handling heterogeneity (II)Handling heterogeneity (II)
Use of existing intellectual work done by the different repositories or resource authors: indexing or classifying documents even with different schemes or terminologies
Use of existing terminologies or classification schemes for automatic processing of transfer relations
etb.eun.org
12.03.2001Kluck (HUB/IZ)
21
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Handling heterogeneity (III)Handling heterogeneity (III)
Methods for solving heterogeneity problems Intellectual building of cross-concordances
between relevant terminologies and classification schemes and between different languages, and automatic (statistical) building of transfer components
Developing transfer components in between those terminologies and schemes and between those and the words occurring in the full texts (co-occurrence analysis, fuzzy methods, neural networks etc.)
etb.eun.org
12.03.2001Kluck (HUB/IZ)
22
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Multilingual AccessMultilingual Access Using ETB thesaurus and heterogeneity handling
ETB thesaurus allows indexing or searching in any covered language and results can automatically be retrieved in all other languages.
Heterogeneity handling (intellectually or automatically processed) allows the use of any (language specific) scheme: results can also be retrieved in other schemes or languages.
Integration of results in the area of cross-language information retrieval and its evaluation (see: CLEF = Cross-Language Evaluation Forum at www.clef-campaign.org )
etb.eun.org
12.03.2001Kluck (HUB/IZ)
23
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ConclusionConclusion
ETB is strongly integrated in an existing and rapidly developing application for practitioners (teachers and pupils) with a good political support for handling ICT in education.
ETB is strongly integrated into top level research on distributed networking, metadata, (cross-language) information retrieval, multilingual thesauri, and heterogeneity handling.
etb.eun.org
12.03.2001Kluck (HUB/IZ)
24
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
Thank you for your attention!Thank you for your attention!
Further informationOn the multilingual ETB thesaurus
http://www.en.eun.org/eun.org2/eun/en/etb/content_frame.cfm?lang=en&ov=3813
On other aspects of the ETB Project (collection description, quality management, technical solutions)
http://www.en.eun.org/eun.org2/eun/en/etb/sub_area_frame.cfm?sa=195&row=1
Michael Kluck‘s publications http://www.educat.hu-berlin.de/~kluck/kl-personal.html
etb.eun.org
12.03.2001Kluck (HUB/IZ)
25
ETB IST 1999 - 11781
IuK 2001Metadata + Heterogeneity in ETB
ReferencesReferences Ardö/Koch 1999: Anders Ardö, Traugott Koch: Automatic classification applied to the full-text Internet documents in a
robot-generated subject index. In: Online Information 99. Proceedings. 23rd International Online Information Meeting. London, 7-9 Dec 1999, p.239-246. Manuscript at: http://www.lub.lu.se/~traugott/online99.htm
Kluck et al. 2000: Michael Kluck, Jürgen Krause, Matthias Müller, in Kooperation mit Rudi Schmiede u.a. Virtuelle Fachbibliothek Sozialwissenschaften. Bonn: 2000 (= IZ-Arbeitsbericht, Nr. 19); at http://www.bonn.iz-soz.de/publications/series/working-papers/#Virtuell pdf-file for downloading.
Koch/Vizine-Goetz 1999: Traugott Koch, Diane Vizine-Goetz: Automatic Classification and Content Navigation Support for Web Services. DESIRE II co-operates with OCLC. In: Annual Review of OCLC Research 1998 http://www.oclc.org/oclc/research/publications/review98/koch_vizine-goetz/automatic.htm
Koch 1998: Traugott Koch: Nutzung von Klassifikationssystemen zur verbesserten Beschreibung, Organisation und Suche von Internet-Ressourcen. Buch und Bibliothek 50:5, p.326-335. Manuscript with hyperlinks at: http://www.ub2.lu.se/tk/publ/bubmanus.html
Meier 2000: Wolfgang Meier, Matthias N.O. Müller, Stefan Winkler: Virtuelle Bibliothek Sozialwissenschaften. Problembereich und Konzeption. In: Bibliotheksdienst, Vol. 34, No. 7/8, 2000, p. 1236-1244 http://www.dbi-berlin.de/dbi_pub/bd_art/bd_2000/00_07_12.htm
Krause 1999: Jürgen Krause: Sacherschließung in virtuellen Bibliotheken. Standardisierung versus Heterogenität. In: Grenzenlos in die Zukunft. 89. Deutscher Bibliothekarthag in Freiburg im Breisgau 1999. Frankfurt am Main: 2000 (ZfBB-Sonderheft 77)
Krause 1996: Jürgen Krause: Informationserschließung und -bereitstellung zwischen Deregulation, Kommerzialisierung und weltweiter Vernetzung [Schalenmodell]. Bonn: 1996 (= IZ-Arbeitsbericht, Nr. 6); at http://www.bonn.iz-soz.de/publications/series/working-papers/#Informationserschließung pdf file for downlaoding.
Krause/Marx 2000: Jürgen Krause, Jutta Marx: Vocabulary Switching and Automatic Metadata Extraction or How to Get Useful Information from a Digital Library. In: First DELOS Workshop on Information Seeking Searching and Querying in Digital Libraries, Zürich, Switzerland, 11.-12.12.2000 (forthcoming in the proceedings)
Krause 2000: Jürgen Krause: Information Systems for Social Science Research. A Perspective from Information Science. In: Symposium Information system for social sciences, 1.-2.10.2000, Mannheim (forthcoming in the proceedings)
Weibel/Koch 2000: The Dublin Core Metadata Initiative. Mission, Current Activities, and Future Directions. In: D-Lib Magazine 6 (12) 2000 at: http://www.dlib.org/dlib/december00/weibel/12weibel.html