gesis dr. maximilian stempfhuber head of research and development social science information centre,...

Post on 28-Dec-2015

215 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

GESIS

Dr. Maximilian StempfhuberHead of Research and Development

Social Science Information Centre, Bonn, Germany

How to deal with heterogeneity when building the CRIS of tomorrow?

Keep the Best – Forget the Rest?

2GESIS

GESIS: Short Introduction

German Social Science Infrastructure Services Centre for Survey Research and Methodology

(ZUMA), Mannheim Central Archive for Empirical Social Research,

University of Cologne (ZA) Social Science Information Centre, Bonn (IZ)

3GESIS

GESIS: Mission Statement

GESIS...

... provides services in support of social science research including the development and supply of databases with information on social science literature and research activities as well as the archiving and provision of survey data from social research

4GESIS

IZ: Short Introduction

The Social Science Information Centre (IZ) Builds databases on literature, research projects and institutes Distributes information via portals, hosts, CD-ROM, printed publications and personalized search services Does research in information technology, information retrieval and UI design

5GESIS

Agenda

Heterogeneity – The Users Heterogeneity – The Domains Heterogeneity – The Data Heterogeneity – The Data Providers Standards – A Solution? We Need a Model! Examples for Current Activities Conclusion

6GESIS

Heterogeneity – The Users (1)

Who will use a CRIS? Researchers Teachers and scholars Decision makers (local, national, EU, …) Funding agencies Companies Journalists …

7GESIS

Heterogeneity – The Users (2)

Are all users the same?

Domain skills

Generalskills

Researcher?

Company?

Scholar?

8GESIS

Heterogeneity – The Users (3)

Users evolve: From novices to domain experts From casual users to power users …Users switch: Between simple questions and complex problems Between different search strategies …And users communicate

9GESIS

Heterogeneity – The Users (4)

What do we have to provide? Adequate support for all user groups “Data in context” becomes information Dynamic, adaptive and adaptable systems All the functions – but only when needed Intelligent and integrated systems…

10GESIS

Heterogeneity – The Domains

Are all domains the same? Traditions (scientific process) Use of information technology Sharing of knowledge and data (raw data, scientific discourse, pre-print serves, electronic vs. printed publications) Scientific output (quantity) Perception by the public “Valuable” (who will request / pay for their output)…

11GESIS

Heterogeneity – The Data (1)

Users need different data Reference databases Full text documents / online ordering Project information Raw data (surveys, timeseries data, specimen, ...) People, teams, institutes, networks ...

12GESIS

Heterogeneity – The Data (2)

Even the same data can be different: Languages (Meta-)Data structures Content analysis and indexing Storage systems Retrieval models User interfaces Free vs. liable for costs / public vs. scientific use ...

13GESIS

Heterogeneity – The Data Providers (1)

Who are they? Authors Research institutes Information centres and libraries Funding agencies Decision makers (local, national, EU, …)...

14GESIS

Heterogeneity – The Data Providers (2)

Do they all have the same goals? Recognition by the community Administration of current research Reports to boards / proof of excellence Services to the community Strategic planning ...

15GESIS

Heterogeneity – The Data Providers (3)

How do they differ? Willing to document research and results Providing metadata and/or “real” data Providing direct access Providing different media and formats Commercial vs. non-commercial Willing to cooperate ...

16GESIS

Standards – A Solution? (1)

Are standards really feasible? A single schema for every data type? A single indexing language for all domains? A single architecture for information systems? A single communication protocol? A single authentication authority? A single …?

17GESIS

Standards – A Solution? (2)

Where are the problems? Long time for defining standards Investments in other (own) standards Every day a new “standard” It’s not “my” standard! Where is my competitive edge? Are Dublin Core and OAI really useful? Do I have the resources to support the standard? Do I have to change my IT? Will I lose some information forever?

18GESIS

Standards – A Solution? (3)

Where do standards help? Communication Date exchange Creating new resources Connecting resources (local to global) Common user experience Universal access

19GESIS

Standards – A Solution? (4)

But still they are just single standards!

How do we choose the best standard?

Will everyone switch to the standard?

20GESIS

We Need A Model! (1)

No! We need...

... a model to connect them!

© IBM

21GESIS

We Need A Model! (2)

Models... For the user experience To cope with structural heterogeneity To cope with semantic heterogeneity To identify and connect objects For connecting distributed systems For making cooperation possible

... based on common standards!

22GESIS

Examples for Current Activities

Current activities dealing with heterogeneity: MORESS infoconnex (www.infoconnex.de) Vascoda (www.vascoda.de)

23GESIS

Cooperation in MORESS (1)

Mapping of Research in European Social Sciences and Humanities 25 countries participating Project leader: European University Association (EUA) April 2003 – May 2005 Cataloguing country-specific information sources (e. g. databases) Thematic (relevant to FP6) mapping of research teams and experts

24GESIS

Cooperation in MORESS (2)

Organizational infrastructure: Central catalogue of information sources, fed by project partners Common metadata model Common classification Open software platform (DBClear), also for building new information sources Live linking to information sources

25GESIS

Cooperation in MORESS (3)

Results: Catalogue accessible by the public New information collections Meta-Search over selected sources Map of European research on a selected FP6 topic Platform for future activities towards the ERA

26GESIS

Multidisciplinarity in infoconnex (1)

Information Network “Education – Social Sciences – Psychology” Multidisciplinary information portal Reference databases (phase 1) Linking to full text (e.g. journal articles) Pay-per-View / One-Stop-Shop for commercial content Domain-specific and cross-domain search

27GESIS

Multidisciplinarity in infoconnex (2)

Treating heterogeneity of database structures and indexing Novel user interfaces, e. g. for cross- database searches Additional content: library catalogues, clearinghouses, research projects, institutes, scholarly materials, … (phase 2) Linking of database objects (phase 2)

28GESIS

Multidisciplinarity in infoconnex (3)

Education SocialSciences

Psychology

User‘s query

User interface layer

Query processing

Heterogeneity

Query mapping

standard

specific

Result presentation

29GESIS

Multidisciplinarity in infoconnex (4)

Education

SocialSciences

Psychology

SWD

Mapping ofvocabularies

30GESIS

Multidisciplinarity in infoconnex (5)

Cross-database search

31GESIS

Multidisciplinarity in infoconnex (6)

Results from multiple databases

32GESIS

Multidisciplinarity in infoconnex (7)

Database-specific result presentation

33GESIS

Multidisciplinarity in Vascoda (1)

German portal for scientific information Combining „Information Networks“ and „Virtual Libraries“ initiatives Reference databases with full text linking and online ordering + clearinghouses No authentication or billing Integrated search, but domain-specific results Cluster and cross-domain searches

34GESIS

Multidisciplinarity in Vascoda (2)

Integration of research projects, data and other materials planned No treatment of heterogeneity right now! How can quality of results be guaranteed with multidisciplinary databases (e.g. library catalogues)?

35GESIS

Multidisciplinarity in Vascoda (3)

36GESIS

Multidisciplinarity in Vascoda (4)

37GESIS

Multidisciplinarity in Vascoda (5)

38GESIS

Current Research: Heterogeneity / IR

Stepwise query refinement with cross-concordances Statistical transfers of query terms Dynamic user interfaces for cross- database searches Merging and ranking of results

39GESIS

Current Research: Novel UIs

40GESIS

Conclusion

To build a model for the CRIS of tomorrow should be the

primary task of

41GESIS

The End

Thank you for your attention!

Maximilian Stempfhuber

st@iz-soz.de

top related