it has become a triviality to state the difficulty of finding relevant information on the web

19
Johannes Keizer Food and Agriculture Organization of the UN Library and Documentatio n Systems Division The Agricultura l Ontology Service - project, a cooperative approach to improve organizatio n and retrieval of knowledge on the web. 21-03-2002 Slide 1 Introduction It has become a triviality to state the difficulty of finding relevant information on the web

Category:

Education


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 1

Introduction

It has become a triviality to state the difficulty of finding relevant information on the web

Page 2: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 2

The Search Problem

Both parameters are ranking low today!

RecallNumber of Relevant Documents in the Collection

Number of Relevant Documents Identified

PrecisionNumber of Relevant Documents Identified

Total Number of Documents Identified

How to evaluate Search Results?

Page 3: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 3

The Browse Problem

• Topic Trees from categorization schemes and thesauri are rigid and not very expressive

• Machine produced clusters are “flexible”, but very imprecise

Page 4: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 4

Ways to Resource Description and Discovery

• Statistical: using words as lexical terms, statistical analysis

(word counts), most full text search engines work like this

• Semantical: referring the information resource to a category or to

thesaurus terms

Statistic methods are unprecise by nature; semantic methods depend on the knowledge organization system used

Page 5: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 5

Knowledge Organization Systems now

AGROVOC

NAL Thesaurus

CABI Thesaurus

Dedicated KOSs

Non-dedicated KOSs

e.g., ASFA thesaurus

e.g., the Multilingual Forestry Thesaurus

e.g., the Sustainable Development

website classification

e.g., biological taxonomies such as NCBI and ITIS

GEMET

Other thematic thesauri

Existing Thesauri and Knowledge Organization Systems (KOSs)

Common concepts are not declared

No or very limited interoperability

Insufficient subject + language coverage

Severe maintenance problems

Very limited machine readability

Only very simple encoding of semantic relations

Page 6: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 6

Consequences

• No common topic trees in one domain, no cross navigation between applications in a specific domain

• Keyword searches are based on statistical textanalysis

• Automatic indexing systems show mostly poor results

• Web crawlers and harvesters do good jobs only on already structured information sources

• The semantic Web is very far away

Page 7: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 7

But there is a solution…

• Development of Application and Domain Ontologies to assure:

a formal description of the application knowledge possibility of machine processing

• Development of an Agricultural Ontology Service to assure:

compatibility reusability sustainability

Page 8: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 8

Ontologies?

An ontology is a formal knowledge organization system

It contains concepts (and instances) Definitions of concepts and instances Relations between concepts and instances

Nearly everyone tries to build (inexplicit) ontologies Directory structures, navigation trees Humans can overcome bad organization by intuition Machine have no intuition, Machine need formal information

Page 9: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 9

A Sample ontology for crop pest management

Page 10: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 10

Why Ontologies?

• Semantic Organization of websites Knowledge maps Guided discovery of knowledge Easy retrievability of information without using complicated

Boolean logic

• Text processing by machines Text Mining on the Web (meaning-oriented access) Automatic indexing and text annotation tools Full text search engines that create meaningful classification

(FAO-Schwartz not related to FAO) (semantic clustering)

• Intelligent search of the Web Building dynamical catalogues from machine readable meta data

• Natural Language processing Better machine translation Queries using natural language

Page 11: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 11

Guided Browse and Search Facilities

Records found: 5

1. xxxxxxxxxxx

2. xxxxxxxxxxx

3. xxxxxxxxxxx

4. xxxxxxxxxxx

5. xxxxxxxxxxx

BiotopesCropping systems using forestsEconomics of forest productionForestry equipmentSoil science

You may also be interested in...

What would you like to view?

Forest rights issuesParasites of forestsPesticides used in forestsTypes of forest productsUses of forest products

Geographic area

You can further limit by:

x

Africa

Web pageType of resource

Page 12: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 12

Context Sensitive Knowledge Access

Conservation agriculture

Farmers like it because it gives them a means of conserving, improving and making more efficient use of their natural resources

About camels and llamas

Descendants of the same rabbit-sized mammal, they have become two of humanity's most versatile domestic animals

Agribusiness and small farmers

Well managed contract farming contributes to both increased income for producers and higher profits for investors

Toward biosecurity

Biological and environmental risks associated with food and agriculture have intensified with economic globalization

Urban food marketing

In the “century of cities”, a major challenge will be providing adequate quantities of nutritional and affordable food for urban inhabitants

Crop science and ethics

In order to continue their contribution to human development, crop scientists must regain credibility

Use your right mouse button to learn more about an italicized word on the page.

Biosecurity:management of all biological and environmental risks associated with food and agriculture, including forestry and fisheries

See also:BiosafetyFood SafetyRisk Management

Or are you interested in...:Food SecurityBiological Diversity

Agricultural Web Page

Page 13: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 13

Why an Agricultural Ontology Service?

The production of knowledge organization systems in our subject area can be only

International The Internet must become plurilingual

MultidisciplinaryFAOs subjectarea is broad and needs various inputs

Cooperativedifferent expert knowledge has to be associated and used)

Distributed no central ownership should be looked for

CoordinatedCoordination must ensure reusability and standardization

Page 14: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 14

AOS: Iterative Knowledge Registration

Application

KOS uses components to build

an application

Discussions and choices for amendments to

components

Partner

Agricultural Ontology Service (AOS)Federated storage and

description facility

Components: terms,

definitions, relationships

Partner

Components: terms, definitions,

relationships

Users search and browse application using

components

User feedback

Page 15: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 15

The Start of the AOS Initiative

• The first workshop took place in Rome, November 2001

• A launch group was established with participation of

– Content providers (FAO, CABI)– Solution providers in the Agricultural Area (ATO -Wageningen,

University of Florida)

– Ontology development Groups (AIFB Karlsruhe, CNR Italy)

• Now key international partners have to be determined

Page 16: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 16

AOS Initiative – the development

• The second AOS workshop took place in January in Oxford, prototypes were planned

– Fishery Ontology– Crop Pest Ontology– Microbial Agents Ontology

• The third AOS workshop will take place in May in Florida

– To discuss the prototypes– To get keyplayers from the Americas (IICA already confirmed

participation)

• Are you interested in participation?

Page 17: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 17

AOS- Intiative - Sustainability

FAO plays a key-role in defining Knowledge Organization Systems since the 70th (AGRIS-Categories, AGROVOC)

But FAO needs committed partners for this project For Subject Coverage, For Language Coverage

External funding is necessary for the first years of the project

The Launch Group is preparing a proposal for the 6th Framework (EU)

Other roads must be explored

Page 18: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 18

The Evolution of Knowledge Management

Libraries/Archives/File systems

Libraries/Archives/File Systems/Websites

Electronic Repositories

Bibliographic Catalogues on Cards or Computers

Human Indexing

Bibliographies

Reviews

Machine Readable Metadata Repositories

Bibliographic Catalogues Machine Index Catalogues

Human reading, checking and classifying

Human Indexing Machine Indexing

Machine Indexing Human Indexing

Statistical Analysis by Machines

Bibliographies/Output from Fulltext Search Engines

Semantical Analysis by Machines

Knowledge based specialized webportals

Books, Magazines, Articles, ….

Books, Magazines, Articles Databases, Webpages

Defined Electronic Information Elements

Knowledge Mining

Pre- Web Web Semantic Web

Ontologies Thesauri, Classification Schemes, Glossaries,

Page 19: It has become a triviality to state the difficulty of finding relevant information on the web

Johannes Keizer

Food and Agriculture

Organization of the UN

Library and Documentation

Systems Division

The Agricultural

Ontology Service - project, a

cooperative approach to

improve organization and retrieval

of knowledge on the web.

21-03-2002

Slide 19

Further Information

http://www.fao.org/agris/AOS

http://www.fao.org/agris/AGMES

[email protected]

[email protected]