data products & problems in agriculture

63
Supported by EU projects 29/11/2013 Athens, Greece Open Data for Agriculture Joint offering by Intro to Big Data

Upload: cthanopoulos

Post on 11-May-2015

731 views

Category:

Technology


0 download

DESCRIPTION

Lecture at the Intro Course "Big data in Agriculture" http://wiki.agroknow.gr/agroknow/index.php/Athens_Green_Hackathon_2013

TRANSCRIPT

Page 1: Data Products & Problems in Agriculture

Supported by EU projects

29/11/2013Athens, Greece

Open Data for Agriculture

Joint offering by

Intro to Big Data

Page 2: Data Products & Problems in Agriculture

Charalampos ThanopoulosAgro-Know Technologies

Data Products & Problems in Agriculture

Page 3: Data Products & Problems in Agriculture

Slide 3 of 63

Intro

• This presentation aims to provide information about the open data in agriculture, examples of agricultural data problems and how these can be described with the drivetrain approach

Page 4: Data Products & Problems in Agriculture

Slide 4 of 63

Objectives

This presentation aims to provide basic information on data-related issues in agriculture

• Provide an intro to agricultural sciences• Describe the use of open data in agriculture• Define the agricultural data formats• Provide examples of agricultural data problems

Page 5: Data Products & Problems in Agriculture

Slide 5 of 63

Structure

• The presentation consists of the following sections:– Intro to agriculture & agricultural sciences– Intro to agricultural market & potential– Intro to Open Data in agriculture– Review of agricultural data problems

• 4 agricultural case studies

Page 6: Data Products & Problems in Agriculture

INTRO TO AGRICULTURE & AGRICULTURAL SCIENCES

Sou

rce:

htt

p://

ww

w.a

gric

orne

r.com

/sha

reho

lder

-dem

ands

-to-

shap

e-m

oder

n-ag

ricu

ltur

e/

Page 7: Data Products & Problems in Agriculture

Slide 7 of 63

About agriculture

Definition 1: “the science or practice of farming, including cultivation of the soil for the growing of crops and the rearing of animals to provide food, wool, and other products”

Definition 2: “the set of activities that transform the environment for the production of animals and plants for human use. Agriculture concerns techniques, including the application of agronomic research”

Page 8: Data Products & Problems in Agriculture

Slide 8 of 63

About agricultural sciences

• Agricultural science: a broad multidisciplinary field encompassing the parts of exact, natural, economic and social sciences that are used in the practice and understanding of agriculture.– Veterinary science, but not animal science, is often

excluded from the definition

Page 9: Data Products & Problems in Agriculture

INTRO TO AGRICULTURAL MARKET & POTENTIAL

Sou

rce:

htt

p://

ww

w.a

rchi

ves.

gov.

on.c

a/en

/exp

lore

/onl

ine/

agri

cult

ure/

big/

big_

12_f

arm

ers_

mar

ket.a

spx

Page 10: Data Products & Problems in Agriculture

Slide 10 of 63

a huge market, globally

Food & Agricultural commodities production, http://faostat.fao.org

Page 11: Data Products & Problems in Agriculture

Slide 11 of 63

some figures

• Food - Gross Production Value globally in 2011: $2,318,966,621

• Agriculture - Gross Production Value globally in 2011: $2,405,001,443

• Investment in agriculture - Gross Capital Stock globally: $5,356,830,000

… they are big

Page 12: Data Products & Problems in Agriculture

Slide 12 of 63

examples of EU production in 2010

Source: Eurostat

Page 13: Data Products & Problems in Agriculture

Slide 13 of 63

how many businesses?

Page 14: Data Products & Problems in Agriculture

INTRO TO OPEN DATA IN AGRICULTURE

Page 15: Data Products & Problems in Agriculture

Slide 15 of 63

Definition of Open Data

“Open data is data that can be freely used, reused and redistributed by anyone -

subject only, at most, to the requirement to attribute and sharealike”

Page 16: Data Products & Problems in Agriculture

Slide 16 of 63

why open data?

• Open data, especially open government data, is a tremendous resource that is as yet largely untapped– individuals and organisations collect broad range of

different types of data to perform their tasks• Government is particularly significant in this

respect– quantity and centrality of data it collects– most is public data by law, could be made open and

made available for others to use

Page 17: Data Products & Problems in Agriculture

Slide 17 of 63

closed data

Examples:

…always bad?

Page 18: Data Products & Problems in Agriculture

Slide 18 of 63

open data for businesses

“new businesses and new business models are beginning to emerge: Suppliers, aggregators, developers, enrichers and enablers”“key link in the value chain for open data is the consumer…direct relevance to the choices individuals make as part of their day-to-day lives”

Page 19: Data Products & Problems in Agriculture

Slide 19 of 63

Open Data in agriculture: a political priority

“How Open Data can be harnessed to help meet the challenge of sustainably feeding nine billion people by 2050”

Page 20: Data Products & Problems in Agriculture

Slide 20 of 63

Agriculture is about to experience a “growth shock” in order to cover the exponentially increasing food needs

of the global population

• Key facts about agricultural trends• All demographic and food demand projections suggest that,

by 2050, the planet will face severe food crises due to our inability to meet agricultural demand – by 2050:– 9.3 billion global population, 34% higher than today– 70% of the world’s population will be urban, compared to 49%

today– food production (net of food used for biofuels) must increase by

70%

• According to these projections, and in order to achieve the forecasted food levels by 2050, a total investment of USD 83 billion per annum will be required

Page 21: Data Products & Problems in Agriculture

Slide 21 of 63

One of the most promising routes to agriculture modernisation is the provision of Open Data to all

interested parties

Open data in agriculture• In an era of Big Data, one of the most promising routes to

bootstrap innovation in agriculture is by the use of Open Data:– e.g. provisioning, maintaining, enriching with relevant metadata,

making openly available a vast amount of information• The use and wide dissemination of these data sets is strongly

advocated by a number of global and national policy makers such as:– The New Alliance for Food Security and Nutrition G-8 initiative– Food & Agriculture Organization of the UN– DEFRA & DFID in UK– USDA & USAID in the US

Page 22: Data Products & Problems in Agriculture

Slide 22 of 63

examples of variety & diversity

Page 23: Data Products & Problems in Agriculture

Slide 23 of 63

data sets

Page 24: Data Products & Problems in Agriculture

Slide 24 of 63

maps

Page 25: Data Products & Problems in Agriculture

Slide 25 of 63

photos

Page 26: Data Products & Problems in Agriculture

Slide 26 of 63

databases

Page 27: Data Products & Problems in Agriculture

Slide 27 of 63

• publications, theses, reports, other grey literature• educational material and content, courseware• primary data, such as measurements & observations

– structured, e.g. datasets as tables– digitized, e.g. images, videos

• secondary data, such as processed elaborations– e.g. dendrograms, pie charts, models

• provenance information, incl. authors, their organizations and projects

• experimental protocols & methods• social data, tags, ratings, etc.

Agricultural data formats

Page 28: Data Products & Problems in Agriculture

Slide 28 of 63

.. examples of Big Data in agriculture

Page 29: Data Products & Problems in Agriculture

REVIEW OF AGRICULTURAL DATA PROBLEMS

Page 30: Data Products & Problems in Agriculture

CASE STUDY 1A: PRODUCING HIGHLY NUTRITIOUS GREEN VEGETABLES

Page 31: Data Products & Problems in Agriculture

Slide 31 of 63

Radiki.com

• food scouter collecting edible plants like of wild Taraxacum officinale W. http://en.wikipedia.org/wiki/Taraxacumbusiness

• opportunity: gourmet restaurants are looking for such highly nutritious & appreciated greens

Page 32: Data Products & Problems in Agriculture

Slide 32 of 63

data problems

Page 33: Data Products & Problems in Agriculture

Slide 33 of 63

data problems

Dried export in US

Page 34: Data Products & Problems in Agriculture

Slide 34 of 63

Issues identified

1. finding right & relevant only legislation2. finding right, natural drying techniques for

these plants3. finding scientific info on proper packaging

Page 35: Data Products & Problems in Agriculture

Slide 35 of 63

1: finding right &

relevant only legislation

Page 36: Data Products & Problems in Agriculture

Slide 36 of 63

Page 37: Data Products & Problems in Agriculture

Slide 37 of 63

Page 38: Data Products & Problems in Agriculture

Slide 38 of 63

2: finding right, natural drying

techniques for these plants

Page 39: Data Products & Problems in Agriculture

Slide 39 of 63

Page 40: Data Products & Problems in Agriculture

Slide 40 of 63

Page 41: Data Products & Problems in Agriculture

Slide 41 of 63

3: finding scientific info

on proper packaging

Page 42: Data Products & Problems in Agriculture

Slide 42 of 63

Page 43: Data Products & Problems in Agriculture

Slide 43 of 63

Page 44: Data Products & Problems in Agriculture

Slide 44 of 63

The Drivetrain approach

• Enrich existing bibliographic information• Link bibliographic information with related Web resources• Allow users to access the full-text of a publication and all the information the Web knows about a specific research area in the agricultural domain

• Users’ requirements• Linked data infrastructure• Selection of available data sources

• Existing bibliographic information• Available additional data sources

• Develop algorithms for linking data from various data sources (i.e. DBPedia, World Bank etc) using a linked-data approach involving AGROVOC

Page 45: Data Products & Problems in Agriculture

CASE STUDY 1B: AGRO-FOOD COOPERATIVE

Page 46: Data Products & Problems in Agriculture

Slide 46 of 63

• Christos Stamatis – (CEO of the Stevia Coop)

• Crowd funding model– 250 growers– first Greek Stevia

Stevia Hellas

Page 47: Data Products & Problems in Agriculture

Slide 47 of 63

Page 48: Data Products & Problems in Agriculture

Slide 48 of 63

Issues identified

1. Strengthen the knowledge about food safety2. Where to set up the adding value processing

unit3. Organic portion of the coops cultivation4. Define product price

Page 49: Data Products & Problems in Agriculture

Slide 49 of 63

• The coop needs to strengthen the knowledge in Food Safety and to follow the standards

• Needs to have access to a portal that provides access to such information

• It could be extended to cover also other food products and domains

• What kind of open data are needed– OER from Educational Institutions– Open courses– Data from the ministry on which are the food standards– Data from FAO e.g. FAO codex

Problem 1: Strengthen the knowledge about food safety

Page 50: Data Products & Problems in Agriculture

Slide 50 of 63

• Benefits for a stakeholder– personnel that you need to ensure that you will

follow the food safety standards– find the food safety standards that should be

followed– define relevant training for your employees

How such service can help

Page 51: Data Products & Problems in Agriculture

Slide 51 of 63

• The cooperative would like to have a product on the shelf

• Valuable information– available energy resources– shipping roots– availability of land

Problem 2: Where to set up the adding value processing unit

Page 52: Data Products & Problems in Agriculture

Slide 52 of 63

• A cooperative would like to invest more in organic cultivation

• Valuable information– market needs in organic products and stevia

specifically– prices of the last years for conventional products– climate conditions– soil quality maps

Problem 3: Organic portion of the coops cultivation

Page 53: Data Products & Problems in Agriculture

Slide 53 of 63

• Estimate the price for coops’ product • Valuable information

– sugar prices– international prices of stevia– meteo data– import prices in Greece

Problem 4: Define product price

Page 54: Data Products & Problems in Agriculture

CASE STUDY 2A: PRODUCE TRAINING MATERIAL FOR NATURAL PRODUCTS

Page 55: Data Products & Problems in Agriculture

Slide 55 of 63

• creating natural effective and holistic products since 1979 to promote health & beauty– Lately involved in the agricultural education and

training– Producing training material, creating courses etc.

related to the ingredients of APIVITA natural products

APIVITA

Page 56: Data Products & Problems in Agriculture

Slide 56 of 63

APIVITA

finding OER material for natural products

Page 57: Data Products & Problems in Agriculture

Slide 57 of 63

The Drivetrain approach

Creating additional services for APIVITA web site

• Existing (generic) user requirements• Existing appropriate functionalities• Data models available to support the functionalities of the new web site

• APIVITA-owned content• External Open Educational Resources with related content• Requirements from the expected users of the APIVITA micro-site• Feedback (rating/reviews) of available resources from the APIVITA users

Develop algorithms for • Filtering results from the linked data stes, • Fine-tuning content based on the feedback received• Revising user interface/facets based on new requirements

Page 58: Data Products & Problems in Agriculture

CASE STUDY 2B: ORGANIZE TRAINING MATERIAL FOR ORGANIC PRODUCTS

Page 59: Data Products & Problems in Agriculture

Slide 59 of 63

Association of organic products

• SEAE: Sociedad Española de Agricultura Ecologica

• A non-profit organization promoting organic agriculture in Spain

• Organizes training events and Conferences/Workshops– Produces training material and collects

publications from Conference submissions

Page 60: Data Products & Problems in Agriculture

Slide 60 of 63

Issues identified

• Issue: – Material produced not described with metadata– Only available (partially) in SEAE website – All information only available in Spanish

Page 61: Data Products & Problems in Agriculture

Slide 61 of 63

Page 62: Data Products & Problems in Agriculture

Slide 62 of 63

The Drivetrain approach

Create a collection of multilingual metadata for describing resources and publish metadata in other websites

• Multilingual metadata authoring tool (e.g. AgLR)• Automatic translation tools• Agricultural educational portals

• Training material produced by SEAE• Conference submissions, publications & proceedings

• Develop algorithms for the selection of content for the SEAE collection• Publication of multilingual metadata in other OER web portals

Page 63: Data Products & Problems in Agriculture

Thank you!

Charalampos Thanopoulos

Agro-Know Technologies

[email protected]