open linked governmental data for citizen engagement
DESCRIPTION
Open Linked Governmental Data for Citizen Engagement – A workshop about the benefits and restrictions of open linked governmental data and the role of metadata in citizen engagement (Anneke Zuiderwijk, Marijn Janssen, Keith Jeffery, Yannis Charalabidis) #cedem12TRANSCRIPT
CEDEM 2012, May 3-4
Open Linked Governmental Data for Citizen Engagement
A workshop about the benefits and restrictions of open linked governmental data and the role of metadata in citizen engagement
Anneke Zuiderwijk*, Marijn Janssen*, Keith Jeffery**, Yannis Charalabidis***
*Delft University of Technology, The Netherlands**Science and Technology Facilities Council, United Kingdom*** University of AEGEAN, Greece
Agenda
0 Introduction0 The ENGAGE project0 Questionnaire0 Discussion about questionnaire and first results0 Presentations
0 Anneke Zuiderwijk - Benefits and restrictions on the use of open linked governmental data from the ENGAGE project
0 Keith Jeffery - The use of metadata for citizen engagement
0 Discussion
CEDEM Workshop, Krems, May 3-4, 2012
Introduction0 Considerable attention is paid to open governmental data (e.g. EC
PSI-directives, national open data platforms, local initiatives)0 EU Public Sector Information (PSI) directive (European Commission,
2003) 0 “A general framework is needed in order to ensure fair, proportionate and non-
discriminatory conditions for the re-use of [PSI]” (p. 1) 0 “PSI is an important primary material for digital content products and services”
(p. 1)
0 Many directives and implementation guidelines followed0 Obama administration “establishment of an unprecedented level of
openness of the Government” (Obama, 2009)0 Open Data Strategy for Europe (European Commission, 2011)
0 “It will be made a general rule that all documents that are made accessible by public sector bodies can be re-used for any purpose, commercial or non-commercial, unless protected by third party copyright” (p. 1)
0 “public bodies should not be allowed to charge more than costs triggered by the individual request for data (marginal costs)” (p. 1)
CEDEM Workshop, Krems, May 3-4, 2012
Introduction0 Open governmental data can be defined as “all stored
data of the public sector which could be made accessible by government in the public interest without any restrictions on usage and distribution” (Geiger & Von Lucke, 2011, p. 185).
0 For example, public sector data can be:0 Geographic data (e.g. cadastral information)0 Legal data (e.g. courts decisions, legislation)0 Meteorological data (e.g. climate data, weather forecasts)0 Social data (e.g. population, public administration)0 Transport data (e.g. traffic congestion, work on roads)0 Business data (e.g. chamber of commerce, patents) (MEPSIR study,
Dekkers et al., 2006)
CEDEM Workshop, Krems, May 3-4, 2012
Introduction
0 Information and Communication Technologies (ICT) have the potential to improve the responsiveness of governments to the needs of citizens and scientific communities
0 Example: feedback loop (derived from Janssen & Zuiderwijk, forthcoming)
CEDEM Workshop, Krems, May 3-4, 2012
government publicOpen data
using
processing
searching
discussing
recommendingparticipation
publishingmake available ?data
?
finding
The ENGAGE project
0 However, significant barriers hinder the effective exploration, management and distribution of the vast amounts of available public sector data ENGAGE project
0 ENGAGE (FP7): An Infrastructure for Open, Linked Governmental Data Provision towards Research Communities and Citizens (http://www.engage-project.eu)
0 Main goal: the development and use of a data infrastructure, incorporating distributed and diverse public sector information (PSI) resources.
CEDEM Workshop, Krems, May 3-4, 2012
The ENGAGE project
0 The ENGAGE project:0 Opens up diverse government data to researchers0 European Level (all countries of the EU)0 Establishes Metadata Standardization framework0 Provides access and discovery on cross-country datasets0 Provides feedback back to public data agencies
0 The ENGAGE platform will enable researchers and citizens to:0 Discover and browse datasets across diverse and dispersed public
sector information resources (local, national and European) in their own language
0 Download the datasets0 Perform geospatial search of datasets0 Visualize properly structured datasets in data tables, maps and charts
CEDEM Workshop, Krems, May 3-4, 2012
The ENGAGE Project0 A European Infrastructure0 Integrating Public Sector Data 0 Providing Public Sector Information (PSI) to Research Communities
and Citizens
CEDEM Workshop, Krems, May 3-4, 2012
The ENGAGE Project - Questionnaire
CEDEM Workshop, Krems, May 3-4, 2012
0 The deposition, access and use of open public sector data should be improved
we conduct a questionnaire to find out needs regarding to the use of public sector data (e.g. deposit, access and use needs).
0 You are asked to participate in this survey, because you might (potentially) use open public sector data
0 The results will be used to develop and further specify the requirements of the ENGAGE e-infrastructure for open data
The ENGAGE Project - Questionnaire
2nd ENGAGE Meeting, Haifa, Nov 14-15, 2011
0 The information provided by you participating is treated in a confidential way
0 Completing the questionnaire will take about 10-25 minutes of your time (14-23 questions)
0 Receive the results of the questionnaire (please leave your contact details at the end)
0 Attention: each time that the term 'open data' is used in the questionnaire, this refers to 'open governmental/public sector data'!
CEDEM Workshop, Krems, May 3-4, 2012
The ENGAGE Project - Questionnaire
CEDEM Workshop, Krems, May 3-4, 2012
0 Taking the questionnaire.
0 The results of this questionnaire will be used to find out your needs regarding the use of public sector data and to develop and further specify the requirements for the ENGAGE e-infrastructure for open public sector data.
0 Your response is very valuable to us. Thank you very much for participating in this survey.
Questionnaire - Approach
0 Target groups:0 (Potential) Users of open public sector data 0 Researchers and citizens from all fields of research and all EU-
countries0 Questions
0 Based on interviews and Unified Theory of Acceptance and Use of Technology (UTAUT)
0 Background questions (gender, age and function)0 Current use of open public sector data (use, type of data, frequency,
websites, purpose, ability, usefulness)0 Metadata (use, benefits, restrictions, needs)0 Statements about use (perceptions of easiness, expectations,
voluntariness, intentions)0 Other (comments and contact details)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire - Approach
0 Questionnaire was put on line and:0 Sent to e-mail lists of conferences (e.g. the E-GOV list)0 Put on the ENGAGE project website0 Sent to contacts via LinkedIn0 Sent to contacts directly (e.g. via the ENGAGE contact list)0 Sent to organizations that employ researchers that probably
work with open data0 Sent to contacts working for open data platforms and asked
them to put the link to the questionnaire on their website (e.g. EPSI-platform and Dutch governmental open data websites)
0 Questionnaire was printed on paper and used for:0 Workshops 0 Handing out to conference participants
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire - Approach
0 Aim: at least 246 respondents finishing the questionnaire (taking into account the confidence level and a margin of error)
0 On April 19, 2012, 129 people started the questionnaire and 60 people (24% of aim) finished the questionnaire
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Background
75
25
0
20
40
60
80
100
Man Woman
Gender (%) (N=120)
1 2
9
16
28 27
12
7
0
5
10
15
20
25
30
Under 18 18-21 22-25 26-30 31-40 41-50 51-60 61 or over
Age (%) (N=120)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Background
* Multiple answers possible
42
8
2015 14
05
1015202530354045
Social sciences Natural sciences Non-scientific Non-scientificindustry
Other
Working field (%) (N=118)
26%28%
32%
0%
5%
10%
15%
20%
25%
30%
35%
Economics Political science Sociology
Social sciences* (N=47)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Did you ever use open public sector data?
0 More or less than 75 percent?0 And which types of open public sector data are used most?
(geographic, legal, meteorological, social, transport, business or other?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Type of use
* Multiple answers possible
Two paths:0 Potential users0 Users
85%
13%
2%0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
Yes No Don't know
Did you ever use open data? (N=113)
65%
37%29%
76%
47%54%
12%
0%
10%
20%
30%
40%
50%
60%
70%
80%
Geographic data Legal data Meteorologicaldata
Social data Transport data Business data Other data
Which types of open public sector data?* (N=94)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 How often do you use open public sector data?
0 What did the majority say? (yearly, monthly, weekly, daily?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Type of use
0%
24%
39%
24%
14%
0%0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
Never Yearly or a fewtimes per year
Monthly or a fewtimes per month
Weekly or a fewtimes per week
Daily or multipletimes per day
Don't know
How often? (N=93)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Which of the following non-European Union open public sector data sources/websites have you used in the past?
0 What did the majority say? (E.g. data.gov? Which other websites?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Websites
Use of non-EU websites (N=78) %
United States: www.data.gov 65%
Canada: www.data.gc.ca 12%
New Zealand: http://www.data.govt.nz/ 6%
Australia: http://data.australia.gov.au 14%
Morocco: http://data.gov.ma/ 0%
Moldova: http://data.gov.md 1%
Albany: http://open.data.al 0%
Israel: http://data.gov.il 0%
Kenya: http://www.opendata.go.ke/ 4%
Other, namely: 12%
None of these 28%
I do not remember 6%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Which of the following European Union open public sector data sources/websites have you used in the past?
0 What did the majority say? (E.g. data.gov.uk? Which other websites?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Websites
Use of EU-Websites (N=80) %
Europe: www.epsiplatform.eu 25%
United Kingdom: www.data.gov.uk 53%
France: www.data.gouv.fr 5%
Greece: www.observatory.gr 4%
Netherlands: www.data.overheid.nl 13%
Luxembourg: www.statistiques.public.lu 0%
Italy: www.dati.gov.it 6%
Belgium: http://data.gov.be 1%
Norway: www.data.norge.no 4%
Denmark: www.digitaliser.dk 1%
Estonia: http://www.opendata.ee/ 1%
Spain: http://datos.gob.es 8%
Other, namely: 13%
None of these 14%
I do not remember 5%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Which other data sources/websites of open public sector data have you used in the past?
0 Many other websites? Which websites?
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – WebsitesUse of other websites (N=88)
EurostatCBS (Dutch statistics offi ce)United NationsWorld Bankhttp://diavgeia.gov.grhttp://geodata.gov.grhttp://www.gsis.grOECDLand RegistryHistoric weather dataUNESCOhttp://daten.berlin.de/www.norway.no
http://data.wien.gv.at/http://www.denhaag.nl/opendata Openstreetmap.orgwww.joinup.eu http://offeneskoeln.de/maps.geoportal.gov.plwww.eubusinessregister.comwww.e-practice.eu
And many other websites…
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 To which extent are the following purposes important for your use of open public sector data?
0 Which purposes were mentioned most?
(academic publications, statistical analysis, policy research, non-scientific and non-policy investigations, political and policy-making decisions, data linking, news reporting, daily operation in work, curiosity/recreation, other?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results – Purpose
Which purposes are important for use?Very unimportant Unimportant Neutral Important Very important Don't know
Academic publications 7% 5% 15% 22% 47% 4%Statistical analysis 4% 4% 10% 23% 56% 3%Policy research 3% 9% 13% 33% 40% 3%Investigations (non-scientific, non-policy) 4% 10% 19% 43% 21% 3%Political and policy-making decisions 7% 9% 12% 41% 30% 1%Data linking 1% 7% 14% 29% 47% 3%
News reporting 13% 17% 28% 19% 20% 3%
Daily operation in work 7% 11% 28% 24% 27% 3%
Curiosity/recreation 6% 9% 23% 33% 26% 4%
Other 1% 0% 4% 4% 7% 4%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 To which extent are you currently able to perform the following actions when you use open data?
0 To which extent do you find the following actions useful for your use of open public sector data?
0 Which actions were mentioned as difficult/useful by the majority of respondents?
(searching, searching by using an API, finding, finding by the use of metadata, finding linked material, discover and browse datasets on different levels in the own language, downloading, downloading supplementary data?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results - Requirements
Assessment of usefulness of performing the following actions when using open public sector data (N=56)
Very useless Useless Neutral Useful Very useful Don't knowSearching 2% 7% 7% 25% 56% 2%Searching by using an API 2% 2% 15% 25% 36% 20%Finding (getting data) 2% 4% 11% 17% 65% 2%Finding by use of metadata 2% 4% 13% 22% 48% 11%Finding linked material 0% 5% 11% 39% 43% 2%Discover and browse datasets (different levels, own language) 5% 5% 15% 29% 44% 2%Downloading 0% 4% 13% 22% 59% 2%Downloading supplementary data (e.g. metadata) 2% 0% 16% 29% 42% 11%
Current ability to perform the following actions when using open public sector data (N=70)
Very diffi cult Diffi cult Neutral Easy Very easy Don't knowSearching 9% 23% 22% 28% 16% 3%Searching by using an API 7% 28% 19% 13% 4% 27%Finding (getting data) 11% 31% 27% 21% 5% 5%Finding by use of metadata 11% 24% 29% 26% 5% 6%Finding linked material 17% 21% 30% 15% 6% 11%Discover and browse datasets (different levels, own language) 18% 26% 25% 19% 4% 7%Downloading 9% 17% 25% 28% 17% 4%Downloading supplementary data (e.g. metadata) 16% 19% 34% 16% 4% 10%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 To which extent are you currently able to perform the following actions when you use open data?
0 To which extent do you find the following actions useful for your use of open public sector data?
0 Which actions were mentioned as difficult/useful by the majority of respondents?
(processing, processing by transforming data/linking data/linking metadata, visualizing, analyzing, feedback by rating, feedback by putting needs, uploading, uploading processed data, viewing usage statistics, getting training?)
CEDEM Workshop, Krems, May 3-4, 2012
Current ability to perform the following actions when using open public sector data (N=61)Very diffi cult Diffi cult Neutral Easy Very easy Don't know
Processing 6% 15% 32% 25% 13% 9%Processing by transforming data 5% 18% 15% 32% 13% 17%Processing by linking data 15% 24% 22% 17% 5% 17%Processing by linking metadata 16% 23% 21% 11% 12% 18%Processing by visualising data in tables, maps and charts 5% 13% 25% 27% 23% 7%Processing by analysing data 5% 20% 20% 38% 12% 5%Providing feedback by rating the data 20% 27% 25% 12% 3% 12%Providing feedback to the data producer by putting needs 15% 25% 20% 15% 5% 20%Uploading datasets 14% 24% 19% 12% 7% 24%Uploading processed, enhanced, extended, annotated and/or linked datasets
19% 22% 22% 5% 2% 31%
Viewing usage statistics 12% 25% 25% 8% 7% 22%Getting training on the use of open data 12% 35% 23% 13% 5% 12%
Assessment of usefulness of performing the following actions when using open public sector data (N=53)Very useless Useless Neutral Useful Very useful Don't know
Processing 2% 4% 8% 38% 44% 4%Processing by transforming data 2% 4% 13% 37% 40% 4%Processing by linking data 2% 4% 20% 33% 37% 4%Processing by linking metadata 2% 4% 20% 27% 35% 12%Processing by visualising data in tables, maps and charts 0% 6% 12% 20% 59% 4%Processing by analysing data 0% 4% 6% 33% 53% 4%Providing feedback by rating the data 2% 4% 29% 29% 29% 8%Providing feedback to the data producer by putting needs 4% 8% 19% 23% 40% 6%Uploading datasets 2% 9% 26% 32% 19% 11%Uploading processed, enhanced, extended, annotated and/or linked datasets
2% 9% 19% 38% 19% 13%
Viewing usage statistics 4% 8% 22% 33% 20% 14%Getting training on the use of open data 8% 10% 23% 29% 25% 6%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Do you currently use metadata in the context of your work or for other activities?
0 How many respondents used metadata? More or less than 90%?
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results - Metadata
82%
14%
4%
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
Yes No Don't know
Use metadata? (N=56)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 When you use metadata for open public sector data in your current practice, how often do you personally obtain the following benefits from it?
0 When you use metadata for open public sector data in your current practice, how often do you personally notice the following problems?
0 Which benefits are mentioned by the majority of respondents? (metadata can make reuse, interpretation, searching and browsing and linking easier)
0 Which problems are mentioned by the majority of respondents? (difficult to interpret, insufficient data about data quality, data gathering and data measuring, no structure, difficult to search and browse?)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results - Metadata
Assessment of noticing problems when using open public sector data (N=53)Never Rarely Sometimes Often Always Don't know
Insuffi cient metadata and therefore diffi cult to interpret the data 0% 7% 25% 57% 5% 7%Insuffi cient data about the data quality 0% 0% 35% 47% 14% 5%Insuffi cient metadata about data gathering and measuring 0% 0% 30% 49% 16% 5%Metadata have no structure and are therefore diffi cult to search and browse 0% 14% 27% 36% 14% 9%
Assessment of noticing benefits when using open public sector data (N=47)Never Rarely SometimesOften Always Don't know
Metadata can make reusing data easier 0% 2% 18% 42% 36% 2%Metadata can make interpretation of data easier 0% 2% 13% 33% 50% 2%Metadata can make searching and browsing data easier 0% 9% 16% 18% 56% 2%Metadata can make linking data easier 0% 11% 13% 33% 33% 9%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Discussion
0 Which of the following metadata would you like to use when you use (e.g. search, browse, retrieve and evaluate) open public sector data?
0 Which metadata?
(e.g. description of dataset, title, creator, publisher, country, source, type/theme/category, format, language, keywords/tags, geographical/spatial coverage, temporal coverage, release data, license, linked datasets, organizations and persons involved, projects related, funding, data collection period, helpdesk, quality, completeness, parameters used by software)
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – Results - MetadataType of metadata % Type of metadata %Description of dataset 95% Linked datasets 86%Title of dataset 88% Organizations involved in creating the dataset 70%Creator of dataset 74% Persons involved in creating the dataset 60%Publisher of dataset 70% Projects related to the dataset 65%Country where the dataset was created 79% Funding information of the dataset 47%Source of dataset 86% Data collection period (from-to) 84%Type/theme/category of data 74% Helpdesk for the dataset 58%Format of dataset 81% Quality as declared by the data provider 74%Language used in dataset 72% Quality as declared by the data user (feedback) 74%Keywords/tags in dataset 84% Completeness of the dataset 81%
Geographical or spatial coverage of dataset93%
Parameters used by software accessing and processing the dataset
53%
Temporal coverage of dataset 81% Other metadata, namely: 21%Release date of dataset 77% Don't know 0%License of dataset 67%
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – discussion
0 Statement: Using open public sector data is of benefit for me
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: Using open public sector data is of benefit for me
CEDEM Workshop, Krems, May 3-4, 2012
0% 0% 0%
31%
67%
2%0%
10%
20%
30%
40%
50%
60%
70%
80%
Stronglydisagree
Disagree Neutral Agree Stronglyagree
Don't know
Using open public sector data is of benefit for me (N=56)
Questionnaire – discussion
0 Statement: Using open public sector data will enable me to accomplish my research more quickly
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: Using open public sector data will enable me to accomplish my research more quickly
CEDEM Workshop, Krems, May 3-4, 2012
0% 0%5%
35%
51%
9%
0%
10%
20%
30%
40%
50%
60%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
Using open public sector data will enable me to accomplish my research more quickly
(N=56)
Questionnaire – discussion
0 Statement: I have the resources necessary to use open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: I have the resources necessary to use open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
4%
16%20%
38%
18%
4%
0%
5%
10%
15%
20%
25%
30%
35%
40%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
I have the resources necessary to use open public sector data (N=56)
Questionnaire – discussion
0 Statement: A specific person or group is available for assistance with difficulties concerning the use of open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: A specific person or group is available for assistance with difficulties concerning the use of open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
18%
33%
20%
4%
11%
15%
0%
5%
10%
15%
20%
25%
30%
35%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
A specific person or group is available for assistance with difficulties concerning the
use of open public sector data (N=56)
Questionnaire – discussion
0 Statement: It will be easy for me to become skillful at using open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: It will be easy for me to become skillful at using open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
2%
7%
24%
45%
20%
2%
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
It will be easy for me to become skillful at using open public sector data (N=56)
Questionnaire – discussion
0 Statement: People who are important to me (e.g. colleagues) think that I should use open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: People who are important to me (e.g. colleagues) think that I should use open public sector data
CEDEM Workshop, Krems, May 3-4, 2012
4%7%
24%
31%
24%
11%
0%
5%
10%
15%
20%
25%
30%
35%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
People who are important to me (e.g. colleagues) think that I should use open
public sector data (N=56)
Questionnaire – discussion
0 Statement: I intend to use open public sector data in the future
CEDEM Workshop, Krems, May 3-4, 2012
Questionnaire – first results
0 Statement: I intend to use open public sector data in the future
CEDEM Workshop, Krems, May 3-4, 2012
0% 0%4%
40%
55%
2%
0%
10%
20%
30%
40%
50%
60%
Stronglydisagree
Disagree Neutral Agree Strongly agree Don't know
I intend to use open public sector data in the future (N=56)
Presentations
0 Anneke Zuiderwijk - Benefits and restrictions of the use of open linked governmental data from the ENGAGE project
0 Keith Jeffery - The use of meta-data for citizen engagement
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) - Benefits
0 Literature overview and two use-cases to identify benefits of the use of open linked governmental data for the ENGAGE project
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Benefits (user perspective)
Category Benefits Political and social Obtaining new insights in the public sector
Creating new ways of understanding problems and interpreting data Easier to participate in policy making More participation and self-empowerment of users Improvement of policy-making processes New (innovative) and/or improved governmental services for users Improving citizen satisfaction Improving life-quality of user
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Benefits (user perspective)
Category Benefits Economical Economic growth
Stimulating innovation Stimulating scientific progress Less dependency on other (governmental) organizations Development of new products and services Easier to perform research Easier to do job Reuse of data and therefore not having to collect the same data again Counteracting unnecessary duplication of costs (public money Availability of information for investors and companies More competition
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Benefits (user perspective)
Category Benefits
Operational and technical
Being able to scrutinize data Creating new data and obtaining new knowledge by merging, integrating and mashing public and private data (linked data) Fair decision-making by enabling comparison Sustainability of data (no data loss on the long term) Cooperation with data provider Ability to use the wisdom of the crowds
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Restrictions (user perspective)
0 However, there are also many restrictions of the use of open linked governmental data
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Restrictions (user perspective)
Categories Barriers
Task complexity and access restrictions
Not able to discover the appropriate data The data are (temporarily) not available/open Not having access to the original data (only processed data) Difficult to search and browse; few central websites No information about the way access to data may be obtained No/few central website(s) to request access to data Prior written permission is required to get access to and reproduce data Not being free to creatively reuse data because of licences Registration required before being able to download the data Having to pay a fee for the data Language issues Data about the data (metadata) are not available Not being aware of the potential use of data Data are available in various forms resulting in discussing what is the right source No tooling support or helpdesk Focus is on making use of single datasets, whereas the real value might come from combining various datasets Contradicting outcomes based on the use of the same data
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Restrictions (user perspective)
Categories Barriers Use and participation
No incentives for users Public organizations do not react on user input No time to make use of the open data Lack of knowledge to make sense and therefore to make use of data Lack of capabilities – users do not have the information capabilities necessary No statistical knowledge and understanding of the potential and the limitations of statistics Data are poorly annotated Data format is not reusable Insufficient metadata available No explanation of the meaning of data Invalid conclusions based on the reused data Data formats and datasets are too complex to handle and use easily Barriers stemming from laws and guidelines Risk on privacy violation Risk on dispute and litigations; threat of lawsuits or other violations Unclearness because there is no uniform policy for opening data
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Restrictions (user perspective)
Categories Barriers I nformation Quality
Lack of information Accuracy/ imprecise information Obsolete data Information may appear to be irrelevant or benign when viewed in isolation, but when linked and analyzed collectively it may add value Too much information to process and not sure what to look at (Essential) Information is missing Similar data stored in different systems yield different results
Categories Barriers Technical Restrictions on data format for deposition and use
Absence of standards (e.g. for architecture) Lack of metadata standards No standard software for processing open data Fragmentation of software and applications
CEDEM Workshop, Krems, May 3-4, 2012
Fishbone diagram (Zuiderwijk, Janssen & Choenni, forthcoming)
Impediments of current open data policies
Data access (A)
Data deposition (D)
Political, Economical, Social, Technical (PEST)
Legacy system complicates the opening of data (1)
No funding
Limited types of data formats accepted
Deposition requires registration or becoming
a member
Few central websites (fragmentation of
sources) (5)
Data are currently not available (2)
Data cannot be found (5)
Only a part of the data is available
Access requires registration or becoming a member
Access requires (filling a form for) a data request (4)
Access requires written permission (1)
Access requires a fee (2)
Access is limited to professionals
Access requires accepting a variety of use agreements
No uniform set of licensing terms for reuse
No awareness of data (3)
No information about structurally updating data in
the future
No information about which data will be published in the future
Complex to understand data provenance
Little knowledge about data quality (1)
Data use (U)
Use (especially comparability) requires data transformations (2)
Too few concessions to statistical needs
Losing track because of size of dataset
Reproductions must comply with standard conditionsInsufficient metadata (7)
Little attention paid to data gathering
Users cannot make sense of data and extract the knowledge contained within (4)
Metadata have no structure (8)
Unfamiliar with data format
Unfamiliar with format of metadata
Data-infrastructure is not easily expandable when PSI-amount increases
massively
No or few visualization facilities
No tooling support or helpdesk (6)
No dialogue between the data-producing public body and data user (4)
Use (especially linking data) requires domain expertise (5)
No access to original data (only processed data) (3)
Difficult to search and browse data
Users lack capabilities to use data
(Essential) Information is missing
Data are covered by copyright (act) and
other regulations (1)
Data about the same topic are displayed in different ways
Downloading data requires a lot of disk space (3)
Unfamiliar with (meta)data language
No access to recent data, only out-dated data
Threat of lawsuits or other violations
Language problems
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (1) – Main challenges
0 Rectify fragmentation by creating a single shop for PSI0 Create open access for all users0 Create interoperability and provide users with possibilities to
analyse data0 Create an infrastructure for processing PSI
(Zuiderwijk, Janssen & Choenni, forthcoming)
CEDEM Workshop, Krems, May 3-4, 2012
Presentations (2) – Metadata for citizen engagement
0 The survey shows that a key technology for making open data available is metadata
0 The metadata is used for0 Discovery (finding appropriate datasets)0 Contextualising (the data was collected for what purpose,
which project(s), how funded, by whom, which organisations, any related publications….
0 Data processing: here detailed domain (or even project) specific metadata is used to link the software used for analysis / reporting / visualising to the dataset
CEDEM Workshop, Krems, May 3-4, 2012
The Vision: Metadata for Data Model
DISCOVERY(DC, eGMS…)
CONTEXT(CERIF)
DETAIL(SUBJECT OR TOPIC SPECIFIC)
Generate
Point to
Linked open data
Formal Information
Systems
CEDEM Workshop, Krems, May 3-4, 2012
Models for an infrastructure
0 The data model with its metadata described is only one relevant model
0 The other models are0 User model0 Processing model0 Resource model
CEDEM Workshop, Krems, May 3-4, 2012
The Vision: The Models
Complete ICT environment for PSIComplete cohort of users
Processing Model
User Model
Data Model
CEDEM Workshop, Krems, May 3-4, 2012
Models
0 User Model: controls the way in which the end-user interacts with the e-infrastructure.0 User profile, security certification, privacy;0 Device and interaction mode preferences (keyboard/mouse through voice and
gesture to brain-connected), language preference;0 Resource preferences (including contacts) with directories;
0 METADATA
CEDEM Workshop, Krems, May 3-4, 2012
Models
0 Process Model controls the way processes are constructed and executed in the e-infrastructure.0 Services
0 Described for discovery, described for functional and non-functional (security, privacy, performance) properties
0 Mobile (deployed in distributed / parallel execution environments)
0 Open source where possible0 Service composition
0 Dynamically (re-) composable during execution0 METADATA
CEDEM Workshop, Krems, May 3-4, 2012
Models
0 Resource Model catalogs the available computing resources in the e-infrastructure0 This allows virtualisation so the user neither knows nor
cares from where the data comes, or where the processing is done, as long as quality of service is maintained;
0 Requires updating by resource owners – together with conditions of use
0 METADATA
CEDEM Workshop, Krems, May 3-4, 2012
Discussion
0 I do not have difficulty in explaining why using metadata for open public sector data may be beneficial
0 I clearly understand how to use metadata for open public sector data
CEDEM Workshop, Krems, May 3-4, 2012