unlocking the value of open data in europe
DESCRIPTION
A presentation on how open data is being used by companies in Europe. Presented at 'Virk Data Dag' in Copenhagen, at invitation of Danish Business AuthorityTRANSCRIPT
Unlocking the Value of Open Data in Europe emergent patterns, lessons learned and examples
Virk Data Dag, Copenhagen, 25 Nov 2014
Ton Zijlstra, @ton_zylstra
2
EU/DK
GeneralValue
Which data?
Activities
Roles
Issues
“new oil”, “new gold”
July 2015, the right to re-use
http://www.flickr.com/photos/59937401@N07/5858059202/
DK addresses 2010: value > 70 * cost
ETLA, SME’s geo data grow 15% faster
Spain 2011/12, up to 600 million Euro
POPSIS 2012, 21 cases
Vickery / EC 2012, 2% GDP EU
McKinsey 2014, $3 trillion+
See „Open Data for Economic Growth” (WB, june 2014)
all empirical evidence points same way
You are here
a European market, not just a Danish one
http://global.census.okfn.org/
http://epsiplatform.eu/content/european-psi-scoreboard
http://www.opendataresearch.org/content/2013/535/get-data-open-data-barometer-2013
OGP Action plan
General value
Market and civic domain Government domain
There always was value in gov-data
Government Data
Re-user 1
Re-user 2
Open Data
2: effect of ‘Open’
- free or not? - ∆ demand - Ev price
- ∆ fiscal revenue > ∆ sales revenue ?
OXFORD 2009 POPSIS 2011
1: Market value PSI
PIRA 2000 (€95 B) MEPSIR 2006 (€ 37 B)
e.g. geo, legal, meteo3: cost of ‘Open’
<<1% of costs
Market and civic domain Government domain
Gov body 1 DataOpen Data
Government - Interaction w market/citizens changes
- Quality of data improves - Increased efficiency & effectiveness - public tasks shift (smaller, different)
Digitization has shifted that valueMarket
- Barriers to entry drop away
- market dynamics: chains change
- paradigm shift: from owning to using
Value looks different: - less lineair - not monetary - more equally spread - hard to measure - hard to correlate - but doable
four valuable reasons
more efficient government
better public services
more transparent government
new socio-economic value
16
efficiency? effectiveness? economic value? all!
efficiency? effectiveness? transparency? all!
OPENdata
openDATAvs
Which data?
untapped abundance
maps and geospatial
addresses and properties
https://www.flickr.com/photos/add1sun/3551744984
government procurement & spending
public transport
https://www.flickr.com/photos/caro2francq/8889505867
25
census and statistics
weather
‘base layers’ for other things
seriousissues
seriousdata=
but there is much more
1 Core reference data
hansjebrinker.com
impacts infrastructure, mobility, coastal defence, costs
http://hansjebrinker.com
crops, fertilizer use, pesticide use
32
2 Data people care about
cleaning up the country side
3 Data gov cares about
value, efficiency, impact
general rule: providing new affordances
important issue
stakeholders open data
new actors (500 apps, 5k ppl, TfL stopped)
Activities
38
All sectors
50% SME’s
40% founded in last 5 years
(weather, geo, health, finance, demographic)
businessoptimization
new business&
Business optimization:
greater efficiency
competitive advantage
improved decision making
44
open data in business intelligence
45
strawberry pop-tarts & hurricanes
https://www.flickr.com/photos/simon_shek/313608149
New business:
data services / products
information services / products
novelties
gov as client
one may deliver data
then another builds a service
the first 1B+ open data exit?
53
encourage what you cannot do yourself
ddj stories not just about gov then, but the real issues of today
DDJ, not just about gov
Roles
57
aggregators
58
app builders
59
enablers
enrichers
also non-profits
“I am not an open data company”
Issues
Costs too much; What’s the business case; Has commercial value; Possible privacy issues; Confidential info; It’s not ours, and we don’t know who owns it; It’s not ours and supplier won’t allow it; The quality isn’t very good; We don’t know where it is; Not our job; It’s in a useless format anyway; I don’t have the authority; People will misuse the data; People will use it wrongly; Only we understand our data; We’ll get sued; Files are just too big; Too little bandwith; It starts with this, but where’s the end? It’s there, but can’t be opened; Data is dated/too old; It’s not in digital format; Is this even legal?; Our Minister says no; We never have done this before, why start now?; I don’t see the use; Nobody will be interested; No time; No resources; Just do FOIA requests; We’ll publish it redacted; It’s not complete; It contains errors; It’s commercially sensitive; Combining this with other data is dangerous; People will come to wrong conclusions; People will get lost and confused; It will trigger endless discussions; We can’t confirm or deny we have that data; We’ll get feedback, and can’t handle that; Our IT supplier says it’s not possible; Our IT supplier will charge too much; Our site will crash; It’s already online! (but in unfindable PDFs); If people download it and use it later it will be outdated; I can’t take responsibility for all the reuse; People will get angry; Our data is in contradiction to the data of the department that is in charge of the topic; Only we truly understand.....statistics/meteo/geo/laws; We’ll disrupt the market; It will only be used to attack us.
many shades of ‘no’
Слишком дорого; В чем выгода; Есть коммерческая ценность; Возможные проблемы с личными данными; Конфиденциальные данные; Эти данные не наши, мы не знаем, кому они принадлежат; Мы не владеем данными, а поставщик не разрешит; Качество не очень хорошее; Мы не знаем, где они; Не наша работа; Они в бесполезном формате; Я за это не отвечаю; Люди воспользуются данными неправильно; Люди используют данные в неправильных целях; Только мы понимаем наши данные; Нас засудят; Файлы очень тяжелые; Низкая скорость загрузки; Данные устарели; Данные не в цифровом формате; Это вообще законно?; Министр запретил; Мы никогда этим не занимались, зачем начинать?; Я не вижу в этом пользы; Никому не будет интересно; Нет времени; Нет ресурсов; Мы будем отвечать только на запросы, связанные со свободой информации; Мы опубликуем информацию выборочно; Данные неполные; В файле есть ошибки; Это коммерчески важная информация; Если объединить эти данные с другими, они могут быть опасными; Люди сделают неверные выводы; Люди запутаются; Начнутся бесконечные обсуждения; Мы не можем ни подтвердить, ни опровергнуть, что у нас есть эти данные; Начнется обратная связь, у нас нет возможности всем отвечать; поставщик наших ИТ систем говорит, что это невозможно; Наш сайт упадет; Эти данные уже онлайн! (но в PDF, и их невозможно найти); Если люди загрузят их сейчас, а используют потом, они устареют; Я не могу отвечать за всех повторных пользователей; Люди разозлятся; Наши данные не совпадают с данными ведомства, которое отвечает за этот вопрос; Только мы по-настоящему понимаем … статистику, погоду, законы; Мы нарушим баланс на рынке; Ими воспользуются, чтобы нас атаковать.
Множество оттенков «нет»
real barriers: it’s a transitionSee http://www.flickr.com/photos/epsiplatform/5737203950/
http://www.flickr.com/photos/59937401@N07/5858059202/
economic value? transparency?
not my departments job!
68
intrinsic motivation: 6.5m GBP saved
data quality
https://www.flickr.com/photos/umdrums/8733821251
no instant usability
https://www.flickr.com/photos/cacophonyx/5128009885
6 years lead time after data publication
72
beware
What, but not why What gov knows, what gov does Evidence based: correlation / causation Direct re-use vs proxies Pre-hypothesis tool: q’s, context, experiments Big data also Σ small data (access / ownership)
openness privacy&
http://www.flickr.com/photos/mdid/3271972434/
too important to just fix at the end
76
paying for data, a bad idea
https://www.flickr.com/photos/add1sun/3551744984
Who is to pay the bill? 5 options
Data setsRegistration+ updating
Delivery to users
Public sector
Private sector + society
Reporting fee
Registration fee
User fee
User fee
State
budget
Who is to pay the bill?
Data setsRegistration+ updating
Delivery to users
Public sector
Private sector + society
Reporting fee
Registration fee
User fee
User fee
State
budget➢Discourage use
➢Affect quality data
➢Discourage use ➢ govt pays govt ➢Admin costs
➢If there is a concrete return ➢If admin costs low
➢Discourage use ➢Miss out on value
Some Dutch figures on income key registers 2011 * 1 million euros
Registration fees
Public sector use
Private sector use
Own re-use activities
State budget Total
Buisness registers
67 6 42 5 - 120
Cadastral registers
130 15 35 30 - 210
Topogra-phical map
- 9,5 0,5 - 14 24
Adresses - - - - 4 4
Total 197 30,5 77,5 35 18 358
iinformality important ingredient
Networked life, networked work, networked learning
know the gov does not exist. find 1 civil servant
informality important ingredient
value, efficiency, impact
building the triangle of new connections
important issue
stakeholders open data
Issue driven
Techdrivenvs
CreditsAll photos: Ton Zijlstra, byExcept where mentioned on the photo.
Slides: Ton Zijlstra / The Green Land, by nc sa
Slides: http://slideshare.net/thegreenland Blog: http://zylstra.org/blogContact: [email protected] @ton_zylstra