lt@ec.eu roberto cencioni dg information society & media digital content & cognitive systems...
Post on 26-Dec-2015
218 Views
Preview:
TRANSCRIPT
LT@EC.eu
Roberto Cencioni
DG Information Society & Media
Digital Content & Cognitive Systems
infso-e1@ec.europa.eu
Outline:
• Background
• (recent) Past & Present
• (near) Future & Conclusions
3
ForewordA digital lingua franca?
“If you talk to a man in a language he understands, that goes to his head. If you talk to him in his own language, that goes to his heart.” Nelson Mandela
•globalisation & lingua franca; the 30% element:
– English web content accounts for <30% of the total
– English postings account for <30% of global social media
– only 30% or so of EU online buyers are prepared to do so in a language other than theirs
•Europe’s digital single market is multilingual
– and so are export markets, and our societies!
•but some languages are more “equal” than others…
Today’s applicationstext & speech
• authoring aids, terminology management
• computer-aided translation
• cross-lingual information gisting
• enterprise search & content management
• web, text & audio mining
• dictation, command/control systems
• voice-based assistance, call centres
• in-car navigation & control systems
• edu-entertainment incl. games
• aids for the handicapped…
5
Digital content in an international setting
• "translation underpins successful business globalisation"
• global content management chain, from authoring through to publishing
– growing demand, 10% per year
– more and more social, volatile, personalised content
– more markets, more languages
• industry cannot keep the pace
– whether translation, web globalisation, media localisation
– whether corporate, commercial, creative… content
• how to cut time & cost, cope with growing volumes, preserve quality… if you are not a corporation?
• well, technology can help…
6
Making sense of smart content
• huge and growing amounts of digital content, largely unstructured
– documents, web pages, messages, audio-visual material…
• how to extract, interpret, correlate… information (people, products, events, relations, etc.) from multiple textual sources?
• how to classify and find a video clip or broadcast material?
• how to detect trends & opinions in online blogs and news feeds?
• how to get to the “hidden knowledge” in a corporate repository?
– typically through language-encoded information embedded in digital objects
7
A long-term commitment
• EC has supported Language Technologies (LT) since 1980s
• a fresh start since mid-2008
– renewed political commitment
– explosion of online content, esp. social networks
– promising technical advances
– sizeable language-services industry
• ~150 Meuro in 2009-2012
• ~60 projects underway by mid-2012
• next call for proposals expected in July
8
Our approach- early years
• provide a broad framework to boost cross-border partnerships
− from 27 project proposals in 2009 …
− … to 200 proposals in 2011
• try and address structural weaknesses
– duplication & fragmentation
shared vision: needs & core technologies
− reuse of basic assets (data & tools)
− smoother technology transfer & early deployment
99
Project lines
“… effective solutions that support business and inter-personal communication and enable people to make sense of online content and services in Europe's many languages.”
• global content processing eg automated translation
• mining of unstructured information eg text analytics
• natural interaction eg speech understanding & dialogue
in and across multiple languages
• broad participation, old hands with newcomers
− ICT-7 call: 90 submissions,400+ unique participants requesting 260 M (1:6)
− ICT-SME: 110 submissions,400+ unique participants requesting 170 M (1:13)
− 40+ countries, year after year
• balanced academic/industrial consortia
− ICT-7: enterprises account for 39% of the submitters
− ICT-SME: 57% of the proposers are SMEs
• above-average score of EU12 participants (15.6%)
Partnerships2011 calls
Cross-cutting issues:language resources
raw material for research & product development
around 30% of our efforts so far:
• research projects – “tools”- automate the compilation of resources
• innovation projects – “data” (CESAR)- make existing resources accessible & fit for online consumption, help develop a share/reuse culture
• newly started projects from the “SME call”,- add value to commercial language data
• META-SHARE – develop a community-driven “service” framework for resource sharing (CESAR)
12
Groundwork:beyond individual actions
• research: towards a federating vision & roadmap
– META-NET (2010-2012)
• business: ease market uptake & deployment
– LT-INNOVATE (2012-2013)
– forum for business stakeholders esp. vendors
– also, market data, value chains & non-technical barriers, demand & supply
• public sector: promote the use of LT by our own institution
– inter-service group with major EC departments
Upcomingopportunities
• last phase of current programmes (2012-13)
• start of new programmes from 2014:
– research & innovation: Horizon 2020 (H2020)
– infrastructure: Connecting Europe Facility (CEF)
• heavy competition with other policy areas;emphasis on economic growth & jobs
• so to be successful:
– clear & credible research agenda
– well understood impact on economy & society
– well connected & mobilized community
– national recognition & support
14
Last call under FP72012-2013
• objective 4.1 of Challenge 4 of the ICT work programme
– open: Jul 2012; close: Jan 2013; budget: 27 Meuro
• 3 project lines:
1) cross-media content analytics
2) high-quality machine translation
3) speech-enabled multimodal interaction
• in addition, objective 4.3 under a separate call & budget:
4) industrialisation of promising technologies, field validation, business models & exploitation paths
15
Last call under FP72012-2013
• for each project line:
– ambitious endeavours, cross-disciplinary, up- & out-ward looking
multilingual, multimedia, multimodal…
– 1 road-mapping actionto help define, organize & bootstrap future collaborative work under H2020
– 1 to 3 projectsacting as a bridge towards H2020
... social, mobile, cloud, big data ...
16
Upcomingprogrammes
H2020
H2020 2014-2020:– research & innovation; budget (EC proposal): 80 Beuro,
~20% for ICT
science (including FET)
technologies (most of the mainstream ICT themes)
societal challenges (e.g. health, ageing, education…)
– on-going discussions with Council & EP
outcome depends on concomitant discussion of Multiannual Financial Framework (= money for all EU policies)
– meanwhile, concrete orientations re early priorities & operations by Q1 2013
first series of calls towards Q4 2013
17
Upcomingprogrammes
CEF
CEF 2014-2020:– TEN model: pan-European, public-interest infrastructures
– 3 pillars: Transport, Energy & ICT (9 Beuro proposed)
– for ICT: (i) Broadband & (ii) Digital Service Platforms
eGovernment services, eHealth, eCulture…
Multilingual Access to online Services (new!)
– aim is to enable service providers to offer their content & services in as many languages as possible, in a most effective fashion
powerful hw/sw platform + general-purpose tools & datasets + hosted services co-developed with industry & governments
– still a long way to go vis-à-vis Member States, update towards the end of the year
18
Conclusions
• the task is immense, and so is the reward‾ technical
‾ economic
‾ societal
• a lot of work accomplished since 2009– S & T
– networking & community building
– academia/industry partnerships
• definite need for more awareness & recognition
• heavy competition ahead, need to move
‾ from sponsorship…
‾ … to partnership a shared commitment!
Thank you!
infso-e1@ec.europa.eu
top related