my memory keynote london aslib marco trombetti

16
MyMemory Creating the world’s largest Translation Memory Marco Trombetti Translated ASLIB, London November 2009 http://mymemory.translated.net

Upload: mymemory

Post on 09-Jul-2015

555 views

Category:

Business


2 download

DESCRIPTION

Creating the world’s largest Translation Memory

TRANSCRIPT

Page 1: My Memory Keynote London Aslib Marco Trombetti

MyMemory

Creating the world’s largest Translation Memory

Marco TrombettiTranslatedASLIB, LondonNovember 2009http://mymemory.translated.net

Page 2: My Memory Keynote London Aslib Marco Trombetti

Language Service ProviderHuman Translation in 80 languages, 9000 customers, 100K translations delivered.

Translated

Core TechnologyA fully automated translation workflow. Customers can access 35,000 translators through a Web Service.

Page 3: My Memory Keynote London Aslib Marco Trombetti

Translation Memories fail!

Prob(10-word-sentence) is too small

Page 4: My Memory Keynote London Aslib Marco Trombetti

Why is MT not focused towards helping HT?

Human Translation

99%

Machine Translation

1%

Source: Allied Business Intelligence

Page 5: My Memory Keynote London Aslib Marco Trombetti

My wish-list for the perfect TM Platform

Billions of segments, but still FAST

MyMemory

Fully integrated statistical MT

Web 2.0, Wiki-like contributions

One-click upload/download of TM

Adaptive matching, a priori probability

Propagate changes in real-time

Open standard and CAT support

Page 6: My Memory Keynote London Aslib Marco Trombetti

Free, Collaborative

Web Translation Memory

The Solution

Page 7: My Memory Keynote London Aslib Marco Trombetti

Barriers to contribution

Privacy and IP

Time vs Value

Page 8: My Memory Keynote London Aslib Marco Trombetti

Yes! 100 words per second for 1,5 years

5 Billion words

Will people contribute?

Page 9: My Memory Keynote London Aslib Marco Trombetti

Sources

Web Translators Customers

Page 10: My Memory Keynote London Aslib Marco Trombetti

LSP & ClientsMore control

Free contribution from professionals & the crowd

Quick reference and QA

TranslatorsQuick and relevant reference

Improved organisation and usability

Free memories

Benefits

Page 11: My Memory Keynote London Aslib Marco Trombetti

Technology Challenges

Scalable Real Time Indexing

Adaptive Ranking

Cost-Effective Scalability

Page 12: My Memory Keynote London Aslib Marco Trombetti

Generating the interest

Pre-Populate

Make it free

Make it easy to contribute

Page 13: My Memory Keynote London Aslib Marco Trombetti

Protecting IP

Private memories

Hide proper nouns

One-click to report abuse

Robots.txt

Page 14: My Memory Keynote London Aslib Marco Trombetti

Quality

Feedback from the crowd

QA correlation

Automatic QA

Page 15: My Memory Keynote London Aslib Marco Trombetti

No.Professional translators produce

Tens of billions of words per year

Is 5 Billion words is enough?

Page 16: My Memory Keynote London Aslib Marco Trombetti

More data & technologyMore data extraction from the web

Real time SMT training

Every CAT tool connected

QA crowd platformFast and cost-effective MT evaluation

Parallel corpus evaluation and clean-up

Next Steps