using translation memory to speed up translation process
DESCRIPTION
Using Translation Memory to Speed up Translation Process. Marija Brkić , [email protected] Department of Informatics - University of Rijeka Sanja Seljan , [email protected] Department of Information Sciences Faculty of Humanities and Social Sciences - University of Zagreb - PowerPoint PPT PresentationTRANSCRIPT
Using Translation Memory to Speed up Translation
ProcessMarija Brkić, [email protected]
Department of Informatics - University of Rijeka
Sanja Seljan, [email protected] of Information Sciences
Faculty of Humanities and Social Sciences- University of Zagreb
Božena Bašić Mikulić, [email protected] school “Turnić”
2/22
Outline
• Motivation
• Translation memories
• Experimental study• Description• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
3/22
Motivation
• Globalization
• EU accession negotiations
• Information exchange
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
languages differ culturally, stylistically, syntactically and lexically
consistency
data sharing
insufficient number of translators
short deadlines
4/22
Outline
• Motivation
• Translation memories
• Experimental study• Description• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
5/22
Translation memories
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
• Approaches• Sentence-based approach• Character-string-in-bitext-based approach
• Advantages
• Disadvantages
6/22
Sentence-based approach
• Translation Units (Tus)– Sentences– Titles– Subtitles– List entries
• High precision, low recall search
• Fuzzy matching • Noise• Silence
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
7/22
Character-string-in-bitext-based approach
• Character strings of any length– Several consecutive identical sentences
• Unreliably small matches• Noise
• No support for fuzzy matching• Silence
• Recycling internal repetitions through terminology databases
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
8/22
Advantages• Consistency
– Internal and external repetitions– Shared projects
• Speed– Globalization
• Cost reduction• Preservation of original page layout• Statistics
– Scheduling localization projects
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
9/22
Disadvantages
• Initial decrease in productivity
• Beneficiary effects felt only on repetitive texts
• TM needs to be of considerable size
• Is alignment worthwhile and preconditions met?
• Database maintenance
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
10/22
Disadvantages (cont.)
• File format support
• The quality of translation
• Lack of language knowledge
• Context insensitivity
• Additional costs
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
11/22
Outline
• Motivation
• Translation memories
• Experimental study• Description• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
12/22
Experimental study
• Feature: the speed of the translation process
• Tool: ATRIL – Déjà vu• Parameters:
• Measure• Evaluation procedure• Score• Metric• Languages• Text type• System used
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
13/22
Experimental study (cont.)
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
• Text selection phase
• Preparation phase
• Translation phase
• Revision phase
14/22
Experimental study (cont.)
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
Figure 1: Matches found by DVX after the third source text was inserted
15/22
Problems
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
• morphology
• word order
• capitalization
• punctuation
16/22
Outline
• Motivation
• Translation memories
• Experimental study• Description• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
17/22
Evaluation• Automatic
• Manual– Post-editing– 1-4 scale
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
20%
71%
7% 2%
absolutely uselessmany changes neededfew changes neededno changes needed
Chart 1: Effectiveness of TM
18/22
Evaluation (cont.)
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
42 36
69 63
3436
01530456075
First text Secondtext
Third text
Tim
e (m
inu
tes)
Traditionaltranslation
TM
Chart 2: Total time for traditional translation and TM
19/22
Evaluation (cont.)
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
Chart 3: Time spent in translation process for traditional translation and TM
37
29
37
23
3130
05
10152025303540
First text Secondtext
Third text
Tim
e (
min
ute
s)
Traditionaltranslation
TM
20/22
Outline
• Motivation
• Translation memories
• Experimental study• Description• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
21/22
Conclusion
• TMs speed up the translation process
• Database maintenance time-consuming only in the beginning
• Fast, consistent and professional-quality translations
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
22/22
Thank you on your attention!
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić