bled, 30 september 2014 multilingual dictionary projects @ k dictionaries ilan kernerman k...
TRANSCRIPT
Bled, 30 September 2014
Multilingual dictionary projects @ K Dictionaries
Ilan KernermanK Dictionaries Ltd, Tel Aviv
K DICTIONARIES
ENeL • Bled 20140930 1
Established in 1993, based in Tel Aviv Focus on Technology-Driven Content Create lexicographic data covering 47
languages Cooperate worldwide with
• editors • translators• programmers • designers• publishers • ICT firms• academe• associations
RESOURCES
ENeL • Bled 20140930 2
English dictionaries for learners & native speakers
dictionaries for native & foreign language learning
monolingual, bilingual & multilingual dictionaries
multi-language/layer lexical datasets lexicographic editorial tools & applications morphology & pronunciation language supplements, audio & images
LINGUISTIC
ENeL • Bled 20140930 3
macro & micro planning editorial & translation styleguides headword lists special features (e.g. alternate script, audio) corpora, morphology lists, conversion tables L1 lexicographer teams & L2 translators content & format revisions technical infrastructure synchronization
TECHNOLOGIC
ENeL • Bled 20140930 4
editorial software configuration XML, DTD, XSLT, CSS (→RDF) QA & statistics data processing utilities digital versions & print-ready data maintenance, update & upgrade technical support R&D
EVOLUTION
ENeL • Bled 20140930 5
1. Monolingual English learner’s dictionary2. (Semi-)Bilingual English learner’s dictionary3. Multilingual English dictionary4. L2-English reversed indices5. L2, L3 etc. multilingual dictionaries6. L2-L3 bilingual glossaries7. Multi-language networks
MULTI-LAYER
ENeL • Bled 20140930 6
network
Monolingua
l
Multilingua
l
Bilingua
l
SAMPLES. ENGLISH
ENeL • Bled 20140930 7
from PASSWORD to MULTILINGUAL
L2 MULTILINGUALS
ENeL • Bled 20140930 8
Extract list of Translations of any language (L2) with their corresponding English (EN) Entries & POS
Edit the L2 Translations into L2 Headwords, keeping the default EN links
Revise the links from the new Headword & POS to the relevant sense of the EN Entry
Each sense of the L2 Headword now addresses its counterpart sense(s) in the EN Entries, and through it translation equivalents in all other languages
[Expand the lexical data of the L2 Headword and turn it into a full Entry]
PROCESS
ENeL • Bled 20140930 9
Generating an L2-English Index automatically― produce L2 Index table― produce EN Senses table
Editing the L2 Index― include/exclude HW in L2 Index― revise the L2 HW and POS― add new L2 HW― revise the Senses – add, remove, re-order
Translating multilingually― link L2 HW via EN Sense to all the translations
KIET. MAIN SCREEN
ENeL • Bled 20140930 10
KIET. EDIT L2-ENGLISH INDEX
ENeL • Bled 20140930 11
KIET. EXPORT TO HTML (TO REVIEW BY EDITOR)
ENeL • Bled 20140930 12
SAMPLE. SWEDISH MULTILINGUAL (RAW)
ENeL • Bled 20140930 13
bortsprungen noun runaway a person, animal etc that runs away ◊ The police caught the two runaways. ■ (also adjective) ◊ a runaway horse.af wegloper | ar حPجاِم شاِرPد، bg беглец | br fugitivo | ca | هاِرPب، fugitiu |cs uprchlík/-ice, uprchlý | de der/die Ausreißer(in); durchgebrannt |dk bortløben | el φυγάδας | es fugitivo | et põgenik | fa | فراِریfi karkuri | fr fugitif/-ive | he חqֵרsֹוuּב | hi अनि�यं�नि�त, उच्छृं��खल, बहुत सहज |hr odbjegao | hu szökevény | id pelarian | it fuggiasco, fuggitivo |ja 逃亡者 | ko 도망자 | lt pabėgėlis; pabėgęs | lv bēglis; izbēdzis |ml cabut lari | nl vluchteling | pl zbiec | ps فراِری | pt fugitivo |ro evadat, fugar | ru беглец | sk utečenec/ka; na úteku, ktorý ušiel |sl ubežnik; pobegel | sr odbegao | th ผู้��หลบหนี� | tr kaçak, firari |tw 逃跑的人或動物 | uk утікач; дезертир | ur جانا و ہفراِر |vi kẻ chạy trốn | zh 潜逃者,逃跑者
SAMPLE. SWEDISH MULTILINGUAL (RAW)
ENeL • Bled 20140930 14
bortsprungen adjective stray wandering or lost ◊ stray cats and dogs.af weglopend | ar هP تاِئ ضال، | bg изгубен | br perdido | شاِرPد،ca perdut, extraviat, llista de carrers | cs zatoulaný | de streunend |dk omstrejfende; herreløs | el αδέσποτος | es perdido, extraviado, callejero | et hulkuv | fa گمشده | fi kuljeksiva | fr errant | he ִית¢uּב | ח¤ַס¢ֵרhi भटका�, भ�ल�भटका� | hr zalutao, zabludio | hu elkóborolt | id sesat |it randagio | ja はぐれた | ko 길잃은 | lt benamis, valkataujantis |lv noklīdis; klaiņojošs | ml terbiar | nl zwerf- | pl bezdomny |ps شوی pt perdido | ro rătăcit | ru бездомный | sk | وِرکzatúlaný |sl klateški | sr izgubljen | th ซึ่�งพล�ดหลง | tr başıboş dolaşan |tw 漫遊的 | uk бездомний | ur الواِرث یا vi lạc, mất | zh | ٓاواِر漫游的
GLOBAL
ENeL • Bled 20140930 15
Create a rich lexical dataset for each language Apply English pedagogical lexicography
principleslexical deconstruction and reconstructions
Each language set serves as a base to develop monolingual, bilingual & multilingual combinations
Add translation equivalents to L1 in other languages
Adapt content specifically for each language pair
Use differently to suit each target group and usage L1/L2-speakers, language learning, translation, etc.
Produce the data in digital & print forms Incorporate data with NLP/LT
AlternativeScriptingAlternativeSpelling
AntonymCompositionalPhras
eCrossReference
Definition
Example
GeographicalUsageGrammaticalGenderGrammaticalNumbe
rHomographNumber
Lemma
Morphology
PartOfSpeech
Pronunciation
RangeOfApplication
Register
SenseIndicator
SenseQualifier
SubCategorization
SubjectField
Synonym
MAPPING
ENeL • Bled 20140930 16
LANGUAGES
ENeL • Bled 20140930 17
Arabic Chinese Simp. Chinese Trad. Czech Danish Dutch (2) English French (2)
German (2) Greek Hebrew Italian (2) Japanese Korean Latin Norwegian
Polish Portuguese
Braz. Portuguese
Port. Russian Spanish (3) Swedish (2) Thai Turkish
SAMPLES. GLOBAL
ENeL • Bled 20140930 18
Dutch French
ENeL • Bled 20140930 19
VISION
THANK YOU[θӕŋk juː] interj. I thank you: Thank you for your attention!
ENeL • Bled 20140930
Afrikaans dankieArabic ك ُك�َر� أْش� ُك�َرا، ْش�Bulgarian благодаряChinese Simplified 谢谢(你)Chinese Traditional 謝謝(你)Croatian hvalaCzech děkujiDanish takDutch dank jeEstonian aitäh, tänan teidFarsi ممنونFinnish kiitosFrench merciGerman dankeGreek (σε, σας) ευχαριστώHebrew הAָדCתֹוHindi धन्यवा�द द�ने� य� मने� करने� क� एकHungarian köszönöm!Icelandic þakka þérIndonesian terima kasihItalian grazie
Japanese ありがとうKorean 감사합니다Latvian paldies; pateicosLithuanian ačiūMalay terima kasihNorwegian tusen takk (for)Polish dziękujęPortuguese Brazil obrigado/-daPortuguese Portugal obrigado/-daRomanian mulţumescRussian благодарюSerbian hvalaSlovak ďakujemSlovene hvalaSpanish graciasSwedish tack [ska du/ni ha]!, tackar!Thai การแสดงความขอบค�ณTurkish teşekkür ederimUkrainian дякую; спасибіUrdu ْشکَري کا ٓاپVietnamese cảm ơn