the practice of terminology work · don’t forget the maintenance ... fr "indicateur [visuel]...
TRANSCRIPT
May 2014 Tekom Belgium - © D. O. G. GmbH 1
The Practice of Terminology work
Dr. François Massion, D. O. G. Dokumentation ohne Grenzen GmbH ([email protected])
May 2014 Tekom Belgium - © D. O. G. GmbH 2
The most important aspects of terminology work
Use of terminology
Terminology extraction
Term entry: Models and concepts
Management and distribution
Programs and standards
Overview
May 2014 Tekom Belgium - © D. O. G. GmbH 3
This is Mr. Smith, design engineer
Mr. Smith has
designed a "thing"
Mr. Smith
I am the "thing"
May 2014 Tekom Belgium - © D. O. G. GmbH 4
Journey of a concept
I need 300 timingcircuits
CustomerWe will start a sales
campaign for thetimer A-12
Sales
Now thetime emitter A-12 is being produced.
Supplier
I get a quote for theproduction of the electronic
timer
Purchasing
This is my new thing. I call it A-12 Timer
Developper
May 2014 Tekom Belgium - © D. O. G. GmbH 6
Test
How do you call this thing “N”? "soap dispenser" "detergent dispenser" "detergent drawer" "detergent tray" "soap drawer" "洗涤剂抽屉"
"清洁剂抽屉"
"洗洁精抽屉"
May 2014 Tekom Belgium - © D. O. G. GmbH 9
On the authoring side
Written by Mr. Smith, June 2007 Written by Mr. Jones, April 2009Modules
May 2014 Tekom Belgium - © D. O. G. GmbH 10
On the translation side
Translated by Mr. Martinez, May 2010
Machine translated, April 2007
(Change Hidden Key : Input the new hidden key .)
Translated by Ms. Fernandez, July 2010
Translated by Mr. Caballero, Oct. 2008
Sentences (or "segments")
May 2014 Tekom Belgium - © D. O. G. GmbH 11
Growing importance of terminology
Globalization of the world economy means: More trade and competition More products and shorter
product development cycles(example: iPhone)
More laws and regulations This leads to:
More documents in shorter time intervals More translators and authors More frequent updates More Languages
May 2014 Tekom Belgium - © D. O. G. GmbH 12
Data bases
Docu-ments
Content Management System
Machine Translation (MT)
Translation MemorySystem
"Single Source Publishing"
New translation processes
<XML>
May 2014 Tekom Belgium - © D. O. G. GmbH 13
Terminology and the outside world
Within the company: Standardize With the outside world: Understand and be
understood Customers (sales, support, user-friendly documentation) Government & institutions Suppliers Public at large
Know and use the language of the outside world
May 2014 Tekom Belgium - © D. O. G. GmbH 14
Terminological influence (in %)
100%
50%
75%
30%10%
0%
20%
40%
60%
80%
100%
120%
Ownpublications
Customers Suppliers Govt. Agencies Public
May 2014 Tekom Belgium - © D. O. G. GmbH 16
Terminology and knowledge
The philosophy of the inductive sciences, Band 1, 1840
William Whewell (1794-1866)
Whewell was a co-founder and president of the British Association for the Advancement of Science. He was a Master of Trinity College, Cambridge. He wrote two important studies on the history and philosophy of science. He coined the word "scientist" in 1833.
May 2014 Tekom Belgium - © D. O. G. GmbH 17
Terminology and knowledge
Knowledge is a dynamic development process Terminology helps ordering and transfer
knowledge Break down knowledge in its components:
Creation of concepts Concepts may be different between languages! Relationships between concepts
(e.g. "Belongs to") Add clarity with definitions Work out semantic characteristics
(with respect to terminology gap)
May 2014 Tekom Belgium - © D. O. G. GmbH 18
Use of terminology
Terminology extraction
Term entry: Models and concepts
Management and distribution
Programs and standards
May 2014 Tekom Belgium - © D. O. G. GmbH 19
Types of extraction
Terminology extraction
Manual
Monolingual Multilingual
Automatic
Monolingual Multilingual
Interactive
Monolingual Multilingual
May 2014 Tekom Belgium - © D. O. G. GmbH 20
Exercise 1: Extraction(downloaded from the Internet) on 28.04.2014
May 2014 Tekom Belgium - © D. O. G. GmbH 22
Extraction in the source language
Which words do you want to extract? Which word categories are considered?
– Grammatically: • Nouns, verbs and adjectives• Bi-grams, tri-grams and collocations
– Content: • Specific technical term of a company• General technical terms• Abbreviations• General terms
May 2014 Tekom Belgium - © D. O. G. GmbH 23
How many terms shall be extracted?
Examples: Manufacturing company: 3-10,000 terms per
language Steel plant: 58,000 entries! Open biomedical ontologies: 1.5 million concepts!
Optimal scope Depends on goals, time, and resources Don’t forget the maintenance If little time: only the very specific terms for a
domain / company
May 2014 Tekom Belgium - © D. O. G. GmbH 24
Terminology and goals
Goals have an impact on terminology work Examples of goals:
1. Reduce costs of translation through standardization
2. Facilitate trade between countries
3. Reach more customers4. Organize knowledge5. Improve communication
Tools Data model
Type ofcontent # of entries
GOALS
May 2014 Tekom Belgium - © D. O. G. GmbH 25
Terminology "at the press of a button"
Our vision
Start
Terminologyentries
May 2014 Tekom Belgium - © D. O. G. GmbH 26
But ...
Text = highly complex system of lexical entities Automatic recognition of:
Various appearances of same term ("lemmatize") Collocations: "user interface" vs. „some interface" Lexical variants:
„program", „application", „software", „tool "... Correct relations (ambiguity):
„Check settings" = „Settings of the check" or „Check the settings"?
Correct grammatical category: "Copy" = verb or a noun?
May 2014 Tekom Belgium - © D. O. G. GmbH 27
Learning from computational linguistics
Methods
Nothingcomes
out of it!
May 2014 Tekom Belgium - © D. O. G. GmbH 28
Computational linguistics
Objective: Understand natural languages in order to process them with programs
Natural Language Processing (NLP) Create a language model, e.g. recognize
components and their relations Extraction of patterns such as Adj + Noun, Noun +
Noun. Application examples:
Machine translation (MT) Terminology extraction Information mining
May 2014 Tekom Belgium - © D. O. G. GmbH 29
Word analysis: Morphology
Analytical languages (such as Chinese) and synthetic languages with affixes and inflections (case, number, gender, etc. ). Example: Drukontluchtingsklep / 减压阀 / pressure relief valve / vanne de dépressurisation
Stemming: Reduce to root form of a word (=stem): Activated, activation, actively, actives activ (stem)
Lemmatization: Reduction to base form of a word, lexical entry (=lemma) activated activate (verb)
N-grams: bi-gram, tri-gram = sequences of N consecutive words. Often adjacent, but not always: EN "[visual] clogging indicator" FR "indicateur [visuel] de colmatage"
May 2014 Tekom Belgium - © D. O. G. GmbH 30
Extraction steps
Corpus Import•Remove format and punctuation•Create list of words (single words, n-grams)
Clean-up•Remove duplicates, unwanted words, stop words• Lemmatize
Enter data•Add information according to data model (status, etc. )•Add equivalents in other languages
May 2014 Tekom Belgium - © D. O. G. GmbH 31
Which types of words? Raw corpus
Belangrijk!Zorg voor voldoende plaats voor onderhoud en instandhouding!Voldoende ruimte inplannen voor het openen van de aansluitkast en de elektrische aansluiting en de eventuele frequentieomzetter inplannen.Het betonfundament moet- geheel zijn uitgehard,- voldoende stijfheid hebben (minimaal klasse X0 conform DIN EN 206),- een horizontaal en effen oppervlak hebben,- vibraties, krachtinvloeden en stoten kunnen opnemen- zo zijn bemeten, dat de stervormige grepen (004) op de filterdeksel (003) handmatig moeten kunnen worden geopend.Plaats de pomp als de vloer aan deze voorwaarden voldoet.
May 2014 Tekom Belgium - © D. O. G. GmbH 32
Which types of words? First categorization
Belangrijk!Zorg voor voldoende plaats voor onderhoud en instandhouding!Voldoende ruimte inplannen voor het openen van de aansluitkast en de elektrische aansluiting en de eventuele frequentieomzetter inplannen.Het betonfundament moet- geheel zijn uitgehard,- voldoende stijfheid hebben (minimaal klasse X0 conform DIN EN 206),- een horizontaal en effen oppervlak hebben,- vibraties, krachtinvloeden en stoten kunnen opnemen- zo zijn bemeten, dat de stervormige grepen (004) op de filterdeksel (003) handmatig moeten kunnen worden geopend.Plaats de pomp als de vloer aan deze voorwaarden voldoet.
NewTerminology Known Terminology Stop words Words to be qualified Abbreviations/Acronyms/Product names
May 2014 Tekom Belgium - © D. O. G. GmbH 33
The Word/Excel method
Quick help in urgent situations Word:
1. Save file as a text file (=remove formatting)2. Remove superfluous characters3. Generate a list of words (replace blanks with
return) Excel:
1. Delete duplicates2. Remove stop words3. Reduce to basic form („lemmatize")
May 2014 Tekom Belgium - © D. O. G. GmbH 34
Remove superfluous words (Excel)
Stop words (MATCH function) General words (is, the, always... ) Existing terms in database Save rejected words for later
clean-up actions “recycling”
Stop words become more efficient over time
Special stop words for group of words (phrases)
Include existing terminology in stop words list
May 2014 Tekom Belgium - © D. O. G. GmbH 35
Sorting out in Excel
step 1:Insert an empty column
step 2:Enter symbol for terms you want to reject (or accept)
step 3:Sort list with this column step 4:
Edit terms
step 5:Save list of rejected entries for later re-use
May 2014 Tekom Belgium - © D. O. G. GmbH 36
Lemmatization
New column with formulaRIGHT(A1;2)
Sort on endings Edit inflectional endings
May 2014 Tekom Belgium - © D. O. G. GmbH 37
Used Excel functions
Comparison of entries (here "A1 ") with lists=MATCH(A1;MYSTOPWORDS;0) or =MATCH(A1;$C$1:$C$1000;0)
[on German OS: "VERGLEICH"]
"MYSTOPWORDS" is a variable, self-defined name Length of an entry (e.g. cell „A1")
=LEN(A1) Two last characters from the right (of „A1")
=RIGHT(A1;2)
May 2014 Tekom Belgium - © D. O. G. GmbH 38
What are N-grams? Set groups of 1, 2, n words Example: „elektrische aansluiting" More frequent in some languages (English, Italian
etc. ) Challenge: How do you recognize them?
Bi-grams, tri-grams, n-grams (1/4)
May 2014 Tekom Belgium - © D. O. G. GmbH 39
Bi-grams, tri-grams, n-grams (2/4)
Copy the unsorted list of words with an offset of one cell up
May 2014 Tekom Belgium - © D. O. G. GmbH 40
Sort in a text editor (Word, Notepad etc. ) + replace tabs with spaces
Copy in a column in XL and evaluate (partial results)
Bi-grams, tri-grams, n-grams (3/4)
May 2014 Tekom Belgium - © D. O. G. GmbH 41
Excel result:
Review list manually
Bi-grams, tri-grams, n-grams (4/4)
May 2014 Tekom Belgium - © D. O. G. GmbH 42
No one-to-one symmetry:
1) Different morphology: Dutch: frequentieomzetter (1 word); English: frequency converter (2 words);
Chinese: 目录(2 characters)
Chinese: No plural in Chinese symbols: treaty / treaties = 条约
2) Unmatching word categories
3) Different syntactical patterns
4) Semantical differences (terminology gap): Different nomenclatures (e.g. international classification of diseases (ICD) but differing
categories in many countries)
Bilingual extraction: Issues
May 2014 Tekom Belgium - © D. O. G. GmbH 43
Approaches to extract bilingual terminology:Option #1 Align texts (e.g. translate with CAT) Extract separately key terms in both
languages and highlight them Bilingual term alignmentOption #2 Align texts Sort according to length Shorter entries (up to 30 characters)
may be matching termsOption #3 Use extraction software like
Multiterm Extract
Methodology
May 2014 Tekom Belgium - © D. O. G. GmbH 44
Equivalence and standards
Standard ISO 12620 describes equivalence categories: narrower equivalent quasi-equivalent (or near-equivalent) broader equivalent phrase
• Qualifier assigned to a phraseological unit in one language that expresses the same semantic content as a phraseological unit in another language.
Automatic translation of "time" in various languages
May 2014 Tekom Belgium - © D. O. G. GmbH 45
Tips & tricks (1)
Where do you find terminology? Databases (IATE = Inter-Active Terminology for Europe, LEO) Online dictionaries (e.g. : "inurl:terminology") Collections of dictionaries Discussion groups (www.proz.com)
Search engines Helpful search methods
Searched phrase in quotation marks Search in multiple languages "socket" + "ist ein" Search in multilingual sites: "site:ch" term + "de-en" (example: Schraube de-en)
May 2014 Tekom Belgium - © D. O. G. GmbH 46
Tips & tricks (2)
Use the Internet
(a) To validate an expression
(b) To find the frequency of alternatives
Example: "Schakelschema" in French? Schéma de câblage Schéma des connexions Schéma des circuits Schéma électrique
May 2014 Tekom Belgium - © D. O. G. GmbH 47
Tips & tricks (3)
Use the image search feature
Frequency converter = 变频器 ???
May 2014 Tekom Belgium - © D. O. G. GmbH 48
Use of terminology
Terminology extraction
Term entry: Models and concepts
Management and distribution
Programs and standards
May 2014 Tekom Belgium - © D. O. G. GmbH 49
Organizing the terminology
Classical dictionaries are sorted in alphabetical order.
No way to group words with the same meaning: Synonyms such as "application" and "software" Abbreviations and full names: "afz." and "afzender",
Classical dictionaries list all meanings of one word. "bank": what are the meanings of this word?
Same with multilingual dictionaries
May 2014 Tekom Belgium - © D. O. G. GmbH 50
The „semiotic triangle"
" Electric socket "
socket jack wall outlet junction box
Term (= designation)„stopcontact“
Mental representation = "concept"
(actual) object
May 2014 Tekom Belgium - © D. O. G. GmbH 52
Concept / Term
Term-based terminologies Traditional dictionaries Encyclopedias
Concept-based terminologies The basic entry is a concept Can include multiple terms
(e.g. synonyms) and languages
May 2014 Tekom Belgium - © D. O. G. GmbH 53
But what is really a concept?
Reality can be looked at in different ways
Different visions and perceptions: „drawer“ vs. „schuiflade“„exit“ „uitgang“ / „afrit“
Influence of culture, educational system, languagecommunity, classification systems … Translation issues
Why is this the same?Why is this different?
May 2014 Tekom Belgium - © D. O. G. GmbH 54
Componential Analysis (CA): relevantcharacteristics
Useful to define the degree of equivalence between words in a language or between languages
Concepts, terms and synonyms
May 2014 Tekom Belgium - © D. O. G. GmbH 55
Meaning relationships
Synonyms - Homonyms
Concept Term
Synonymy
Polysemy
Homonymy
1
123
123
TTT
TaTbTc
T
softwareprogramapplication
foot
mouse (pl. "mice")mouse (pl. "mouses", "mice")
May 2014 Tekom Belgium - © D. O. G. GmbH 56
Exercise 2: Data model
Which information should be added to the terms?
What are the hierarchical levels in the data model?
Which levels the additional information should be allocated to?
May 2014 Tekom Belgium - © D. O. G. GmbH 57
Data categories
ISO 12620:2009-12 defines more than 200 data categories
"Terminology and other language and content resources - Specification of data categories and management of a Data Category Registry for language resources" Concept oriented categories Administrative categories Term related categories
www.isocat.org
May 2014 Tekom Belgium - © D. O. G. GmbH 58
A data model (example)
• Definition• Source of definition• Image• Attribute• Comment
Concept Level
• Homonym available (Y/N)Language level
• Status• Word Type• Usage• Example• Term comment
Term level
ApprovedTo be checked
NounVerbAdjectiveAdverbProper NameAbbreviationShort FormDisplay Text
AuthorizedProhibitedUnclearNot requiredTo be deleted
May 2014 Tekom Belgium - © D. O. G. GmbH 59
Semantic enrichment
Adds important information to the meaning of the concept
Particularly important for complex or ambiguous terms: "device", "disk" ...
Instruments: Definition Reference Image Contextual data (example) Remarks
May 2014 Tekom Belgium - © D. O. G. GmbH 60
Definitions must: Have a coherent structure Give unique essential characteristics
(function, appearance, consistence) Be as short as possible
Definitions must not: Consist of synonyms ("a dog is a hound") Be circular (A is B and B is A) Be negative (XYZ is no ABC) Be too narrow / broad
What is a good definition?
May 2014 Tekom Belgium - © D. O. G. GmbH 61
Define a touch screen
Touch screenBij een aanraakscherm of touch screen kan je met de vinger of een speciale pen (stylus genoemd) een handeling op de monitor uitvoeren. Er komt dus geen toetsenbord of muis bij kijken.
touch screenaanraakgevoelig schermType beeldscherm dat reageert bij het aanwijzen met je vinger. Veel gebruikt bij infokiosken. Maakt een toetsenbord quasi overbodig.
Aanraakschermbeeldscherm waarmee je je computer bedient door bepaalde plaatsen aan te raken computers
Aanraakschermeen beeldscherm dat ook als invoerapparaat voor een computer kan worden gebruikt door het scherm aan te raken.
AanraakschermBij een aanraakscherm of touchscreen wijs je met de vinger of een speciale pen op de monitor om een handeling uit te voeren. Hier komt dus geen muis of toetsenbord aan te pas. Het grootste voordeel hiervan is dat de gebruiker in principe een oneindig aantal (virtuele) knoppen kan aansturen met hetzelfde oppervlakte.
AanraakschermEen aanraakscherm (vaak met het Engelse touchscreen aangeduid) is een beeldscherm dat ook als invoerapparaat voor een computer of embedded system (zoals een smartphone of tablet) kan worden gebruikt door het scherm aan te raken.
May 2014 Tekom Belgium - © D. O. G. GmbH 62
Usage of terminology
Terminology extraction
Term entry: Models and concepts
Management and distribution
Programs and standards
May 2014 Tekom Belgium - © D. O. G. GmbH 63
Distribution of terminology
Usage: What / whom for? Work: Translator, interpreter, author, programmer… Quality check Information and knowledge
Applications Translation Memories Authoring tools,
quality assurance tools Data bases etc…
Central repository Settle responsibilities and rights Stores all modification from various users
May 2014 Tekom Belgium - © D. O. G. GmbH 64
Exchange formats
Excel: List with information in columns List with information in cells
PDF, TXT, CSV, MS-Word XML based standards:
TBX (TermBase eXchange) OLIF (Open Lexicon Interchange Format) MARTIF
(Machine readable terminological interchange format) UTX (universal terminology eXchange)
[designed by AAMT (Asia-Pacific Association for Machine Translation)]
May 2014 Tekom Belgium - © D. O. G. GmbH 65
List with term data in columns
Problem with variable number of columns
Many languages, many attributes: very large tables
Def. Image EN1 Attr. EN2 Attr. DE Attr. FR ...
XXX PC Computer Rechner
May 2014 Tekom Belgium - © D. O. G. GmbH 66
List term data in rows
IDs must be allocated Difficult exchange of data:
Database is three-dimensional List is two-dimensional
ID Def. Image Language Term Attr. 1 Attr. 2
1 DE Rechner
1 EN PC
1 EN Computer
2 DE Maus
..
N
May 2014 Tekom Belgium - © D. O. G. GmbH 67
Maintenance of terminology
Terminology is never complete: Status Optional fields Control the use of synonyms Detection of polysemous terms Use experience gained from Quality Assurance (texts,
translations, terminology). Depending on the structure of the dictionary, efforts vary
(definitions or not, additional information, etc. ) Approx. 10% of the time needed to create new terminology
entries
May 2014 Tekom Belgium - © D. O. G. GmbH 69
LookUp – a short introduction
Access via the internet or an intranet Definition of the different user rights
Read Comment Write Also for individual languages (notification by mail)
Free definition of data model Definable queries Suggestion system Recognition of homonyms Various export and import formats (XLS, TBX)
May 2014 Tekom Belgium - © D. O. G. GmbH 71
Access Data
http://88.79.119.98 User: Tekom_May14 Password: Antwerp_011
May 2014 Tekom Belgium - © D. O. G. GmbH 72
Exercise 3: Terminology quality
Which entries are not correct? Entry rules
Singular without article Uniform spelling One type/piece of information per data field
May 2014 Tekom Belgium - © D. O. G. GmbH 73
Is terminology always recognized?
Mit elektrischem Handrad,Tasten für Achs-Richtung,Vorschub, Eilgang, Not-Aus-und Zustimmtaste, sowie drei belegbaren Funktionstasten.
With electrical handwheel, keys for axial direction axis direction, feed, rapid traverse, Emergency buttonEmergency Stop button and acceptance key as well as three assignable functional keys function keys.
In database and recognized In database and not recognized Not entered in database
Entries in the terminology database: Not-Aus-Taste Achsrichtung (Bindestrich bitte prüfen)
3 unrecognized errors!
May 2014 Tekom Belgium - © D. O. G. GmbH 74
Can you always rely on terminology?
10%
15%
15%60%
Checking a project: Terminology is….
SuperfluousMissingNot usableCorrect
May 2014 Tekom Belgium - © D. O. G. GmbH 75
1. Grammar and language usage ("apple(s)")2. Unambiguiousness:
Resistance = value or hardware?Control = action or hardware?
3. Consistent spelling (timeout / time out / time-out). (Consistency of terms, ISO standard 704:2000)
4. Comprehensibility ("Hippopotomonstrosesquippediliophobia" = "the fear of long words“[Hip-po-pot-o-mon-stro-ses-quip-pe-da-li-o-phob-i-a])
5. Natural order (sealing, elastic)6. Singular without article
Quality of terminologyLinguistic criteria
May 2014 Tekom Belgium - © D. O. G. GmbH 76
Quality of terminologyTechnical criteria
Format, no parenthesis, no question marks
Data elementarity(information is separated into individual data categories)
Term autonomy in concept based dictionaries
Comprehensiveness and correctness of information(e.g. status, mandatory fields)
May 2014 Tekom Belgium - © D. O. G. GmbH 77
Overview of the problems to be solved
Detection of synonyms Special case: wrongly separated concepts Special case: contextual synonyms (access point/point)
Detection of homonyms Obsolete entries Missing, wrong, contradictory attributes Definitions: Structure and content, homonyms
without definition Organization: References, relationships
May 2014 Tekom Belgium - © D. O. G. GmbH 78
Quality of terminologyAutonomy of terms
Adapted from: Birgit Wöllbrink – Tools für Terminologiemanagement in Tekom – Schriften zur Technischen Kommunikation Nr. 12, 2008, S. 81
May 2014 Tekom Belgium - © D. O. G. GmbH 79
Translation-friendly terminology
Takes into account the international use, "neutral with regard to culture" Always use the complete form
Takes into account the issue of equivalence (e.g. narrower/broader meaning): Definitions, fields for translation related information
Terms should be understandable for readers, who do not see the actual object (machine, software, device, service).
respiratie-apparaathuishoudelijk apparaat
espresso-apparaat massage apparaat
politie apparaat iOS-apparaat
respiratie-apparaatrespiration device
household applianceespresso machine
massagerpolice apparatus
iOS deviceiOS-apparaat
apparaat
May 2014 Tekom Belgium - © D. O. G. GmbH 80
Usage of terminology
Terminology extraction
Term entry: Models and concepts
Management and distribution
Programs and standards
May 2014 Tekom Belgium - © D. O. G. GmbH 85
Some software packages…Software Description Web
acrolinx® Terminology Lifecycle Management
Terminology extraction and management, terminology platform www.acrolinx.com
Across crossTerm System for the extraction, management and review of terminology. www.across.net
AntConc3.2 Statistical analysis of text, free www.antlab.sci.waseda.ac.jpConcord Wordsmith: concordance and keywords www.lexically.net
Concordance Monolingual terminology extraction with sentence examples. Price value. www.concordancesoftware.co.uk
Extphr33 Monolingual terminology extraction, free http://publish.uwo.ca/~craven/freeware.htmHeartsome Dictionary Editor XML-based dictionary www.heartsome.net
KWIC Concordance Corpus statistics, concordance, free www.chs.nihon-u.ac.jp/eng_dpt/tukamoto/kwic_e.html
KWICKWIC Concordance tool www.kwickwic.com
Lookup Internet-enabled platform terminology - purchase + rental software version www.dog-gmbh.de
MultiTerm 2011 und MultiTerm Extract Offered by SDL / Trados. Integrated in Trados technology. Multilingual. www.sdl.com
qTerm Web-enabled system for managing terminology. www.kilgray.com
SDLX Phrase Finder Terminology extraction with linguistic skills. Available for seven languages. www.sdl.com
Simple Concordance Program (SCP) Concordance and extraction program (monolingual) - Freeware www.textworld.com
Synchroterm Bilingual terminology extraction, semi-automatic, efficient www.terminotix.com
Terminology Extractor Extraction of Monolingual terminology www.chamblon.com/terminologyextractor.htm
TermStar Terminology module of Star Transit www.star-ag.ch
Textanz Monolingual terminology extraction with context specification www.cro-code.com
Textstat Statistical analysis of text, free www.niederlandistik.fu-berlin.de/textstat/
May 2014 Tekom Belgium - © D. O. G. GmbH 86
ISO Standards
ISO Technical Committee (ISO/TC 37). Since 1951: ISO 704:2009 Terminology work – Principles and methods ISO 1087:2000 Terminology work – Vocabulary – Part 1: Theory and
application ISO 12620:1999 Computer applications in terminology - Data
CategoriesISO 12620:2009 is now an open category database("Data Category Registry"): www.isocat.org
ISO 16642:2003 Computer applications in terminology –Terminological markup framework
ISO 15188:2001 Project management guidelines for terminology standardization
ISO/DIS 22274 Systems to manage terminology, knowledge and content - Concept-related aspects for developing and internationalizing classification systems
May 2014 Tekom Belgium - © D. O. G. GmbH 87
Bibliography
Best Practices in der Terminologiearbeit. 2.0 Akten des Symposions. Deutscher Terminologie-Tag e.V., Heidelberg, (2014) Price: € 50
Cabré, M. Teresa. – Terminology : Theory, Methods and Applications. – John Benjamins Publishing; 1999
COTSOES: Recommendations for Terminology Work. Bern; 2002 (can be downloaded: http://termcoord.files.wordpress.com/2013/07/here.pdf)
Sager, Juan C. - A Practical Course in Terminology Processing. John Benjamins Publishing; 1990
Wright, S.E./Budin, G. comp. (1997/2001). Handbook of Terminology Management. John Benjamins Publishing;Volume 1: 1997; Volume 2: 2001
May 2014 Tekom Belgium - © D. O. G. GmbH 88
Thank you for your attention!
Dr. F. MassionD. O. G. Dokumentation ohne Grenzen GmbHNeue Ramtelstr. 1271229 Leonberg, [email protected]