fifteen centuries dutch vocabulary
DESCRIPTION
Fifteen centuries Dutch vocabulary. Four historical dictionaries in one internet application. Historical Dictionaries of Dutch. 2009 - Oudnederlands woordenboek - ONW Dictionary of Old Dutch , 2009 (online) coverage 500-1200 1999 - Vroegmiddelnederlands woordenboek – VMNW - PowerPoint PPT PresentationTRANSCRIPT
Marijke Mooijaart INL Leiden Bullay Deutschland april 2010
Fifteen centuries Dutch vocabulary
Four historical dictionaries in one internet application
Historical Dictionaries of Dutch
2009 - Oudnederlands woordenboek - ONWDictionary of Old Dutch, 2009 (online)
coverage 500-1200
1999 - Vroegmiddelnederlands woordenboek – VMNWDictionary of Early Middle Dutch (hard copy and online)
coverage 1200-1300
1882-1929/1941 - Middelnederlands Woordenboek – MNWDictionary of Middle Dutch (hard copy, CD-ROM, online)
coverage 1250-1550
1864-1998/2001 Woordenboek der Nederlandsche Taal - WNTDictionary of the Dutch Language (hard copy, CD-ROM, online)
coverage 1500-1976
Differences
• source materialopen or closed corpus, size
• grammatical informationspelling and sound variants; etymology
• regional informationorigin of texts and authors
• quantitative data spelling and sound variants
• datingfirst attestation, dated citations
• definition style; sense structureanalytical or synonym definition; sense hierachy
WNT (1500-1921/1976)3 meters of WNT volumes
I WNT (1500-1976)
• source material: open corpus, expanding from 1000-12.000 titles; no proper names
• grammatical information:often detailed etymological information
• regional information :no, or only very global information
• quantitative data: none• dating: originally only in recent volumes• definition style; sense structure:
analytical definitions; hierarchical structure
MNW (1250-1550)
Jakob Verdam *22 January 1845 - † 19 July 1919
II MNW (1250-1550)• source material:
open but restricted, mainly literary corpus; basis often 19th-century, critical editions; no proper names
• grammatical information:spelling and sound variants are included, often as reference entries
• regional information:information attached to source information (separate volume)
• quantitative data: none• dating: originally only attached to source information
(separate volume)• definition style and sense structure
synonym definitions; more linear sense structure
VMNW (1200-1300)
Jacob van Maerlantauthor and scribe ca. 1225 – ca. 1300
III VMNW (1200-1300)
• source material: closed corpus of official and literary texts; diplomatic editions; proper names included
• grammatical information: succinct etymology
• regional information:information attached to citations
• quantitative data on spelling forms and sound variants• dating of all citations• definition; sense development:
synonym definitions; more linear sense structure
ONW (600-1200)
Love song or probatio pennae? Manuscript 11th c.
Oldest sentence: maltho, thi afrio, letu ‘I declare, I let you free, half-free man’
Lex Salica, 6th century
IV ONW (600-1200)
• source material: closed corpus of three longer literary religious
texts; proper names included; fragments and glosses
• grammatical information: succinct etymology
• regional information:information in secondary literature about sources
• quantitative data on spelling forms and sound variants• dating of all citations• sense development:
synonym definitions; more linear sense structure
Integrated search: dictionaries in the Language Bank
INL Dutch Language Bank:Electronic database of Dutch from 6th-21st centuryincludes
• Texts• Dictionaries• Computational lexica
Searches are combined: one search for the results in the various works in the three components
Dictionary data model
The four dictionaries are different in many aspects,but are similar as to the macrostructure of the electronicentries:
• headword• grammatical information• semantic analysis, including illustrative citations• related entries (subentries: compounds, derivatives)
Encoding
All data was converted to TEI-XML; various information categories were uniformly encoded
For some dictionaries additional data development was necessary:
• modern Dutch headword for all entries• uniform indication of part of speech• citations: separate encoding of citation text, source
indication and dating• abbreviated forms of compounds (as subentries) were
completed,e.g. apen-, burger-, dominees-, varkensbrood > apenbrood, burgerbrood, domineesbrood, varkensbrood
Modern Dutch lemma as standard headword
head fish deal/ ‘to divide’
Haupt Fisch teilen
ONW Old Dutch standard hōvit fisk dēlon(10th c.)
VMNWMiddle Dutch standard hovet visc delenMNW Middle Dutch standard hovet visch delenWNT Spelling
De Vries-te Winkel hoofd visch deelen2009 modern spelling form hoofd vis delen
Headwords problems
• modern headword form for lost words or misreadings (MNW)
e.g. ONW swēneri ‘swineherd’MNW swinerWNT zwener (not after 17th c.)
modern headword zwijner or zwener?
• homonyms are usually numbered differently in the four dictionaries: MNW wel III = ‘well’, adverb
WNT wel III = ‘bubble’, noun wel V = ‘well’, adverb
wel ‘well’ 1, ONW1 – VMNW2 – MNW3
wel ‘well’ 2, MNW-3
wel ‘well’ 3, MNW-3, WNT4
Internet application: search options
• various combinations of dictionaries
• entries (search key: modern headword)
• words in sense section (definitions)• words in citations• words in full text
• sources
• external links
Internet application: results
• full article text OR display of article structure (sense numbers and definitions)
• display with OR without citations• all results in articles OR in concordances with number of
attestations• results in different sections (see advanced search)• highlighting of key words in results• search results can be exported• external links to other reference works (etymology,
dialect) and pictures
Further developments: lexicon GiGant
• project of Dutch Language Bank• computational lexicon 6th – 21th century• each entry consists minimally of standard form – word
form – pos – attestation, with additional semantic, morphological, syntactic information
• provides access to texts and dictionaries• historical variants are retrieved from historical texts AND
from historical dictionary citations; e.g. uiterlijk in WNT
uiterlijk – utterly – äusserlich ‘exterior’ WNT UITERLIJK adj.
regular, predictable variants: • anlautvowel: u – uu – ui – uj – uij – uy – w• suffix -lijk: ij - i – y – ie (?) / c – k – ck• endings; -e – -st – -ste – -er – -en – -s
uytterlijcste uyterlijkste d'uyterlijke uiterlyke uyterlijcke uiterlijke uyterlijck uiterlyken uiterlijkste uiterlicke wterlicke wterlijcke ulterlijk uiterlyk uiterlijk uyterlick wterlicken d'uyterlijcke uiterlijken uiterlijks wterlijck uytterlicke uitterlijke ujterlijke uytterlijk uyterlycke uyterlicken uijterlicke d'uiterlijcke wtterlijcke wterlyke wtterlijk uiterlijke uuterlick uuterlic uyterlijke uyterlijcken uyterlicke d'uiterlyke wterlijke vuyterlijcke uuterlycke uuterlicke wterlijken uyterlijcksten uuyterlicke uuyterlick uuyterlycke uytterl uytterlijcke uytterlycke uytterlick vuytterlicke uiterlijker uyterlyck uterliek wterlijcken uiterlijkst uitterlijk uytterlijcken uyterlyk uiterlijk-net wterlick uutterlijck uuyterlicken uyttelijck uijterlijk uytterlijck uuterlijck uiterlick uitterlyk uuyterlic uuyterlyck uuyterlijck uiterlijck uytterlyck uterlyc wterlijk
Further research: sense development from Old Dutch through Modern Dutch
tuin, old germanic tūn (german zaun, english town)
ONW: ‘fence, palings, enclosure(?)’. 1 citation from the Wachtendonck Psalms (10th c.)
VMNW: ‘fence, palings’. Oldest attestation 1268. Much onomastic material.
MNW: 1. ‘fence, palings; wattle’. 2. ‘by the fence enclosed area; yard’.Citations from the whole Middle Dutch period.
WNT: 1. ‘fence’. Obsolete; current in the 16th c., after that less common; still to be found in dialects. 2. ‘separate piece of land, among other things for growing vegetables or flowers; (engl. ‘garden’)’. Since the 17th c., common from the 19th c. onwards.