elhanan adler [email protected] english language subject headings in the catalog of the...
TRANSCRIPT
Elhanan [email protected]
English language subject headings in the catalog of the National
Library of Israel
Background
2
• Subject retrieval at the NLI has traditionally been based on a classified catalog using a system which is a mixture of Dewey Classification, Scholem Classification (expansion of Dewey for Judaica), UDC and descriptive text (including many synonyms)
• This system is unique to the NLI. Much effort was expended in its upkeep.
• Most Israeli libraries use alphabetic subject headings (either LCSH or Hebrew)
• The stacks of the NLI (closed) are arranged by inventory number. Reading rooms have unique shelving systems.
Examples of the current system
3
001.091.767.1(08) Knowledge (History, description, critical appraisal of intellectual activity in general) - Muslim, Islamic, Arab countries - Collections, congresses
616(01)(09)(P) Human pathology; Clinical medicine; Diseases; Illnesses - Works of historic value; Sources
296.52 Halakha and poseqim before Moshe ben Maimon (Maimonides), Geonic literature and responsa, tshuvot ha-geonim
296.562.22(42) Shabbat - Halakha - Eruv hatserot (eruv of courtyards), eruv tehumin (eruv of boundaries) - England, Great Britain
E114(020.8) Yerushalayim, Jerusalem - History and description - Audiovisual treatment including video cassettes
The major problems
6
• The system is not known or understood outside the Library
• Today, most users of the NLI catalog are remote users
• Other libraries cannot use this data in their cataloging
• The NLI cannot make use of subject data from other libraries
• High maintenance cost
The logical conclusion
7
The NLI should switch to a subject retrieval system understandable to all users, particularly remote users
A widely used system which does not require local development should be used
The system chosen should allow transfer of subject data to and from the NLI
Why LCSH?
8
Most widely used system in Israeli academic libraries
Most widely used system in academic libraries in the English speaking world
Scholars worldwide are familiar with the system
Use of LCSH will enable the NLI to receive and provide usable subject data
Why not Hebrew?
9
• Creation of a Hebrew subject thesaurus for a large research library (even based on existing thesauri) is a major, expensive professional task and would take years
• It is unlikely other Israeli academic libraries would adopt it (another unique system!)
• It would not serve non-Hebrew speaking scholars (the Library has a universal mission)
• But there is still a need for Hebrew subject access – particularly for the non-academic segment of the NLI’s local user community
Isn’t LCSH passé?
10
Calhoun report (2006): “Abandon the attempt to do comprehensive subject analysis manually with LCSH in favor of subject keywords; urge LC to dismantle LCSH”
But LC has decided to continue LCSH with some simplifications: “Following the Calhoun report, LC did an analysis and determined we would continue with pre-coordinated subject strings (and LCSH for the foreseeable future) complemented by social tagging when that becomes feasible in our firewall environment.” (communication from Barbara Tillett, Feb. 2010)
When change does come, presumably there will be LCSH based conversion tools
The NLI can lead Israeli libraries into future systems only if it starts out together with them
How do we get from here to there?
11
Matching NLI records against LC MARC database (to 2006) held by the MALMAD Consortium [after 2006 MALMAD ceased purchasing LC data and joined OCLC]
Matching NLI records against Israel ULI Union List records from LCSH-using libraries
Creating conversion tables from NLI classification to LCSH
80+% of incoming data from major card conversion project (outsourced to Backstage Library Works) contains LCSH
Not uniform but…“Quick and dirty” with ongoing cleanup procedures
12
Example of conversion table$$a296(07) = 650 $$aJudaism$$xStudy and teaching$$a296.01 = 650 $$aJudaism$$xBibliography$$a296.01(020.8) = 650 $$aJudaism$$xBibliography @@ 655 $$aAudio-visual$$a296.01(04) = 650 $$aJudaism$$xBibliography$$a296.01(05) = 650 $$aJudaism$$xBibliography$$vPeriodicals$$a296.01(06) = 650 $$aJudaism$$xBibliography$$xSocieties, etc.$$a296.01(07) = 650 $$aJudaism$$xBibliography$$xStudy and teaching$$a296.01(08) = 650 $$aJudaism$$xBibliography$$a296.01(09) = 650 $$aJudaism$$xBibliography$$xHistory$$a296.012[100] = 650 $$aJewish philosophy$$vBibliography @@ 650 $$aJewish philosophers$$vBibliography$$a296.012[400] = 650 $$aJewish linguists$$vBibliography$$a296.012[530] = 650 $$aJewish scientists$$vBibliography @@ 650 $$aPhysicists$$vBibliography$$a296.012[574] = 650 $$aJewish scientists$$vBibliography @@ 650 $$aBiologists$$vBibliography$$a296.012[800] = 650 $$aJewish authors$$vBibliography$$a296.012[933.09] = 650 $$aJewish historians$$vBibliography$$a296.014 = 650 $$aAnonyms and pseudonyms, Hebrew @@ 650 $$aAnonyms and pseudonyms, Jewish
Progress in 1.5 years
13
Jan. 2010 – 50,000 NLI records with LCSHJune 2010 – 500,000 NLI records with LCSH
(over 50% of relevant records)June 2011 – 1,127,000 NLI records with LCSH
(over 90% of relevant records)
Authority controlNLI purchased the retrospective LCSH
authority file (370,000 records) from LCSet up as a separate ALEPH authority file
controlling the NLIs subject headingsGeneral policy – make minimal changes and
only in the NLIs core collection areas (Judaica & Israel)Add xrefs from more common (in Israel) termPrefer Israeli standard termAdd specific terms (kosher and/or ‘kosher-
style’)Quantity of records being processed
precluded going through SACO, etc.14
Personal and corporate subjects
16
NLI already has 600 & 610 subjects – but in all 4 scripts, and usually without subdivision
English/romanized 600s and 610s are added when found but this is not a major goal of the current project
Inconsistent romanization (LC vs. Academy of the Hebrew Language)
Future linking of personal and corporate names in various scripts (a separate project) will improve cross-alphabet searching
Uniform title subjectsFollow standard Israeli usage:
Uniform Headings in Judaica authority filee.g. Talmud Bavli and Talmud Yerushalmirather than Talmud and Talmud Yerushalmi
17
Theological issues - Bible
Bible used for Jewish Bible (Tanakh)Bible. O.T. changed to BibleBible. O.T. Apocrypha changed to
ApocryphaBible.N.T. changed to stand-alone New
TestamentRequires parallel series of subjects :
… in the New Testament… $xNew Testament teachingetc.
19
Other theological issuesB.C./A.D. or B.C.E./C.E.?Personal subjects: Jesus, Mary, Muhammad,
saints, etc.
20
Political issues: Israel/Palestine
LC policy: Israel = 1948/1967 bordersPre-1948 = PalestinePost-1948 area of British mandate (Israel + West
Bank) = PalestinePalestine not to be used for post-Oslo Palestinian
Authority“680 __ |i Here are entered works on the region
on the eastern coast of the Mediterranean Sea that in ancient times was called the Land of Canaan, later the kingdoms of Israel and Judah, and in modern times comprises the entire state of Israel, as well as the various disputed territories. Works on a specific jurisdiction or territory within this region are entered under its respective name”21
Existing Israeli solutions Follow LCIsrael for all periods and all borders (just as
Persia = Iran)Israel (State) vs. IsraelIsrael and Eretz Israel (NLI choice)
Problems distinguishing between periods and geographic boundariesAntiquitiesGeography, geology, etc.Overlapping periods (20th century)
23
Other geo-political issuesWest Bank or “Judea and Samaria”Jerusalem (Israel)-- Israel -- JerusalemNablus or Shechem (and other multiple
place name issues)
24
Historical subjectsIsrael – History -- War of Independence,
1948-1949not: Israel-Arab War, 1948-1949
Six Day War, 1967not: Israel-Arab War, 1967
Yom Kippur War, 1973not: Israel-Arab War, 1973
Operation Peace for Galilee, 1982-1985not: Lebanon -- History -- Israeli intervention, 1982-1985
25
Uniquely Israeli topicsImmigrant absorptionPre-army yeshivotYeshivot hesderBagrut examinations (Matriculation
examinations) Aliyah/Yeridah (much more than Emigration
and immigration), also:Aliyah, 1st (1882-1903)Aliyah, 2nd (1904-1914)Aliyah, 3rd (1919-1923)Aliyah, 4th (1924-1929)Aliyah, 5th (1929-1939)Aliyah Bet (1933-1948)
26
Extension/expansion of LCSH terms
Judaism – [Moroccan, Tunisian, etc.] rite[subject] (Jewish law)Chronological subdivisions under Hasidism,
Cabala, Jewish philosophy, etc.More groups of Hasidim
27
28
Alexander HasidimAmshinov HasidimBelz HasidimBiala HasidimBobov HasidimBoyan HasidimBratslav HasidimBuhusi HasidimChernobyl HasidimChortkov HasidimGrodzhisk HasidimGur HasidimHusiatyn HasidimIzbica HasidimKaliv HasidimKarlin HasidimKomarno HasidimKozhnitz HasidimKretshniff HasidimKuzmir HasidimLelov HasidimModzitz HasidimMogielnica Hasidim
Munkacs HasidimNadvorna HasidimPiaseczno HasidimPremishlan HasidimPrshischa HasidimRachmastrivka HasidimRadomsk HasidimRuzhin HasidimSadigura HasidimSatmar HasidimSighet HasidimSkulen HasidimSkvira HasidimSlonim HasidimSochaczew HasidimSpinka HasidimStefanesti HasidimToldot Aharon Hasidim (LCSH = Guardian-of-the-Faithful Hasidim)Tolna HasidimTosh HasidimVizhnitz HasidimZanz HasidimZidichov HasidimZvolin Hasidim
Red = LCSHBlack = NLI
Problem of subjects which require Jewish + general subjectJewish … educationJewish scientists + Physicists, etc.
29
Standard 667 (internal) notesChanged to standard Israeli formCross references added by NLIHeading created by NLIHeading changed by NLI
30
Use of Genre/Form headingsPrimarily in conversion from Dewey literature
numbers655_7 with $2localExample:
Polish dramaPolish essaysPolish fictionPolish lettersPolish poetryPolish prose literaturePolish wit and humor
Also: Audio-visual, Israeli themes in literature, Jewish themes in literature
31
Ongoing cleanupCreate initial files of valid main terms ($a)
and subfields (vxyz) from LC LCSH fileAdd terms & subfields as validatedRun test against all 630/650/651 2nd
indicator 0/4 (existing records and new records being added)
Report errors & automatically fix manyGroup of programs and data files
32
33
LCSH options 1 - Retest LCSH records 2 - Run nnl01_fix_lcsh on lcsh_retest.dat creating rep.dat 3 - Edit 082_to_lcsh.dat 4 - Edit lcsh_subfielda.dat 5 - Edit 082_to_lcsh_subdivisions.dat 6 - Run nnl01_lcsh_subfield_clash_retest 9 - Run nnl01_match_082_to_lcsh 11- Edit nnl01_fix_lcsh.dat 12- Edit vxyz_subfields.dat 15- Edit nnl01_lcsh_test.dat 16- Edit israel_eretz_israel.dat 17- Edit nnl01_lcsh_test_ok.dat 19- Run nnl01_drop_lcsh on safety creating rep.dat 22- Run nnl01_reselect_no_lcsh 23- Drop app.dat records from nnl01_no_lcsh.dat 30- Retrieve lcsh.toload from ram1 and process to app.dat 35- Make ram1:lcsh.650 from scratch file safety 40- match nnl01_no_lcsh.dat to nnl6xx_bib_fields.dat 41- match nnl01_no_lcsh.dat to nnl6xx_bib_fields2.dat 50- retest 650 0/650 4 subfield a against NNL15 (for lcsh_fix_a routine) 51- Edit lcsh_fix_a.dat 52- Run lcsh_fix_a 99- Pretest datafile ram2 before adding to 082_to_lcsh.dat
34
Correct a to b$$aBible.$$pApocrypha. = $$aApocrypha.$$aBible.$$pN.T.$$ = $$aNew Testament$$$$aBible.$$pN.T.$$p = $$aNew Testament.$$p$$aBible.$$pO.T.$$ = $$aBible$$$$aBible.$$pO.T.$$p = $$aBible.$$p$$aYehuda weShomeron$$ = $$aWest Bank$$$$xAddresses, essays, lectures = ~$$zYehuda weShomeron = $$zWest Bank
correct $$aIsrael/$$aEretz Israel based on string in subfieldE $$xAntiquities…E $$y18E $$y190…E $$yTalmudicE $$yWorld WarI $$y195I $$y196I $$y197…
35
Example – test of subfields:000385128 Unknown subfield: $$v Bibligraphy at: 000385128 650 4 L $$aJewish periodicals$$xIndexes$$vBibligraphy.000273103 Unknown subfield: $$v Bibligrpahy at: 000273103 650 0 L $$aCommunism$$xPeriodicals$$vBibligrpahy.002808570 Unknown subfield: $$v Bibliographie at: 002808570 650 0 L $$aJuifs$$xHistoire$$vBibliographie$$vCatalogs.002617234 Unknown subfield: $$v Biografias at: 002617234 650 4 L $$aEconomistas chilenos$$vBiografias.002916818 Unknown subfield: $$v Biyografi at: 002916818 650 4 L $$aIslam$$vBiyografi.002997635 Unknown subfield: $$v Biyografi at: 002997635 651 4 L $$aTurkey$$vBiyografi.002204402 Unknown subfield: $$v Bubliography at: 002204402 650 0 L $$aGeology$$zSoviet Union$$vBubliography.002858437 Unknown subfield: $$v Chrestomathies and readers at: 002858437 650 4 L $$aRussian language$$vChrestomathies and readers.002975569 Unknown subfield: $$v Chrestomathies and readers at: 002975569 650 4 L $$aGreek language$$vChrestomathies and readers.
Ongoing & future plansSubjects are being sent to Worldcat as 650_4Continued mass loading from conversion projectsConversion of subject headings from NLI special
collections (manuscripts, music, Scholem, others) to LCSH
Migrate RAMBI to LCSHBegin work on Hebrew language search to LCSH
headingsUltimately provide single search interface (Hebrew
and English) to all NLI bibliographic dataIntroduction of LC classification for open shelf
collections
36