sorting documents by base theme with synthetic classification · udc seminar 2013, the hague...

20
UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method Claudio Gnoli & Alberto Cheti

Upload: vomien

Post on 18-Jul-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

UDC Seminar 2013, The Hague

Sorting documents by base theme with synthetic classification:

the double query method

Claudio Gnoli & Alberto Cheti

Page 2: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Knowledge organization A to Z ?...

Friday

Monday

Sathurday

Sunday

Thursday

Tuesday

Wednesday

Page 3: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

1 Sunday2 Monday3 Tuesday4 Wednesday5 Thursday6 Friday7 Sathurday

Knowledge organization A to Z ?...

A solution:

good old classification :-)

Page 4: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Systematic presentationcan act as an intellectual guideto contents

Knowledge organization A to Z ?...

Page 5: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Knowledge organization A to Z ?...

Original German term:

Wissensordnung

= “ordering of knowledge”

Page 6: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Classification

Often poorly applied in online resources…

Lack of integrationbetween cataloguers’ and OPACmasters’ work

[Bland & Stoffan, 2008; Rozman, 2009; Casson et al. 2011]

Page 7: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Compound subjects

Most real documents are about combinations of concepts, e.g.:

«the corrosion of tinplace by acid fruit products»…

[Foskett 1958]

Synthetic classmarks needed

(subdivisions, auxiliaries, facets, roles, links…)

Page 8: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Citation order matters

1:34 «philosophy – law»

34:1 «law – philosophy»

Page 9: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

The PRECIS-GRIS tradition

(Verbal) subject stringsshould be ordered combinations of terms (concepts)

Law – influence of philosophy – U.K. – dictionaries

Page 10: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Base vs. particular theme

Notions coming from text linguistics

[Beaugrande & Dressler 1981]

«Influence of the abundance of wild ungulates on wolf diet in Northern Apennines»

Wolf – diet – effect of ungulate abundance – N Apennines

Page 11: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Two-step search [GRIS]

Interfaces should allow to:

-- identify a concept (finding the right term, discarding homographs etc.)

-- examine all combinationsof it with other concepts

…starting with those where it is the base theme!

Page 12: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Double query method

Let’s give the userwhat (s)he’s asked for:

(1) all combinations where the search term is the base theme

(2) all combinations wherethe search term is a particular theme

Page 13: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

An application

Page 14: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

(1) Results as base theme

Page 15: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

…either alone or combined…

Page 16: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

(2) Results as a particular theme

Page 17: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Double query method

$queryA =

"SELECT * FROM `literature`

WHERE `classmark` REGEXP '^757*'

ORDER BY classmark";

$queryB =

"SELECT * FROM `literature`

WHERE `classmark` REGEXP ';757*'

ORDER BY classmark";

Page 18: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Position depends on search

Page 19: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Conclusions

-- Principles for combination in verbal indexing(base vs. particular themes)can be extended to classification

-- They help users to locatewhat they are actually searching foramong many possible results

-- They can be applied to search interfacesby any script (e.g. PHP + MySQL)managing a double query

Page 20: Sorting documents by base theme with synthetic classification · UDC Seminar 2013, The Hague Sorting documents by base theme with synthetic classification: the double query method

Thank you!

[email protected]

@scritur