-
Townley, H.M.; Gee, R.G.: Thesaurus-making : grow your own wordstock (1980)
0.15
0.14965992 = product of:
0.29931983 = sum of:
0.038525887 = weight(_text_:und in 548) [ClassicSimilarity], result of:
0.038525887 = score(doc=548,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.29405463 = fieldWeight in 548, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.09375 = fieldNorm(doc=548)
0.26079395 = weight(_text_:headings in 548) [ClassicSimilarity], result of:
0.26079395 = score(doc=548,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.9098252 = fieldWeight in 548, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.09375 = fieldNorm(doc=548)
0.5 = coord(2/4)
- LCSH
- Subject headings
- Subject
- Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Gilchrist, A.: ¬The thesaurus in retrieval (1971)
0.14
0.13646781 = product of:
0.27293563 = sum of:
0.055607334 = weight(_text_:und in 5593) [ClassicSimilarity], result of:
0.055607334 = score(doc=5593,freq=6.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.42443132 = fieldWeight in 5593, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.078125 = fieldNorm(doc=5593)
0.2173283 = weight(_text_:headings in 5593) [ClassicSimilarity], result of:
0.2173283 = score(doc=5593,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.7581877 = fieldWeight in 5593, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.078125 = fieldNorm(doc=5593)
0.5 = coord(2/4)
- Classification
- AN 94200 Allgemeines / Buch- und Bibliothekswesen, Informationswissenschaft / Informationswissenschaft / Grundlagen, Theorie / Thesauruskunde
- LCSH
- Subject headings
- RVK
- AN 94200 Allgemeines / Buch- und Bibliothekswesen, Informationswissenschaft / Informationswissenschaft / Grundlagen, Theorie / Thesauruskunde
- Subject
- Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Aitchison, J.; Gilchrist, A.: Thesaurus construction : a practical manual (1987)
0.12
0.1247166 = product of:
0.2494332 = sum of:
0.03210491 = weight(_text_:und in 6938) [ClassicSimilarity], result of:
0.03210491 = score(doc=6938,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.24504554 = fieldWeight in 6938, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.078125 = fieldNorm(doc=6938)
0.2173283 = weight(_text_:headings in 6938) [ClassicSimilarity], result of:
0.2173283 = score(doc=6938,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.7581877 = fieldWeight in 6938, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.078125 = fieldNorm(doc=6938)
0.5 = coord(2/4)
- LCSH
- Subject headings
- Subject
- Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Aitchison, J.; Gilchrist, A.; Bawden, D.: Thesaurus construction and use : a practical manual (1997)
0.10
0.09977328 = product of:
0.19954656 = sum of:
0.025683926 = weight(_text_:und in 254) [ClassicSimilarity], result of:
0.025683926 = score(doc=254,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.19603643 = fieldWeight in 254, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=254)
0.17386264 = weight(_text_:headings in 254) [ClassicSimilarity], result of:
0.17386264 = score(doc=254,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.60655016 = fieldWeight in 254, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0625 = fieldNorm(doc=254)
0.5 = coord(2/4)
- LCSH
- Subject headings / Terminology
- Subject
- Subject headings / Terminology
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Aitchison, J.; Gilchrist, A.; Bawden, D.: Thesaurus construction and use : a practical manual (2000)
0.10
0.09977328 = product of:
0.19954656 = sum of:
0.025683926 = weight(_text_:und in 1130) [ClassicSimilarity], result of:
0.025683926 = score(doc=1130,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.19603643 = fieldWeight in 1130, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=1130)
0.17386264 = weight(_text_:headings in 1130) [ClassicSimilarity], result of:
0.17386264 = score(doc=1130,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.60655016 = fieldWeight in 1130, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0625 = fieldNorm(doc=1130)
0.5 = coord(2/4)
- LCSH
- Subject headings / Terminology
- Subject
- Subject headings / Terminology
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Bellamy, L.M.; Bickham, L.: Thesaurus development for subject cataloging (1989)
0.09
0.089482985 = product of:
0.17896597 = sum of:
0.019262943 = weight(_text_:und in 2261) [ClassicSimilarity], result of:
0.019262943 = score(doc=2261,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.14702731 = fieldWeight in 2261, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=2261)
0.15970303 = weight(_text_:headings in 2261) [ClassicSimilarity], result of:
0.15970303 = score(doc=2261,freq=6.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.5571519 = fieldWeight in 2261, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=2261)
0.5 = coord(2/4)
- Abstract
- The biomedical book collection in the Genetech Library and Information Services was first inventoried and cataloged in 1983 when it totaled about 2000 titles. Cataloging records were retrieved from the OCLC system and used as a basis for cataloging. A year of cataloging produced a list of 1900 subject terms. More than one term describing the same concept often appears on the list, and no hierarchical structure related the terms to one another. As the collection grew, the subject catalog became increasingly inconsistent. To bring consistency to subject cataloging, a thesaurus of biomedical terms was constructed using the list of subject headings as a basis. This thesaurus follows the broad categories of the National Library of Medicine's Medical Subject Headings and, with some exceptions, the Guidelines for the Establishment and Development of Monolingual Thesauri. It has enabled the cataloger in providing greater in-depth subject analysis of materials added to the collection and in consistently assigning subject headings to cataloging record.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Shiri, A.: Powering search : the role of thesauri in new information environments (2012)
0.07
0.07482996 = product of:
0.14965992 = sum of:
0.019262943 = weight(_text_:und in 2322) [ClassicSimilarity], result of:
0.019262943 = score(doc=2322,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.14702731 = fieldWeight in 2322, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=2322)
0.13039698 = weight(_text_:headings in 2322) [ClassicSimilarity], result of:
0.13039698 = score(doc=2322,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.4549126 = fieldWeight in 2322, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=2322)
0.5 = coord(2/4)
- LCSH
- Subject headings
- Subject
- Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Gillman, P.: Transferring text (1993)
0.07
0.07431169 = product of:
0.14862338 = sum of:
0.025683926 = weight(_text_:und in 6245) [ClassicSimilarity], result of:
0.025683926 = score(doc=6245,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.19603643 = fieldWeight in 6245, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=6245)
0.122939445 = weight(_text_:headings in 6245) [ClassicSimilarity], result of:
0.122939445 = score(doc=6245,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.4288957 = fieldWeight in 6245, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0625 = fieldNorm(doc=6245)
0.5 = coord(2/4)
- Abstract
- Describes a consultancy project for the development of a health care thesaurus involving the movement of text between different application programs. The thesaurus was built from existing text within the organisation originating from 3 sources: natural language registry file headings; descriptions from an internal business directory and a controlled vocabulary. The software used was WordPerfect and Cardbox
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Rada, R.: Connecting and evaluating thesauri : issues and cases (1987)
0.07
0.06502272 = product of:
0.13004544 = sum of:
0.022473436 = weight(_text_:und in 822) [ClassicSimilarity], result of:
0.022473436 = score(doc=822,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.17153187 = fieldWeight in 822, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0546875 = fieldNorm(doc=822)
0.10757201 = weight(_text_:headings in 822) [ClassicSimilarity], result of:
0.10757201 = score(doc=822,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.37528375 = fieldWeight in 822, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0546875 = fieldNorm(doc=822)
0.5 = coord(2/4)
- Abstract
- Connecting and evaluating thesauri is an important task for the systematic development of better information retrieval systems. Connecting thesauri includes not only determining when terms in different thesauri are the same but also determining what kinds of relationships can be transferred from one thesaurus to another. This paper first presents issues in connecting and evaluating thesauri. Various experiments in connecting a particular thesaurus, the Medical Subject Headings, with other medical thesauri are described. In these experiments, similar terms in two thesauri are recognized and then differences in two thesauri are exploited to create more powerful thesauri. Part of the evaluation requires the thesaurus to support automatic indexing and retrieving of documents
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Rolland-Thomas, P.: Thesaural codes : an appraisal of their use in the Library of Congress Subject Headings (1993)
0.06
0.059655324 = product of:
0.11931065 = sum of:
0.012841963 = weight(_text_:und in 674) [ClassicSimilarity], result of:
0.012841963 = score(doc=674,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.098018214 = fieldWeight in 674, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=674)
0.106468685 = weight(_text_:headings in 674) [ClassicSimilarity], result of:
0.106468685 = score(doc=674,freq=6.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.3714346 = fieldWeight in 674, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.03125 = fieldNorm(doc=674)
0.5 = coord(2/4)
- Abstract
- LCSH is known as such since 1975. It always has created headings to serve the LC collections instead of a theoretical basis. It started to replace cross reference codes by thesaural codes in 1986, in a mechanical fashion. It was in no way transformed into a thesaurus. Its encyclopedic coverage, its pre-coordinate concepts make it substantially distinct, considering that thesauri usually map a restricted field of knowledge and use uniterms. The questions raised are whether the new symbols comply with thesaurus standards and if they are true to one or to several models. Explanations and definitions from other lists of subject headings and thesauri, literature in the field of classification and subject indexing will provide some answers. For instance, see refers from a subject heading not used to another or others used. Exceptionally it will lead from a specific term to a more general one. Some equate a see reference with the equivalence relationship. Such relationships are pointed by USE in LCSH. See also references are made from the broader subject to narrower parts of it and also between associated subjects. They suggest lateral or vertical connexions as well as reciprocal relationships. They serve a coordination purpose for some, lay down a methodical search itinerary for others. Since their inception in the 1950's thesauri have been devised for indexing and retrieving information in the fields of science and technology. Eventually they attended to a number of social sciences and humanities. Research derived from thesauri was voluminous. Numerous guidelines are designed. They did not discriminate between the "hard" sciences and the social sciences. RT relationships are widely but diversely used in numerous controlled vocabularies. LCSH's aim is to achieve a list almost free of RT and SA references. It thus restricts relationships to BT/NT, USE and UF. This raises the question as to whether all fields of knowledge can "fit" in the Procrustean bed of RT/NT, i.e., genus/species relationships. Standard codes were devised. It was soon realized that BT/NT, well suited to the genus/species couple could not signal a whole-part relationship. In LCSH, BT and NT function as reciprocals, the whole-part relationship is taken into account by ISO. It is amply elaborated upon by authors. The part-whole connexion is sometimes studied apart. The decision to replace cross reference codes was an improvement. Relations can now be distinguished through the distinct needs of numerous fields of knowledge are not attended to. Topic inclusion, and topic-subtopic, could provide the missing link where genus/species or whole/part are inadequate. Distinct codes, BT/NT and whole/part, should be provided. Sorting relationships with mechanical means can only lead to confusion.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Kuhr, P.S.: Putting the world back together : mapping multiple vocabularies into a single thesaurus (2003)
0.06
0.055733763 = product of:
0.111467525 = sum of:
0.019262943 = weight(_text_:und in 4813) [ClassicSimilarity], result of:
0.019262943 = score(doc=4813,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.14702731 = fieldWeight in 4813, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=4813)
0.09220458 = weight(_text_:headings in 4813) [ClassicSimilarity], result of:
0.09220458 = score(doc=4813,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.32167178 = fieldWeight in 4813, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=4813)
0.5 = coord(2/4)
- Abstract
- This paper describes an ongoing project in which the subject headings contained in twelve controlled vocabularies covering multiple disciplines from the humanities to the sciences and including law and education among others are being collapsed into a single vocabulary and reference structure. The design of the database, algorithms created to programmatically link like-concepts, and daily maintenance are detailed. The problems and pitfalls of dealing with multiple vocabularies are noted, as well as the difficulties in relying purely an computer generated algorithms. The application of this megathesaurus to bibliographic records and methodology of retrieval is explained.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Landry, P.: Multilingual subject access : the linking approach of MACS (2004)
0.06
0.055733763 = product of:
0.111467525 = sum of:
0.019262943 = weight(_text_:und in 9) [ClassicSimilarity], result of:
0.019262943 = score(doc=9,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.14702731 = fieldWeight in 9, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=9)
0.09220458 = weight(_text_:headings in 9) [ClassicSimilarity], result of:
0.09220458 = score(doc=9,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.32167178 = fieldWeight in 9, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=9)
0.5 = coord(2/4)
- Abstract
- The MACS (Multilingual access to subjects) project is one of the many projects that are currently exploring solutions to multilingual subject access to online catalogs. Its strategy is to develop a Web based link and search interface through which equivalents between three Subject Heading Languages: SWD/RSWK (Schlagwortnormdatei/Regeln für den Schlagwortkatalog) for German, RAMEAU (Repertoire d'Autorite-Matière Encyclopedique et Alphabetique Unifie) for French and LCSH (Library of Congress Subject Headings) for English can be created and maintained, and by which users can access online databases in the language of their choice. Factors that have lead to this approach will be examined and the MACS linking strategy will be explained. The trend to using mapping or linking strategies between different controlled vocabularies to create multilingual access challenges the traditional view of the multilingual thesaurus.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Cazan, C.: Medizinische Ontologien : das Ende des MeSH (2006)
0.05
0.05475997 = product of:
0.10951994 = sum of:
0.048050225 = weight(_text_:und in 1132) [ClassicSimilarity], result of:
0.048050225 = score(doc=1132,freq=28.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.36675057 = fieldWeight in 1132, product of:
5.2915025 = tf(freq=28.0), with freq of:
28.0 = termFreq=28.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=1132)
0.061469723 = weight(_text_:headings in 1132) [ClassicSimilarity], result of:
0.061469723 = score(doc=1132,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.21444786 = fieldWeight in 1132, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.03125 = fieldNorm(doc=1132)
0.5 = coord(2/4)
- Abstract
- Die Komplexizität medizinischer Fragestellungen und des medizinischen Informationsmanagements war seit den Anfängen der Informatik immer ein besonders wichtiges Thema. Trotz des Scheiterns der Künstlichen Intelligenz in den 80er Jahren des vorigen Jahrhunderts haben deren Kernideen Früchte getragen. Durch kongruente Entwicklung einer Reihe anderer Wissenschaftsdisziplinen und der exponentiellen Entwicklung im Bereich Computerhardware konnten die gestellten, hohen Anforderungen bei der medizinischen Informationssuche doch noch erfüllt werden. Die programmatische Forderung von Tim Berners-Lee betreffend "Semantic Web" im Jahr 2000 hat dem Thema Ontologien für maschinenlesbare Repositorien in Allgemein- und Fachsprache breitere Aufmerksamkeit gewonnen. Da in der Medizin (PubMed) mit dem von NLM schon vor 20 Jahren entwickelten Unified Medical Language System (UMLS) eine funktionierende Ontologie in Form eines semantischen Netzes in Betrieb ist, ist es auch für Medizinbibliothekare und Medizindokumentare hoch an der Zeit, sich damit zu beschäftigen. Ontologien können im Wesen, trotz der informatisch vernebelnden Terminologie, als Werkzeuge der Klassifikation verstanden werden. Hier sind von seiten der Bibliotheks- und Dokumentationswissenschaft wesentliche Beiträge möglich. Der vorliegende Bericht bietet einen Einstieg in das Thema, erklärt wesentliche Elemente des UMLS und schließt mit einer kommentierten Anmerkungs- und Literaturliste für die weitere Beschäftigung mit Ontologien.
- Content
- Dieser Aufsatz ist kein Abgesang auf MeSH (= Medical Subject Headings in Medline/PubMed), wie man/frau vielleicht vermuten könnte. Vielmehr wird - ohne informatiklastiges Fachchinesisch - an Hand des von der National Library of Medicine entwickelten Unified Medical Language System erklärt, worin die Anforderungen an Ontologien bestehen, die im Zusammenhang mit dem Semantic Web allerorten eingefordert und herbeigewünscht werden. Eine Lektüre für Einsteigerinnen, die zum Vertiefen der gewonnenen Begriffssicherheit an Hand der weiterführenden Literaturhinweise anregt. Da das UMLS hier vor allem als Beispiel verwendet wird, werden auch Bibliothekarlnnen, Dokumentarlnnen und Informationsspezialistinnen anderer Fachbereiche den Aufsatz mit Gewinn lesen - und erfahren, dass unser Fachwissen aus der Sacherschließung und der Verwendung und Mitgestaltung von Normdateien und Thesauri bei der Entwicklung von Ontologien gefragt ist! (Eveline Pipp, Universitätsbibliothek Innsbruck). - Die elektronische Version dieses Artikels ist verfügbar unter: http://www.egms.de/en/journals/mbi/2006-6/mbi000049.shtml.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Tavakolizadeh-Ravari, M.: Analysis of the long term dynamics in thesaurus developments and its consequences (2017)
0.05
0.05103979 = product of:
0.10207958 = sum of:
0.040609855 = weight(_text_:und in 4081) [ClassicSimilarity], result of:
0.040609855 = score(doc=4081,freq=20.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.3099608 = fieldWeight in 4081, product of:
4.472136 = tf(freq=20.0), with freq of:
20.0 = termFreq=20.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=4081)
0.061469723 = weight(_text_:headings in 4081) [ClassicSimilarity], result of:
0.061469723 = score(doc=4081,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.21444786 = fieldWeight in 4081, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.03125 = fieldNorm(doc=4081)
0.5 = coord(2/4)
- Abstract
- Die Arbeit analysiert die dynamische Entwicklung und den Gebrauch von Thesaurusbegriffen. Zusätzlich konzentriert sie sich auf die Faktoren, die die Zahl von Indexbegriffen pro Dokument oder Zeitschrift beeinflussen. Als Untersuchungsobjekt dienten der MeSH und die entsprechende Datenbank "MEDLINE". Die wichtigsten Konsequenzen sind: 1. Der MeSH-Thesaurus hat sich durch drei unterschiedliche Phasen jeweils logarithmisch entwickelt. Solch einen Thesaurus sollte folgenden Gleichung folgen: "T = 3.076,6 Ln (d) - 22.695 + 0,0039d" (T = Begriffe, Ln = natürlicher Logarithmus und d = Dokumente). Um solch einen Thesaurus zu konstruieren, muss man demnach etwa 1.600 Dokumente von unterschiedlichen Themen des Bereiches des Thesaurus haben. Die dynamische Entwicklung von Thesauri wie MeSH erfordert die Einführung eines neuen Begriffs pro Indexierung von 256 neuen Dokumenten. 2. Die Verteilung der Thesaurusbegriffe erbrachte drei Kategorien: starke, normale und selten verwendete Headings. Die letzte Gruppe ist in einer Testphase, während in der ersten und zweiten Kategorie die neu hinzukommenden Deskriptoren zu einem Thesauruswachstum führen. 3. Es gibt ein logarithmisches Verhältnis zwischen der Zahl von Index-Begriffen pro Aufsatz und dessen Seitenzahl für die Artikeln zwischen einer und einundzwanzig Seiten. 4. Zeitschriftenaufsätze, die in MEDLINE mit Abstracts erscheinen erhalten fast zwei Deskriptoren mehr. 5. Die Findablity der nicht-englisch sprachigen Dokumente in MEDLINE ist geringer als die englische Dokumente. 6. Aufsätze der Zeitschriften mit einem Impact Factor 0 bis fünfzehn erhalten nicht mehr Indexbegriffe als die der anderen von MEDINE erfassten Zeitschriften. 7. In einem Indexierungssystem haben unterschiedliche Zeitschriften mehr oder weniger Gewicht in ihrem Findability. Die Verteilung der Indexbegriffe pro Seite hat gezeigt, dass es bei MEDLINE drei Kategorien der Publikationen gibt. Außerdem gibt es wenige stark bevorzugten Zeitschriften."
- Footnote
- Dissertation, Humboldt-Universität zu Berlin - Institut für Bibliotheks- und Informationswissenschaft.
- Imprint
- Berlin : Humboldt-Universität zu Berlin / Institut für Bibliotheks- und Informationswissenschaft
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Hedden, H.: ¬The accidental taxonomist (2012)
0.05
0.04988664 = product of:
0.09977328 = sum of:
0.012841963 = weight(_text_:und in 3915) [ClassicSimilarity], result of:
0.012841963 = score(doc=3915,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.098018214 = fieldWeight in 3915, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=3915)
0.08693132 = weight(_text_:headings in 3915) [ClassicSimilarity], result of:
0.08693132 = score(doc=3915,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.30327508 = fieldWeight in 3915, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.03125 = fieldNorm(doc=3915)
0.5 = coord(2/4)
- LCSH
- Subject headings
- Subject
- Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
¬The thesaurus: review, renaissance and revision (2004)
0.05
0.048266858 = product of:
0.096533716 = sum of:
0.016682202 = weight(_text_:und in 4243) [ClassicSimilarity], result of:
0.016682202 = score(doc=4243,freq=6.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.12732941 = fieldWeight in 4243, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0234375 = fieldNorm(doc=4243)
0.079851516 = weight(_text_:headings in 4243) [ClassicSimilarity], result of:
0.079851516 = score(doc=4243,freq=6.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.27857596 = fieldWeight in 4243, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0234375 = fieldNorm(doc=4243)
0.5 = coord(2/4)
- Footnote
- The didactic parts of the book are a collection of exercises, readings and resources constituting a "Teach yourself " chapter written by Alan Thomas, ending with the warning that "New challenges include how to devise multi-functional and usersensitive vocabularies, corporate taxonomies and ontologies, and how to apply the transformative technology to them." This is absolutely right, and there is a need for some good writing that would tackle these issues. Another chapter, by James Shearer, skilfully manages to compress a practical exercise in building a thesaurus into some twenty A5 size pages. The third chapter in this set, by Marianne Lykke Nielsen, contains extensive reviews of key issues and selected readings under eight headings from the concept of the thesaurus, through the various construction stages and ending with automatic construction techniques. . . . This is a useful and approachable book. It is a pity that the index is such a poor advertisement for vocabulary control and usefulness."
- LCSH
- Subject headings
- RSWK
- Informations- und Dokumentationswissenschaft / Information Retrieval / Inhaltserschließung / Thesaurus (BVB)
- Subject
- Informations- und Dokumentationswissenschaft / Information Retrieval / Inhaltserschließung / Thesaurus (BVB)
Subject headings
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Compatibility and integration of order systems : Research Seminar Proceedings of the TIP/ISKO Meeting, Warsaw, 13-15 September 1995 (1996)
0.05
0.046444803 = product of:
0.09288961 = sum of:
0.016052455 = weight(_text_:und in 6118) [ClassicSimilarity], result of:
0.016052455 = score(doc=6118,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.12252277 = fieldWeight in 6118, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0390625 = fieldNorm(doc=6118)
0.07683715 = weight(_text_:headings in 6118) [ClassicSimilarity], result of:
0.07683715 = score(doc=6118,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.26805982 = fieldWeight in 6118, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0390625 = fieldNorm(doc=6118)
0.5 = coord(2/4)
- Content
- Enthält die Beiträge: SCHMITZ-ESSER, W.: Language of general communication and concept compatibility; RIESTHUIS, G.: Theory of compatibility of information languages; DAHLBERG, I.: The compatibility guidelines - a re-evaluation; SOERGEL, D.: Data structure and software support for integrated thesauri; MURASZKIEWICZ, M., H. RYBINSKI u. W. STRUK: Software problems of merging multilingual thesauri; CHMIELEWSKA-GORCZYCA, E.: Compatibility of indexing tools in multidatabase environment; NEGRINI, G.: Towards structural compatibility between concept systems; SCIBOR, E.: Some remarks on the establishment of concordances between a universal classification system and an interdisciplinary thesaurus; HOPPE, S.: The UMLS - a model for knowledge integration in a subject field; DEXTRE-CLARKE, S.: Integrating thesauri in the agricultural sciences; ROULIN, C.: Bringing multilingual thesauri together: a feasibility study; ZIMMERMANN, H.: Conception and application possibilities of classification concordances in an OPAC environment; SOSINSKA-KALATA, B.: The Universal Decimal Classification as an international standard for knowledge organization in bibliographic databases and library catalogues; WOZNIAK, J. u. T. GLOWACKA: KABA Subject Authority File - an example of an integrated Polish-French-English subject headings system; BABIK, W.: Terminology as a level for the compatibility of indexing languages - some remarks; STANCIKOVA, P.: International integrated database systems linked to multilingual thesauri covering the field of environment and agriculture; SAMEK, T.: Indexing languages integration and the EUROVOC Thesaurus in the Czech Republic; SIWEK, K.: Compatibility discrepancies between Polish and foreign databases; GLINSKI, W. u. M. MURASZKIEWICZ: An intelligent front-end processor for accessing information systems
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Mu, X.; Lu, K.; Ryu, H.: Explicitly integrating MeSH thesaurus help into health information retrieval systems : an empirical user study (2014)
0.05
0.046444803 = product of:
0.09288961 = sum of:
0.016052455 = weight(_text_:und in 3703) [ClassicSimilarity], result of:
0.016052455 = score(doc=3703,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.12252277 = fieldWeight in 3703, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0390625 = fieldNorm(doc=3703)
0.07683715 = weight(_text_:headings in 3703) [ClassicSimilarity], result of:
0.07683715 = score(doc=3703,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.26805982 = fieldWeight in 3703, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0390625 = fieldNorm(doc=3703)
0.5 = coord(2/4)
- Abstract
- When consumers search for health information, a major obstacle is their unfamiliarity with the medical terminology. Even though medical thesauri such as the Medical Subject Headings (MeSH) and related tools (e.g., the MeSH Browser) were created to help consumers find medical term definitions, the lack of direct and explicit integration of these help tools into a health retrieval system prevented them from effectively achieving their objectives. To explore this issue, we conducted an empirical study with two systems: One is a simple interface system supporting query-based searching; the other is an augmented system with two new components supporting MeSH term searching and MeSH tree browsing. A total of 45 subjects were recruited to participate in the study. The results indicated that the augmented system is more effective than the simple system in terms of improving user-perceived topic familiarity and question-answer performance, even though we did not find users spend more time on the augmented system. The two new MeSH help components played a critical role in participants' health information retrieval and were found to allow them to develop new search strategies. The findings of the study enhanced our understanding of consumers' search behaviors and shed light on the design of future health information retrieval systems.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Broughton, V.: Essential thesaurus construction (2006)
0.04
0.038103186 = product of:
0.07620637 = sum of:
0.032740712 = weight(_text_:und in 3924) [ClassicSimilarity], result of:
0.032740712 = score(doc=3924,freq=52.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.2498984 = fieldWeight in 3924, product of:
7.2111025 = tf(freq=52.0), with freq of:
52.0 = termFreq=52.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.015625 = fieldNorm(doc=3924)
0.04346566 = weight(_text_:headings in 3924) [ClassicSimilarity], result of:
0.04346566 = score(doc=3924,freq=4.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.15163754 = fieldWeight in 3924, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.015625 = fieldNorm(doc=3924)
0.5 = coord(2/4)
- Footnote
- Rez. in: Mitt. VÖB 60(2007) H.1, S.98-101 (O. Oberhauser): "Die Autorin von Essential thesaurus construction (and essential taxonomy construction, so der implizite Untertitel, vgl. S. 1) ist durch ihre Lehrtätigkeit an der bekannten School of Library, Archive and Information Studies des University College London und durch ihre bisherigen Publikationen auf den Gebieten (Facetten-)Klassifikation und Thesaurus fachlich einschlägig ausgewiesen. Nach Essential classification liegt nun ihr Thesaurus-Lehrbuch vor, mit rund 200 Seiten Text und knapp 100 Seiten Anhang ein handliches Werk, das seine Genese zum Grossteil dem Lehrbetrieb verdankt, wie auch dem kurzen Einleitungskapitel zu entnehmen ist. Das Buch ist der Schule von Jean Aitchison et al. verpflichtet und wendet sich an "the indexer" im weitesten Sinn, d.h. an alle Personen, die ein strukturiertes, kontrolliertes Fachvokabular für die Zwecke der sachlichen Erschliessung und Suche erstellen wollen bzw. müssen. Es möchte dieser Zielgruppe das nötige methodische Rüstzeug für eine solche Aufgabe vermitteln, was einschliesslich der Einleitung und der Schlussbemerkungen in zwanzig Kapiteln geschieht - eine ansprechende Strukturierung, die ein wohldosiertes Durcharbeiten möglich macht. Zu letzterem tragen auch die von der Autorin immer wieder gestellten Übungsaufgaben bei (Lösungen jeweils am Kapitelende). Zu Beginn der Darstellung wird der "information retrieval thesaurus" von dem (zumindest im angelsächsischen Raum) weit öfter mit dem Thesaurusbegriff assoziierten "reference thesaurus" abgegrenzt, einem nach begrifflicher Ähnlichkeit angeordneten Synonymenwörterbuch, das gerne als Mittel zur stilistischen Verbesserung beim Abfassen von (wissenschaftlichen) Arbeiten verwendet wird. Ohne noch ins Detail zu gehen, werden optische Erscheinungsform und Anwendungsgebiete von Thesauren vorgestellt, der Thesaurus als postkoordinierte Indexierungssprache erläutert und seine Nähe zu facettierten Klassifikationssystemen erwähnt. In der Folge stellt Broughton die systematisch organisierten Systeme (Klassifikation/ Taxonomie, Begriffs-/Themendiagramme, Ontologien) den alphabetisch angeordneten, wortbasierten (Schlagwortlisten, thesaurusartige Schlagwortsysteme und Thesauren im eigentlichen Sinn) gegenüber, was dem Leser weitere Einordnungshilfen schafft. Die Anwendungsmöglichkeiten von Thesauren als Mittel der Erschliessung (auch als Quelle für Metadatenangaben bei elektronischen bzw. Web-Dokumenten) und der Recherche (Suchformulierung, Anfrageerweiterung, Browsing und Navigieren) kommen ebenso zur Sprache wie die bei der Verwendung natürlichsprachiger Indexierungssysteme auftretenden Probleme. Mit Beispielen wird ausdrücklich auf die mehr oder weniger starke fachliche Spezialisierung der meisten dieser Vokabularien hingewiesen, wobei auch Informationsquellen über Thesauren (z.B. www.taxonomywarehouse.com) sowie Thesauren für nicht-textuelle Ressourcen kurz angerissen werden.
In den stärker ins Detail gehenden Kapiteln weist Broughton zunächst auf die Bedeutung des systematischen Teils eines Thesaurus neben dem alphabetischen Teil hin und erläutert dann die Elemente des letzteren, wobei neben den gängigen Thesaurusrelationen auch die Option der Ausstattung der Einträge mit Notationen eines Klassifikationssystems erwähnt wird. Die Thesaurusrelationen selbst werden später noch in einem weiteren Kapitel ausführlicher diskutiert, wobei etwa auch die polyhierarchische Beziehung thematisiert wird. Zwei Kapitel zur Vokabularkontrolle führen in Aspekte wie Behandlung von Synonymen, Vermeidung von Mehrdeutigkeit, Wahl der bevorzugten Terme sowie die Formen von Thesauruseinträgen ein (grammatische Form, Schreibweise, Zeichenvorrat, Singular/Plural, Komposita bzw. deren Zerlegung usw.). Insgesamt acht Kapitel - in der Abfolge mit den bisher erwähnten Abschnitten didaktisch geschickt vermischt - stehen unter dem Motto "Building a thesaurus". Kurz zusammengefasst, geht es dabei um folgende Tätigkeiten und Prozesse: - Sammlung des Vokabulars unter Nutzung entsprechender Quellen; - Termextraktion aus den Titeln von Dokumenten und Probleme hiebei; - Analyse des Vokabulars (Facettenmethode); - Einbau einer internen Struktur (Facetten und Sub-Facetten, Anordnung der Terme); - Erstellung einer hierarchischen Struktur und deren Repräsentation; - Zusammengesetzte Themen bzw. Begriffe (Facettenanordnung: filing order vs. citation order); - Konvertierung der taxonomischen Anordnung in ein alphabetisches Format (Auswahl der Vorzugsbegriffe, Identifizieren hierarchischer Beziehungen, verwandter Begriffe usw.); - Erzeugen der endgültigen Thesaurus-Einträge.
Diese Abschnitte sind verständlich geschrieben und trotz der mitunter gar nicht so einfachen Thematik auch für Einsteiger geeignet. Vorteilhaft ist sicherlich, dass die Autorin die Thesauruserstellung konsequent anhand eines einzelnen thematischen Beispiels demonstriert und dafür das Gebiet "animal welfare" gewählt hat, wohl nicht zuletzt auch deshalb, da die hier auftretenden Facetten und Beziehungen ohne allzu tiefgreifende fachwissenschaftliche Kenntnisse für die meisten Leser nachvollziehbar sind. Das methodische Gerüst der Facettenanalyse wird hier deutlich stärker betont als etwa in der (spärlichen) deutschsprachigen Thesaurusliteratur. Diese Vorgangsweise soll neben der Ordnungsbildung auch dazu verhelfen, die Zahl der Deskriptoren überschaubar zu halten und weniger auf komplexe (präkombinierte) Deskriptoren als auf postkoordinierte Indexierung zu setzen. Dafür wird im übrigen das als Verfeinerung der bekannten Ranganathanschen PMEST-Formel geltende Schema der 13 "fundamental categories" der UK Classification Research Group (CRG) vorgeschlagen bzw. in dem Beispiel verwendet (Thing / Kind / Part / Property; Material / Process / Operation; Patient / Product / By-product / Agent; Space; Time). Als "minor criticism" sei erwähnt, dass Broughton in ihrem Demonstrationsbeispiel als Notation für die erarbeitete Ordnung eine m.E. schwer lesbare Buchstabenfolge verwendet, obwohl sie zugesteht (S. 165), dass ein Zifferncode vielfach als einfacher handhabbar empfunden wird.
In einem abschliessenden Kapitel geht das Buch auf Thesauruspflege und -verwaltung ein, wobei auch das Thema "Thesaurussoftware" angerissen wird - letzteres vielleicht ein wenig zu kurz. Erst hier mag manchem unbefangenen Leser bewusst werden, dass die in den vorhergehenden Kapiteln dargestellte Methodik eigentlich ohne den Einsatz dezidierter Software besprochen wurde, ja vielleicht auch so besprochen werden musste, um ein entsprechendes Verständnis herzustellen. In der nachfolgenden zweiseitigen Conclusio wird erwähnt, dass die britische Norm Structured Vocabularies for Information Retrieval (BS 8723) vor einer Neufassung stehe - was den Rezensenten darauf hinzuweisen gemahnt, dass sich dieses Buch natürlich ausschliesslich auf den anglo-amerikanischen Sprachraum und die dort geltenden Thesaurus-Gepflogenheiten bezieht. Der relativ umfangreiche Anhang beinhaltet ausser Materialie zum erwähnten Demonstrationsbeispiel auch ein nützliches Glossarium sowie ein professionell gefertigtes Sachregister. Literaturhinweise werden - in sparsamer Dosierung - jeweils am Ende der einzelnen Kapitel gegeben, sodass sich die "Bibliography" am Ende des Buches auf einige Normen und zwei Standardwerke beschränken kann. Realistisch betrachtet, darf vermutlich nicht davon ausgegangen werden, dass Leser, die dieses Buch durchgearbeitet haben, sogleich in der Lage sein werden, eigenständig einen Thesaurus zu erstellen. Ein Lehrbuch allein kann weder einen Kurs noch die praktische Erfahrung ersetzen, die für eine solche Tätigkeit vonnöten sind. Ich kann mir aber gut vorstellen, dass die Kenntnis der in diesem Buch vermittelten Inhalte sehr wohl in die Lage versetzt, in einem Team, das einen Thesaurus erstellen soll, kompetent mitzuarbeiten, mit den Konzepten und Fachtermini zurechtzukommen und selbst konstruktive Beiträge zu leisten. Ausserdem erscheint mir das Werk hervorragend als Begleitmaterial zu einer Lehrveranstaltung geeignet - oder auch als Grundlage für die Planung einer solchen. Ein britischer Einführungstext eben, im besten Sinne."
- LCSH
- Subject headings / Terminology
- Subject
- Subject headings / Terminology
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013)
0.04
0.037155844 = product of:
0.07431169 = sum of:
0.012841963 = weight(_text_:und in 2016) [ClassicSimilarity], result of:
0.012841963 = score(doc=2016,freq=2.0), product of:
0.13101609 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.05907216 = queryNorm
0.098018214 = fieldWeight in 2016, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=2016)
0.061469723 = weight(_text_:headings in 2016) [ClassicSimilarity], result of:
0.061469723 = score(doc=2016,freq=2.0), product of:
0.2866418 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.05907216 = queryNorm
0.21444786 = fieldWeight in 2016, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.03125 = fieldNorm(doc=2016)
0.5 = coord(2/4)
- Abstract
- Relationships between terms and features are an essential component of thesauri, ontologies, and a range of controlled vocabularies. In this article, we describe ways to identify important concepts in documents using the relationships in a thesaurus or other vocabulary structures. We introduce a methodology for the analysis and modeling of the indexing process based on a weighted random walk algorithm. The primary goal of this research is the analysis of the contribution of thesaurus structure to the indexing process. The resulting models are evaluated in the context of automatic subject indexing using four collections of documents pre-indexed with 4 different thesauri (AGROVOC [UN Food and Agriculture Organization], high-energy physics taxonomy [HEP], National Agricultural Library Thesaurus [NALT], and medical subject headings [MeSH]). We also introduce a thesaurus-centric matching algorithm intended to improve the quality of candidate concepts. In all cases, the weighted random walk improves automatic indexing performance over matching alone with an increase in average precision (AP) of 9% for HEP, 11% for MeSH, 35% for NALT, and 37% for AGROVOC. The results of the analysis support our hypothesis that subject indexing is in part a browsing process, and that using the vocabulary and its structure in a thesaurus contributes to the indexing process. The amount that the vocabulary structure contributes was found to differ among the 4 thesauri, possibly due to the vocabulary used in the corresponding thesauri and the structural relationships between the terms. Each of the thesauri and the manual indexing associated with it is characterized using the methods developed here.
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus