-
Mandl, T.: Evaluierung im Information Retrieval : die Hildesheimer Antwort auf aktuelle Herausforderungen der globalisierten Informationsgesellschaft (2010)
0.10
0.099214815 = product of:
0.19842963 = sum of:
0.1434502 = weight(_text_:jahren in 11) [ClassicSimilarity], result of:
0.1434502 = score(doc=11,freq=2.0), product of:
0.34421802 = queryWeight, product of:
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.073006414 = queryNorm
0.4167423 = fieldWeight in 11, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.0625 = fieldNorm(doc=11)
0.05497943 = weight(_text_:und in 11) [ClassicSimilarity], result of:
0.05497943 = score(doc=11,freq=6.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.33954507 = fieldWeight in 11, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=11)
0.5 = coord(2/4)
- Abstract
- Die Forschung zur Evaluierung von Information Retrieval Systemen hat in den letzten Jahren neue Richtungen eingeschlagen und interessante Ergebnisse erzielt. Während früher primär die Überlegenheit einzelner Verfahren in heterogenen Anwendungsszenarien im Fokus stand, gerät zunehmend die Validität der Evaluierungsmethodik ins Zentrum der Aufmerksamkeit. Dieser Artikel fasst die aktuelle Forschung zu innovativen Evaluierungsmaßen und zur Zuverlässigkeit des so genannten Cranfield-Paradigmas zusammen.
- Source
- Information - Wissenschaft und Praxis. 61(2010) H.6/7, S.341-348
-
Kluck, M.; Mandl, T.; Womser-Hacker, C.: Cross-Language Evaluation Forum (CLEF) : Europäische Initiative zur Bewertung sprachübergreifender Retrievalverfahren (2002)
0.09
0.090534054 = product of:
0.18106811 = sum of:
0.12551893 = weight(_text_:jahren in 1266) [ClassicSimilarity], result of:
0.12551893 = score(doc=1266,freq=2.0), product of:
0.34421802 = queryWeight, product of:
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.073006414 = queryNorm
0.3646495 = fieldWeight in 1266, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.0546875 = fieldNorm(doc=1266)
0.055549175 = weight(_text_:und in 1266) [ClassicSimilarity], result of:
0.055549175 = score(doc=1266,freq=8.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.34306374 = fieldWeight in 1266, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0546875 = fieldNorm(doc=1266)
0.5 = coord(2/4)
- Abstract
- Seit einigen Jahren hat sich in Europa eine Initiative zur Bewertung von Information Retrieval in mehrsprachigen Kontexten etabliert. Das Cross Language Evaluation forum (CLEF) wird von der EU gefördert und kooperiert mit Evaluierungsprojekten in den USA (TREC) und in Japan (NTCIR). Dieser Artikel stellt das CLEF in den Rahmen der anderen internationalen Initiativen. Neue Entwicklungen sowohl bei den Information Retrieval Systemen als auch bei den Evaluierungsmethoden werden aufgezeit. Die hohe Anzahl von Teilnehmern aus Forschungsinstitutionen und der Industrie beweist die steigende Bedeutung des sprachübergreifenden Retrievals
- Source
- Information - Wissenschaft und Praxis. 53(2002) H.2, S.82-89
-
Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997)
0.08
0.081412874 = product of:
0.3256515 = sum of:
0.3256515 = weight(_text_:headings in 2446) [ClassicSimilarity], result of:
0.3256515 = score(doc=2446,freq=12.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.91925365 = fieldWeight in 2446, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0546875 = fieldNorm(doc=2446)
0.25 = coord(1/4)
- Abstract
- A recent survey at the Royal Library in the Netherlands showed that subject headings are more efficient than title keywords for retrieval purposes. 475 Dutch publications were selected at random and assigned subject headings. The study showed that subject headings provided additional useful information in 56% of titles. Subsequent searching of the library's online catalogue showed that 88% of titles were retrieved via subject headings against 57% through title keywords. Further precision may be achieved with the help of indexing staff, but at considerable cost
- Footnote
- Übers. d. Titels: A good title has no need of subject headings, or does it?: a comparative study of title keywords against subject headings
-
Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003)
0.08
0.07772744 = product of:
0.15545487 = sum of:
0.089656375 = weight(_text_:jahren in 283) [ClassicSimilarity], result of:
0.089656375 = score(doc=283,freq=2.0), product of:
0.34421802 = queryWeight, product of:
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.073006414 = queryNorm
0.26046392 = fieldWeight in 283, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.0390625 = fieldNorm(doc=283)
0.0657985 = weight(_text_:und in 283) [ClassicSimilarity], result of:
0.0657985 = score(doc=283,freq=22.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.4063621 = fieldWeight in 283, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0390625 = fieldNorm(doc=283)
0.5 = coord(2/4)
- Abstract
- Automatische Indexierung verzeichnet schon seit einigen Jahren aufgrund steigender Informationsflut ein wachsendes Interesse. Allerdings gibt es immer noch Vorbehalte gegenüber der intellektuellen Indexierung in Bezug auf Qualität und größerem Aufwand der Systemimplementierung bzw. -pflege. Neuere Entwicklungen aus dem Bereich des Wissensmanagements, wie beispielsweise Verfahren aus der Künstlichen Intelligenz, der Informationsextraktion, dem Text Mining bzw. der automatischen Klassifikation sollen die automatische Indexierung aufwerten und verbessern. Damit soll eine intelligentere und mehr inhaltsbasierte Erschließung geleistet werden. In dieser Masterarbeit wird außerhalb der Darstellung von Grundlagen und Verfahren der automatischen Indexierung sowie neueren Entwicklungen auch Möglichkeiten der Evaluation dargestellt. Die mögliche Anwendung der automatischen Indexierung im DFG-ProjektGemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" bilden den Schwerpunkt der Arbeit. Im Portal steht die bibliothekarische Erschließung von Texten im Vordergrund. In einem umfangreichen Test werden drei deutsche, linguistische Systeme mit statistischen Verfahren kombiniert (die aber teilweise im System bereits integriert ist) und evaluiert, allerdings nur auf der Basis der ausgegebenen Indexate. Abschließend kann festgestellt werden, dass die Ergebnisse und damit die Qualität (bezogen auf die Indexate) von intellektueller und automatischer Indexierung noch signifikant unterschiedlich sind. Die Gründe liegen in noch zu lösenden semantischen Problemen bzw, in der Obereinstimmung mit Worten aus einem Thesaurus, die von einem automatischen Indexierungssystem nicht immer nachvollzogen werden kann. Eine Inhaltsanreicherung mit den Indexaten zum Vorteil beim Retrieval kann, je nach System oder auch über die Einbindung durch einen Thesaurus, erreicht werden.
- Imprint
- Konstanz : Universität / Fachbereich Informatik und Informationswissenschaft
-
Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002)
0.07
0.07441111 = product of:
0.14882222 = sum of:
0.10758765 = weight(_text_:jahren in 1883) [ClassicSimilarity], result of:
0.10758765 = score(doc=1883,freq=2.0), product of:
0.34421802 = queryWeight, product of:
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.073006414 = queryNorm
0.3125567 = fieldWeight in 1883, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.714901 = idf(docFreq=1081, maxDocs=44421)
0.046875 = fieldNorm(doc=1883)
0.041234575 = weight(_text_:und in 1883) [ClassicSimilarity], result of:
0.041234575 = score(doc=1883,freq=6.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.25465882 = fieldWeight in 1883, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=1883)
0.5 = coord(2/4)
- Abstract
- In Anlehnung an die in den neunziger Jahren durchgeführten Erschließungsprojekte MILOS I und MILOS II, die die Eignung eines Verfahrens zur automatischen Indexierung für Bibliothekskataloge zum Thema hatten, wurde eine empirische Untersuchung anhand einer repräsentativen Stichprobe von Titelsätzen aus dem Österreichischen Verbundkatalog durchgeführt. Ziel war die Prüfung und Bewertung der Einsatzmöglichkeit dieses Verfahrens in den Online-Katalogen des Verbundes. Der Realsituation der OPAC-Benutzung gemäß wurde ausschließlich die Auswirkung auf den automatisch generierten Begriffen angereicherten Basic Index ("Alle Felder") untersucht. Dazu wurden 100 Suchanfragen zunächst im ursprünglichen Basic Index und sodann im angereicherten Basic Index in einem OPAC unter Aleph 500 durchgeführt. Die Tests erbrachten einen Zuwachs an relevanten Treffern bei nur leichten Verlusten an Precision, eine Reduktion der Nulltreffer-Ergebnisse sowie Aufschlüsse über die Auswirkung einer vorhandenen verbalen Sacherschließung.
-
Voorbij, H.: Titelwoorden - trefwoorden : een vergelijkend onderzoek (1997)
0.07
0.06647334 = product of:
0.26589337 = sum of:
0.26589337 = weight(_text_:headings in 4175) [ClassicSimilarity], result of:
0.26589337 = score(doc=4175,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.7505675 = fieldWeight in 4175, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.109375 = fieldNorm(doc=4175)
0.25 = coord(1/4)
- Footnote
- Übers. d. Titels: Title words - subject headings: a comparative research
-
Schabas, A.H.: ¬A comparative evaluation of the retrieval effectiveness of titles, Library of Congress Subject Headings and PRECIS strings for computer searching of UK MARC data (1979)
0.06
0.05697715 = product of:
0.2279086 = sum of:
0.2279086 = weight(_text_:headings in 5276) [ClassicSimilarity], result of:
0.2279086 = score(doc=5276,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.64334357 = fieldWeight in 5276, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.09375 = fieldNorm(doc=5276)
0.25 = coord(1/4)
-
Drabenstott, K.M.; Vizine-Goetz, D.: Using subject headings for online retrieval : theory, practice and potential (1994)
0.05
0.04934366 = product of:
0.19737464 = sum of:
0.19737464 = weight(_text_:headings in 386) [ClassicSimilarity], result of:
0.19737464 = score(doc=386,freq=6.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.5571519 = fieldWeight in 386, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=386)
0.25 = coord(1/4)
- Abstract
- Using subject headings for Online Retrieval is an indispensable tool for online system desingners who are developing new systems or refining exicting ones. The book describes subject analysis and subject searching in online catalogs, including the limitations of retrieval, and demonstrates how such limitations can be overcome through system design and programming. The book describes the Library of Congress Subject headings system and system characteristics, shows how information is stored in machine readable files, and offers examples of and recommendations for successful methods. Tables are included to support these recommendations, and diagrams, graphs, and bar charts are used to provide results of data analyses.
-
Byrne, J.R.: Relative effectiveness of titles, abstracts, and subject headings for machine retrieval from the COMPENDEX services (1975)
0.05
0.04700375 = product of:
0.188015 = sum of:
0.188015 = weight(_text_:headings in 1603) [ClassicSimilarity], result of:
0.188015 = score(doc=1603,freq=4.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.5307314 = fieldWeight in 1603, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0546875 = fieldNorm(doc=1603)
0.25 = coord(1/4)
- Abstract
- We have investigated the relative merits of searching on titles, subject headings, abstracts, free-language terms, and combinations of these elements. The COMPENDEX data base was used for this study since it combined all of these data elements of interest. In general, the results obtained from the experiments indicate that, as expected, titles alone are not satisfactory for efficient retrieval. The combination of titles and abstracts came the closest to 100% retrieval, with searching of abstracts alone doing almost as well. Indexer input, although necessary for 100% retrieval in almost all cases, was found to be relatively unimportant
-
Brown, M.E.: By any other name : accounting for failure in the naming of subject categories (1995)
0.03
0.03323667 = product of:
0.13294668 = sum of:
0.13294668 = weight(_text_:headings in 5666) [ClassicSimilarity], result of:
0.13294668 = score(doc=5666,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.37528375 = fieldWeight in 5666, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0546875 = fieldNorm(doc=5666)
0.25 = coord(1/4)
- Abstract
- Research shows that 65-80% of subject search terms fail to match the appropriate subject heading and one third to one half of subject searches result in no references being retrieved. Examines the subject search terms geberated by 82 school and college students in Princeton, NJ, evaluated the match between the named terms and the expected subject headings, proposes an explanation for match failures in relation to 3 invariant properties common to all search terms: concreteness, complexity, and syndeticity. Suggests that match failure is a consequence of developmental naming patterns and that these patterns can be overcome through the use of metacognitive naming skills
-
Schultz Jr., W.N.; Braddy, L.: ¬A librarian-centered study of perceptions of subject terms and controlled vocabulary (2017)
0.03
0.03323667 = product of:
0.13294668 = sum of:
0.13294668 = weight(_text_:headings in 156) [ClassicSimilarity], result of:
0.13294668 = score(doc=156,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.37528375 = fieldWeight in 156, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0546875 = fieldNorm(doc=156)
0.25 = coord(1/4)
- Abstract
- Controlled vocabulary and subject headings in OPAC records have proven to be useful in improving search results. The authors used a survey to gather information about librarian opinions and professional use of controlled vocabulary. Data from a range of backgrounds and expertise were examined, including academic and public libraries, and technical services as well as public services professionals. Responses overall demonstrated positive opinions of the value of controlled vocabulary, including in reference interactions as well as during bibliographic instruction sessions. Results are also examined based upon factors such as age and type of librarian.
-
Tibbo, H.R.: ¬The epic struggle : subject retrieval from large bibliographic databases (1994)
0.03
0.028488575 = product of:
0.1139543 = sum of:
0.1139543 = weight(_text_:headings in 2247) [ClassicSimilarity], result of:
0.1139543 = score(doc=2247,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.32167178 = fieldWeight in 2247, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=2247)
0.25 = coord(1/4)
- Abstract
- Discusses a retrieval study that focused on collection level archival records in the OCLC OLUC, made accessible through the EPIC online search system. Data were also collected from the local OPAC at North Carolina University at Chapel Hill (UNC-CH) in which UNC-CH produced OCLC records are loaded. The chief objective was to explore the retrieval environments in which a random sample of USMARC AMC records produced at UNC-CH were found: specifically to obtain a picture of the density of these databases in regard to each subject heading applied and, more generally, for each records. Key questions were: how many records would be retrieved for each subject heading attached to each of the records; and what was the nature of these subject headings vis a vis the numer of hits associated with them. Results show that large retrieval sets are a potential problem with national bibliographic utilities and that the local and national retrieval environments can vary greatly. The need for specifity in indexing is emphasized
-
McJunkin, M.C.: Precision and recall in title keyword searching (1995)
0.03
0.028488575 = product of:
0.1139543 = sum of:
0.1139543 = weight(_text_:headings in 3419) [ClassicSimilarity], result of:
0.1139543 = score(doc=3419,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.32167178 = fieldWeight in 3419, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=3419)
0.25 = coord(1/4)
- Abstract
- Investigates the extent to which title keywords convey subject content and compares the relative effectiveness of searching title keywords using 2 search strategies to examine whether adjacency operators in title keyword searches are effective in improving recall and precision of online searching. Title keywords from a random sample of titles in the field of economics were searched on FirstSearch, using the WorldCat database, which is equivalent in coverage to the OCLC OLUC, with and without adjacency of the keywords specified. The LCSH of the items retrieved were compared with the sample title subject headings to determine the degree of match or relevance and the values for precision and recall were calculated. Results indicated that, when keywords were discipline specific, adjacency operators improved precision with little degradation of recall. Systems that allow positional operators or rank output by proximity of terms may increase search success
-
Abdou, S.; Savoy, J.: Searching in Medline : query expansion and manual indexing evaluation (2008)
0.03
0.028488575 = product of:
0.1139543 = sum of:
0.1139543 = weight(_text_:headings in 3062) [ClassicSimilarity], result of:
0.1139543 = score(doc=3062,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.32167178 = fieldWeight in 3062, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.046875 = fieldNorm(doc=3062)
0.25 = coord(1/4)
- Abstract
- Based on a relatively large subset representing one third of the Medline collection, this paper evaluates ten different IR models, including recent developments in both probabilistic and language models. We show that the best performing IR models is a probabilistic model developed within the Divergence from Randomness framework [Amati, G., & van Rijsbergen, C.J. (2002) Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM-Transactions on Information Systems 20(4), 357-389], which result in 170% enhancements in mean average precision when compared to the classical tf idf vector-space model. This paper also reports on our impact evaluations on the retrieval effectiveness of manually assigned descriptors (MeSH or Medical Subject Headings), showing that by including these terms retrieval performance can improve from 2.4% to 13.5%, depending on the underling IR model. Finally, we design a new general blind-query expansion approach showing improved retrieval performances compared to those obtained using the Rocchio approach.
-
Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2017)
0.02
0.023740478 = product of:
0.09496191 = sum of:
0.09496191 = weight(_text_:headings in 4868) [ClassicSimilarity], result of:
0.09496191 = score(doc=4868,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.26805982 = fieldWeight in 4868, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0390625 = fieldNorm(doc=4868)
0.25 = coord(1/4)
- Abstract
- Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This paper reports on an analysis of the loss levels that would result if a bibliographic database, namely the Australian Education Index (AEI), were missing the subject descriptors and identifiers assigned by its professional indexers, employing the methodology developed by Gross and Taylor (2005), and later by Gross et al. (2015). The results indicate that AEI users would lose a similar proportion of hits per query to that experienced by library catalog users: on average, 27% of the resources found by a sample of keyword queries on the AEI database would not have been found without the subject indexing, based on the Australian Thesaurus of Education Descriptors (ATED). The paper also discusses the methodological limitations of these studies, pointing out that real-life users might still find some of the resources missed by a particular query through follow-up searches, while additional resources might also be found through iterative searching on the subject vocabulary. The paper goes on to describe a new research design, based on a before - and - after experiment, which addresses some of these limitations. It is argued that this alternative design will provide a more realistic picture of the value that professionally assigned subject indexing and controlled subject vocabularies can add to literature searching of a more scholarly and thorough kind.
-
Hider, P.: ¬The search value added by professional indexing to a bibliographic database (2018)
0.02
0.023740478 = product of:
0.09496191 = sum of:
0.09496191 = weight(_text_:headings in 300) [ClassicSimilarity], result of:
0.09496191 = score(doc=300,freq=2.0), product of:
0.35425642 = queryWeight, product of:
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.073006414 = queryNorm
0.26805982 = fieldWeight in 300, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.8524013 = idf(docFreq=942, maxDocs=44421)
0.0390625 = fieldNorm(doc=300)
0.25 = coord(1/4)
- Abstract
- Gross et al. (2015) have demonstrated that about a quarter of hits would typically be lost to keyword searchers if contemporary academic library catalogs dropped their controlled subject headings. This article reports on an investigation of the search value that subject descriptors and identifiers assigned by professional indexers add to a bibliographic database, namely the Australian Education Index (AEI). First, a similar methodology to that developed by Gross et al. (2015) was applied, with keyword searches representing a range of educational topics run on the AEI database with and without its subject indexing. The results indicated that AEI users would also lose, on average, about a quarter of hits per query. Second, an alternative research design was applied in which an experienced literature searcher was asked to find resources on a set of educational topics on an AEI database stripped of its subject indexing and then asked to search for additional resources on the same topics after the subject indexing had been reinserted. In this study, the proportion of additional resources that would have been lost had it not been for the subject indexing was again found to be about a quarter of the total resources found for each topic, on average.
-
Mandl, T.: Neue Entwicklungen bei den Evaluierungsinitiativen im Information Retrieval (2006)
0.02
0.020995615 = product of:
0.08398246 = sum of:
0.08398246 = weight(_text_:und in 975) [ClassicSimilarity], result of:
0.08398246 = score(doc=975,freq=14.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.51866364 = fieldWeight in 975, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=975)
0.25 = coord(1/4)
- Abstract
- Im Information Retrieval tragen Evaluierungsinitiativen erheblich zur empirisch fundierten Forschung bei. Mit umfangreichen Kollektionen und Aufgaben unterstützen sie die Standardisierung und damit die Systementwicklung. Die wachsenden Anforderungen hinsichtlich der Korpora und Anwendungsszenarien führten zu einer starken Diversifizierung innerhalb der Evaluierungsinitiativen. Dieser Artikel gibt einen Überblick über den aktuellen Stand der wichtigsten Evaluierungsinitiativen und neuen Trends.
- Source
- Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
-
Lohmann, H.: Verbesserung der Literatursuche durch Dokumentanreicherung und automatische Inhaltserschließung : Das Projekt 'KASCADE' an der Universitäts- und Landesbibliothek Düsseldorf (1999)
0.02
0.020617288 = product of:
0.08246915 = sum of:
0.08246915 = weight(_text_:und in 2221) [ClassicSimilarity], result of:
0.08246915 = score(doc=2221,freq=6.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.50931764 = fieldWeight in 2221, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.09375 = fieldNorm(doc=2221)
0.25 = coord(1/4)
- Imprint
- Köln : Fachhochschule, Fachbereich Bibliotheks- und Informationswesen
-
Mandl, T.: Web- und Multimedia-Dokumente : Neuere Entwicklungen bei der Evaluierung von Information Retrieval Systemen (2003)
0.02
0.019438164 = product of:
0.07775266 = sum of:
0.07775266 = weight(_text_:und in 2734) [ClassicSimilarity], result of:
0.07775266 = score(doc=2734,freq=12.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.48018923 = fieldWeight in 2734, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=2734)
0.25 = coord(1/4)
- Abstract
- Die Menge an Daten im Internet steigt weiter rapide an. Damit wächst auch der Bedarf an qualitativ hochwertigen Information Retrieval Diensten zur Orientierung und problemorientierten Suche. Die Entscheidung für die Benutzung oder Beschaffung von Information Retrieval Software erfordert aussagekräftige Evaluierungsergebnisse. Dieser Beitrag stellt neuere Entwicklungen bei der Evaluierung von Information Retrieval Systemen vor und zeigt den Trend zu Spezialisierung und Diversifizierung von Evaluierungsstudien, die den Realitätsgrad derErgebnisse erhöhen. DerSchwerpunkt liegt auf dem Retrieval von Fachtexten, Internet-Seiten und Multimedia-Objekten.
- Source
- Information - Wissenschaft und Praxis. 54(2003) H.4, S.203-210
-
Kluck, M.; Winter, M.: Topic-Entwicklung und Relevanzbewertung bei GIRT : ein Werkstattbericht (2006)
0.02
0.019438164 = product of:
0.07775266 = sum of:
0.07775266 = weight(_text_:und in 967) [ClassicSimilarity], result of:
0.07775266 = score(doc=967,freq=12.0), product of:
0.16192086 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.073006414 = queryNorm
0.48018923 = fieldWeight in 967, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.0625 = fieldNorm(doc=967)
0.25 = coord(1/4)
- Abstract
- Der Zusammenhang zwischen Topic-Entwicklung und Relevanzbewertung wird anhand einiger Fallbeispiele aus der CLEF-Evaluierungskampagne 2005 diskutiert. Im fachspezifischen Retrievaltest für multilinguale Systeme wurden die Topics am Dokumentenbestand von GIRT entwickelt. Die Zusammenhänge von Topic-Formulierung und Interpretationsspielräumen bei der Relevanzbewertung werden untersucht.
- Source
- Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker