Search (1975 results, page 4 of 99)

Sauperl, A.: Four views of a novel : characteristics of novels as described by publishers, librarians, literary theorists, and readers (2013) 0.03
```
0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 2952) [ClassicSimilarity], result of:
      0.11045824 = score(doc=2952,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 2952, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=2952)
  0.25 = coord(1/4)
```
Abstract

Publishers present novels with summaries, librarians provide subject headings, classification numbers and annotations, literary theorists write reviews. Readers share opinions and tags in social networks. These groups share interest in the same novel and possibly in the same library catalogs. I analyze the descriptions of novels written by these four groups to propose the enhancement of library catalogs. Results show that the story, information about the author, genre, personal experience with reading the novel, and an evaluation (awards, personal evaluation) are consistently presented by all four groups and should become standard elements for the subject description of fiction.
Junger, U.: Can indexing be automated? : the example of the Deutsche Nationalbibliothek (2014) 0.03
```
0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 2969) [ClassicSimilarity], result of:
      0.11045824 = score(doc=2969,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 2969, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=2969)
  0.25 = coord(1/4)
```
Abstract

The German Integrated Authority File (Gemeinsame Normdatei, GND), provides a broad controlled vocabulary for indexing documents on all subjects. Traditionally used for intellectual subject cataloging primarily for books, the Deutsche Nationalbibliothek (DNB, German National Library) has been working on developing and implementing procedures for automated assignment of subject headings for online publications. This project, its results, and problems are outlined in this article.
Gross, T.; Taylor, A.G.; Joudrey, D.N.: Still a lot to lose : the role of controlled vocabulary in keyword searching (2015) 0.03
```
0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 3007) [ClassicSimilarity], result of:
      0.11045824 = score(doc=3007,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 3007, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=3007)
  0.25 = coord(1/4)
```
Abstract

In their 2005 study, Gross and Taylor found that more than a third of records retrieved by keyword searches would be lost without subject headings. A review of the literature since then shows that numerous studies, in various disciplines, have found that a quarter to a third of records returned in a keyword search would be lost without controlled vocabulary. Other writers, though, have continued to suggest that controlled vocabulary be discontinued. Addressing criticisms of the Gross/Taylor study, this study replicates the search process in the same online catalog, but after the addition of automated enriched metadata such as tables of contents and summaries. The proportion of results that would be lost remains high.
Bardenheier, P.; Wilkinson, E.H.; Dale, H.: Ki te Tika te Hanga, Ka Pakari te Kete : with the right structure we weave a strong basket (2015) 0.03
```
0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 3176) [ClassicSimilarity], result of:
      0.11045824 = score(doc=3176,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 3176, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=3176)
  0.25 = coord(1/4)
```
Abstract

Two Indigenous frameworks were successfully applied to a significant collection of junior Maori language material at the University of Auckland, New Zealand. Nga Kete Korero Framework is used to assign levels to readers designed for structured literacy development and formed the basis of a new classification system. Nga Upoko Tukutuku is an Indigenous subject headings schema developed to empower and enrich records using Maori knowledge systems and terminology. Library staff worked collaboratively with Maori language literacy experts to transform access to the material. The Indigenous frameworks, their application for reclassification and record enhancement, and associated benefits of the project are described.
Schultz Jr., W.N.; Braddy, L.: ¬A librarian-centered study of perceptions of subject terms and controlled vocabulary (2017) 0.03
```
0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 156) [ClassicSimilarity], result of:
      0.11045824 = score(doc=156,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 156, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=156)
  0.25 = coord(1/4)
```
Abstract

Controlled vocabulary and subject headings in OPAC records have proven to be useful in improving search results. The authors used a survey to gather information about librarian opinions and professional use of controlled vocabulary. Data from a range of backgrounds and expertise were examined, including academic and public libraries, and technical services as well as public services professionals. Responses overall demonstrated positive opinions of the value of controlled vocabulary, including in reference interactions as well as during bibliographic instruction sessions. Results are also examined based upon factors such as age and type of librarian.

Biswas, P.: Rooted in the past : use of "East Indians" in Library of Congress Subject Headings (2018) 0.03

0.02761456 = product of:
  0.11045824 = sum of:
    0.11045824 = weight(_text_:headings in 167) [ClassicSimilarity], result of:
      0.11045824 = score(doc=167,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.37528375 = fieldWeight in 167, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0546875 = fieldNorm(doc=167)
  0.25 = coord(1/4)

Leong, J.H.-t.: ¬The convergence of metadata and bibliographic control? : trends and patterns in addressing the current issues and challenges of providing subject access (2010) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 342) [ClassicSimilarity], result of:
      0.09467849 = score(doc=342,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 342, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=342)
  0.25 = coord(1/4)
```
Abstract

Resource description and discovery have been facilitated generally in two approaches, namely bibliographic control and metadata, which now may converge in response to current issues and challenges of providing subject access. Four categories of major issues and challenges in the provision of subject access to digital and non-digital resources are: 1) the advancement of new knowledge; 2) the fall of controlled vocabulary and the rise of natural language; 3) digitizing and networking the traditional catalogue systems; and 4) electronic publishing and the Internet. The creation of new knowledge and the debate about the use of natural language and controlled vocabulary as subject headings becomes even more intense in the digital and online environment. The third and fourth categories are conceived after the emergence of networked environments and the rapid expansion of electronic resources. Recognizing the convergence of metadata schemas and bibliographic control calls for adapting to the new environment by developing tools that exploit the strengths of both.
Gemberling, T.: Thema and FRBR's third group (2010) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 158) [ClassicSimilarity], result of:
      0.09467849 = score(doc=158,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 158, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=158)
  0.25 = coord(1/4)
```
Abstract

The treatment of subjects by Functional Requirements for Bibliographic Records (FRBR) has attracted less attention than some of its other aspects, but there seems to be a general consensus that it needs work. While some have proposed elaborating its subject categories-concepts, objects, events, and places-to increase their semantic complexity, a working group of the International Federation of Library Associations and Institutions (IFLA) has recently made a promising proposal that essentially bypasses those categories in favor of one entity, thema. This article gives an overview of the proposal and discusses its relevance to another difficult problem, ambiguities in the establishment of headings for buildings.Use of dynamic links from subject-based finding aids to records for electronic resources in the OPAC is suggested as one method for by-passing the OPAC search interface, thus making the library's electronic resources more accessible. This method simplifies maintenance of links to electronic resources and aids instruction by providing a single, consistent access point to them. Results of a usage study from before and after this project was completed show a consistent, often dramatic increase in use of the library's electronic resources.
Buckland, M.K.: Obsolescence in subject description (2012) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 1299) [ClassicSimilarity], result of:
      0.09467849 = score(doc=1299,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 1299, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=1299)
  0.25 = coord(1/4)
```
Abstract

Purpose - The paper aims to explain the character and causes of obsolescence in assigned subject descriptors. Design/methodology/approach - The paper takes the form of a conceptual analysis with examples and reference to existing literature. Findings - Subject description comes in two forms: assigning the name or code of a subject to a document and assigning a document to a named subject category. Each method associates a document with the name of a subject. This naming activity is the site of tensions between the procedural need of information systems for stable records and the inherent multiplicity and instability of linguistic expressions. As languages change, previously assigned subject descriptions become obsolescent. The issues, tensions, and compromises involved are introduced. Originality/value - Drawing on the work of Robert Fairthorne and others, an explanation of the unavoidable obsolescence of assigned subject headings is presented. The discussion relates to libraries, but the same issues arise in any context in which subject description is expected to remain useful for an extended period of time.
Julien, C.-A.; Tirilly, P.; Leide, J.E.; Guastavino, C.: Constructing a true LCSH tree of a science and engineering collection (2012) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 1512) [ClassicSimilarity], result of:
      0.09467849 = score(doc=1512,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 1512, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=1512)
  0.25 = coord(1/4)
```
Abstract

The Library of Congress Subject Headings (LCSH) is a subject structure used to index large library collections throughout the world. Browsing a collection through LCSH is difficult using current online tools in part because users cannot explore the structure using their existing experience navigating file hierarchies on their hard drives. This is due to inconsistencies in the LCSH structure, which does not adhere to the specific rules defining tree structures. This article proposes a method to adapt the LCSH structure to reflect a real-world collection from the domain of science and engineering. This structure is transformed into a valid tree structure using an automatic process. The analysis of the resulting LCSH tree shows a large and complex structure. The analysis of the distribution of information within the LCSH tree reveals a power law distribution where the vast majority of subjects contain few information items and a few subjects contain the vast majority of the collection.
Hooland, S. van; Verborgh, R.; Wilde, M. De; Hercher, J.; Mannens, E.; Wa, R.Van de: Evaluating the success of vocabulary reconciliation for cultural heritage collections (2013) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 1662) [ClassicSimilarity], result of:
      0.09467849 = score(doc=1662,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 1662, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=1662)
  0.25 = coord(1/4)
```
Abstract

The concept of Linked Data has made its entrance in the cultural heritage sector due to its potential use for the integration of heterogeneous collections and deriving additional value out of existing metadata. However, practitioners and researchers alike need a better understanding of what outcome they can reasonably expect of the reconciliation process between their local metadata and established controlled vocabularies which are already a part of the Linked Data cloud. This paper offers an in-depth analysis of how a locally developed vocabulary can be successfully reconciled with the Library of Congress Subject Headings (LCSH) and the Arts and Architecture Thesaurus (AAT) through the help of a general-purpose tool for interactive data transformation (OpenRefine). Issues negatively affecting the reconciliation process are identified and solutions are proposed in order to derive maximum value from existing metadata and controlled vocabularies in an automated manner.
Tirilly, P.; Julien, C.-A.: Random walks for subject hierarchy simplification (2012) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 1835) [ClassicSimilarity], result of:
      0.09467849 = score(doc=1835,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 1835, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=1835)
  0.25 = coord(1/4)
```
Abstract

Although subject hierarchies are widely used to index document collections, few tools leverage their structure to facilitate collection browsing. This is mostly due to the complexity of such structures that include thousands of nodes. This paper proposes a new approach to simplify subject hierarchies based on the distribution of documents among the nodes. A random walk algorithm simulates the route of a user within the hierarchy, under the assumption that the user is attracted by the most populated nodes. Poorly visited nodes can be identified and eliminated, leaving a structure containing only the nodes that best represent the content of the collection. Experiments on a collection indexed using the Library of Congress Subject Headings (LCSH) show that, as compared to the state-of-the-art simplification method, the random walk-based approach gives access to a larger part of the collection for the same structure size, and offers more flexibility to customize the complexity of thestructure.

Wisser, K.: ¬The errors of our ways : using metadata quality research to understand common error patterns in the application of name headings (2014) 0.02

0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 2574) [ClassicSimilarity], result of:
      0.09467849 = score(doc=2574,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 2574, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=2574)
  0.25 = coord(1/4)

Tuttle, J.: ¬The aphasia of modern subject access (2012) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 2948) [ClassicSimilarity], result of:
      0.09467849 = score(doc=2948,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 2948, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=2948)
  0.25 = coord(1/4)
```
Abstract

Why do catalogers use two systems, one notational like Library of Congress Classification (LCC) and the other terminological like Library of Congress Subject Headings (LCSH), to reach the same goal: subject description and access? This article, divided into two parts, first surveys the library science literature to address the unsatisfying answers given to that question and, secondly, provides a new answer based on the linguistic theory of Roman Jakobson. Jakobson's theory that language is always twofold, the act of selecting words paired with the act of combining words, is proposed as a theory of subject access, with LCSH doing the work of selection and LCC thework of combination.
Rotolo, D.; Leydesdorff, L.: Matching Medline/PubMed data with Web of Science: A routine in R language (2015) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 3224) [ClassicSimilarity], result of:
      0.09467849 = score(doc=3224,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 3224, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=3224)
  0.25 = coord(1/4)
```
Abstract

We present a novel routine, namely medlineR, based on the R language, that allows the user to match data from Medline/PubMed with records indexed in the ISI Web of Science (WoS) database. The matching allows exploiting the rich and controlled vocabulary of medical subject headings (MeSH) of Medline/PubMed with additional fields of WoS. The integration provides data (e.g., citation data, list of cited reference, list of the addresses of authors' host organizations, WoS subject categories) to perform a variety of scientometric analyses. This brief communication describes medlineR, the method on which it relies, and the steps the user should follow to perform the matching across the two databases. To demonstrate the differences from Leydesdorff and Opthof (Journal of the American Society for Information Science and Technology, 64(5), 1076-1080), we conclude this artcle by testing the routine on the MeSH category "Burgada syndrome."
Vukadin, A.: Development of a classification-oriented authority control : the experience of the National and University Library in Zagreb (2015) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 3296) [ClassicSimilarity], result of:
      0.09467849 = score(doc=3296,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 3296, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=3296)
  0.25 = coord(1/4)
```
Abstract

The paper presents experiences and challenges encountered during the planning and creation of the Universal Decimal Classification (UDC) authority database in the National and University Library in Zagreb, Croatia. The project started in 2014 with the objective of facilitating classification data management, improving the indexing consistency at the institutional level and the machine readability of data for eventual sharing and re-use in the Web environment. The paper discusses the advantages and disadvantages of UDC, which is an analytico-synthetic classification scheme tending towards a more faceted structure, in regard to various aspects of authority control. This discussion represents the referential framework for the project. It determines the choice of elements to be included in the authority file, e.g. distinguishing between syntagmatic and paradigmatic combinations of subjects. It also determines the future lines of development, e.g. interlinking with the subject headings authority file in order to provide searching by verbal expressions.
Baga, J.; Hoover, L.; Wolverton, R.E.: Online, practical, and free cataloging resources (2013) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 3603) [ClassicSimilarity], result of:
      0.09467849 = score(doc=3603,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 3603, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=3603)
  0.25 = coord(1/4)
```
Abstract

This comprehensive annotated webliography describes online cataloging resources that are free to use, currently updated, and of high quality. The major aim of this webliography is to provide assistance for catalogers who are new to the profession, unfamiliar with cataloging specific formats, or unable to access costly print and subscription resources. The annotated resources include general websites and webpages, databases, workshop presentations, streaming media, and local documentation. The scope of the webliography is limited to resources reflecting traditional cataloging practices using the Anglo-American Cataloguing Rules, 2nd edition, RDA: Resource Description and Access, and MAchine Readable Cataloging (MARC) standards. Non-MARC metadata schemas like Dublin Core are not covered. Most components of cataloging are represented in this webliography, such as authority control, classification, subject headings, and genre terms. Guidance also is provided for cataloging miscellaneous formats including sound and videorecordings, streaming media, e-books, video games, graphic novels, kits, rare materials, maps, serials, realia, government documents, and music.

Weinheimer, J.: ¬A visual explanation of the areas defined by AACR2, RDA, ISBD, LC NAF, LC Classification, LC Subject Headings, Dewey Classification, MARC21 : plus a quick look at ISO2709, MARCXML and a version of BIBFRAME (2015) 0.02

0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 3882) [ClassicSimilarity], result of:
      0.09467849 = score(doc=3882,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 3882, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=3882)
  0.25 = coord(1/4)

Gil-Leiva, I.: SISA-automatic indexing system for scientific articles : experiments with location heuristics rules versus TF-IDF rules (2017) 0.02
```
0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 4622) [ClassicSimilarity], result of:
      0.09467849 = score(doc=4622,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 4622, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=4622)
  0.25 = coord(1/4)
```
Abstract

Indexing is contextualized and a brief description is provided of some of the most used automatic indexing systems. We describe SISA, a system which uses location heuristics rules, statistical rules like term frequency (TF) or TF-IDF to obtain automatic or semi-automatic indexing, depending on the user's preference. The aim of this research is to ascertain which rules (location heuristics rules or TF-IDF rules) provide the best indexing terms. SISA is used to obtain the automatic indexing of 200 scientific articles on fruit growing written in Portuguese. It uses, on the one hand, location heuristics rules founded on the value of certain parts of the articles for indexing such as titles, abstracts, keywords, headings, first paragraph, conclusions and references and, on the other, TF-IDF rules. The indexing is then evaluated to ascertain retrieval performance through recall, precision and f-measure. Automatic indexing of the articles with location heuristics rules provided the best results with the evaluation measures.

Liu, Y.-H.; Wacholder, N.: Evaluating the impact of MeSH (Medical Subject Headings) terms on different types of searchers (2017) 0.02

0.023669623 = product of:
  0.09467849 = sum of:
    0.09467849 = weight(_text_:headings in 96) [ClassicSimilarity], result of:
      0.09467849 = score(doc=96,freq=2.0), product of:
        0.2943326 = queryWeight, product of:
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.0606571 = queryNorm
        0.32167178 = fieldWeight in 96, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.8524013 = idf(docFreq=942, maxDocs=44421)
          0.046875 = fieldNorm(doc=96)
  0.25 = coord(1/4)

Search (1975 results, page 4 of 99)

Authors

Languages

Types

Themes

Subjects

Classifications