-
Haas, S.W.; Losee, R.M.: Looking in text windows : their size and composition (1994)
0.04
0.0356988 = product of:
0.1427952 = sum of:
0.1427952 = weight(_text_:having in 139) [ClassicSimilarity], result of:
0.1427952 = score(doc=139,freq=2.0), product of:
0.36014074 = queryWeight, product of:
5.981156 = idf(docFreq=304, maxDocs=44421)
0.060212567 = queryNorm
0.39649835 = fieldWeight in 139, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.981156 = idf(docFreq=304, maxDocs=44421)
0.046875 = fieldNorm(doc=139)
0.25 = coord(1/4)
- Abstract
- A text window is a group of words appearing in contiguous positions in text used to exploit a variety of lexical, syntactics, and semantic relationships without having to analyze the text explicitely for their structure. This supports the previously suggested idea that natural grouping of words are best treated as a unit of size 7 to 11 words, that is, plus or minus 3 to 5 words. The text retrieval experiments varying the size of windows, both with full text and with stopwords removed, support these size ranges. The characteristcs of windows that best match terms in queries are examined in detail, revealing intersting differences between those for queries with good results and those for queries with poorer results. Queries with good results tend to contain morte content word phrase and few terms with high frequency of use in the database. Information retrieval systems may benefit from expanding thesaurus-style relationships or incorporating statistical dependencies for terms within these windows
-
Losee, R.M.: Decisions in thesaurus construction and use (2007)
0.00
0.004908705 = product of:
0.01963482 = sum of:
0.01963482 = weight(_text_:und in 1924) [ClassicSimilarity], result of:
0.01963482 = score(doc=1924,freq=2.0), product of:
0.1335454 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.060212567 = queryNorm
0.14702731 = fieldWeight in 1924, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.046875 = fieldNorm(doc=1924)
0.25 = coord(1/4)
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus
-
Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013)
0.00
0.0032724703 = product of:
0.013089881 = sum of:
0.013089881 = weight(_text_:und in 2016) [ClassicSimilarity], result of:
0.013089881 = score(doc=2016,freq=2.0), product of:
0.1335454 = queryWeight, product of:
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.060212567 = queryNorm
0.098018214 = fieldWeight in 2016, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
2.217899 = idf(docFreq=13141, maxDocs=44421)
0.03125 = fieldNorm(doc=2016)
0.25 = coord(1/4)
- Theme
- Konzeption und Anwendung des Prinzips Thesaurus