Search (4 results, page 1 of 1)

Schneider, J.W.; Borlund, P.: ¬A bibliometric-based semiautomatic approach to identification of candidate thesaurus terms : parsing and filtering of noun phrases from citation contexts (2005) 0.02

0.021500306 = product of:
  0.043000612 = sum of:
    0.0052644373 = weight(_text_:e in 156) [ClassicSimilarity], result of:
      0.0052644373 = score(doc=156,freq=2.0), product of:
        0.047356583 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03294669 = queryNorm
        0.1111659 = fieldWeight in 156, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.0546875 = fieldNorm(doc=156)
    0.02732059 = weight(_text_:u in 156) [ClassicSimilarity], result of:
      0.02732059 = score(doc=156,freq=2.0), product of:
        0.107882105 = queryWeight, product of:
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.03294669 = queryNorm
        0.25324488 = fieldWeight in 156, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2744443 = idf(docFreq=4547, maxDocs=44218)
          0.0546875 = fieldNorm(doc=156)
    0.010415585 = product of:
      0.031246753 = sum of:
        0.031246753 = weight(_text_:22 in 156) [ClassicSimilarity], result of:
          0.031246753 = score(doc=156,freq=2.0), product of:
            0.1153737 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03294669 = queryNorm
            0.2708308 = fieldWeight in 156, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=156)
      0.33333334 = coord(1/3)
  0.5 = coord(3/6)

Date: 8. 3.2007 19:55:22
Language: e
Source: Context: nature, impact and role. 5th International Conference an Conceptions of Library and Information Sciences, CoLIS 2005 Glasgow, UK, June 2005. Ed. by F. Crestani u. I. Ruthven

Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.01

0.008960012 = product of:
  0.026880037 = sum of:
    0.0090247495 = weight(_text_:e in 4483) [ClassicSimilarity], result of:
      0.0090247495 = score(doc=4483,freq=2.0), product of:
        0.047356583 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03294669 = queryNorm
        0.19057012 = fieldWeight in 4483, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.09375 = fieldNorm(doc=4483)
    0.017855287 = product of:
      0.05356586 = sum of:
        0.05356586 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
          0.05356586 = score(doc=4483,freq=2.0), product of:
            0.1153737 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03294669 = queryNorm
            0.46428138 = fieldWeight in 4483, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=4483)
      0.33333334 = coord(1/3)
  0.33333334 = coord(2/6)

Date: 15. 3.2000 10:22:37
Language: e

Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.00

0.0037558496 = product of:
  0.011267548 = sum of:
    0.0037603125 = weight(_text_:e in 5226) [ClassicSimilarity], result of:
      0.0037603125 = score(doc=5226,freq=2.0), product of:
        0.047356583 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03294669 = queryNorm
        0.07940422 = fieldWeight in 5226, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5226)
    0.007507236 = product of:
      0.022521708 = sum of:
        0.022521708 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
          0.022521708 = score(doc=5226,freq=2.0), product of:
            0.11589616 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03294669 = queryNorm
            0.19432661 = fieldWeight in 5226, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5226)
      0.33333334 = coord(1/3)
  0.33333334 = coord(2/6)

Abstract: Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.
Language: e

Rahmstorf, G.: Information retrieval using conceptual representations of phrases (1994) 0.00

7.520625E-4 = product of:
  0.0045123748 = sum of:
    0.0045123748 = weight(_text_:e in 7862) [ClassicSimilarity], result of:
      0.0045123748 = score(doc=7862,freq=2.0), product of:
        0.047356583 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03294669 = queryNorm
        0.09528506 = fieldWeight in 7862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.046875 = fieldNorm(doc=7862)
  0.16666667 = coord(1/6)

Language: e

Search (4 results, page 1 of 1)

Authors

Years