Search (49 results, page 3 of 3)

Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010) 0.00

8.1174844E-4 = product of:
  0.0048704906 = sum of:
    0.0048704906 = product of:
      0.024352452 = sum of:
        0.024352452 = weight(_text_:28 in 5854) [ClassicSimilarity], result of:
          0.024352452 = score(doc=5854,freq=2.0), product of:
            0.12305808 = queryWeight, product of:
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03435205 = queryNorm
            0.19789396 = fieldWeight in 5854, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5854)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 6. 1.1997 18:30:28

Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.00
```
7.827461E-4 = product of:
  0.0046964767 = sum of:
    0.0046964767 = product of:
      0.023482382 = sum of:
        0.023482382 = weight(_text_:29 in 5226) [ClassicSimilarity], result of:
          0.023482382 = score(doc=5226,freq=2.0), product of:
            0.12083977 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03435205 = queryNorm
            0.19432661 = fieldWeight in 5226, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5226)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)
```
Abstract

Tseng constructs a word co-occurrence based thesaurus by means of the automatic analysis of Chinese text. Words are identified by a longest dictionary match supplemented by a key word extraction algorithm that merges back nearby tokens and accepts shorter strings of characters if they occur more often than the longest string. Single character auxiliary words are a major source of error but this can be greatly reduced with the use of a 70-character 2680 word stop list. Extracted terms with their associate document weights are sorted by decreasing frequency and the top of this list is associated using a Dice coefficient modified to account for longer documents on the weights of term pairs. Co-occurrence is not in the document as a whole but in paragraph or sentence size sections in order to reduce computation time. A window of 29 characters or 11 words was found to be sufficient. A thesaurus was produced from 25,230 Chinese news articles and judges asked to review the top 50 terms associated with each of 30 single word query terms. They determined 69% to be relevant.

Mu, X.; Lu, K.; Ryu, H.: Explicitly integrating MeSH thesaurus help into health information retrieval systems : an empirical user study (2014) 0.00

7.827461E-4 = product of:
  0.0046964767 = sum of:
    0.0046964767 = product of:
      0.023482382 = sum of:
        0.023482382 = weight(_text_:29 in 2703) [ClassicSimilarity], result of:
          0.023482382 = score(doc=2703,freq=2.0), product of:
            0.12083977 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03435205 = queryNorm
            0.19432661 = fieldWeight in 2703, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2703)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 25. 1.2016 18:43:29

Dextre Clarke, S.G.; Gilchrist, A.; Will, L.: Revision and extension of thesaurus standards (2004) 0.00

6.493987E-4 = product of:
  0.0038963922 = sum of:
    0.0038963922 = product of:
      0.01948196 = sum of:
        0.01948196 = weight(_text_:28 in 2615) [ClassicSimilarity], result of:
          0.01948196 = score(doc=2615,freq=2.0), product of:
            0.12305808 = queryWeight, product of:
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03435205 = queryNorm
            0.15831517 = fieldWeight in 2615, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03125 = fieldNorm(doc=2615)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 6. 1.1997 18:30:28

Shiri, A.A.; Revie, C.; Chowdhurry, G.: Assessing the impact of user interaction with thesaural knowledge structures : a quantitative analysis framework (2003) 0.00

6.493987E-4 = product of:
  0.0038963922 = sum of:
    0.0038963922 = product of:
      0.01948196 = sum of:
        0.01948196 = weight(_text_:28 in 2766) [ClassicSimilarity], result of:
          0.01948196 = score(doc=2766,freq=2.0), product of:
            0.12305808 = queryWeight, product of:
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03435205 = queryNorm
            0.15831517 = fieldWeight in 2766, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03125 = fieldNorm(doc=2766)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 6. 1.1997 18:30:28

Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013) 0.00

6.493987E-4 = product of:
  0.0038963922 = sum of:
    0.0038963922 = product of:
      0.01948196 = sum of:
        0.01948196 = weight(_text_:28 in 1016) [ClassicSimilarity], result of:
          0.01948196 = score(doc=1016,freq=2.0), product of:
            0.12305808 = queryWeight, product of:
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03435205 = queryNorm
            0.15831517 = fieldWeight in 1016, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03125 = fieldNorm(doc=1016)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 28. 7.2013 14:20:39

Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010) 0.00

6.2619685E-4 = product of:
  0.003757181 = sum of:
    0.003757181 = product of:
      0.018785905 = sum of:
        0.018785905 = weight(_text_:29 in 4639) [ClassicSimilarity], result of:
          0.018785905 = score(doc=4639,freq=2.0), product of:
            0.12083977 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03435205 = queryNorm
            0.15546128 = fieldWeight in 4639, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.03125 = fieldNorm(doc=4639)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 29. 7.2011 14:44:56

Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.00

5.4299337E-4 = product of:
  0.00325796 = sum of:
    0.00325796 = product of:
      0.0162898 = sum of:
        0.0162898 = weight(_text_:22 in 3644) [ClassicSimilarity], result of:
          0.0162898 = score(doc=3644,freq=2.0), product of:
            0.120295025 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03435205 = queryNorm
            0.1354154 = fieldWeight in 3644, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3644)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Footnote: Original in: Information retrieval today: papers presented at an Institute conducted by the Library School and the Center for Continuation Study, University of Minnesota, Sept. 19-22, 1962. Ed. by Wesley Simonton. Minneapolis, Minn.: The Center, 1963. S.21-36.

Moreira, A.; Alvarenga, L.; Paiva Oliveira, A. de: "Thesaurus" and "Ontology" : a study of the definitions found in the computer and information science literature (2004) 0.00

4.87049E-4 = product of:
  0.002922294 = sum of:
    0.002922294 = product of:
      0.01461147 = sum of:
        0.01461147 = weight(_text_:28 in 3726) [ClassicSimilarity], result of:
          0.01461147 = score(doc=3726,freq=2.0), product of:
            0.12305808 = queryWeight, product of:
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.03435205 = queryNorm
            0.11873637 = fieldWeight in 3726, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5822632 = idf(docFreq=3342, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3726)
      0.2 = coord(1/5)
  0.16666667 = coord(1/6)

Date: 6. 1.1997 18:30:28

Search (49 results, page 3 of 3)

Authors

Years

Types

Themes