Document (#1811)

Author
Ruge, G.
Title
Experiments on linguistically-based term associations
Source
Information processing and management. 28(1992) no.3, S.317-332
Year
1992
Abstract
Describes the hyperterm system REALIST (REtrieval Aids by LInguistic and STatistics) and describes its semantic component. The semantic component of REALIST generates semantic term relations such synonyms. It takes as input a free text data base and generates as output term pairs that are semantically related with respect to their meanings in the data base. In the 1st step an automatic syntactic analysis provides linguistical knowledge about the terms of the data base. In the 2nd step this knowledge is compared by statistical similarity computation. Various experiments with different similarity measures are described
Theme
Computerlinguistik
Object
REALIST

Similar documents (author)

  1. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 5.92
    5.9235125 = sum of:
      5.9235125 = weight(author_txt:ruge in 4506) [ClassicSimilarity], result of:
        5.9235125 = fieldWeight in 4506, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.625 = fieldNorm(doc=4506)
    
  2. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 5.92
    5.9235125 = sum of:
      5.9235125 = weight(author_txt:ruge in 3535) [ClassicSimilarity], result of:
        5.9235125 = fieldWeight in 3535, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.625 = fieldNorm(doc=3535)
    
  3. Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 4.74
    4.73881 = sum of:
      4.73881 = weight(author_txt:ruge in 2310) [ClassicSimilarity], result of:
        4.73881 = fieldWeight in 2310, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.5 = fieldNorm(doc=2310)
    
  4. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 4.74
    4.73881 = sum of:
      4.73881 = weight(author_txt:ruge in 5544) [ClassicSimilarity], result of:
        4.73881 = fieldWeight in 5544, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.5 = fieldNorm(doc=5544)
    
  5. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 4.74
    4.73881 = sum of:
      4.73881 = weight(author_txt:ruge in 4636) [ClassicSimilarity], result of:
        4.73881 = fieldWeight in 4636, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.5 = fieldNorm(doc=4636)
    

Similar documents (content)

  1. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.14
    0.1445758 = sum of:
      0.1445758 = product of:
        0.9035988 = sum of:
          0.09241453 = weight(abstract_txt:statistics in 5544) [ClassicSimilarity], result of:
            0.09241453 = score(doc=5544,freq=1.0), product of:
              0.117957056 = queryWeight, product of:
                1.0437486 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.018031077 = queryNorm
              0.7834591 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.125 = fieldNorm(doc=5544)
          0.117604144 = weight(abstract_txt:aids in 5544) [ClassicSimilarity], result of:
            0.117604144 = score(doc=5544,freq=1.0), product of:
              0.1385199 = queryWeight, product of:
                1.1310714 = boost
                6.792043 = idf(docFreq=131, maxDocs=43254)
                0.018031077 = queryNorm
              0.8490054 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.792043 = idf(docFreq=131, maxDocs=43254)
                0.125 = fieldNorm(doc=5544)
          0.12604663 = weight(abstract_txt:term in 5544) [ClassicSimilarity], result of:
            0.12604663 = score(doc=5544,freq=1.0), product of:
              0.2092305 = queryWeight, product of:
                2.407726 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.018031077 = queryNorm
              0.6024295 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.125 = fieldNorm(doc=5544)
          0.5675335 = weight(abstract_txt:realist in 5544) [ClassicSimilarity], result of:
            0.5675335 = score(doc=5544,freq=1.0), product of:
              0.4983886 = queryWeight, product of:
                3.0341218 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.018031077 = queryNorm
              1.138737 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.125 = fieldNorm(doc=5544)
        0.16 = coord(4/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.14
    0.1412378 = sum of:
      0.1412378 = product of:
        0.5044207 = sum of:
          0.07486223 = weight(abstract_txt:input in 56) [ClassicSimilarity], result of:
            0.07486223 = score(doc=56,freq=3.0), product of:
              0.11281976 = queryWeight, product of:
                1.0207669 = boost
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.018031077 = queryNorm
              0.663556 = fieldWeight in 56, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1296678 = idf(docFreq=255, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.017215 = weight(abstract_txt:knowledge in 56) [ClassicSimilarity], result of:
            0.017215 = score(doc=56,freq=1.0), product of:
              0.076948196 = queryWeight, product of:
                1.1921974 = boost
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.018031077 = queryNorm
              0.22372195 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.042676196 = weight(abstract_txt:data in 56) [ClassicSimilarity], result of:
            0.042676196 = score(doc=56,freq=4.0), product of:
              0.10163922 = queryWeight, product of:
                1.6781286 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.018031077 = queryNorm
              0.41987923 = fieldWeight in 56, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.10447533 = weight(abstract_txt:similarity in 56) [ClassicSimilarity], result of:
            0.10447533 = score(doc=56,freq=2.0), product of:
              0.20320119 = queryWeight, product of:
                1.9373678 = boost
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.018031077 = queryNorm
              0.5141472 = fieldWeight in 56, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.10157984 = weight(abstract_txt:semantic in 56) [ClassicSimilarity], result of:
            0.10157984 = score(doc=56,freq=4.0), product of:
              0.18119346 = queryWeight, product of:
                2.2406077 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.018031077 = queryNorm
              0.56061536 = fieldWeight in 56, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.063023314 = weight(abstract_txt:term in 56) [ClassicSimilarity], result of:
            0.063023314 = score(doc=56,freq=1.0), product of:
              0.2092305 = queryWeight, product of:
                2.407726 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.018031077 = queryNorm
              0.30121475 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.1005888 = weight(abstract_txt:base in 56) [ClassicSimilarity], result of:
            0.1005888 = score(doc=56,freq=1.0), product of:
              0.28575286 = queryWeight, product of:
                2.8137784 = boost
                5.632212 = idf(docFreq=420, maxDocs=43254)
                0.018031077 = queryNorm
              0.35201326 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.632212 = idf(docFreq=420, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Taylor, C.: Navigation via similarity (1997) 0.14
    0.1396018 = sum of:
      0.1396018 = product of:
        0.58167416 = sum of:
          0.052156094 = weight(abstract_txt:takes in 2156) [ClassicSimilarity], result of:
            0.052156094 = score(doc=2156,freq=1.0), product of:
              0.110199705 = queryWeight, product of:
                1.0088444 = boost
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.018031077 = queryNorm
              0.47328705 = fieldWeight in 2156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.058074 = idf(docFreq=274, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
          0.0763638 = weight(abstract_txt:semantically in 2156) [ClassicSimilarity], result of:
            0.0763638 = score(doc=2156,freq=1.0), product of:
              0.14209172 = queryWeight, product of:
                1.1455613 = boost
                6.8790545 = idf(docFreq=120, maxDocs=43254)
                0.018031077 = queryNorm
              0.5374261 = fieldWeight in 2156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8790545 = idf(docFreq=120, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
          0.025925461 = weight(abstract_txt:describes in 2156) [ClassicSimilarity], result of:
            0.025925461 = score(doc=2156,freq=1.0), product of:
              0.08712405 = queryWeight, product of:
                1.2685803 = boost
                3.8088896 = idf(docFreq=2606, maxDocs=43254)
                0.018031077 = queryNorm
              0.2975695 = fieldWeight in 2156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8088896 = idf(docFreq=2606, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
          0.20648749 = weight(abstract_txt:similarity in 2156) [ClassicSimilarity], result of:
            0.20648749 = score(doc=2156,freq=5.0), product of:
              0.20320119 = queryWeight, product of:
                1.9373678 = boost
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.018031077 = queryNorm
              1.0161726 = fieldWeight in 2156, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8169117 = idf(docFreq=349, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
          0.14196216 = weight(abstract_txt:semantic in 2156) [ClassicSimilarity], result of:
            0.14196216 = score(doc=2156,freq=5.0), product of:
              0.18119346 = queryWeight, product of:
                2.2406077 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.018031077 = queryNorm
              0.78348386 = fieldWeight in 2156, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
          0.07877914 = weight(abstract_txt:term in 2156) [ClassicSimilarity], result of:
            0.07877914 = score(doc=2156,freq=1.0), product of:
              0.2092305 = queryWeight, product of:
                2.407726 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.018031077 = queryNorm
              0.37651843 = fieldWeight in 2156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.078125 = fieldNorm(doc=2156)
        0.24 = coord(6/25)
    
  4. Kantardzic, M.: Data mining : concepts, models, methods, and algorithms (2003) 0.14
    0.1361433 = sum of:
      0.1361433 = product of:
        0.42544782 = sum of:
          0.046207264 = weight(abstract_txt:statistics in 4292) [ClassicSimilarity], result of:
            0.046207264 = score(doc=4292,freq=1.0), product of:
              0.117957056 = queryWeight, product of:
                1.0437486 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.018031077 = queryNorm
              0.39172956 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.058802072 = weight(abstract_txt:aids in 4292) [ClassicSimilarity], result of:
            0.058802072 = score(doc=4292,freq=1.0), product of:
              0.1385199 = queryWeight, product of:
                1.1310714 = boost
                6.792043 = idf(docFreq=131, maxDocs=43254)
                0.018031077 = queryNorm
              0.4245027 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.792043 = idf(docFreq=131, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.017215 = weight(abstract_txt:knowledge in 4292) [ClassicSimilarity], result of:
            0.017215 = score(doc=4292,freq=1.0), product of:
              0.076948196 = queryWeight, product of:
                1.1921974 = boost
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.018031077 = queryNorm
              0.22372195 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.020740367 = weight(abstract_txt:describes in 4292) [ClassicSimilarity], result of:
            0.020740367 = score(doc=4292,freq=1.0), product of:
              0.08712405 = queryWeight, product of:
                1.2685803 = boost
                3.8088896 = idf(docFreq=2606, maxDocs=43254)
                0.018031077 = queryNorm
              0.2380556 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8088896 = idf(docFreq=2606, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.08520575 = weight(abstract_txt:computation in 4292) [ClassicSimilarity], result of:
            0.08520575 = score(doc=4292,freq=1.0), product of:
              0.17737661 = queryWeight, product of:
                1.279918 = boost
                7.685861 = idf(docFreq=53, maxDocs=43254)
                0.018031077 = queryNorm
              0.48036632 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.685861 = idf(docFreq=53, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.07693561 = weight(abstract_txt:data in 4292) [ClassicSimilarity], result of:
            0.07693561 = score(doc=4292,freq=13.0), product of:
              0.10163922 = queryWeight, product of:
                1.6781286 = boost
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.018031077 = queryNorm
              0.75694805 = fieldWeight in 4292, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.3590338 = idf(docFreq=4087, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.057318486 = weight(abstract_txt:experiments in 4292) [ClassicSimilarity], result of:
            0.057318486 = score(doc=4292,freq=1.0), product of:
              0.17157614 = queryWeight, product of:
                1.7802353 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.018031077 = queryNorm
              0.33407027 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
          0.063023314 = weight(abstract_txt:term in 4292) [ClassicSimilarity], result of:
            0.063023314 = score(doc=4292,freq=1.0), product of:
              0.2092305 = queryWeight, product of:
                2.407726 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.018031077 = queryNorm
              0.30121475 = fieldWeight in 4292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.0625 = fieldNorm(doc=4292)
        0.32 = coord(8/25)
    
  5. Quillian, M.R.: Word concepts : a theory and simulation of some basic semantic capabilities. (1967) 0.12
    0.124637835 = sum of:
      0.124637835 = product of:
        0.44513512 = sum of:
          0.05472816 = weight(abstract_txt:associations in 879) [ClassicSimilarity], result of:
            0.05472816 = score(doc=879,freq=1.0), product of:
              0.13204572 = queryWeight, product of:
                1.104323 = boost
                6.6314197 = idf(docFreq=154, maxDocs=43254)
                0.018031077 = queryNorm
              0.41446373 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6314197 = idf(docFreq=154, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.08343831 = weight(abstract_txt:meanings in 879) [ClassicSimilarity], result of:
            0.08343831 = score(doc=879,freq=2.0), product of:
              0.13883024 = queryWeight, product of:
                1.1323378 = boost
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.018031077 = queryNorm
              0.6010096 = fieldWeight in 879, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.799648 = idf(docFreq=130, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.060018603 = weight(abstract_txt:pairs in 879) [ClassicSimilarity], result of:
            0.060018603 = score(doc=879,freq=1.0), product of:
              0.1404239 = queryWeight, product of:
                1.1388184 = boost
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.018031077 = queryNorm
              0.4274102 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.838563 = idf(docFreq=125, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.017215 = weight(abstract_txt:knowledge in 879) [ClassicSimilarity], result of:
            0.017215 = score(doc=879,freq=1.0), product of:
              0.076948196 = queryWeight, product of:
                1.1921974 = boost
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.018031077 = queryNorm
              0.22372195 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5795512 = idf(docFreq=3278, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.057318486 = weight(abstract_txt:experiments in 879) [ClassicSimilarity], result of:
            0.057318486 = score(doc=879,freq=1.0), product of:
              0.17157614 = queryWeight, product of:
                1.7802353 = boost
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.018031077 = queryNorm
              0.33407027 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3451242 = idf(docFreq=560, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.071827784 = weight(abstract_txt:semantic in 879) [ClassicSimilarity], result of:
            0.071827784 = score(doc=879,freq=2.0), product of:
              0.18119346 = queryWeight, product of:
                2.2406077 = boost
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.018031077 = queryNorm
              0.3964149 = fieldWeight in 879, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.484923 = idf(docFreq=1325, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
          0.1005888 = weight(abstract_txt:base in 879) [ClassicSimilarity], result of:
            0.1005888 = score(doc=879,freq=1.0), product of:
              0.28575286 = queryWeight, product of:
                2.8137784 = boost
                5.632212 = idf(docFreq=420, maxDocs=43254)
                0.018031077 = queryNorm
              0.35201326 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.632212 = idf(docFreq=420, maxDocs=43254)
                0.0625 = fieldNorm(doc=879)
        0.28 = coord(7/25)