Document (#10431)

Author
Schwarz, C.
Title
THESYS: Thesaurus Syntax System : a fully automatic thesaurus building aid
Source
Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl
Imprint
Frankfurt : Indeks
Year
1988
Pages
S.63-70
Series
Studien zur Klassifikation; Bd.18
Abstract
THESYS is based on the natural language processing of free-text databases. It yields statistically evaluated correlations between words of the database. These correlations correspond to traditional thesaurus relations. The person who has to build a thesaurus is thus assisted by the proposals made by THESYS. THESYS is being tested on commercial databases under real world conditions. It is part of a text processing project at Siemens, called TINA (Text-Inhalts-Analyse). Software from TINA is actually being applied and evaluated by the US Department of Commerce for patent search and indexing (REALIST: REtrieval Aids by Linguistics and STatistics)
Theme
Computerlinguistik
Object
THESYS
TINA

Similar documents (author)

  1. Schwarz, C.: Natural language and information retrieval : Kommentierte Literaturliste zu Systemen, Verfahren und Tools (1986) 5.13
    5.125237 = sum of:
      5.125237 = weight(author_txt:schwarz in 408) [ClassicSimilarity], result of:
        5.125237 = fieldWeight in 408, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.200379 = idf(docFreq=32, maxDocs=44218)
          0.625 = fieldNorm(doc=408)
    
  2. Schwarz, C.: Linguistische Hilfsmittel beim Information Retrieval (1984) 5.13
    5.125237 = sum of:
      5.125237 = weight(author_txt:schwarz in 545) [ClassicSimilarity], result of:
        5.125237 = fieldWeight in 545, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.200379 = idf(docFreq=32, maxDocs=44218)
          0.625 = fieldNorm(doc=545)
    
  3. Schwarz, B.: Book House: ein OPAC für die Erschließung und Recherche Schöner Literatur (1991) 5.13
    5.125237 = sum of:
      5.125237 = weight(author_txt:schwarz in 1022) [ClassicSimilarity], result of:
        5.125237 = fieldWeight in 1022, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.200379 = idf(docFreq=32, maxDocs=44218)
          0.625 = fieldNorm(doc=1022)
    
  4. Schwarz, C.: Freitextrecherche: Grenzen und Möglichkeiten (1982) 5.13
    5.125237 = sum of:
      5.125237 = weight(author_txt:schwarz in 1349) [ClassicSimilarity], result of:
        5.125237 = fieldWeight in 1349, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.200379 = idf(docFreq=32, maxDocs=44218)
          0.625 = fieldNorm(doc=1349)
    
  5. Schwarz, R.: Buch und Bahn : Auskunftsdienst per CD-ROM (1995) 5.13
    5.125237 = sum of:
      5.125237 = weight(author_txt:schwarz in 4142) [ClassicSimilarity], result of:
        5.125237 = fieldWeight in 4142, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.200379 = idf(docFreq=32, maxDocs=44218)
          0.625 = fieldNorm(doc=4142)
    

Similar documents (content)

  1. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.58
    0.5760953 = sum of:
      0.5760953 = product of:
        1.2001985 = sum of:
          0.06305439 = weight(abstract_txt:department in 3567) [ClassicSimilarity], result of:
            0.06305439 = score(doc=3567,freq=1.0), product of:
              0.1300101 = queryWeight, product of:
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.020942518 = queryNorm
              0.48499608 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.07484218 = weight(abstract_txt:syntax in 3567) [ClassicSimilarity], result of:
            0.07484218 = score(doc=3567,freq=1.0), product of:
              0.1457464 = queryWeight, product of:
                1.0587913 = boost
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.020942518 = queryNorm
              0.51350963 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.07566835 = weight(abstract_txt:actually in 3567) [ClassicSimilarity], result of:
            0.07566835 = score(doc=3567,freq=1.0), product of:
              0.14681701 = queryWeight, product of:
                1.062673 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.020942518 = queryNorm
              0.51539224 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.0875702 = weight(abstract_txt:patent in 3567) [ClassicSimilarity], result of:
            0.0875702 = score(doc=3567,freq=1.0), product of:
              0.16183451 = queryWeight, product of:
                1.1156989 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.020942518 = queryNorm
              0.54110956 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.08855064 = weight(abstract_txt:commerce in 3567) [ClassicSimilarity], result of:
            0.08855064 = score(doc=3567,freq=1.0), product of:
              0.1630402 = queryWeight, product of:
                1.1198473 = boost
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.020942518 = queryNorm
              0.5431215 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.11431831 = weight(abstract_txt:yields in 3567) [ClassicSimilarity], result of:
            0.11431831 = score(doc=3567,freq=1.0), product of:
              0.19330545 = queryWeight, product of:
                1.2193644 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.020942518 = queryNorm
              0.5913869 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.18353212 = weight(abstract_txt:inhalts in 3567) [ClassicSimilarity], result of:
            0.18353212 = score(doc=3567,freq=1.0), product of:
              0.26503807 = queryWeight, product of:
                1.4277941 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.020942518 = queryNorm
              0.69247454 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.04840205 = weight(abstract_txt:being in 3567) [ClassicSimilarity], result of:
            0.04840205 = score(doc=3567,freq=1.0), product of:
              0.13732597 = queryWeight, product of:
                1.453459 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.020942518 = queryNorm
              0.35246098 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.21192315 = weight(abstract_txt:siemens in 3567) [ClassicSimilarity], result of:
            0.21192315 = score(doc=3567,freq=1.0), product of:
              0.2917108 = queryWeight, product of:
                1.4979168 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.020942518 = queryNorm
              0.72648376 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.06323097 = weight(abstract_txt:processing in 3567) [ClassicSimilarity], result of:
            0.06323097 = score(doc=3567,freq=1.0), product of:
              0.16410813 = queryWeight, product of:
                1.5888815 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.020942518 = queryNorm
              0.38530064 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.09854475 = weight(abstract_txt:evaluated in 3567) [ClassicSimilarity], result of:
            0.09854475 = score(doc=3567,freq=1.0), product of:
              0.22059679 = queryWeight, product of:
                1.8421545 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.020942518 = queryNorm
              0.44671887 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.09056139 = weight(abstract_txt:text in 3567) [ClassicSimilarity], result of:
            0.09056139 = score(doc=3567,freq=3.0), product of:
              0.16549909 = queryWeight, product of:
                1.954204 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020942518 = queryNorm
              0.54720175 = fieldWeight in 3567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
        0.48 = coord(12/25)
    
  2. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.20
    0.19773923 = sum of:
      0.19773923 = product of:
        0.82391346 = sum of:
          0.1042588 = weight(abstract_txt:statistics in 5544) [ClassicSimilarity], result of:
            0.1042588 = score(doc=5544,freq=1.0), product of:
              0.13289094 = queryWeight, product of:
                1.0110186 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.020942518 = queryNorm
              0.78454405 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.123843685 = weight(abstract_txt:linguistics in 5544) [ClassicSimilarity], result of:
            0.123843685 = score(doc=5544,freq=1.0), product of:
              0.14905143 = queryWeight, product of:
                1.0707289 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.020942518 = queryNorm
              0.8308789 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.13341771 = weight(abstract_txt:aids in 5544) [ClassicSimilarity], result of:
            0.13341771 = score(doc=5544,freq=1.0), product of:
              0.15663755 = queryWeight, product of:
                1.0976386 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.020942518 = queryNorm
              0.8517607 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.072468534 = weight(abstract_txt:databases in 5544) [ClassicSimilarity], result of:
            0.072468534 = score(doc=5544,freq=1.0), product of:
              0.13138019 = queryWeight, product of:
                1.4216458 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.020942518 = queryNorm
              0.5515941 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.30626774 = weight(abstract_txt:realist in 5544) [ClassicSimilarity], result of:
            0.30626774 = score(doc=5544,freq=1.0), product of:
              0.27257606 = queryWeight, product of:
                1.4479558 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.020942518 = queryNorm
              1.1236047 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.083657034 = weight(abstract_txt:text in 5544) [ClassicSimilarity], result of:
            0.083657034 = score(doc=5544,freq=1.0), product of:
              0.16549909 = queryWeight, product of:
                1.954204 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020942518 = queryNorm
              0.5054833 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
        0.24 = coord(6/25)
    
  3. Ruge, G.: Experiments on linguistically-based term associations (1992) 0.09
    0.09053538 = sum of:
      0.09053538 = product of:
        0.56584615 = sum of:
          0.0781941 = weight(abstract_txt:statistics in 1810) [ClassicSimilarity], result of:
            0.0781941 = score(doc=1810,freq=1.0), product of:
              0.13289094 = queryWeight, product of:
                1.0110186 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.020942518 = queryNorm
              0.58840805 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.09375 = fieldNorm(doc=1810)
          0.10006328 = weight(abstract_txt:aids in 1810) [ClassicSimilarity], result of:
            0.10006328 = score(doc=1810,freq=1.0), product of:
              0.15663755 = queryWeight, product of:
                1.0976386 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.020942518 = queryNorm
              0.6388205 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.09375 = fieldNorm(doc=1810)
          0.32484597 = weight(abstract_txt:realist in 1810) [ClassicSimilarity], result of:
            0.32484597 = score(doc=1810,freq=2.0), product of:
              0.27257606 = queryWeight, product of:
                1.4479558 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.020942518 = queryNorm
              1.1917627 = fieldWeight in 1810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.09375 = fieldNorm(doc=1810)
          0.06274277 = weight(abstract_txt:text in 1810) [ClassicSimilarity], result of:
            0.06274277 = score(doc=1810,freq=1.0), product of:
              0.16549909 = queryWeight, product of:
                1.954204 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020942518 = queryNorm
              0.37911248 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=1810)
        0.16 = coord(4/25)
    
  4. Kousha, K.; Thelwall, M.: Patent citation analysis with Google (2017) 0.08
    0.08351951 = sum of:
      0.08351951 = product of:
        0.521997 = sum of:
          0.22153704 = weight(abstract_txt:patent in 3317) [ClassicSimilarity], result of:
            0.22153704 = score(doc=3317,freq=10.0), product of:
              0.16183451 = queryWeight, product of:
                1.1156989 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.020942518 = queryNorm
              1.368911 = fieldWeight in 3317, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=3317)
          0.05124299 = weight(abstract_txt:databases in 3317) [ClassicSimilarity], result of:
            0.05124299 = score(doc=3317,freq=2.0), product of:
              0.13138019 = queryWeight, product of:
                1.4216458 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.020942518 = queryNorm
              0.3900359 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.0625 = fieldNorm(doc=3317)
          0.0788358 = weight(abstract_txt:evaluated in 3317) [ClassicSimilarity], result of:
            0.0788358 = score(doc=3317,freq=1.0), product of:
              0.22059679 = queryWeight, product of:
                1.8421545 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.020942518 = queryNorm
              0.3573751 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.0625 = fieldNorm(doc=3317)
          0.17038114 = weight(abstract_txt:correlations in 3317) [ClassicSimilarity], result of:
            0.17038114 = score(doc=3317,freq=1.0), product of:
              0.36874932 = queryWeight, product of:
                2.3817275 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.020942518 = queryNorm
              0.4620514 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0625 = fieldNorm(doc=3317)
        0.16 = coord(4/25)
    
  5. Larson, R.R.: Cheshire 2 : design and evaluation of a next-generation online catalog system (1995) 0.07
    0.0728174 = sum of:
      0.0728174 = product of:
        0.45510876 = sum of:
          0.1042588 = weight(abstract_txt:statistics in 3820) [ClassicSimilarity], result of:
            0.1042588 = score(doc=3820,freq=1.0), product of:
              0.13289094 = queryWeight, product of:
                1.0110186 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.020942518 = queryNorm
              0.78454405 = fieldWeight in 3820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.125 = fieldNorm(doc=3820)
          0.10952132 = weight(abstract_txt:being in 3820) [ClassicSimilarity], result of:
            0.10952132 = score(doc=3820,freq=2.0), product of:
              0.13732597 = queryWeight, product of:
                1.453459 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.020942518 = queryNorm
              0.7975281 = fieldWeight in 3820, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.125 = fieldNorm(doc=3820)
          0.1576716 = weight(abstract_txt:evaluated in 3820) [ClassicSimilarity], result of:
            0.1576716 = score(doc=3820,freq=1.0), product of:
              0.22059679 = queryWeight, product of:
                1.8421545 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.020942518 = queryNorm
              0.7147502 = fieldWeight in 3820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.125 = fieldNorm(doc=3820)
          0.083657034 = weight(abstract_txt:text in 3820) [ClassicSimilarity], result of:
            0.083657034 = score(doc=3820,freq=1.0), product of:
              0.16549909 = queryWeight, product of:
                1.954204 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020942518 = queryNorm
              0.5054833 = fieldWeight in 3820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=3820)
        0.16 = coord(4/25)