Document (#10431)

Author
Schwarz, C.
Title
THESYS: Thesaurus Syntax System : a fully automatic thesaurus building aid
Source
Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl
Imprint
Frankfurt : Indeks
Year
1988
Pages
S.63-70
Series
Studien zur Klassifikation; Bd.18
Abstract
THESYS is based on the natural language processing of free-text databases. It yields statistically evaluated correlations between words of the database. These correlations correspond to traditional thesaurus relations. The person who has to build a thesaurus is thus assisted by the proposals made by THESYS. THESYS is being tested on commercial databases under real world conditions. It is part of a text processing project at Siemens, called TINA (Text-Inhalts-Analyse). Software from TINA is actually being applied and evaluated by the US Department of Commerce for patent search and indexing (REALIST: REtrieval Aids by Linguistics and STatistics)
Theme
Computerlinguistik
Object
THESYS
TINA

Similar documents (author)

  1. Schwarz, C.: Natural language and information retrieval : Kommentierte Literaturliste zu Systemen, Verfahren und Tools (1986) 5.12
    5.1211123 = sum of:
      5.1211123 = weight(author_txt:schwarz in 408) [ClassicSimilarity], result of:
        5.1211123 = fieldWeight in 408, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.19378 = idf(docFreq=31, maxDocs=42596)
          0.625 = fieldNorm(doc=408)
    
  2. Schwarz, C.: Linguistische Hilfsmittel beim Information Retrieval (1984) 5.12
    5.1211123 = sum of:
      5.1211123 = weight(author_txt:schwarz in 545) [ClassicSimilarity], result of:
        5.1211123 = fieldWeight in 545, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.19378 = idf(docFreq=31, maxDocs=42596)
          0.625 = fieldNorm(doc=545)
    
  3. Schwarz, B.: Book House: ein OPAC für die Erschließung und Recherche Schöner Literatur (1991) 5.12
    5.1211123 = sum of:
      5.1211123 = weight(author_txt:schwarz in 1022) [ClassicSimilarity], result of:
        5.1211123 = fieldWeight in 1022, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.19378 = idf(docFreq=31, maxDocs=42596)
          0.625 = fieldNorm(doc=1022)
    
  4. Schwarz, C.: Freitextrecherche: Grenzen und Möglichkeiten (1982) 5.12
    5.1211123 = sum of:
      5.1211123 = weight(author_txt:schwarz in 1349) [ClassicSimilarity], result of:
        5.1211123 = fieldWeight in 1349, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.19378 = idf(docFreq=31, maxDocs=42596)
          0.625 = fieldNorm(doc=1349)
    
  5. Schwarz, R.: Buch und Bahn : Auskunftsdienst per CD-ROM (1995) 5.12
    5.1211123 = sum of:
      5.1211123 = weight(author_txt:schwarz in 4142) [ClassicSimilarity], result of:
        5.1211123 = fieldWeight in 4142, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.19378 = idf(docFreq=31, maxDocs=42596)
          0.625 = fieldNorm(doc=4142)
    

Similar documents (content)

  1. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.58
    0.5754661 = sum of:
      0.5754661 = product of:
        1.1988877 = sum of:
          0.06218253 = weight(abstract_txt:department in 3636) [ClassicSimilarity], result of:
            0.06218253 = score(doc=3636,freq=1.0), product of:
              0.128554 = queryWeight, product of:
                6.1914554 = idf(docFreq=236, maxDocs=42596)
                0.02076313 = queryNorm
              0.48370746 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1914554 = idf(docFreq=236, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.07415599 = weight(abstract_txt:syntax in 3636) [ClassicSimilarity], result of:
            0.07415599 = score(doc=3636,freq=1.0), product of:
              0.14456755 = queryWeight, product of:
                1.0604559 = boost
                6.5657654 = idf(docFreq=162, maxDocs=42596)
                0.02076313 = queryNorm
              0.5129504 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5657654 = idf(docFreq=162, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.075216636 = weight(abstract_txt:actually in 3636) [ClassicSimilarity], result of:
            0.075216636 = score(doc=3636,freq=1.0), product of:
              0.14594278 = queryWeight, product of:
                1.0654879 = boost
                6.5969205 = idf(docFreq=157, maxDocs=42596)
                0.02076313 = queryNorm
              0.51538444 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5969205 = idf(docFreq=157, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.08596771 = weight(abstract_txt:patent in 3636) [ClassicSimilarity], result of:
            0.08596771 = score(doc=3636,freq=1.0), product of:
              0.15953779 = queryWeight, product of:
                1.1140097 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.02076313 = queryNorm
              0.53885484 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.087275 = weight(abstract_txt:commerce in 3636) [ClassicSimilarity], result of:
            0.087275 = score(doc=3636,freq=1.0), product of:
              0.16115108 = queryWeight, product of:
                1.1196282 = boost
                6.932128 = idf(docFreq=112, maxDocs=42596)
                0.02076313 = queryNorm
              0.5415725 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.932128 = idf(docFreq=112, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.1165667 = weight(abstract_txt:yields in 3636) [ClassicSimilarity], result of:
            0.1165667 = score(doc=3636,freq=1.0), product of:
              0.1954443 = queryWeight, product of:
                1.233016 = boost
                7.634164 = idf(docFreq=55, maxDocs=42596)
                0.02076313 = queryNorm
              0.59641904 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.634164 = idf(docFreq=55, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.18791981 = weight(abstract_txt:inhalts in 3636) [ClassicSimilarity], result of:
            0.18791981 = score(doc=3636,freq=1.0), product of:
              0.26871282 = queryWeight, product of:
                1.4457773 = boost
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.02076313 = queryNorm
              0.69933325 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.048536427 = weight(abstract_txt:being in 3636) [ClassicSimilarity], result of:
            0.048536427 = score(doc=3636,freq=1.0), product of:
              0.13730781 = queryWeight, product of:
                1.4615707 = boost
                4.524625 = idf(docFreq=1254, maxDocs=42596)
                0.02076313 = queryNorm
              0.3534863 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.524625 = idf(docFreq=1254, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.20813785 = weight(abstract_txt:siemens in 3636) [ClassicSimilarity], result of:
            0.20813785 = score(doc=3636,freq=1.0), product of:
              0.28765643 = queryWeight, product of:
                1.4958713 = boost
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.02076313 = queryNorm
              0.7235641 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.2616205 = idf(docFreq=10, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.0635611 = weight(abstract_txt:processing in 3636) [ClassicSimilarity], result of:
            0.0635611 = score(doc=3636,freq=1.0), product of:
              0.164353 = queryWeight, product of:
                1.5990462 = boost
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.02076313 = queryNorm
              0.38673526 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9502115 = idf(docFreq=819, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.09898912 = weight(abstract_txt:evaluated in 3636) [ClassicSimilarity], result of:
            0.09898912 = score(doc=3636,freq=1.0), product of:
              0.22082165 = queryWeight, product of:
                1.8535018 = boost
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.02076313 = queryNorm
              0.44827634 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
          0.090378724 = weight(abstract_txt:text in 3636) [ClassicSimilarity], result of:
            0.090378724 = score(doc=3636,freq=3.0), product of:
              0.16494943 = queryWeight, product of:
                1.961974 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.02076313 = queryNorm
              0.5479178 = fieldWeight in 3636, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.078125 = fieldNorm(doc=3636)
        0.48 = coord(12/25)
    
  2. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.20
    0.19879751 = sum of:
      0.19879751 = product of:
        0.82832295 = sum of:
          0.10289984 = weight(abstract_txt:statistics in 5544) [ClassicSimilarity], result of:
            0.10289984 = score(doc=5544,freq=1.0), product of:
              0.13147298 = queryWeight, product of:
                1.0112894 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.02076313 = queryNorm
              0.7826691 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
          0.124721505 = weight(abstract_txt:linguistics in 5544) [ClassicSimilarity], result of:
            0.124721505 = score(doc=5544,freq=1.0), product of:
              0.1494586 = queryWeight, product of:
                1.0782455 = boost
                6.675909 = idf(docFreq=145, maxDocs=42596)
                0.02076313 = queryNorm
              0.83448863 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.675909 = idf(docFreq=145, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
          0.13089672 = weight(abstract_txt:aids in 5544) [ClassicSimilarity], result of:
            0.13089672 = score(doc=5544,freq=1.0), product of:
              0.1543521 = queryWeight, product of:
                1.0957551 = boost
                6.7843184 = idf(docFreq=130, maxDocs=42596)
                0.02076313 = queryNorm
              0.8480398 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7843184 = idf(docFreq=130, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
          0.070993274 = weight(abstract_txt:databases in 5544) [ClassicSimilarity], result of:
            0.070993274 = score(doc=5544,freq=1.0), product of:
              0.12933463 = queryWeight, product of:
                1.4185009 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.02076313 = queryNorm
              0.5489116 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
          0.31532335 = weight(abstract_txt:realist in 5544) [ClassicSimilarity], result of:
            0.31532335 = score(doc=5544,freq=1.0), product of:
              0.27737296 = queryWeight, product of:
                1.46889 = boost
                9.094566 = idf(docFreq=12, maxDocs=42596)
                0.02076313 = queryNorm
              1.1368208 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.094566 = idf(docFreq=12, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
          0.08348829 = weight(abstract_txt:text in 5544) [ClassicSimilarity], result of:
            0.08348829 = score(doc=5544,freq=1.0), product of:
              0.16494943 = queryWeight, product of:
                1.961974 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.02076313 = queryNorm
              0.50614476 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.125 = fieldNorm(doc=5544)
        0.24 = coord(6/25)
    
  3. Ruge, G.: Experiments on linguistically-based term associations (1992) 0.09
    0.09158632 = sum of:
      0.09158632 = product of:
        0.5724145 = sum of:
          0.07717488 = weight(abstract_txt:statistics in 1810) [ClassicSimilarity], result of:
            0.07717488 = score(doc=1810,freq=1.0), product of:
              0.13147298 = queryWeight, product of:
                1.0112894 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.02076313 = queryNorm
              0.58700186 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.09375 = fieldNorm(doc=1810)
          0.09817254 = weight(abstract_txt:aids in 1810) [ClassicSimilarity], result of:
            0.09817254 = score(doc=1810,freq=1.0), product of:
              0.1543521 = queryWeight, product of:
                1.0957551 = boost
                6.7843184 = idf(docFreq=130, maxDocs=42596)
                0.02076313 = queryNorm
              0.63602984 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7843184 = idf(docFreq=130, maxDocs=42596)
                0.09375 = fieldNorm(doc=1810)
          0.3344509 = weight(abstract_txt:realist in 1810) [ClassicSimilarity], result of:
            0.3344509 = score(doc=1810,freq=2.0), product of:
              0.27737296 = queryWeight, product of:
                1.46889 = boost
                9.094566 = idf(docFreq=12, maxDocs=42596)
                0.02076313 = queryNorm
              1.2057805 = fieldWeight in 1810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.094566 = idf(docFreq=12, maxDocs=42596)
                0.09375 = fieldNorm(doc=1810)
          0.06261622 = weight(abstract_txt:text in 1810) [ClassicSimilarity], result of:
            0.06261622 = score(doc=1810,freq=1.0), product of:
              0.16494943 = queryWeight, product of:
                1.961974 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.02076313 = queryNorm
              0.37960857 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.09375 = fieldNorm(doc=1810)
        0.16 = coord(4/25)
    
  4. Kousha, K.; Thelwall, M.: Patent citation analysis with Google (2017) 0.08
    0.0832869 = sum of:
      0.0832869 = product of:
        0.5205431 = sum of:
          0.21748301 = weight(abstract_txt:patent in 4318) [ClassicSimilarity], result of:
            0.21748301 = score(doc=4318,freq=10.0), product of:
              0.15953779 = queryWeight, product of:
                1.1140097 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.02076313 = queryNorm
              1.3632069 = fieldWeight in 4318, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.0625 = fieldNorm(doc=4318)
          0.050199825 = weight(abstract_txt:databases in 4318) [ClassicSimilarity], result of:
            0.050199825 = score(doc=4318,freq=2.0), product of:
              0.12933463 = queryWeight, product of:
                1.4185009 = boost
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.02076313 = queryNorm
              0.3881391 = fieldWeight in 4318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3912926 = idf(docFreq=1433, maxDocs=42596)
                0.0625 = fieldNorm(doc=4318)
          0.0791913 = weight(abstract_txt:evaluated in 4318) [ClassicSimilarity], result of:
            0.0791913 = score(doc=4318,freq=1.0), product of:
              0.22082165 = queryWeight, product of:
                1.8535018 = boost
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.02076313 = queryNorm
              0.35862106 = fieldWeight in 4318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.0625 = fieldNorm(doc=4318)
          0.17366892 = weight(abstract_txt:correlations in 4318) [ClassicSimilarity], result of:
            0.17366892 = score(doc=4318,freq=1.0), product of:
              0.37273893 = queryWeight, product of:
                2.4081004 = boost
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.02076313 = queryNorm
              0.46592644 = fieldWeight in 4318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.0625 = fieldNorm(doc=4318)
        0.16 = coord(4/25)
    
  5. Larson, R.R.: Cheshire 2 : design and evaluation of a next-generation online catalog system (1995) 0.07
    0.07273538 = sum of:
      0.07273538 = product of:
        0.45459613 = sum of:
          0.10289984 = weight(abstract_txt:statistics in 3889) [ClassicSimilarity], result of:
            0.10289984 = score(doc=3889,freq=1.0), product of:
              0.13147298 = queryWeight, product of:
                1.0112894 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.02076313 = queryNorm
              0.7826691 = fieldWeight in 3889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.125 = fieldNorm(doc=3889)
          0.1098254 = weight(abstract_txt:being in 3889) [ClassicSimilarity], result of:
            0.1098254 = score(doc=3889,freq=2.0), product of:
              0.13730781 = queryWeight, product of:
                1.4615707 = boost
                4.524625 = idf(docFreq=1254, maxDocs=42596)
                0.02076313 = queryNorm
              0.7998482 = fieldWeight in 3889, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.524625 = idf(docFreq=1254, maxDocs=42596)
                0.125 = fieldNorm(doc=3889)
          0.1583826 = weight(abstract_txt:evaluated in 3889) [ClassicSimilarity], result of:
            0.1583826 = score(doc=3889,freq=1.0), product of:
              0.22082165 = queryWeight, product of:
                1.8535018 = boost
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.02076313 = queryNorm
              0.7172421 = fieldWeight in 3889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.737937 = idf(docFreq=372, maxDocs=42596)
                0.125 = fieldNorm(doc=3889)
          0.08348829 = weight(abstract_txt:text in 3889) [ClassicSimilarity], result of:
            0.08348829 = score(doc=3889,freq=1.0), product of:
              0.16494943 = queryWeight, product of:
                1.961974 = boost
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.02076313 = queryNorm
              0.50614476 = fieldWeight in 3889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.049158 = idf(docFreq=2018, maxDocs=42596)
                0.125 = fieldNorm(doc=3889)
        0.16 = coord(4/25)