Document (#20529)

Author
Ruge, G.
Goeser, S.
Title
Information Retrieval ohne Linguistik
Source
nfd Information - Wissenschaft und Praxis. 49(1998) H.6, S.361-369
Year
1998
Abstract
Natürlicherweise sollte man erwarten, daß linguistische Textanalyseverfahren die Effektivität und Benutzerfreundlichkeit von Information Retrieval Systemen verbessern, da sowohl Dokumente als auch Suchanfragen die interessierenden Inhalte linguistisch enkodieren. Ein Retrievalabgleich auf der Ebene der linguistischen Inhaltsdarstellung müßte demzufolge zu besseren Retrievalsystemen führen als ein Abgleich auf Wort- oder gar Zeichenebene. Tatsächlich aber ist immer noch weitgehend unklar, inwieweit linguistische Textanalyseverfahren Retrievalsysteme verbessern können. Evaluationen von Retrievalsystemen mit linguistischen Komponenten führen nach wie vor zu unterschiedlichen, teils gegenläufigen Ergebnissen, obwohl die dazu erforderliche Computerlinguistik große Fortschritte gemacht hat. Wir gehen der Frage nach, wie es zu diesen kontraintuitiven Ergenissen kommt. Dazu wird der Stand der Kunst im linguistischen IR zusammengefaßt, so daß die Ergebnisse anhand des Vergleich verschiedener Evaluierungen diskutiert werden können.
Footnote
Vgl. auch die Erwiderung: Ladewig, C.: 'Information Retrieval ohne Linguistik?' in: nfd 49(1998) H.8, S.476-478
Theme
Computerlinguistik

Similar documents (author)

  1. Ruge, G.: Experiments on linguistically-based term associations (1992) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:ruge in 1810) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1810, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1810)
    
  2. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:ruge in 4506) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 4506, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=4506)
    
  3. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:ruge in 1534) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1534)
    
  4. Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:ruge in 2310) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 2310, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=2310)
    
  5. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:ruge in 5544) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 5544, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=5544)
    

Similar documents (content)

  1. Bachfeld, S.: Möglichkeiten und Grenzen linguistischer Verfahren der automatischen Indexierung : Entwurf einer Simulation für den Einsatz im Grundstudium (2003) 0.12
    0.115799546 = sum of:
      0.115799546 = product of:
        0.57899773 = sum of:
          0.019211281 = weight(abstract_txt:nach in 2827) [ClassicSimilarity], result of:
            0.019211281 = score(doc=2827,freq=1.0), product of:
              0.08075247 = queryWeight, product of:
                1.1418906 = boost
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.016256195 = queryNorm
              0.23790333 = fieldWeight in 2827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2827)
          0.020541191 = weight(abstract_txt:können in 2827) [ClassicSimilarity], result of:
            0.020541191 = score(doc=2827,freq=1.0), product of:
              0.0844375 = queryWeight, product of:
                1.1676543 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.016256195 = queryNorm
              0.24327096 = fieldWeight in 2827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2827)
          0.06486954 = weight(abstract_txt:führen in 2827) [ClassicSimilarity], result of:
            0.06486954 = score(doc=2827,freq=1.0), product of:
              0.18175185 = queryWeight, product of:
                1.713113 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.016256195 = queryNorm
              0.35691267 = fieldWeight in 2827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2827)
          0.15372685 = weight(abstract_txt:linguistische in 2827) [ClassicSimilarity], result of:
            0.15372685 = score(doc=2827,freq=1.0), product of:
              0.32306117 = queryWeight, product of:
                2.2839625 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.016256195 = queryNorm
              0.47584438 = fieldWeight in 2827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2827)
          0.3206489 = weight(abstract_txt:linguistischen in 2827) [ClassicSimilarity], result of:
            0.3206489 = score(doc=2827,freq=2.0), product of:
              0.4791725 = queryWeight, product of:
                3.4067335 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016256195 = queryNorm
              0.66917217 = fieldWeight in 2827, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2827)
        0.2 = coord(5/25)
    
  2. Fuhr, N.: Zur Überwindung der Diskrepanz zwischen Retrievalforschung und -praxis (1990) 0.11
    0.11051488 = sum of:
      0.11051488 = product of:
        0.5525744 = sum of:
          0.09510473 = weight(abstract_txt:besseren in 6625) [ClassicSimilarity], result of:
            0.09510473 = score(doc=6625,freq=1.0), product of:
              0.12997332 = queryWeight, product of:
                1.0243744 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016256195 = queryNorm
              0.73172504 = fieldWeight in 6625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.09375 = fieldNorm(doc=6625)
          0.032933623 = weight(abstract_txt:nach in 6625) [ClassicSimilarity], result of:
            0.032933623 = score(doc=6625,freq=1.0), product of:
              0.08075247 = queryWeight, product of:
                1.1418906 = boost
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.016256195 = queryNorm
              0.40783426 = fieldWeight in 6625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.09375 = fieldNorm(doc=6625)
          0.049799368 = weight(abstract_txt:können in 6625) [ClassicSimilarity], result of:
            0.049799368 = score(doc=6625,freq=2.0), product of:
              0.0844375 = queryWeight, product of:
                1.1676543 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.016256195 = queryNorm
              0.5897779 = fieldWeight in 6625, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.09375 = fieldNorm(doc=6625)
          0.11120493 = weight(abstract_txt:führen in 6625) [ClassicSimilarity], result of:
            0.11120493 = score(doc=6625,freq=1.0), product of:
              0.18175185 = queryWeight, product of:
                1.713113 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.016256195 = queryNorm
              0.6118503 = fieldWeight in 6625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.09375 = fieldNorm(doc=6625)
          0.26353174 = weight(abstract_txt:linguistische in 6625) [ClassicSimilarity], result of:
            0.26353174 = score(doc=6625,freq=1.0), product of:
              0.32306117 = queryWeight, product of:
                2.2839625 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.016256195 = queryNorm
              0.81573325 = fieldWeight in 6625, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.09375 = fieldNorm(doc=6625)
        0.2 = coord(5/25)
    
  3. Sprachtechnologie, mobile Kommunikation und linguistische Ressourcen : Beiträge zur GLDV Tagung 2005 in Bonn (2005) 0.09
    0.09161062 = sum of:
      0.09161062 = product of:
        0.76342183 = sum of:
          0.11120493 = weight(abstract_txt:führen in 3578) [ClassicSimilarity], result of:
            0.11120493 = score(doc=3578,freq=1.0), product of:
              0.18175185 = queryWeight, product of:
                1.713113 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.016256195 = queryNorm
              0.6118503 = fieldWeight in 3578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.09375 = fieldNorm(doc=3578)
          0.26353174 = weight(abstract_txt:linguistische in 3578) [ClassicSimilarity], result of:
            0.26353174 = score(doc=3578,freq=1.0), product of:
              0.32306117 = queryWeight, product of:
                2.2839625 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.016256195 = queryNorm
              0.81573325 = fieldWeight in 3578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.09375 = fieldNorm(doc=3578)
          0.38868517 = weight(abstract_txt:linguistischen in 3578) [ClassicSimilarity], result of:
            0.38868517 = score(doc=3578,freq=1.0), product of:
              0.4791725 = queryWeight, product of:
                3.4067335 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016256195 = queryNorm
              0.8111592 = fieldWeight in 3578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.09375 = fieldNorm(doc=3578)
        0.12 = coord(3/25)
    
  4. Luckhardt, H.-D.: Computerlinguistik und Informationswissenschaft : Facetten des wissenschaftlichen Wirkens von Harald H. Zimmermann (2006) 0.09
    0.085722044 = sum of:
      0.085722044 = product of:
        0.7143504 = sum of:
          0.11661989 = weight(abstract_txt:linguistik in 6079) [ClassicSimilarity], result of:
            0.11661989 = score(doc=6079,freq=1.0), product of:
              0.13436002 = queryWeight, product of:
                1.0415176 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.016256195 = queryNorm
              0.86796576 = fieldWeight in 6079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.109375 = fieldNorm(doc=6079)
          0.14426447 = weight(abstract_txt:computerlinguistik in 6079) [ClassicSimilarity], result of:
            0.14426447 = score(doc=6079,freq=1.0), product of:
              0.15483217 = queryWeight, product of:
                1.1180525 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016256195 = queryNorm
              0.9317474 = fieldWeight in 6079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.109375 = fieldNorm(doc=6079)
          0.45346603 = weight(abstract_txt:linguistischen in 6079) [ClassicSimilarity], result of:
            0.45346603 = score(doc=6079,freq=1.0), product of:
              0.4791725 = queryWeight, product of:
                3.4067335 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016256195 = queryNorm
              0.94635236 = fieldWeight in 6079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.109375 = fieldNorm(doc=6079)
        0.12 = coord(3/25)
    
  5. Maschinelle Sprachsynthese (1996) 0.07
    0.07175884 = sum of:
      0.07175884 = product of:
        0.59799033 = sum of:
          0.10610175 = weight(abstract_txt:fortschritte in 5872) [ClassicSimilarity], result of:
            0.10610175 = score(doc=5872,freq=1.0), product of:
              0.12615466 = queryWeight, product of:
                1.0092139 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.016256195 = queryNorm
              0.841045 = fieldWeight in 5872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.109375 = fieldNorm(doc=5872)
          0.038422562 = weight(abstract_txt:nach in 5872) [ClassicSimilarity], result of:
            0.038422562 = score(doc=5872,freq=1.0), product of:
              0.08075247 = queryWeight, product of:
                1.1418906 = boost
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.016256195 = queryNorm
              0.47580665 = fieldWeight in 5872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.350232 = idf(docFreq=1550, maxDocs=44218)
                0.109375 = fieldNorm(doc=5872)
          0.45346603 = weight(abstract_txt:linguistischen in 5872) [ClassicSimilarity], result of:
            0.45346603 = score(doc=5872,freq=1.0), product of:
              0.4791725 = queryWeight, product of:
                3.4067335 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016256195 = queryNorm
              0.94635236 = fieldWeight in 5872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.109375 = fieldNorm(doc=5872)
        0.12 = coord(3/25)