Document (#35916)

Author
Nhongkai, S.N.
Bentz, H.-J.
Title
Bilinguale Suche mittels Konzeptnetzen
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.203-222
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Eine neue Methode der Volltextsuche in bilingualen Textsammlungen wird vorgestellt und anhand eines parallelen Textkorpus (Englisch-Deutsch) geprüft. Die Brücke liefern passende Wortcluster, die aus einer Kookkurrenzanalyse stammen, geliefert von der neuartigen Suchmaschine SENTRAX (Essente Extractor Engine). Diese Cluster repräsentieren Konzepte, die sich in beiden Textsammlungen finden. Die Hypothese ist, dass das Finden mittels solcher Strukturvergleiche erfolgreich möglich ist.
Theme
Computerlinguistik
Sprachretrieval
Object
SENTRAX

Similar documents (content)

  1. Glogau, R.: Suchmaschine mit Köpfchen (1996) 0.12
    0.115967005 = sum of:
      0.115967005 = product of:
        0.7247938 = sum of:
          0.090170555 = weight(abstract_txt:anhand in 4904) [ClassicSimilarity], result of:
            0.090170555 = score(doc=4904,freq=1.0), product of:
              0.10093478 = queryWeight, product of:
                5.7174697 = idf(docFreq=381, maxDocs=42740)
                0.01765375 = queryNorm
              0.89335465 = fieldWeight in 4904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7174697 = idf(docFreq=381, maxDocs=42740)
                0.15625 = fieldNorm(doc=4904)
          0.12741455 = weight(abstract_txt:suchmaschine in 4904) [ClassicSimilarity], result of:
            0.12741455 = score(doc=4904,freq=1.0), product of:
              0.12709947 = queryWeight, product of:
                1.1221514 = boost
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.01765375 = queryNorm
              1.0024791 = fieldWeight in 4904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4158664 = idf(docFreq=189, maxDocs=42740)
                0.15625 = fieldNorm(doc=4904)
          0.33211437 = weight(abstract_txt:volltextsuche in 4904) [ClassicSimilarity], result of:
            0.33211437 = score(doc=4904,freq=1.0), product of:
              0.24072589 = queryWeight, product of:
                1.5443331 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.01765375 = queryNorm
              1.3796371 = fieldWeight in 4904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.15625 = fieldNorm(doc=4904)
          0.17509432 = weight(abstract_txt:finden in 4904) [ClassicSimilarity], result of:
            0.17509432 = score(doc=4904,freq=1.0), product of:
              0.19793491 = queryWeight, product of:
                1.980413 = boost
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.01765375 = queryNorm
              0.8846056 = fieldWeight in 4904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.15625 = fieldNorm(doc=4904)
        0.16 = coord(4/25)
    
  2. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.11
    0.105556354 = sum of:
      0.105556354 = product of:
        0.37698698 = sum of:
          0.03904499 = weight(abstract_txt:anhand in 3055) [ClassicSimilarity], result of:
            0.03904499 = score(doc=3055,freq=3.0), product of:
              0.10093478 = queryWeight, product of:
                5.7174697 = idf(docFreq=381, maxDocs=42740)
                0.01765375 = queryNorm
              0.38683388 = fieldWeight in 3055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7174697 = idf(docFreq=381, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.036278345 = weight(abstract_txt:möglich in 3055) [ClassicSimilarity], result of:
            0.036278345 = score(doc=3055,freq=2.0), product of:
              0.110016875 = queryWeight, product of:
                1.044021 = boost
                5.969158 = idf(docFreq=296, maxDocs=42740)
                0.01765375 = queryNorm
              0.32975253 = fieldWeight in 3055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.969158 = idf(docFreq=296, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.05756877 = weight(abstract_txt:methode in 3055) [ClassicSimilarity], result of:
            0.05756877 = score(doc=3055,freq=2.0), product of:
              0.14967605 = queryWeight, product of:
                1.2177433 = boost
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.01765375 = queryNorm
              0.38462245 = fieldWeight in 3055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.96241 = idf(docFreq=109, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.042953372 = weight(abstract_txt:liefern in 3055) [ClassicSimilarity], result of:
            0.042953372 = score(doc=3055,freq=1.0), product of:
              0.1551324 = queryWeight, product of:
                1.2397406 = boost
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.01765375 = queryNorm
              0.27688202 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.044329483 = weight(abstract_txt:erfolgreich in 3055) [ClassicSimilarity], result of:
            0.044329483 = score(doc=3055,freq=1.0), product of:
              0.15842831 = queryWeight, product of:
                1.252841 = boost
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.01765375 = queryNorm
              0.27980784 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1630807 = idf(docFreq=89, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.11303844 = weight(abstract_txt:repräsentieren in 3055) [ClassicSimilarity], result of:
            0.11303844 = score(doc=3055,freq=2.0), product of:
              0.23469931 = queryWeight, product of:
                1.5248793 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.01765375 = queryNorm
              0.48163092 = fieldWeight in 3055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
          0.04377358 = weight(abstract_txt:finden in 3055) [ClassicSimilarity], result of:
            0.04377358 = score(doc=3055,freq=1.0), product of:
              0.19793491 = queryWeight, product of:
                1.980413 = boost
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.01765375 = queryNorm
              0.2211514 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3055)
        0.28 = coord(7/25)
    
  3. Schiffhauer, N.: Microsofts Encarta ist eine zuverlässige Enzyklopädie auf CD-ROM - Die Suchfunktionen sind noch verbesserungswürdig : ¬Ein Suchspiel mit 14 Millionen Wörtern (2001) 0.07
    0.06830935 = sum of:
      0.06830935 = product of:
        0.2846223 = sum of:
          0.023322405 = weight(abstract_txt:beiden in 599) [ClassicSimilarity], result of:
            0.023322405 = score(doc=599,freq=1.0), product of:
              0.11981005 = queryWeight, product of:
                1.0894974 = boost
                6.2291684 = idf(docFreq=228, maxDocs=42740)
                0.01765375 = queryNorm
              0.19466151 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2291684 = idf(docFreq=228, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
          0.045282 = weight(abstract_txt:englisch in 599) [ClassicSimilarity], result of:
            0.045282 = score(doc=599,freq=1.0), product of:
              0.18646389 = queryWeight, product of:
                1.35918 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.01765375 = queryNorm
              0.24284594 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
          0.057288084 = weight(abstract_txt:passende in 599) [ClassicSimilarity], result of:
            0.057288084 = score(doc=599,freq=1.0), product of:
              0.21811585 = queryWeight, product of:
                1.4700198 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.01765375 = queryNorm
              0.2626498 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
          0.057288084 = weight(abstract_txt:geprüft in 599) [ClassicSimilarity], result of:
            0.057288084 = score(doc=599,freq=1.0), product of:
              0.21811585 = queryWeight, product of:
                1.4700198 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.01765375 = queryNorm
              0.2626498 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
          0.06642287 = weight(abstract_txt:geliefert in 599) [ClassicSimilarity], result of:
            0.06642287 = score(doc=599,freq=1.0), product of:
              0.24072589 = queryWeight, product of:
                1.5443331 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.01765375 = queryNorm
              0.27592742 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
          0.035018865 = weight(abstract_txt:finden in 599) [ClassicSimilarity], result of:
            0.035018865 = score(doc=599,freq=1.0), product of:
              0.19793491 = queryWeight, product of:
                1.980413 = boost
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.01765375 = queryNorm
              0.17692111 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6614757 = idf(docFreq=403, maxDocs=42740)
                0.03125 = fieldNorm(doc=599)
        0.24 = coord(6/25)
    
  4. Gesell, J.: Neuauflage der Internationalen Patentklassifikation : incompatibility issues of library classification systems and subject headings in subject cataloguing (1986) 0.06
    0.063908815 = sum of:
      0.063908815 = product of:
        0.39943013 = sum of:
          0.051305324 = weight(abstract_txt:möglich in 3645) [ClassicSimilarity], result of:
            0.051305324 = score(doc=3645,freq=1.0), product of:
              0.110016875 = queryWeight, product of:
                1.044021 = boost
                5.969158 = idf(docFreq=296, maxDocs=42740)
                0.01765375 = queryNorm
              0.46634048 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.969158 = idf(docFreq=296, maxDocs=42740)
                0.078125 = fieldNorm(doc=3645)
          0.09169961 = weight(abstract_txt:deutsch in 3645) [ClassicSimilarity], result of:
            0.09169961 = score(doc=3645,freq=1.0), product of:
              0.16203022 = queryWeight, product of:
                1.2670028 = boost
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.01765375 = queryNorm
              0.5659414 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24405 = idf(docFreq=82, maxDocs=42740)
                0.078125 = fieldNorm(doc=3645)
          0.113205 = weight(abstract_txt:englisch in 3645) [ClassicSimilarity], result of:
            0.113205 = score(doc=3645,freq=1.0), product of:
              0.18646389 = queryWeight, product of:
                1.35918 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.01765375 = queryNorm
              0.60711485 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.078125 = fieldNorm(doc=3645)
          0.14322022 = weight(abstract_txt:passende in 3645) [ClassicSimilarity], result of:
            0.14322022 = score(doc=3645,freq=1.0), product of:
              0.21811585 = queryWeight, product of:
                1.4700198 = boost
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.01765375 = queryNorm
              0.6566245 = fieldWeight in 3645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.404794 = idf(docFreq=25, maxDocs=42740)
                0.078125 = fieldNorm(doc=3645)
        0.16 = coord(4/25)
    
  5. Stock, W.G.: Informationelle Städte im 21. Jahrhundert (2011) 0.06
    0.060180224 = sum of:
      0.060180224 = product of:
        0.3761264 = sum of:
          0.05455381 = weight(abstract_txt:cluster in 1512) [ClassicSimilarity], result of:
            0.05455381 = score(doc=1512,freq=1.0), product of:
              0.13299677 = queryWeight, product of:
                1.1478896 = boost
                6.563024 = idf(docFreq=163, maxDocs=42740)
                0.01765375 = queryNorm
              0.410189 = fieldWeight in 1512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.563024 = idf(docFreq=163, maxDocs=42740)
                0.0625 = fieldNorm(doc=1512)
          0.067273766 = weight(abstract_txt:solcher in 1512) [ClassicSimilarity], result of:
            0.067273766 = score(doc=1512,freq=1.0), product of:
              0.15294015 = queryWeight, product of:
                1.2309498 = boost
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.01765375 = queryNorm
              0.43986985 = fieldWeight in 1512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0379176 = idf(docFreq=101, maxDocs=42740)
                0.0625 = fieldNorm(doc=1512)
          0.13857558 = weight(abstract_txt:hypothese in 1512) [ClassicSimilarity], result of:
            0.13857558 = score(doc=1512,freq=1.0), product of:
              0.24759898 = queryWeight, product of:
                1.5662245 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.01765375 = queryNorm
              0.55967754 = fieldWeight in 1512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.0625 = fieldNorm(doc=1512)
          0.11572324 = weight(abstract_txt:mittels in 1512) [ClassicSimilarity], result of:
            0.11572324 = score(doc=1512,freq=1.0), product of:
              0.27663985 = queryWeight, product of:
                2.3412724 = boost
                6.693077 = idf(docFreq=143, maxDocs=42740)
                0.01765375 = queryNorm
              0.41831732 = fieldWeight in 1512, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.693077 = idf(docFreq=143, maxDocs=42740)
                0.0625 = fieldNorm(doc=1512)
        0.16 = coord(4/25)