Document (#35916)

Author
Nhongkai, S.N.
Bentz, H.-J.
Title
Bilinguale Suche mittels Konzeptnetzen
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.203-222
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Eine neue Methode der Volltextsuche in bilingualen Textsammlungen wird vorgestellt und anhand eines parallelen Textkorpus (Englisch-Deutsch) geprüft. Die Brücke liefern passende Wortcluster, die aus einer Kookkurrenzanalyse stammen, geliefert von der neuartigen Suchmaschine SENTRAX (Essente Extractor Engine). Diese Cluster repräsentieren Konzepte, die sich in beiden Textsammlungen finden. Die Hypothese ist, dass das Finden mittels solcher Strukturvergleiche erfolgreich möglich ist.
Theme
Computerlinguistik
Sprachretrieval
Object
SENTRAX

Similar documents (content)

  1. Glogau, R.: Suchmaschine mit Köpfchen (1996) 0.12
    0.116032906 = sum of:
      0.116032906 = product of:
        0.72520566 = sum of:
          0.0897519 = weight(abstract_txt:anhand in 5904) [ClassicSimilarity], result of:
            0.0897519 = score(doc=5904,freq=1.0), product of:
              0.10057528 = queryWeight, product of:
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.017609982 = queryNorm
              0.89238524 = fieldWeight in 5904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.15625 = fieldNorm(doc=5904)
          0.12763608 = weight(abstract_txt:suchmaschine in 5904) [ClassicSimilarity], result of:
            0.12763608 = score(doc=5904,freq=1.0), product of:
              0.1271875 = queryWeight, product of:
                1.1245444 = boost
                6.4225717 = idf(docFreq=190, maxDocs=43254)
                0.017609982 = queryNorm
              1.0035268 = fieldWeight in 5904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4225717 = idf(docFreq=190, maxDocs=43254)
                0.15625 = fieldNorm(doc=5904)
          0.33299974 = weight(abstract_txt:volltextsuche in 5904) [ClassicSimilarity], result of:
            0.33299974 = score(doc=5904,freq=1.0), product of:
              0.24104127 = queryWeight, product of:
                1.5481038 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.017609982 = queryNorm
              1.381505 = fieldWeight in 5904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.15625 = fieldNorm(doc=5904)
          0.17481792 = weight(abstract_txt:finden in 5904) [ClassicSimilarity], result of:
            0.17481792 = score(doc=5904,freq=1.0), product of:
              0.19763452 = queryWeight, product of:
                1.9824432 = boost
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.017609982 = queryNorm
              0.8845515 = fieldWeight in 5904, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.15625 = fieldNorm(doc=5904)
        0.16 = coord(4/25)
    
  2. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.11
    0.105047606 = sum of:
      0.105047606 = product of:
        0.37517002 = sum of:
          0.038863715 = weight(abstract_txt:anhand in 2519) [ClassicSimilarity], result of:
            0.038863715 = score(doc=2519,freq=3.0), product of:
              0.10057528 = queryWeight, product of:
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.017609982 = queryNorm
              0.38641417 = fieldWeight in 2519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7112656 = idf(docFreq=388, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.036081403 = weight(abstract_txt:möglich in 2519) [ClassicSimilarity], result of:
            0.036081403 = score(doc=2519,freq=2.0), product of:
              0.10956734 = queryWeight, product of:
                1.0437462 = boost
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.017609982 = queryNorm
              0.32930803 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.057560403 = weight(abstract_txt:methode in 2519) [ClassicSimilarity], result of:
            0.057560403 = score(doc=2519,freq=2.0), product of:
              0.1495919 = queryWeight, product of:
                1.2195747 = boost
                6.965315 = idf(docFreq=110, maxDocs=43254)
                0.017609982 = queryNorm
              0.3847829 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.965315 = idf(docFreq=110, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.043110814 = weight(abstract_txt:liefern in 2519) [ClassicSimilarity], result of:
            0.043110814 = score(doc=2519,freq=1.0), product of:
              0.15543887 = queryWeight, product of:
                1.2431805 = boost
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.017609982 = queryNorm
              0.277349 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.100134 = idf(docFreq=96, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.044489626 = weight(abstract_txt:erfolgreich in 2519) [ClassicSimilarity], result of:
            0.044489626 = score(doc=2519,freq=1.0), product of:
              0.15873572 = queryWeight, product of:
                1.2562952 = boost
                7.1750355 = idf(docFreq=89, maxDocs=43254)
                0.017609982 = queryNorm
              0.2802748 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1750355 = idf(docFreq=89, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.11135957 = weight(abstract_txt:repräsentieren in 2519) [ClassicSimilarity], result of:
            0.11135957 = score(doc=2519,freq=2.0), product of:
              0.23226148 = queryWeight, product of:
                1.5196478 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.017609982 = queryNorm
              0.47945774 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.04370448 = weight(abstract_txt:finden in 2519) [ClassicSimilarity], result of:
            0.04370448 = score(doc=2519,freq=1.0), product of:
              0.19763452 = queryWeight, product of:
                1.9824432 = boost
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.017609982 = queryNorm
              0.22113788 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
        0.28 = coord(7/25)
    
  3. Schiffhauer, N.: Microsofts Encarta ist eine zuverlässige Enzyklopädie auf CD-ROM - Die Suchfunktionen sind noch verbesserungswürdig : ¬Ein Suchspiel mit 14 Millionen Wörtern (2001) 0.07
    0.0685767 = sum of:
      0.0685767 = product of:
        0.28573626 = sum of:
          0.023229763 = weight(abstract_txt:beiden in 684) [ClassicSimilarity], result of:
            0.023229763 = score(doc=684,freq=1.0), product of:
              0.11943694 = queryWeight, product of:
                1.0897421 = boost
                6.2238064 = idf(docFreq=232, maxDocs=43254)
                0.017609982 = queryNorm
              0.19449395 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2238064 = idf(docFreq=232, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
          0.04542781 = weight(abstract_txt:englisch in 684) [ClassicSimilarity], result of:
            0.04542781 = score(doc=684,freq=1.0), product of:
              0.186777 = queryWeight, product of:
                1.3627496 = boost
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.017609982 = queryNorm
              0.24321952 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
          0.05668321 = weight(abstract_txt:passende in 684) [ClassicSimilarity], result of:
            0.05668321 = score(doc=684,freq=1.0), product of:
              0.216477 = queryWeight, product of:
                1.4671018 = boost
                8.379008 = idf(docFreq=26, maxDocs=43254)
                0.017609982 = queryNorm
              0.261844 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.379008 = idf(docFreq=26, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
          0.057452578 = weight(abstract_txt:geprüft in 684) [ClassicSimilarity], result of:
            0.057452578 = score(doc=684,freq=1.0), product of:
              0.21843146 = queryWeight, product of:
                1.4737098 = boost
                8.416748 = idf(docFreq=25, maxDocs=43254)
                0.017609982 = queryNorm
              0.26302338 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.416748 = idf(docFreq=25, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
          0.06797932 = weight(abstract_txt:geliefert in 684) [ClassicSimilarity], result of:
            0.06797932 = score(doc=684,freq=1.0), product of:
              0.24435809 = queryWeight, product of:
                1.5587187 = boost
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.017609982 = queryNorm
              0.2781955 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.902256 = idf(docFreq=15, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
          0.034963585 = weight(abstract_txt:finden in 684) [ClassicSimilarity], result of:
            0.034963585 = score(doc=684,freq=1.0), product of:
              0.19763452 = queryWeight, product of:
                1.9824432 = boost
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.017609982 = queryNorm
              0.17691031 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.66113 = idf(docFreq=408, maxDocs=43254)
                0.03125 = fieldNorm(doc=684)
        0.24 = coord(6/25)
    
  4. Gesell, J.: Neuauflage der Internationalen Patentklassifikation : incompatibility issues of library classification systems and subject headings in subject cataloguing (1986) 0.06
    0.06373281 = sum of:
      0.06373281 = product of:
        0.39833006 = sum of:
          0.05102681 = weight(abstract_txt:möglich in 4645) [ClassicSimilarity], result of:
            0.05102681 = score(doc=4645,freq=1.0), product of:
              0.10956734 = queryWeight, product of:
                1.0437462 = boost
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.017609982 = queryNorm
              0.4657119 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.961112 = idf(docFreq=302, maxDocs=43254)
                0.078125 = fieldNorm(doc=4645)
          0.09202571 = weight(abstract_txt:deutsch in 4645) [ClassicSimilarity], result of:
            0.09202571 = score(doc=4645,freq=1.0), product of:
              0.16233854 = queryWeight, product of:
                1.2704723 = boost
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.017609982 = queryNorm
              0.56687534 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2560043 = idf(docFreq=82, maxDocs=43254)
                0.078125 = fieldNorm(doc=4645)
          0.11356953 = weight(abstract_txt:englisch in 4645) [ClassicSimilarity], result of:
            0.11356953 = score(doc=4645,freq=1.0), product of:
              0.186777 = queryWeight, product of:
                1.3627496 = boost
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.017609982 = queryNorm
              0.6080488 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.783025 = idf(docFreq=48, maxDocs=43254)
                0.078125 = fieldNorm(doc=4645)
          0.14170802 = weight(abstract_txt:passende in 4645) [ClassicSimilarity], result of:
            0.14170802 = score(doc=4645,freq=1.0), product of:
              0.216477 = queryWeight, product of:
                1.4671018 = boost
                8.379008 = idf(docFreq=26, maxDocs=43254)
                0.017609982 = queryNorm
              0.65461004 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.379008 = idf(docFreq=26, maxDocs=43254)
                0.078125 = fieldNorm(doc=4645)
        0.16 = coord(4/25)
    
  5. Stock, W.G.: Informationelle Städte im 21. Jahrhundert (2011) 0.06
    0.060019094 = sum of:
      0.060019094 = product of:
        0.37511936 = sum of:
          0.054175843 = weight(abstract_txt:cluster in 976) [ClassicSimilarity], result of:
            0.054175843 = score(doc=976,freq=1.0), product of:
              0.13232014 = queryWeight, product of:
                1.1470103 = boost
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.017609982 = queryNorm
              0.40943006 = fieldWeight in 976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.550881 = idf(docFreq=167, maxDocs=43254)
                0.0625 = fieldNorm(doc=976)
          0.06724283 = weight(abstract_txt:solcher in 976) [ClassicSimilarity], result of:
            0.06724283 = score(doc=976,freq=1.0), product of:
              0.1528221 = queryWeight, product of:
                1.2326717 = boost
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.017609982 = queryNorm
              0.44000724 = fieldWeight in 976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.0625 = fieldNorm(doc=976)
          0.13893712 = weight(abstract_txt:hypothese in 976) [ClassicSimilarity], result of:
            0.13893712 = score(doc=976,freq=1.0), product of:
              0.24791399 = queryWeight, product of:
                1.5700189 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.017609982 = queryNorm
              0.5604247 = fieldWeight in 976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.0625 = fieldNorm(doc=976)
          0.114763595 = weight(abstract_txt:mittels in 976) [ClassicSimilarity], result of:
            0.114763595 = score(doc=976,freq=1.0), product of:
              0.2749803 = queryWeight, product of:
                2.3384073 = boost
                6.677633 = idf(docFreq=147, maxDocs=43254)
                0.017609982 = queryNorm
              0.41735205 = fieldWeight in 976, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.677633 = idf(docFreq=147, maxDocs=43254)
                0.0625 = fieldNorm(doc=976)
        0.16 = coord(4/25)