Document (#30966)

Author
Jensen, N.
Title
Evaluierung von mehrsprachigem Web-Retrieval : Experimente mit dem EuroGOV-Korpus im Rahmen des Cross Language Evaluation Forum (CLEF)
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.235-244
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Der vorliegende Artikel beschreibt die Experimente der Universität Hildesheim im Rahmen des ersten Web Track der CLEF-Initiative (WebCLEF) im Jahr 2005. Bei der Teilnahme konnten Erfahrungen mit einem multilingualen Web-Korpus (EuroGOV) bei der Vorverarbeitung, der Topic- bzw. Query-Entwicklung, bei sprachunabhängigen Indexierungsmethoden und multilingualen Retrieval-Strategien gesammelt werden. Aufgrund des großen Um-fangs des Korpus und der zeitlichen Einschränkungen wurden multilinguale Indizes aufgebaut. Der Artikel beschreibt die Vorgehensweise bei der Teilnahme der Universität Hildesheim und die Ergebnisse der offiziell eingereichten sowie weiterer Experimente. Für den Multilingual Task konnte das beste Ergebnis in CLEF erzielt werden.
Theme
Computerlinguistik
Multilinguale Probleme
Sprachretrieval
Object
CLEF

Similar documents (author)

  1. Jensen, E.A.: Systematischer oder alphabetischer Sachkatalog (1957) 5.33
    5.327513 = sum of:
      5.327513 = weight(author_txt:jensen in 805) [ClassicSimilarity], result of:
        5.327513 = fieldWeight in 805, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.524021 = idf(docFreq=22, maxDocs=42596)
          0.625 = fieldNorm(doc=805)
    
  2. Jensen, P.E.: Three methods of teaching basic subject cataloging (1985) 5.33
    5.327513 = sum of:
      5.327513 = weight(author_txt:jensen in 1739) [ClassicSimilarity], result of:
        5.327513 = fieldWeight in 1739, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.524021 = idf(docFreq=22, maxDocs=42596)
          0.625 = fieldNorm(doc=1739)
    
  3. Jensen, A.: INSPEC: Window on the world of the physical and applied sciences (1993) 5.33
    5.327513 = sum of:
      5.327513 = weight(author_txt:jensen in 5016) [ClassicSimilarity], result of:
        5.327513 = fieldWeight in 5016, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.524021 = idf(docFreq=22, maxDocs=42596)
          0.625 = fieldNorm(doc=5016)
    
  4. Jensen, M.B.: Enhancing catalogs with tables of contents and internal indexes (1993) 5.33
    5.327513 = sum of:
      5.327513 = weight(author_txt:jensen in 5551) [ClassicSimilarity], result of:
        5.327513 = fieldWeight in 5551, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.524021 = idf(docFreq=22, maxDocs=42596)
          0.625 = fieldNorm(doc=5551)
    
  5. Jensen, N.: Library cooperation in Denmark : a new model (1995) 5.33
    5.327513 = sum of:
      5.327513 = weight(author_txt:jensen in 2697) [ClassicSimilarity], result of:
        5.327513 = fieldWeight in 2697, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.524021 = idf(docFreq=22, maxDocs=42596)
          0.625 = fieldNorm(doc=2697)
    

Similar documents (content)

  1. Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.22
    0.21671776 = sum of:
      0.21671776 = product of:
        0.9029907 = sum of:
          0.0818173 = weight(abstract_txt:evaluierung in 153) [ClassicSimilarity], result of:
            0.0818173 = score(doc=153,freq=1.0), product of:
              0.110165976 = queryWeight, product of:
                1.1226218 = boost
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.012387613 = queryNorm
              0.74267304 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
          0.14470136 = weight(abstract_txt:multilinguale in 153) [ClassicSimilarity], result of:
            0.14470136 = score(doc=153,freq=1.0), product of:
              0.1611137 = queryWeight, product of:
                1.3576128 = boost
                9.580074 = idf(docFreq=7, maxDocs=42596)
                0.012387613 = queryNorm
              0.89813197 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.580074 = idf(docFreq=7, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
          0.06953472 = weight(abstract_txt:universität in 153) [ClassicSimilarity], result of:
            0.06953472 = score(doc=153,freq=1.0), product of:
              0.12453608 = queryWeight, product of:
                1.6879995 = boost
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.012387613 = queryNorm
              0.55834997 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
          0.080797985 = weight(abstract_txt:beschreibt in 153) [ClassicSimilarity], result of:
            0.080797985 = score(doc=153,freq=1.0), product of:
              0.1376452 = queryWeight, product of:
                1.7746196 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.012387613 = queryNorm
              0.58700186 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
          0.21045572 = weight(abstract_txt:hildesheim in 153) [ClassicSimilarity], result of:
            0.21045572 = score(doc=153,freq=1.0), product of:
              0.26057607 = queryWeight, product of:
                2.4416983 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              0.8076556 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
          0.3156836 = weight(abstract_txt:clef in 153) [ClassicSimilarity], result of:
            0.3156836 = score(doc=153,freq=1.0), product of:
              0.39086413 = queryWeight, product of:
                3.6625473 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              0.8076556 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.09375 = fieldNorm(doc=153)
        0.24 = coord(6/25)
    
  2. Becks, D.; Mandl, T.; Womser-Hacker, C.: Spezielle Anforderungen bei der Evaluierung von Patent-Retrieval-Systemen (2010) 0.17
    0.16940194 = sum of:
      0.16940194 = product of:
        0.7058414 = sum of:
          0.016866755 = weight(abstract_txt:werden in 668) [ClassicSimilarity], result of:
            0.016866755 = score(doc=668,freq=1.0), product of:
              0.043706954 = queryWeight, product of:
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.012387613 = queryNorm
              0.38590553 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
          0.09545352 = weight(abstract_txt:evaluierung in 668) [ClassicSimilarity], result of:
            0.09545352 = score(doc=668,freq=1.0), product of:
              0.110165976 = queryWeight, product of:
                1.1226218 = boost
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.012387613 = queryNorm
              0.8664519 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
          0.06185271 = weight(abstract_txt:rahmen in 668) [ClassicSimilarity], result of:
            0.06185271 = score(doc=668,freq=1.0), product of:
              0.103936635 = queryWeight, product of:
                1.5420877 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.012387613 = queryNorm
              0.59510016 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
          0.06910657 = weight(abstract_txt:artikel in 668) [ClassicSimilarity], result of:
            0.06910657 = score(doc=668,freq=1.0), product of:
              0.111911766 = queryWeight, product of:
                1.600157 = boost
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.012387613 = queryNorm
              0.6175094 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
          0.09426432 = weight(abstract_txt:beschreibt in 668) [ClassicSimilarity], result of:
            0.09426432 = score(doc=668,freq=1.0), product of:
              0.1376452 = queryWeight, product of:
                1.7746196 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.012387613 = queryNorm
              0.6848355 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
          0.36829755 = weight(abstract_txt:clef in 668) [ClassicSimilarity], result of:
            0.36829755 = score(doc=668,freq=1.0), product of:
              0.39086413 = queryWeight, product of:
                3.6625473 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              0.94226485 = fieldWeight in 668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.109375 = fieldNorm(doc=668)
        0.24 = coord(6/25)
    
  3. Elbeshausen, S.; Geist, K.; Pätsch, G.: Akzeptanz und Lernförderlichkeit von Microblogging im Hochschulkontext (2010) 0.16
    0.15715349 = sum of:
      0.15715349 = product of:
        0.5612624 = sum of:
          0.020445593 = weight(abstract_txt:werden in 22) [ClassicSimilarity], result of:
            0.020445593 = score(doc=22,freq=2.0), product of:
              0.043706954 = queryWeight, product of:
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.012387613 = queryNorm
              0.4677881 = fieldWeight in 22, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.067777604 = weight(abstract_txt:weiterer in 22) [ClassicSimilarity], result of:
            0.067777604 = score(doc=22,freq=1.0), product of:
              0.09717208 = queryWeight, product of:
                1.0543395 = boost
                7.440008 = idf(docFreq=67, maxDocs=42596)
                0.012387613 = queryNorm
              0.69750077 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.440008 = idf(docFreq=67, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.053016603 = weight(abstract_txt:rahmen in 22) [ClassicSimilarity], result of:
            0.053016603 = score(doc=22,freq=1.0), product of:
              0.103936635 = queryWeight, product of:
                1.5420877 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.012387613 = queryNorm
              0.5100858 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.059234202 = weight(abstract_txt:artikel in 22) [ClassicSimilarity], result of:
            0.059234202 = score(doc=22,freq=1.0), product of:
              0.111911766 = queryWeight, product of:
                1.600157 = boost
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.012387613 = queryNorm
              0.5292938 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.06953472 = weight(abstract_txt:universität in 22) [ClassicSimilarity], result of:
            0.06953472 = score(doc=22,freq=1.0), product of:
              0.12453608 = queryWeight, product of:
                1.6879995 = boost
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.012387613 = queryNorm
              0.55834997 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.080797985 = weight(abstract_txt:beschreibt in 22) [ClassicSimilarity], result of:
            0.080797985 = score(doc=22,freq=1.0), product of:
              0.1376452 = queryWeight, product of:
                1.7746196 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.012387613 = queryNorm
              0.58700186 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
          0.21045572 = weight(abstract_txt:hildesheim in 22) [ClassicSimilarity], result of:
            0.21045572 = score(doc=22,freq=1.0), product of:
              0.26057607 = queryWeight, product of:
                2.4416983 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              0.8076556 = fieldWeight in 22, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.09375 = fieldNorm(doc=22)
        0.28 = coord(7/25)
    
  4. Heinz, S.: Realisierung und Evaluierung eines virtuellen Bibliotheksregals für die Informationswissenschaft an der Universitätsbibliothek Hildesheim (2003) 0.14
    0.13598572 = sum of:
      0.13598572 = product of:
        0.6799286 = sum of:
          0.09545352 = weight(abstract_txt:evaluierung in 162) [ClassicSimilarity], result of:
            0.09545352 = score(doc=162,freq=1.0), product of:
              0.110165976 = queryWeight, product of:
                1.1226218 = boost
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.012387613 = queryNorm
              0.8664519 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.921846 = idf(docFreq=41, maxDocs=42596)
                0.109375 = fieldNorm(doc=162)
          0.06185271 = weight(abstract_txt:rahmen in 162) [ClassicSimilarity], result of:
            0.06185271 = score(doc=162,freq=1.0), product of:
              0.103936635 = queryWeight, product of:
                1.5420877 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.012387613 = queryNorm
              0.59510016 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.109375 = fieldNorm(doc=162)
          0.08112384 = weight(abstract_txt:universität in 162) [ClassicSimilarity], result of:
            0.08112384 = score(doc=162,freq=1.0), product of:
              0.12453608 = queryWeight, product of:
                1.6879995 = boost
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.012387613 = queryNorm
              0.6514083 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.109375 = fieldNorm(doc=162)
          0.09426432 = weight(abstract_txt:beschreibt in 162) [ClassicSimilarity], result of:
            0.09426432 = score(doc=162,freq=1.0), product of:
              0.1376452 = queryWeight, product of:
                1.7746196 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.012387613 = queryNorm
              0.6848355 = fieldWeight in 162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.109375 = fieldNorm(doc=162)
          0.34723422 = weight(abstract_txt:hildesheim in 162) [ClassicSimilarity], result of:
            0.34723422 = score(doc=162,freq=2.0), product of:
              0.26057607 = queryWeight, product of:
                2.4416983 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              1.3325638 = fieldWeight in 162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.109375 = fieldNorm(doc=162)
        0.2 = coord(5/25)
    
  5. Caroli, F.; Görtz, M.; Kölle, R.: Informationswissenschaftliche Lehre in den Bachelor und Masterstudiengängen "Internationales Informationsmanagement" an der Universität Hildesheim (2010) 0.11
    0.11066379 = sum of:
      0.11066379 = product of:
        0.5533189 = sum of:
          0.012047682 = weight(abstract_txt:werden in 13) [ClassicSimilarity], result of:
            0.012047682 = score(doc=13,freq=1.0), product of:
              0.043706954 = queryWeight, product of:
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.012387613 = queryNorm
              0.2756468 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.528279 = idf(docFreq=3398, maxDocs=42596)
                0.078125 = fieldNorm(doc=13)
          0.069808185 = weight(abstract_txt:artikel in 13) [ClassicSimilarity], result of:
            0.069808185 = score(doc=13,freq=2.0), product of:
              0.111911766 = queryWeight, product of:
                1.600157 = boost
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.012387613 = queryNorm
              0.62377876 = fieldWeight in 13, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6458006 = idf(docFreq=408, maxDocs=42596)
                0.078125 = fieldNorm(doc=13)
          0.10036472 = weight(abstract_txt:universität in 13) [ClassicSimilarity], result of:
            0.10036472 = score(doc=13,freq=3.0), product of:
              0.12453608 = queryWeight, product of:
                1.6879995 = boost
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.012387613 = queryNorm
              0.8059088 = fieldWeight in 13, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9557333 = idf(docFreq=299, maxDocs=42596)
                0.078125 = fieldNorm(doc=13)
          0.06733166 = weight(abstract_txt:beschreibt in 13) [ClassicSimilarity], result of:
            0.06733166 = score(doc=13,freq=1.0), product of:
              0.1376452 = queryWeight, product of:
                1.7746196 = boost
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.012387613 = queryNorm
              0.4891682 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.261353 = idf(docFreq=220, maxDocs=42596)
                0.078125 = fieldNorm(doc=13)
          0.30376667 = weight(abstract_txt:hildesheim in 13) [ClassicSimilarity], result of:
            0.30376667 = score(doc=13,freq=3.0), product of:
              0.26057607 = queryWeight, product of:
                2.4416983 = boost
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.012387613 = queryNorm
              1.1657504 = fieldWeight in 13, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.614993 = idf(docFreq=20, maxDocs=42596)
                0.078125 = fieldNorm(doc=13)
        0.2 = coord(5/25)