Document (#30966)

Author
Jensen, N.
Title
Evaluierung von mehrsprachigem Web-Retrieval : Experimente mit dem EuroGOV-Korpus im Rahmen des Cross Language Evaluation Forum (CLEF)
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.235-244
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Der vorliegende Artikel beschreibt die Experimente der Universität Hildesheim im Rahmen des ersten Web Track der CLEF-Initiative (WebCLEF) im Jahr 2005. Bei der Teilnahme konnten Erfahrungen mit einem multilingualen Web-Korpus (EuroGOV) bei der Vorverarbeitung, der Topic- bzw. Query-Entwicklung, bei sprachunabhängigen Indexierungsmethoden und multilingualen Retrieval-Strategien gesammelt werden. Aufgrund des großen Um-fangs des Korpus und der zeitlichen Einschränkungen wurden multilinguale Indizes aufgebaut. Der Artikel beschreibt die Vorgehensweise bei der Teilnahme der Universität Hildesheim und die Ergebnisse der offiziell eingereichten sowie weiterer Experimente. Für den Multilingual Task konnte das beste Ergebnis in CLEF erzielt werden.
Theme
Computerlinguistik
Multilinguale Probleme
Sprachretrieval
Object
CLEF

Similar documents (author)

  1. Jensen, E.A.: Systematischer oder alphabetischer Sachkatalog (1957) 5.34
    5.3370943 = sum of:
      5.3370943 = weight(author_txt:jensen in 805) [ClassicSimilarity], result of:
        5.3370943 = fieldWeight in 805, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.5393505 = idf(docFreq=22, maxDocs=43254)
          0.625 = fieldNorm(doc=805)
    
  2. Jensen, P.E.: Three methods of teaching basic subject cataloging (1985) 5.34
    5.3370943 = sum of:
      5.3370943 = weight(author_txt:jensen in 1739) [ClassicSimilarity], result of:
        5.3370943 = fieldWeight in 1739, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.5393505 = idf(docFreq=22, maxDocs=43254)
          0.625 = fieldNorm(doc=1739)
    
  3. Jensen, A.: INSPEC: Window on the world of the physical and applied sciences (1993) 5.34
    5.3370943 = sum of:
      5.3370943 = weight(author_txt:jensen in 5016) [ClassicSimilarity], result of:
        5.3370943 = fieldWeight in 5016, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.5393505 = idf(docFreq=22, maxDocs=43254)
          0.625 = fieldNorm(doc=5016)
    
  4. Jensen, M.B.: Enhancing catalogs with tables of contents and internal indexes (1993) 5.34
    5.3370943 = sum of:
      5.3370943 = weight(author_txt:jensen in 5551) [ClassicSimilarity], result of:
        5.3370943 = fieldWeight in 5551, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.5393505 = idf(docFreq=22, maxDocs=43254)
          0.625 = fieldNorm(doc=5551)
    
  5. Jensen, N.: Library cooperation in Denmark : a new model (1995) 5.34
    5.3370943 = sum of:
      5.3370943 = weight(author_txt:jensen in 3697) [ClassicSimilarity], result of:
        5.3370943 = fieldWeight in 3697, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.5393505 = idf(docFreq=22, maxDocs=43254)
          0.625 = fieldNorm(doc=3697)
    

Similar documents (content)

  1. Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.22
    0.21654178 = sum of:
      0.21654178 = product of:
        0.90225744 = sum of:
          0.082217485 = weight(abstract_txt:evaluierung in 974) [ClassicSimilarity], result of:
            0.082217485 = score(doc=974,freq=1.0), product of:
              0.11049101 = queryWeight, product of:
                1.1273292 = boost
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.012348386 = queryNorm
              0.74411017 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
          0.14526334 = weight(abstract_txt:multilinguale in 974) [ClassicSimilarity], result of:
            0.14526334 = score(doc=974,freq=1.0), product of:
              0.16148102 = queryWeight, product of:
                1.36285 = boost
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.012348386 = queryNorm
              0.8995691 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.595404 = idf(docFreq=7, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
          0.069428764 = weight(abstract_txt:universität in 974) [ClassicSimilarity], result of:
            0.069428764 = score(doc=974,freq=1.0), product of:
              0.12437138 = queryWeight, product of:
                1.6914631 = boost
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.012348386 = queryNorm
              0.5582375 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
          0.08028132 = weight(abstract_txt:beschreibt in 974) [ClassicSimilarity], result of:
            0.08028132 = score(doc=974,freq=1.0), product of:
              0.13701575 = queryWeight, product of:
                1.7753645 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.012348386 = queryNorm
              0.58592767 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
          0.20798664 = weight(abstract_txt:hildesheim in 974) [ClassicSimilarity], result of:
            0.20798664 = score(doc=974,freq=1.0), product of:
              0.2584547 = queryWeight, product of:
                2.4383414 = boost
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.012348386 = queryNorm
              0.8047315 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
          0.31707987 = weight(abstract_txt:clef in 974) [ClassicSimilarity], result of:
            0.31707987 = score(doc=974,freq=1.0), product of:
              0.39189556 = queryWeight, product of:
                3.6773343 = boost
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.012348386 = queryNorm
              0.80909276 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.09375 = fieldNorm(doc=974)
        0.24 = coord(6/25)
    
  2. Becks, D.; Mandl, T.; Womser-Hacker, C.: Spezielle Anforderungen bei der Evaluierung von Patent-Retrieval-Systemen (2010) 0.17
    0.16961826 = sum of:
      0.16961826 = product of:
        0.70674276 = sum of:
          0.016737811 = weight(abstract_txt:werden in 1132) [ClassicSimilarity], result of:
            0.016737811 = score(doc=1132,freq=1.0), product of:
              0.043470576 = queryWeight, product of:
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.012348386 = queryNorm
              0.38503772 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
          0.0959204 = weight(abstract_txt:evaluierung in 1132) [ClassicSimilarity], result of:
            0.0959204 = score(doc=1132,freq=1.0), product of:
              0.11049101 = queryWeight, product of:
                1.1273292 = boost
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.012348386 = queryNorm
              0.86812854 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
          0.061513655 = weight(abstract_txt:rahmen in 1132) [ClassicSimilarity], result of:
            0.061513655 = score(doc=1132,freq=1.0), product of:
              0.1035247 = queryWeight, product of:
                1.5432074 = boost
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.012348386 = queryNorm
              0.59419304 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
          0.068982825 = weight(abstract_txt:artikel in 1132) [ClassicSimilarity], result of:
            0.068982825 = score(doc=1132,freq=1.0), product of:
              0.111743845 = queryWeight, product of:
                1.6032975 = boost
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.012348386 = queryNorm
              0.61732996 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
          0.09366154 = weight(abstract_txt:beschreibt in 1132) [ClassicSimilarity], result of:
            0.09366154 = score(doc=1132,freq=1.0), product of:
              0.13701575 = queryWeight, product of:
                1.7753645 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.012348386 = queryNorm
              0.6835823 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
          0.3699265 = weight(abstract_txt:clef in 1132) [ClassicSimilarity], result of:
            0.3699265 = score(doc=1132,freq=1.0), product of:
              0.39189556 = queryWeight, product of:
                3.6773343 = boost
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.012348386 = queryNorm
              0.94394153 = fieldWeight in 1132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.109375 = fieldNorm(doc=1132)
        0.24 = coord(6/25)
    
  3. Elbeshausen, S.; Geist, K.; Pätsch, G.: Akzeptanz und Lernförderlichkeit von Microblogging im Hochschulkontext (2010) 0.16
    0.15612109 = sum of:
      0.15612109 = product of:
        0.55757535 = sum of:
          0.020289289 = weight(abstract_txt:werden in 486) [ClassicSimilarity], result of:
            0.020289289 = score(doc=486,freq=2.0), product of:
              0.043470576 = queryWeight, product of:
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.012348386 = queryNorm
              0.46673614 = fieldWeight in 486, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.0677352 = weight(abstract_txt:weiterer in 486) [ClassicSimilarity], result of:
            0.0677352 = score(doc=486,freq=1.0), product of:
              0.09710176 = queryWeight, product of:
                1.0568196 = boost
                7.4407387 = idf(docFreq=68, maxDocs=43254)
                0.012348386 = queryNorm
              0.69756925 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4407387 = idf(docFreq=68, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.052725993 = weight(abstract_txt:rahmen in 486) [ClassicSimilarity], result of:
            0.052725993 = score(doc=486,freq=1.0), product of:
              0.1035247 = queryWeight, product of:
                1.5432074 = boost
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.012348386 = queryNorm
              0.50930834 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.05912814 = weight(abstract_txt:artikel in 486) [ClassicSimilarity], result of:
            0.05912814 = score(doc=486,freq=1.0), product of:
              0.111743845 = queryWeight, product of:
                1.6032975 = boost
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.012348386 = queryNorm
              0.52914 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.069428764 = weight(abstract_txt:universität in 486) [ClassicSimilarity], result of:
            0.069428764 = score(doc=486,freq=1.0), product of:
              0.12437138 = queryWeight, product of:
                1.6914631 = boost
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.012348386 = queryNorm
              0.5582375 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.08028132 = weight(abstract_txt:beschreibt in 486) [ClassicSimilarity], result of:
            0.08028132 = score(doc=486,freq=1.0), product of:
              0.13701575 = queryWeight, product of:
                1.7753645 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.012348386 = queryNorm
              0.58592767 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
          0.20798664 = weight(abstract_txt:hildesheim in 486) [ClassicSimilarity], result of:
            0.20798664 = score(doc=486,freq=1.0), product of:
              0.2584547 = queryWeight, product of:
                2.4383414 = boost
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.012348386 = queryNorm
              0.8047315 = fieldWeight in 486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.09375 = fieldNorm(doc=486)
        0.28 = coord(7/25)
    
  4. Heinz, S.: Realisierung und Evaluierung eines virtuellen Bibliotheksregals für die Informationswissenschaft an der Universitätsbibliothek Hildesheim (2003) 0.14
    0.13505125 = sum of:
      0.13505125 = product of:
        0.67525625 = sum of:
          0.0959204 = weight(abstract_txt:evaluierung in 983) [ClassicSimilarity], result of:
            0.0959204 = score(doc=983,freq=1.0), product of:
              0.11049101 = queryWeight, product of:
                1.1273292 = boost
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.012348386 = queryNorm
              0.86812854 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9371753 = idf(docFreq=41, maxDocs=43254)
                0.109375 = fieldNorm(doc=983)
          0.061513655 = weight(abstract_txt:rahmen in 983) [ClassicSimilarity], result of:
            0.061513655 = score(doc=983,freq=1.0), product of:
              0.1035247 = queryWeight, product of:
                1.5432074 = boost
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.012348386 = queryNorm
              0.59419304 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.432622 = idf(docFreq=513, maxDocs=43254)
                0.109375 = fieldNorm(doc=983)
          0.081000224 = weight(abstract_txt:universität in 983) [ClassicSimilarity], result of:
            0.081000224 = score(doc=983,freq=1.0), product of:
              0.12437138 = queryWeight, product of:
                1.6914631 = boost
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.012348386 = queryNorm
              0.65127707 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.109375 = fieldNorm(doc=983)
          0.09366154 = weight(abstract_txt:beschreibt in 983) [ClassicSimilarity], result of:
            0.09366154 = score(doc=983,freq=1.0), product of:
              0.13701575 = queryWeight, product of:
                1.7753645 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.012348386 = queryNorm
              0.6835823 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.109375 = fieldNorm(doc=983)
          0.34316042 = weight(abstract_txt:hildesheim in 983) [ClassicSimilarity], result of:
            0.34316042 = score(doc=983,freq=2.0), product of:
              0.2584547 = queryWeight, product of:
                2.4383414 = boost
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.012348386 = queryNorm
              1.3277391 = fieldWeight in 983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.109375 = fieldNorm(doc=983)
        0.2 = coord(5/25)
    
  5. Caroli, F.; Görtz, M.; Kölle, R.: Informationswissenschaftliche Lehre in den Bachelor und Masterstudiengängen "Internationales Informationsmanagement" an der Universität Hildesheim (2010) 0.11
    0.1097909 = sum of:
      0.1097909 = product of:
        0.5489545 = sum of:
          0.011955579 = weight(abstract_txt:werden in 477) [ClassicSimilarity], result of:
            0.011955579 = score(doc=477,freq=1.0), product of:
              0.043470576 = queryWeight, product of:
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.012348386 = queryNorm
              0.27502692 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.06968318 = weight(abstract_txt:artikel in 477) [ClassicSimilarity], result of:
            0.06968318 = score(doc=477,freq=2.0), product of:
              0.111743845 = queryWeight, product of:
                1.6032975 = boost
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.012348386 = queryNorm
              0.62359744 = fieldWeight in 477, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.64416 = idf(docFreq=415, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.10021179 = weight(abstract_txt:universität in 477) [ClassicSimilarity], result of:
            0.10021179 = score(doc=477,freq=3.0), product of:
              0.12437138 = queryWeight, product of:
                1.6914631 = boost
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.012348386 = queryNorm
              0.8057464 = fieldWeight in 477, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.954533 = idf(docFreq=304, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.066901095 = weight(abstract_txt:beschreibt in 477) [ClassicSimilarity], result of:
            0.066901095 = score(doc=477,freq=1.0), product of:
              0.13701575 = queryWeight, product of:
                1.7753645 = boost
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.012348386 = queryNorm
              0.48827305 = fieldWeight in 477, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.249895 = idf(docFreq=226, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
          0.30020285 = weight(abstract_txt:hildesheim in 477) [ClassicSimilarity], result of:
            0.30020285 = score(doc=477,freq=3.0), product of:
              0.2584547 = queryWeight, product of:
                2.4383414 = boost
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.012348386 = queryNorm
              1.1615298 = fieldWeight in 477, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.583802 = idf(docFreq=21, maxDocs=43254)
                0.078125 = fieldNorm(doc=477)
        0.2 = coord(5/25)