Document (#30966)

Author
Jensen, N.
Title
Evaluierung von mehrsprachigem Web-Retrieval : Experimente mit dem EuroGOV-Korpus im Rahmen des Cross Language Evaluation Forum (CLEF)
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.235-244
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Der vorliegende Artikel beschreibt die Experimente der Universität Hildesheim im Rahmen des ersten Web Track der CLEF-Initiative (WebCLEF) im Jahr 2005. Bei der Teilnahme konnten Erfahrungen mit einem multilingualen Web-Korpus (EuroGOV) bei der Vorverarbeitung, der Topic- bzw. Query-Entwicklung, bei sprachunabhängigen Indexierungsmethoden und multilingualen Retrieval-Strategien gesammelt werden. Aufgrund des großen Um-fangs des Korpus und der zeitlichen Einschränkungen wurden multilinguale Indizes aufgebaut. Der Artikel beschreibt die Vorgehensweise bei der Teilnahme der Universität Hildesheim und die Ergebnisse der offiziell eingereichten sowie weiterer Experimente. Für den Multilingual Task konnte das beste Ergebnis in CLEF erzielt werden.
Theme
Computerlinguistik
Multilinguale Probleme
Sprachretrieval
Object
CLEF

Similar documents (author)

  1. Jensen, E.A.: Systematischer oder alphabetischer Sachkatalog (1957) 5.33
    5.3296227 = sum of:
      5.3296227 = weight(author_txt:jensen in 805) [ClassicSimilarity], result of:
        5.3296227 = fieldWeight in 805, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.527396 = idf(docFreq=22, maxDocs=42740)
          0.625 = fieldNorm(doc=805)
    
  2. Jensen, P.E.: Three methods of teaching basic subject cataloging (1985) 5.33
    5.3296227 = sum of:
      5.3296227 = weight(author_txt:jensen in 1739) [ClassicSimilarity], result of:
        5.3296227 = fieldWeight in 1739, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.527396 = idf(docFreq=22, maxDocs=42740)
          0.625 = fieldNorm(doc=1739)
    
  3. Jensen, A.: INSPEC: Window on the world of the physical and applied sciences (1993) 5.33
    5.3296227 = sum of:
      5.3296227 = weight(author_txt:jensen in 5016) [ClassicSimilarity], result of:
        5.3296227 = fieldWeight in 5016, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.527396 = idf(docFreq=22, maxDocs=42740)
          0.625 = fieldNorm(doc=5016)
    
  4. Jensen, M.B.: Enhancing catalogs with tables of contents and internal indexes (1993) 5.33
    5.3296227 = sum of:
      5.3296227 = weight(author_txt:jensen in 5551) [ClassicSimilarity], result of:
        5.3296227 = fieldWeight in 5551, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.527396 = idf(docFreq=22, maxDocs=42740)
          0.625 = fieldNorm(doc=5551)
    
  5. Jensen, N.: Library cooperation in Denmark : a new model (1995) 5.33
    5.3296227 = sum of:
      5.3296227 = weight(author_txt:jensen in 2697) [ClassicSimilarity], result of:
        5.3296227 = fieldWeight in 2697, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.527396 = idf(docFreq=22, maxDocs=42740)
          0.625 = fieldNorm(doc=2697)
    

Similar documents (content)

  1. Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.22
    0.2168212 = sum of:
      0.2168212 = product of:
        0.90342164 = sum of:
          0.08190116 = weight(abstract_txt:evaluierung in 974) [ClassicSimilarity], result of:
            0.08190116 = score(doc=974,freq=1.0), product of:
              0.11023193 = queryWeight, product of:
                1.1242666 = boost
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.012371623 = queryNorm
              0.7429895 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
          0.14481764 = weight(abstract_txt:multilinguale in 974) [ClassicSimilarity], result of:
            0.14481764 = score(doc=974,freq=1.0), product of:
              0.16118638 = queryWeight, product of:
                1.3595018 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.012371623 = queryNorm
              0.89844835 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
          0.06951873 = weight(abstract_txt:universität in 974) [ClassicSimilarity], result of:
            0.06951873 = score(doc=974,freq=1.0), product of:
              0.124506466 = queryWeight, product of:
                1.6897662 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.012371623 = queryNorm
              0.5583544 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
          0.08055965 = weight(abstract_txt:beschreibt in 974) [ClassicSimilarity], result of:
            0.08055965 = score(doc=974,freq=1.0), product of:
              0.13736278 = queryWeight, product of:
                1.7748643 = boost
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.012371623 = queryNorm
              0.58647364 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
          0.2106498 = weight(abstract_txt:hildesheim in 974) [ClassicSimilarity], result of:
            0.2106498 = score(doc=974,freq=1.0), product of:
              0.26071423 = queryWeight, product of:
                2.445192 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              0.807972 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
          0.31597468 = weight(abstract_txt:clef in 974) [ClassicSimilarity], result of:
            0.31597468 = score(doc=974,freq=1.0), product of:
              0.39107132 = queryWeight, product of:
                3.6677883 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              0.807972 = fieldWeight in 974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.09375 = fieldNorm(doc=974)
        0.24 = coord(6/25)
    
  2. Becks, D.; Mandl, T.; Womser-Hacker, C.: Spezielle Anforderungen bei der Evaluierung von Patent-Retrieval-Systemen (2010) 0.17
    0.16931982 = sum of:
      0.16931982 = product of:
        0.7054993 = sum of:
          0.016810045 = weight(abstract_txt:werden in 1668) [ClassicSimilarity], result of:
            0.016810045 = score(doc=1668,freq=1.0), product of:
              0.043605246 = queryWeight, product of:
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.012371623 = queryNorm
              0.38550508 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
          0.09555136 = weight(abstract_txt:evaluierung in 1668) [ClassicSimilarity], result of:
            0.09555136 = score(doc=1668,freq=1.0), product of:
              0.11023193 = queryWeight, product of:
                1.1242666 = boost
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.012371623 = queryNorm
              0.86682105 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
          0.061480626 = weight(abstract_txt:rahmen in 1668) [ClassicSimilarity], result of:
            0.061480626 = score(doc=1668,freq=1.0), product of:
              0.10351065 = queryWeight, product of:
                1.5407181 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.012371623 = queryNorm
              0.5939546 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
          0.069033876 = weight(abstract_txt:artikel in 1668) [ClassicSimilarity], result of:
            0.069033876 = score(doc=1668,freq=1.0), product of:
              0.11182383 = queryWeight, product of:
                1.6013926 = boost
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.012371623 = queryNorm
              0.617345 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
          0.09398626 = weight(abstract_txt:beschreibt in 1668) [ClassicSimilarity], result of:
            0.09398626 = score(doc=1668,freq=1.0), product of:
              0.13736278 = queryWeight, product of:
                1.7748643 = boost
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.012371623 = queryNorm
              0.68421924 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
          0.3686371 = weight(abstract_txt:clef in 1668) [ClassicSimilarity], result of:
            0.3686371 = score(doc=1668,freq=1.0), product of:
              0.39107132 = queryWeight, product of:
                3.6677883 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              0.942634 = fieldWeight in 1668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.109375 = fieldNorm(doc=1668)
        0.24 = coord(6/25)
    
  3. Elbeshausen, S.; Geist, K.; Pätsch, G.: Akzeptanz und Lernförderlichkeit von Microblogging im Hochschulkontext (2010) 0.16
    0.15692006 = sum of:
      0.15692006 = product of:
        0.5604288 = sum of:
          0.02037685 = weight(abstract_txt:werden in 1022) [ClassicSimilarity], result of:
            0.02037685 = score(doc=1022,freq=2.0), product of:
              0.043605246 = queryWeight, product of:
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.012371623 = queryNorm
              0.46730274 = fieldWeight in 1022, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.06745421 = weight(abstract_txt:weiterer in 1022) [ClassicSimilarity], result of:
            0.06745421 = score(doc=1022,freq=1.0), product of:
              0.09685456 = queryWeight, product of:
                1.0538424 = boost
                7.428784 = idf(docFreq=68, maxDocs=42740)
                0.012371623 = queryNorm
              0.6964485 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.428784 = idf(docFreq=68, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.05269768 = weight(abstract_txt:rahmen in 1022) [ClassicSimilarity], result of:
            0.05269768 = score(doc=1022,freq=1.0), product of:
              0.10351065 = queryWeight, product of:
                1.5407181 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.012371623 = queryNorm
              0.50910395 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.0591719 = weight(abstract_txt:artikel in 1022) [ClassicSimilarity], result of:
            0.0591719 = score(doc=1022,freq=1.0), product of:
              0.11182383 = queryWeight, product of:
                1.6013926 = boost
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.012371623 = queryNorm
              0.52915287 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.06951873 = weight(abstract_txt:universität in 1022) [ClassicSimilarity], result of:
            0.06951873 = score(doc=1022,freq=1.0), product of:
              0.124506466 = queryWeight, product of:
                1.6897662 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.012371623 = queryNorm
              0.5583544 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.08055965 = weight(abstract_txt:beschreibt in 1022) [ClassicSimilarity], result of:
            0.08055965 = score(doc=1022,freq=1.0), product of:
              0.13736278 = queryWeight, product of:
                1.7748643 = boost
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.012371623 = queryNorm
              0.58647364 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
          0.2106498 = weight(abstract_txt:hildesheim in 1022) [ClassicSimilarity], result of:
            0.2106498 = score(doc=1022,freq=1.0), product of:
              0.26071423 = queryWeight, product of:
                2.445192 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              0.807972 = fieldWeight in 1022, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.09375 = fieldNorm(doc=1022)
        0.28 = coord(7/25)
    
  4. Heinz, S.: Realisierung und Evaluierung eines virtuellen Bibliotheksregals für die Informationswissenschaft an der Universitätsbibliothek Hildesheim (2003) 0.14
    0.13593557 = sum of:
      0.13593557 = product of:
        0.67967784 = sum of:
          0.09555136 = weight(abstract_txt:evaluierung in 983) [ClassicSimilarity], result of:
            0.09555136 = score(doc=983,freq=1.0), product of:
              0.11023193 = queryWeight, product of:
                1.1242666 = boost
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.012371623 = queryNorm
              0.86682105 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.925221 = idf(docFreq=41, maxDocs=42740)
                0.109375 = fieldNorm(doc=983)
          0.061480626 = weight(abstract_txt:rahmen in 983) [ClassicSimilarity], result of:
            0.061480626 = score(doc=983,freq=1.0), product of:
              0.10351065 = queryWeight, product of:
                1.5407181 = boost
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.012371623 = queryNorm
              0.5939546 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4304423 = idf(docFreq=508, maxDocs=42740)
                0.109375 = fieldNorm(doc=983)
          0.08110519 = weight(abstract_txt:universität in 983) [ClassicSimilarity], result of:
            0.08110519 = score(doc=983,freq=1.0), product of:
              0.124506466 = queryWeight, product of:
                1.6897662 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.012371623 = queryNorm
              0.65141344 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.109375 = fieldNorm(doc=983)
          0.09398626 = weight(abstract_txt:beschreibt in 983) [ClassicSimilarity], result of:
            0.09398626 = score(doc=983,freq=1.0), product of:
              0.13736278 = queryWeight, product of:
                1.7748643 = boost
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.012371623 = queryNorm
              0.68421924 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.109375 = fieldNorm(doc=983)
          0.34755445 = weight(abstract_txt:hildesheim in 983) [ClassicSimilarity], result of:
            0.34755445 = score(doc=983,freq=2.0), product of:
              0.26071423 = queryWeight, product of:
                2.445192 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              1.3330858 = fieldWeight in 983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.109375 = fieldNorm(doc=983)
        0.2 = coord(5/25)
    
  5. Caroli, F.; Görtz, M.; Kölle, R.: Informationswissenschaftliche Lehre in den Bachelor und Masterstudiengängen "Internationales Informationsmanagement" an der Universität Hildesheim (2010) 0.11
    0.110652685 = sum of:
      0.110652685 = product of:
        0.5532634 = sum of:
          0.012007174 = weight(abstract_txt:werden in 1013) [ClassicSimilarity], result of:
            0.012007174 = score(doc=1013,freq=1.0), product of:
              0.043605246 = queryWeight, product of:
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.012371623 = queryNorm
              0.27536076 = fieldWeight in 1013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.524618 = idf(docFreq=3422, maxDocs=42740)
                0.078125 = fieldNorm(doc=1013)
          0.06973475 = weight(abstract_txt:artikel in 1013) [ClassicSimilarity], result of:
            0.06973475 = score(doc=1013,freq=2.0), product of:
              0.11182383 = queryWeight, product of:
                1.6013926 = boost
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.012371623 = queryNorm
              0.62361264 = fieldWeight in 1013, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.644297 = idf(docFreq=410, maxDocs=42740)
                0.078125 = fieldNorm(doc=1013)
          0.10034164 = weight(abstract_txt:universität in 1013) [ClassicSimilarity], result of:
            0.10034164 = score(doc=1013,freq=3.0), product of:
              0.124506466 = queryWeight, product of:
                1.6897662 = boost
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.012371623 = queryNorm
              0.8059151 = fieldWeight in 1013, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.95578 = idf(docFreq=300, maxDocs=42740)
                0.078125 = fieldNorm(doc=1013)
          0.06713304 = weight(abstract_txt:beschreibt in 1013) [ClassicSimilarity], result of:
            0.06713304 = score(doc=1013,freq=1.0), product of:
              0.13736278 = queryWeight, product of:
                1.7748643 = boost
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.012371623 = queryNorm
              0.48872802 = fieldWeight in 1013, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2557187 = idf(docFreq=222, maxDocs=42740)
                0.078125 = fieldNorm(doc=1013)
          0.30404678 = weight(abstract_txt:hildesheim in 1013) [ClassicSimilarity], result of:
            0.30404678 = score(doc=1013,freq=3.0), product of:
              0.26071423 = queryWeight, product of:
                2.445192 = boost
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.012371623 = queryNorm
              1.1662071 = fieldWeight in 1013, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.618368 = idf(docFreq=20, maxDocs=42740)
                0.078125 = fieldNorm(doc=1013)
        0.2 = coord(5/25)