Document (#30965)

Author
Jensen, N.
Title
Evaluierung von mehrsprachigem Web-Retrieval : Experimente mit dem EuroGOV-Korpus im Rahmen des Cross Language Evaluation Forum (CLEF)
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.235-244
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Der vorliegende Artikel beschreibt die Experimente der Universität Hildesheim im Rahmen des ersten Web Track der CLEF-Initiative (WebCLEF) im Jahr 2005. Bei der Teilnahme konnten Erfahrungen mit einem multilingualen Web-Korpus (EuroGOV) bei der Vorverarbeitung, der Topic- bzw. Query-Entwicklung, bei sprachunabhängigen Indexierungsmethoden und multilingualen Retrieval-Strategien gesammelt werden. Aufgrund des großen Um-fangs des Korpus und der zeitlichen Einschränkungen wurden multilinguale Indizes aufgebaut. Der Artikel beschreibt die Vorgehensweise bei der Teilnahme der Universität Hildesheim und die Ergebnisse der offiziell eingereichten sowie weiterer Experimente. Für den Multilingual Task konnte das beste Ergebnis in CLEF erzielt werden.
Theme
Computerlinguistik
Multilinguale Probleme
Sprachretrieval
Object
CLEF

Similar documents (author)

  1. Jensen, E.A.: Systematischer oder alphabetischer Sachkatalog (1957) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:jensen in 805) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 805, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=805)
    
  2. Jensen, P.E.: Three methods of teaching basic subject cataloging (1985) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:jensen in 1739) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 1739, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=1739)
    
  3. Jensen, A.: INSPEC: Window on the world of the physical and applied sciences (1993) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:jensen in 5016) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 5016, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=5016)
    
  4. Jensen, M.B.: Enhancing catalogs with tables of contents and internal indexes (1993) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:jensen in 5551) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 5551, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=5551)
    
  5. Jensen, N.: Library cooperation in Denmark : a new model (1995) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:jensen in 2628) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 2628, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=2628)
    

Similar documents (content)

  1. Effektive Information Retrieval Verfahren in Theorie und Praxis : ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005 (2006) 0.21
    0.21376456 = sum of:
      0.21376456 = product of:
        0.8906857 = sum of:
          0.080636136 = weight(abstract_txt:evaluierung in 5973) [ClassicSimilarity], result of:
            0.080636136 = score(doc=5973,freq=1.0), product of:
              0.10901068 = queryWeight, product of:
                1.1233143 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012299243 = queryNorm
              0.7397086 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
          0.14072981 = weight(abstract_txt:multilinguale in 5973) [ClassicSimilarity], result of:
            0.14072981 = score(doc=5973,freq=1.0), product of:
              0.15801804 = queryWeight, product of:
                1.3524464 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.012299243 = queryNorm
              0.89059335 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
          0.06974503 = weight(abstract_txt:universität in 5973) [ClassicSimilarity], result of:
            0.06974503 = score(doc=5973,freq=1.0), product of:
              0.124681324 = queryWeight, product of:
                1.6989572 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.012299243 = queryNorm
              0.5593863 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
          0.07966763 = weight(abstract_txt:beschreibt in 5973) [ClassicSimilarity], result of:
            0.07966763 = score(doc=5973,freq=1.0), product of:
              0.13624288 = queryWeight, product of:
                1.7759824 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.012299243 = queryNorm
              0.5847471 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
          0.2060273 = weight(abstract_txt:hildesheim in 5973) [ClassicSimilarity], result of:
            0.2060273 = score(doc=5973,freq=1.0), product of:
              0.25669008 = queryWeight, product of:
                2.437734 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012299243 = queryNorm
              0.80263054 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
          0.31387976 = weight(abstract_txt:clef in 5973) [ClassicSimilarity], result of:
            0.31387976 = score(doc=5973,freq=1.0), product of:
              0.38904384 = queryWeight, product of:
                3.6755865 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012299243 = queryNorm
              0.8067979 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.09375 = fieldNorm(doc=5973)
        0.24 = coord(6/25)
    
  2. Elbeshausen, S.; Geist, K.; Pätsch, G.: Akzeptanz und Lernförderlichkeit von Microblogging im Hochschulkontext (2010) 0.17
    0.16530249 = sum of:
      0.16530249 = product of:
        0.590366 = sum of:
          0.05688868 = weight(abstract_txt:konnten in 4021) [ClassicSimilarity], result of:
            0.05688868 = score(doc=4021,freq=1.0), product of:
              0.0863906 = queryWeight, product of:
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.012299243 = queryNorm
              0.65850544 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.06744746 = weight(abstract_txt:weiterer in 4021) [ClassicSimilarity], result of:
            0.06744746 = score(doc=4021,freq=1.0), product of:
              0.09677421 = queryWeight, product of:
                1.058392 = boost
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.012299243 = queryNorm
              0.69695693 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.05233596 = weight(abstract_txt:rahmen in 4021) [ClassicSimilarity], result of:
            0.05233596 = score(doc=4021,freq=1.0), product of:
              0.10295782 = queryWeight, product of:
                1.5438725 = boost
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.012299243 = queryNorm
              0.50832427 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.058253963 = weight(abstract_txt:artikel in 4021) [ClassicSimilarity], result of:
            0.058253963 = score(doc=4021,freq=1.0), product of:
              0.11057991 = queryWeight, product of:
                1.5999995 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.012299243 = queryNorm
              0.5268042 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.06974503 = weight(abstract_txt:universität in 4021) [ClassicSimilarity], result of:
            0.06974503 = score(doc=4021,freq=1.0), product of:
              0.124681324 = queryWeight, product of:
                1.6989572 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.012299243 = queryNorm
              0.5593863 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.07966763 = weight(abstract_txt:beschreibt in 4021) [ClassicSimilarity], result of:
            0.07966763 = score(doc=4021,freq=1.0), product of:
              0.13624288 = queryWeight, product of:
                1.7759824 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.012299243 = queryNorm
              0.5847471 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
          0.2060273 = weight(abstract_txt:hildesheim in 4021) [ClassicSimilarity], result of:
            0.2060273 = score(doc=4021,freq=1.0), product of:
              0.25669008 = queryWeight, product of:
                2.437734 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012299243 = queryNorm
              0.80263054 = fieldWeight in 4021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.09375 = fieldNorm(doc=4021)
        0.28 = coord(7/25)
    
  3. Becks, D.; Mandl, T.; Womser-Hacker, C.: Spezielle Anforderungen bei der Evaluierung von Patent-Retrieval-Systemen (2010) 0.14
    0.13644715 = sum of:
      0.13644715 = product of:
        0.6822357 = sum of:
          0.09407549 = weight(abstract_txt:evaluierung in 4667) [ClassicSimilarity], result of:
            0.09407549 = score(doc=4667,freq=1.0), product of:
              0.10901068 = queryWeight, product of:
                1.1233143 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012299243 = queryNorm
              0.86299336 = fieldWeight in 4667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.109375 = fieldNorm(doc=4667)
          0.061058614 = weight(abstract_txt:rahmen in 4667) [ClassicSimilarity], result of:
            0.061058614 = score(doc=4667,freq=1.0), product of:
              0.10295782 = queryWeight, product of:
                1.5438725 = boost
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.012299243 = queryNorm
              0.59304494 = fieldWeight in 4667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.109375 = fieldNorm(doc=4667)
          0.06796296 = weight(abstract_txt:artikel in 4667) [ClassicSimilarity], result of:
            0.06796296 = score(doc=4667,freq=1.0), product of:
              0.11057991 = queryWeight, product of:
                1.5999995 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.012299243 = queryNorm
              0.61460495 = fieldWeight in 4667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.109375 = fieldNorm(doc=4667)
          0.09294556 = weight(abstract_txt:beschreibt in 4667) [ClassicSimilarity], result of:
            0.09294556 = score(doc=4667,freq=1.0), product of:
              0.13624288 = queryWeight, product of:
                1.7759824 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.012299243 = queryNorm
              0.6822049 = fieldWeight in 4667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.109375 = fieldNorm(doc=4667)
          0.36619306 = weight(abstract_txt:clef in 4667) [ClassicSimilarity], result of:
            0.36619306 = score(doc=4667,freq=1.0), product of:
              0.38904384 = queryWeight, product of:
                3.6755865 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012299243 = queryNorm
              0.9412643 = fieldWeight in 4667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.109375 = fieldNorm(doc=4667)
        0.2 = coord(5/25)
    
  4. Heinz, S.: Realisierung und Evaluierung eines virtuellen Bibliotheksregals für die Informationswissenschaft an der Universitätsbibliothek Hildesheim (2003) 0.13
    0.13387533 = sum of:
      0.13387533 = product of:
        0.6693766 = sum of:
          0.09407549 = weight(abstract_txt:evaluierung in 5982) [ClassicSimilarity], result of:
            0.09407549 = score(doc=5982,freq=1.0), product of:
              0.10901068 = queryWeight, product of:
                1.1233143 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.012299243 = queryNorm
              0.86299336 = fieldWeight in 5982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.109375 = fieldNorm(doc=5982)
          0.061058614 = weight(abstract_txt:rahmen in 5982) [ClassicSimilarity], result of:
            0.061058614 = score(doc=5982,freq=1.0), product of:
              0.10295782 = queryWeight, product of:
                1.5438725 = boost
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.012299243 = queryNorm
              0.59304494 = fieldWeight in 5982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4221253 = idf(docFreq=530, maxDocs=44218)
                0.109375 = fieldNorm(doc=5982)
          0.08136919 = weight(abstract_txt:universität in 5982) [ClassicSimilarity], result of:
            0.08136919 = score(doc=5982,freq=1.0), product of:
              0.124681324 = queryWeight, product of:
                1.6989572 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.012299243 = queryNorm
              0.65261734 = fieldWeight in 5982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.109375 = fieldNorm(doc=5982)
          0.09294556 = weight(abstract_txt:beschreibt in 5982) [ClassicSimilarity], result of:
            0.09294556 = score(doc=5982,freq=1.0), product of:
              0.13624288 = queryWeight, product of:
                1.7759824 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.012299243 = queryNorm
              0.6822049 = fieldWeight in 5982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.109375 = fieldNorm(doc=5982)
          0.3399277 = weight(abstract_txt:hildesheim in 5982) [ClassicSimilarity], result of:
            0.3399277 = score(doc=5982,freq=2.0), product of:
              0.25669008 = queryWeight, product of:
                2.437734 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012299243 = queryNorm
              1.3242729 = fieldWeight in 5982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.109375 = fieldNorm(doc=5982)
        0.2 = coord(5/25)
    
  5. Caroli, F.; Görtz, M.; Kölle, R.: Informationswissenschaftliche Lehre in den Bachelor und Masterstudiengängen "Internationales Informationsmanagement" an der Universität Hildesheim (2010) 0.09
    0.08529371 = sum of:
      0.08529371 = product of:
        0.5330857 = sum of:
          0.06865295 = weight(abstract_txt:artikel in 4012) [ClassicSimilarity], result of:
            0.06865295 = score(doc=4012,freq=2.0), product of:
              0.11057991 = queryWeight, product of:
                1.5999995 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.012299243 = queryNorm
              0.6208447 = fieldWeight in 4012, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.078125 = fieldNorm(doc=4012)
          0.100668274 = weight(abstract_txt:universität in 4012) [ClassicSimilarity], result of:
            0.100668274 = score(doc=4012,freq=3.0), product of:
              0.124681324 = queryWeight, product of:
                1.6989572 = boost
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.012299243 = queryNorm
              0.8074046 = fieldWeight in 4012, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9667873 = idf(docFreq=307, maxDocs=44218)
                0.078125 = fieldNorm(doc=4012)
          0.06638968 = weight(abstract_txt:beschreibt in 4012) [ClassicSimilarity], result of:
            0.06638968 = score(doc=4012,freq=1.0), product of:
              0.13624288 = queryWeight, product of:
                1.7759824 = boost
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.012299243 = queryNorm
              0.4872892 = fieldWeight in 4012, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.237302 = idf(docFreq=234, maxDocs=44218)
                0.078125 = fieldNorm(doc=4012)
          0.2973748 = weight(abstract_txt:hildesheim in 4012) [ClassicSimilarity], result of:
            0.2973748 = score(doc=4012,freq=3.0), product of:
              0.25669008 = queryWeight, product of:
                2.437734 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.012299243 = queryNorm
              1.1584975 = fieldWeight in 4012, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.078125 = fieldNorm(doc=4012)
        0.16 = coord(4/25)