Document (#23784)

Author
Danowski, P.
Voß, J.
Title
Wikipedia sammelt Metadaten
Source
Bibliotheksdienst. 39(2005) H.3, S.385
Year
2005
Abstract
Im Rahmen der Vorbereitung auf die Wikipedia-DVD, die zur Buchmesse in Leipzig erscheinen soll, wurden fast 30.000 Artikel der freien Enzyklopädie Wikipedia mit Personendaten versehen. Damit sind die biographischen Artikel erstmals mit strukturierten Metadaten versehen, die wie alle Inhalte des Projekts unter den Bedingungen der GFDL frei weiterverwendet werden können. Die Personendaten umfassen Angaben zu Namen, Geburtsdatum, Geburtsort, Sterbedatum und Sterbeort. Gleichzeitig wird eine Kurzbeschreibung zu den einzelnen Personen gespeichert. Bisher waren diese Daten nur im Fließtext und Personennamen nur in der Form "Vorname Nachname" abgespeichert. Da auf der DVD jedoch eine gezielte Suche nach Personen möglich sein soll, müssen die Namen und anderen Angaben einheitlich, wie es in bibliothekarischen Datenbanken die Regel ist, in der Form "Nachname, Vorname" angesetzt werden. Ziel der Sammlung von Personendaten ist die dokumentarische Erschließung aller biographischen Artikel. Da wie an der gesamten Wikipedia viele Freiwillige an diesem Prozess beteiligt sind, entsprechen die Ergebnisse sicherlich noch nicht professionellen Regelwerken wie RAK. Sie sind ein erster Schritt um die Wikipedia besser automatisch weiterverwendbar zu machen und somit neue Möglichkeiten der Anwendung zu erschließen. Die Personendaten wurden zum größten Teil in einer vom Verlag Directmedia Publishing ausgerichteten "Tagging-Party" vom 28. bis 30. Januar mit Hilfe eines selbst entwickelten Softwaretools direkt in Online-Enzyklopädie eingetragen. Dazu wurden alle Artikel angeschaut und Fehler in den Datenfeldern korrigiert. Die Strukturierung der Personendaten könnte noch wesentlich durch bestehende bibliothekarische Datenbanken wie die Personennormdatei (PND) verbessert werden. Bibliotheken könnten im Gegenzug die Informationen aus der Wikipedia zur Kataloganreicherung nutzen - beispielsweise zur Anzeige von Kurzbiographien zu einzelnen Autoren. Auch weitere Kooperationsmöglichkeiten sind denkbar. Bei Interesse können Sie sich an Jakob Voss oder Patrick Danowski wenden.
Footnote
Ansprechpartner: zu den Personendaten wikipedia@nichtich.de (Jakob Voß), patrick.danowski@web.de (Patrick Danowski); zur DVD info@directmedia.de; zu allgemeinen Fragen über die Wikipedia info@wikipedia.de. Informationen zur deutschsprachigen Wikipedia: http://www.wikipedia.de/
Theme
Informationsmittel
Internet
Object
Wikipedia

Similar documents (author)

  1. Danowski, J.A.: Network analysis of message content (1993) 5.55
    5.55006 = sum of:
      5.55006 = weight(author_txt:danowski in 908) [ClassicSimilarity], result of:
        5.55006 = fieldWeight in 908, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.8800955 = idf(docFreq=15, maxDocs=42306)
          0.625 = fieldNorm(doc=908)
    
  2. Danowski, P.: Kontext Open Access : Creative Commons (2012) 5.55
    5.55006 = sum of:
      5.55006 = weight(author_txt:danowski in 829) [ClassicSimilarity], result of:
        5.55006 = fieldWeight in 829, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.8800955 = idf(docFreq=15, maxDocs=42306)
          0.625 = fieldNorm(doc=829)
    
  3. Danowski, P.: Authority files and Web 2.0 : Wikipedia and the PND. An Example (2007) 5.55
    5.55006 = sum of:
      5.55006 = weight(author_txt:danowski in 3292) [ClassicSimilarity], result of:
        5.55006 = fieldWeight in 3292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.8800955 = idf(docFreq=15, maxDocs=42306)
          0.625 = fieldNorm(doc=3292)
    
  4. Danowski, P.: Step one: blow up the silo! : Open bibliographic data, the first step towards Linked Open Data (2010) 5.55
    5.55006 = sum of:
      5.55006 = weight(author_txt:danowski in 963) [ClassicSimilarity], result of:
        5.55006 = fieldWeight in 963, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.8800955 = idf(docFreq=15, maxDocs=42306)
          0.625 = fieldNorm(doc=963)
    
  5. Voß, J.; Danowski, P.: Bibliothek, Information und Dokumentation in der Wikipedia (2004) 4.44
    4.4400477 = sum of:
      4.4400477 = weight(author_txt:danowski in 4047) [ClassicSimilarity], result of:
        4.4400477 = fieldWeight in 4047, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.8800955 = idf(docFreq=15, maxDocs=42306)
          0.5 = fieldNorm(doc=4047)
    

Similar documents (content)

  1. Online-Enzyklopädie Wikipedia (2003) 0.23
    0.23445122 = sum of:
      0.23445122 = product of:
        0.73266006 = sum of:
          0.028781801 = weight(abstract_txt:alle in 2411) [ClassicSimilarity], result of:
            0.028781801 = score(doc=2411,freq=2.0), product of:
              0.10195393 = queryWeight, product of:
                1.1246394 = boost
                5.1102123 = idf(docFreq=693, maxDocs=42306)
                0.017739924 = queryNorm
              0.28230202 = fieldWeight in 2411, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1102123 = idf(docFreq=693, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.01743322 = weight(abstract_txt:werden in 2411) [ClassicSimilarity], result of:
            0.01743322 = score(doc=2411,freq=3.0), product of:
              0.07298671 = queryWeight, product of:
                1.1654102 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.017739924 = queryNorm
              0.23885474 = fieldWeight in 2411, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.05165208 = weight(abstract_txt:personen in 2411) [ClassicSimilarity], result of:
            0.05165208 = score(doc=2411,freq=1.0), product of:
              0.18969704 = queryWeight, product of:
                1.5340573 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.017739924 = queryNorm
              0.27228722 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.058019612 = weight(abstract_txt:enzyklopädie in 2411) [ClassicSimilarity], result of:
            0.058019612 = score(doc=2411,freq=1.0), product of:
              0.20498334 = queryWeight, product of:
                1.5946691 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.017739924 = queryNorm
              0.2830455 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.05831486 = weight(abstract_txt:angaben in 2411) [ClassicSimilarity], result of:
            0.05831486 = score(doc=2411,freq=1.0), product of:
              0.20567815 = queryWeight, product of:
                1.5973694 = boost
                7.258235 = idf(docFreq=80, maxDocs=42306)
                0.017739924 = queryNorm
              0.2835248 = fieldWeight in 2411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.258235 = idf(docFreq=80, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.042037286 = weight(abstract_txt:sind in 2411) [ClassicSimilarity], result of:
            0.042037286 = score(doc=2411,freq=5.0), product of:
              0.12183678 = queryWeight, product of:
                1.7386633 = boost
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.017739924 = queryNorm
              0.3450295 = fieldWeight in 2411, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.17362614 = weight(abstract_txt:artikel in 2411) [ClassicSimilarity], result of:
            0.17362614 = score(doc=2411,freq=10.0), product of:
              0.24893658 = queryWeight, product of:
                2.4852533 = boost
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.017739924 = queryNorm
              0.6974714 = fieldWeight in 2411, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
          0.30279508 = weight(abstract_txt:wikipedia in 2411) [ClassicSimilarity], result of:
            0.30279508 = score(doc=2411,freq=7.0), product of:
              0.4649886 = queryWeight, product of:
                4.1599975 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.017739924 = queryNorm
              0.6511882 = fieldWeight in 2411, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.0390625 = fieldNorm(doc=2411)
        0.32 = coord(8/25)
    
  2. Kleinz, T.: Wikipedia professionalisiert sich : Das Büro der deutschen Sektion soll im Oktober in Frankfurt eröffnen - Schreiber und Spender werden umworben (2006) 0.21
    0.20883515 = sum of:
      0.20883515 = product of:
        0.8701465 = sum of:
          0.02415618 = weight(abstract_txt:werden in 691) [ClassicSimilarity], result of:
            0.02415618 = score(doc=691,freq=1.0), product of:
              0.07298671 = queryWeight, product of:
                1.1654102 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.017739924 = queryNorm
              0.33096683 = fieldWeight in 691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
          0.05443904 = weight(abstract_txt:soll in 691) [ClassicSimilarity], result of:
            0.05443904 = score(doc=691,freq=1.0), product of:
              0.10959771 = queryWeight, product of:
                1.1660362 = boost
                5.298314 = idf(docFreq=574, maxDocs=42306)
                0.017739924 = queryNorm
              0.49671695 = fieldWeight in 691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.298314 = idf(docFreq=574, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
          0.13924707 = weight(abstract_txt:enzyklopädie in 691) [ClassicSimilarity], result of:
            0.13924707 = score(doc=691,freq=1.0), product of:
              0.20498334 = queryWeight, product of:
                1.5946691 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.017739924 = queryNorm
              0.67930925 = fieldWeight in 691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
          0.07750727 = weight(abstract_txt:wurden in 691) [ClassicSimilarity], result of:
            0.07750727 = score(doc=691,freq=1.0), product of:
              0.15877663 = queryWeight, product of:
                1.7188984 = boost
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.017739924 = queryNorm
              0.48815292 = fieldWeight in 691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2069645 = idf(docFreq=629, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
          0.18635511 = weight(abstract_txt:artikel in 691) [ClassicSimilarity], result of:
            0.18635511 = score(doc=691,freq=2.0), product of:
              0.24893658 = queryWeight, product of:
                2.4852533 = boost
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.017739924 = queryNorm
              0.7486048 = fieldWeight in 691, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
          0.38844186 = weight(abstract_txt:wikipedia in 691) [ClassicSimilarity], result of:
            0.38844186 = score(doc=691,freq=2.0), product of:
              0.4649886 = queryWeight, product of:
                4.1599975 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.017739924 = queryNorm
              0.83537936 = fieldWeight in 691, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.09375 = fieldNorm(doc=691)
        0.24 = coord(6/25)
    
  3. Ersch, J.S.; Gruber, J.G.: Allgemeine Encyclopädie der Wissenschaften und Künste (1996) 0.20
    0.19697607 = sum of:
      0.19697607 = product of:
        0.9848803 = sum of:
          0.09524195 = weight(abstract_txt:einzelnen in 1928) [ClassicSimilarity], result of:
            0.09524195 = score(doc=1928,freq=1.0), product of:
              0.1313573 = queryWeight, product of:
                1.2765517 = boost
                5.800482 = idf(docFreq=347, maxDocs=42306)
                0.017739924 = queryNorm
              0.7250602 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.800482 = idf(docFreq=347, maxDocs=42306)
                0.125 = fieldNorm(doc=1928)
          0.22994462 = weight(abstract_txt:versehen in 1928) [ClassicSimilarity], result of:
            0.22994462 = score(doc=1928,freq=1.0), product of:
              0.23640184 = queryWeight, product of:
                1.7125242 = boost
                7.781483 = idf(docFreq=47, maxDocs=42306)
                0.017739924 = queryNorm
              0.9726854 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.781483 = idf(docFreq=47, maxDocs=42306)
                0.125 = fieldNorm(doc=1928)
          0.08507748 = weight(abstract_txt:sind in 1928) [ClassicSimilarity], result of:
            0.08507748 = score(doc=1928,freq=2.0), product of:
              0.12183678 = queryWeight, product of:
                1.7386633 = boost
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.017739924 = queryNorm
              0.6982906 = fieldWeight in 1928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.125 = fieldNorm(doc=1928)
          0.39891905 = weight(abstract_txt:biographischen in 1928) [ClassicSimilarity], result of:
            0.39891905 = score(doc=1928,freq=1.0), product of:
              0.34131747 = queryWeight, product of:
                2.0577402 = boost
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.017739924 = queryNorm
              1.1687624 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.125 = fieldNorm(doc=1928)
          0.17569728 = weight(abstract_txt:artikel in 1928) [ClassicSimilarity], result of:
            0.17569728 = score(doc=1928,freq=1.0), product of:
              0.24893658 = queryWeight, product of:
                2.4852533 = boost
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.017739924 = queryNorm
              0.70579135 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.125 = fieldNorm(doc=1928)
        0.2 = coord(5/25)
    
  4. Portal "Bibliothek Information Dokumentation" eingestellt (2004) 0.19
    0.18905888 = sum of:
      0.18905888 = product of:
        1.1816181 = sum of:
          0.27849415 = weight(abstract_txt:enzyklopädie in 4294) [ClassicSimilarity], result of:
            0.27849415 = score(doc=4294,freq=1.0), product of:
              0.20498334 = queryWeight, product of:
                1.5946691 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.017739924 = queryNorm
              1.3586185 = fieldWeight in 4294, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.1875 = fieldNorm(doc=4294)
          0.09023829 = weight(abstract_txt:sind in 4294) [ClassicSimilarity], result of:
            0.09023829 = score(doc=4294,freq=1.0), product of:
              0.12183678 = queryWeight, product of:
                1.7386633 = boost
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.017739924 = queryNorm
              0.740649 = fieldWeight in 4294, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.1875 = fieldNorm(doc=4294)
          0.2635459 = weight(abstract_txt:artikel in 4294) [ClassicSimilarity], result of:
            0.2635459 = score(doc=4294,freq=1.0), product of:
              0.24893658 = queryWeight, product of:
                2.4852533 = boost
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.017739924 = queryNorm
              1.058687 = fieldWeight in 4294, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.646331 = idf(docFreq=405, maxDocs=42306)
                0.1875 = fieldNorm(doc=4294)
          0.5493398 = weight(abstract_txt:wikipedia in 4294) [ClassicSimilarity], result of:
            0.5493398 = score(doc=4294,freq=1.0), product of:
              0.4649886 = queryWeight, product of:
                4.1599975 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.017739924 = queryNorm
              1.1814048 = fieldWeight in 4294, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.1875 = fieldNorm(doc=4294)
        0.16 = coord(4/25)
    
  5. Stöcklin, N.: Wikipedia clever nutzen : in Schule und Beruf (2010) 0.17
    0.17310162 = sum of:
      0.17310162 = product of:
        0.72125673 = sum of:
          0.03256289 = weight(abstract_txt:alle in 1532) [ClassicSimilarity], result of:
            0.03256289 = score(doc=1532,freq=1.0), product of:
              0.10195393 = queryWeight, product of:
                1.1246394 = boost
                5.1102123 = idf(docFreq=693, maxDocs=42306)
                0.017739924 = queryNorm
              0.31938827 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1102123 = idf(docFreq=693, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
          0.022774665 = weight(abstract_txt:werden in 1532) [ClassicSimilarity], result of:
            0.022774665 = score(doc=1532,freq=2.0), product of:
              0.07298671 = queryWeight, product of:
                1.1654102 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.017739924 = queryNorm
              0.31203854 = fieldWeight in 1532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
          0.08264333 = weight(abstract_txt:personen in 1532) [ClassicSimilarity], result of:
            0.08264333 = score(doc=1532,freq=1.0), product of:
              0.18969704 = queryWeight, product of:
                1.5340573 = boost
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.017739924 = queryNorm
              0.43565956 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.970553 = idf(docFreq=107, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
          0.1312834 = weight(abstract_txt:enzyklopädie in 1532) [ClassicSimilarity], result of:
            0.1312834 = score(doc=1532,freq=2.0), product of:
              0.20498334 = queryWeight, product of:
                1.5946691 = boost
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.017739924 = queryNorm
              0.6404589 = fieldWeight in 1532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.245965 = idf(docFreq=81, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
          0.04253874 = weight(abstract_txt:sind in 1532) [ClassicSimilarity], result of:
            0.04253874 = score(doc=1532,freq=2.0), product of:
              0.12183678 = queryWeight, product of:
                1.7386633 = boost
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.017739924 = queryNorm
              0.3491453 = fieldWeight in 1532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.950128 = idf(docFreq=2213, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
          0.40945372 = weight(abstract_txt:wikipedia in 1532) [ClassicSimilarity], result of:
            0.40945372 = score(doc=1532,freq=5.0), product of:
              0.4649886 = queryWeight, product of:
                4.1599975 = boost
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.017739924 = queryNorm
              0.88056725 = fieldWeight in 1532, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.300826 = idf(docFreq=210, maxDocs=42306)
                0.0625 = fieldNorm(doc=1532)
        0.24 = coord(6/25)