Document (#40082)

Author
Tavakolizadeh-Ravari, M.
Title
Analysis of the long term dynamics in thesaurus developments and its consequences
Source
http://edoc.hu-berlin.de/docviews/abstract.php?id=28144
Imprint
Berlin : Humboldt-Universität zu Berlin / Institut für Bibliotheks- und Informationswissenschaft
Year
2017
Pages
128 S
Abstract
Die Arbeit analysiert die dynamische Entwicklung und den Gebrauch von Thesaurusbegriffen. Zusätzlich konzentriert sie sich auf die Faktoren, die die Zahl von Indexbegriffen pro Dokument oder Zeitschrift beeinflussen. Als Untersuchungsobjekt dienten der MeSH und die entsprechende Datenbank "MEDLINE". Die wichtigsten Konsequenzen sind: 1. Der MeSH-Thesaurus hat sich durch drei unterschiedliche Phasen jeweils logarithmisch entwickelt. Solch einen Thesaurus sollte folgenden Gleichung folgen: "T = 3.076,6 Ln (d) - 22.695 + 0,0039d" (T = Begriffe, Ln = natürlicher Logarithmus und d = Dokumente). Um solch einen Thesaurus zu konstruieren, muss man demnach etwa 1.600 Dokumente von unterschiedlichen Themen des Bereiches des Thesaurus haben. Die dynamische Entwicklung von Thesauri wie MeSH erfordert die Einführung eines neuen Begriffs pro Indexierung von 256 neuen Dokumenten. 2. Die Verteilung der Thesaurusbegriffe erbrachte drei Kategorien: starke, normale und selten verwendete Headings. Die letzte Gruppe ist in einer Testphase, während in der ersten und zweiten Kategorie die neu hinzukommenden Deskriptoren zu einem Thesauruswachstum führen. 3. Es gibt ein logarithmisches Verhältnis zwischen der Zahl von Index-Begriffen pro Aufsatz und dessen Seitenzahl für die Artikeln zwischen einer und einundzwanzig Seiten. 4. Zeitschriftenaufsätze, die in MEDLINE mit Abstracts erscheinen erhalten fast zwei Deskriptoren mehr. 5. Die Findablity der nicht-englisch sprachigen Dokumente in MEDLINE ist geringer als die englische Dokumente. 6. Aufsätze der Zeitschriften mit einem Impact Factor 0 bis fünfzehn erhalten nicht mehr Indexbegriffe als die der anderen von MEDINE erfassten Zeitschriften. 7. In einem Indexierungssystem haben unterschiedliche Zeitschriften mehr oder weniger Gewicht in ihrem Findability. Die Verteilung der Indexbegriffe pro Seite hat gezeigt, dass es bei MEDLINE drei Kategorien der Publikationen gibt. Außerdem gibt es wenige stark bevorzugten Zeitschriften."
Content
Vgl.: https://www.ibi.hu-berlin.de/de/archiv/forschung/prom_habil/dissertationen/Tavakolizadeh-Ravari2007. Vgl. auch: http://mravari.blogfa.com/post-20.aspxgl.
Footnote
Dissertation, Humboldt-Universität zu Berlin - Institut für Bibliotheks- und Informationswissenschaft.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Informetrie
Automatisches Indexieren
Object
MEDLINE
MeSH

Similar documents (content)

  1. Berg-Schorn, E.: MeSH 2006: Deutsche Version lieferbar (2006) 0.21
    0.21440387 = sum of:
      0.21440387 = product of:
        0.7657281 = sum of:
          0.029460313 = weight(abstract_txt:neuen in 5959) [ClassicSimilarity], result of:
            0.029460313 = score(doc=5959,freq=1.0), product of:
              0.09225561 = queryWeight, product of:
                1.0625144 = boost
                5.1093373 = idf(docFreq=725, maxDocs=44218)
                0.016993912 = queryNorm
              0.31933358 = fieldWeight in 5959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1093373 = idf(docFreq=725, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.071323216 = weight(abstract_txt:zahl in 5959) [ClassicSimilarity], result of:
            0.071323216 = score(doc=5959,freq=1.0), product of:
              0.16633685 = queryWeight, product of:
                1.4267 = boost
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.016993912 = queryNorm
              0.42878783 = fieldWeight in 5959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.07259353 = weight(abstract_txt:erhalten in 5959) [ClassicSimilarity], result of:
            0.07259353 = score(doc=5959,freq=1.0), product of:
              0.16830608 = queryWeight, product of:
                1.4351205 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.016993912 = queryNorm
              0.43131855 = fieldWeight in 5959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.09705552 = weight(abstract_txt:kategorien in 5959) [ClassicSimilarity], result of:
            0.09705552 = score(doc=5959,freq=1.0), product of:
              0.20425907 = queryWeight, product of:
                1.58099 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.016993912 = queryNorm
              0.47515893 = fieldWeight in 5959, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.18190062 = weight(abstract_txt:deskriptoren in 5959) [ClassicSimilarity], result of:
            0.18190062 = score(doc=5959,freq=3.0), product of:
              0.21528655 = queryWeight, product of:
                1.623106 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016993912 = queryNorm
              0.84492326 = fieldWeight in 5959, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.18153584 = weight(abstract_txt:mesh in 5959) [ClassicSimilarity], result of:
            0.18153584 = score(doc=5959,freq=2.0), product of:
              0.2817279 = queryWeight, product of:
                2.2740448 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.016993912 = queryNorm
              0.64436585 = fieldWeight in 5959, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
          0.13185903 = weight(abstract_txt:thesaurus in 5959) [ClassicSimilarity], result of:
            0.13185903 = score(doc=5959,freq=3.0), product of:
              0.23578383 = queryWeight, product of:
                2.6857493 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.016993912 = queryNorm
              0.55923694 = fieldWeight in 5959, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=5959)
        0.28 = coord(7/25)
    
  2. Plott, C.; Ball, R.: Mit Sicherheit zum Dokument : Die Identifizierung von Online-Publikationen (2004) 0.12
    0.12107842 = sum of:
      0.12107842 = product of:
        0.5044934 = sum of:
          0.02701139 = weight(abstract_txt:einem in 2549) [ClassicSimilarity], result of:
            0.02701139 = score(doc=2549,freq=1.0), product of:
              0.09966964 = queryWeight, product of:
                1.3525879 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.016993912 = queryNorm
              0.2710092 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
          0.0644952 = weight(abstract_txt:unterschiedliche in 2549) [ClassicSimilarity], result of:
            0.0644952 = score(doc=2549,freq=1.0), product of:
              0.15554383 = queryWeight, product of:
                1.379637 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.016993912 = queryNorm
              0.41464326 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
          0.071323216 = weight(abstract_txt:zahl in 2549) [ClassicSimilarity], result of:
            0.071323216 = score(doc=2549,freq=1.0), product of:
              0.16633685 = queryWeight, product of:
                1.4267 = boost
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.016993912 = queryNorm
              0.42878783 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
          0.037746314 = weight(abstract_txt:mehr in 2549) [ClassicSimilarity], result of:
            0.037746314 = score(doc=2549,freq=1.0), product of:
              0.1245799 = queryWeight, product of:
                1.512196 = boost
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.016993912 = queryNorm
              0.3029888 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
          0.062243238 = weight(abstract_txt:gibt in 2549) [ClassicSimilarity], result of:
            0.062243238 = score(doc=2549,freq=2.0), product of:
              0.13801186 = queryWeight, product of:
                1.5916306 = boost
                5.1024737 = idf(docFreq=730, maxDocs=44218)
                0.016993912 = queryNorm
              0.4509992 = fieldWeight in 2549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1024737 = idf(docFreq=730, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
          0.24167408 = weight(abstract_txt:dokumente in 2549) [ClassicSimilarity], result of:
            0.24167408 = score(doc=2549,freq=5.0), product of:
              0.2764869 = queryWeight, product of:
                2.6013017 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016993912 = queryNorm
              0.8740887 = fieldWeight in 2549, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=2549)
        0.24 = coord(6/25)
    
  3. Mayr, P.; Walter, A.-K.: Abdeckung und Aktualität des Suchdienstes Google Scholar (2006) 0.11
    0.11352674 = sum of:
      0.11352674 = product of:
        0.5676337 = sum of:
          0.05207897 = weight(abstract_txt:neuen in 5131) [ClassicSimilarity], result of:
            0.05207897 = score(doc=5131,freq=2.0), product of:
              0.09225561 = queryWeight, product of:
                1.0625144 = boost
                5.1093373 = idf(docFreq=725, maxDocs=44218)
                0.016993912 = queryNorm
              0.56450737 = fieldWeight in 5131, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1093373 = idf(docFreq=725, maxDocs=44218)
                0.078125 = fieldNorm(doc=5131)
          0.03376424 = weight(abstract_txt:einem in 5131) [ClassicSimilarity], result of:
            0.03376424 = score(doc=5131,freq=1.0), product of:
              0.09966964 = queryWeight, product of:
                1.3525879 = boost
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.016993912 = queryNorm
              0.3387615 = fieldWeight in 5131, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3361473 = idf(docFreq=1572, maxDocs=44218)
                0.078125 = fieldNorm(doc=5131)
          0.08220118 = weight(abstract_txt:drei in 5131) [ClassicSimilarity], result of:
            0.08220118 = score(doc=5131,freq=1.0), product of:
              0.18037526 = queryWeight, product of:
                1.8195859 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.016993912 = queryNorm
              0.45572314 = fieldWeight in 5131, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=5131)
          0.1350999 = weight(abstract_txt:dokumente in 5131) [ClassicSimilarity], result of:
            0.1350999 = score(doc=5131,freq=1.0), product of:
              0.2764869 = queryWeight, product of:
                2.6013017 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.016993912 = queryNorm
              0.4886304 = fieldWeight in 5131, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=5131)
          0.2644894 = weight(abstract_txt:zeitschriften in 5131) [ClassicSimilarity], result of:
            0.2644894 = score(doc=5131,freq=3.0), product of:
              0.30001038 = queryWeight, product of:
                2.7097023 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.016993912 = queryNorm
              0.88160086 = fieldWeight in 5131, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=5131)
        0.2 = coord(5/25)
    
  4. Böll, S.K.: Informations- und bibliothekswissenschaftliche Zeitschriften in Literaturdatenbanken (2010) 0.11
    0.10694114 = sum of:
      0.10694114 = product of:
        0.66838217 = sum of:
          0.1213194 = weight(abstract_txt:kategorien in 3234) [ClassicSimilarity], result of:
            0.1213194 = score(doc=3234,freq=1.0), product of:
              0.20425907 = queryWeight, product of:
                1.58099 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.016993912 = queryNorm
              0.59394866 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=3234)
          0.055015773 = weight(abstract_txt:gibt in 3234) [ClassicSimilarity], result of:
            0.055015773 = score(doc=3234,freq=1.0), product of:
              0.13801186 = queryWeight, product of:
                1.5916306 = boost
                5.1024737 = idf(docFreq=730, maxDocs=44218)
                0.016993912 = queryNorm
              0.39863077 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1024737 = idf(docFreq=730, maxDocs=44218)
                0.078125 = fieldNorm(doc=3234)
          0.15059264 = weight(abstract_txt:verteilung in 3234) [ClassicSimilarity], result of:
            0.15059264 = score(doc=3234,freq=1.0), product of:
              0.23591942 = queryWeight, product of:
                1.6991053 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016993912 = queryNorm
              0.63832235 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.078125 = fieldNorm(doc=3234)
          0.3414544 = weight(abstract_txt:zeitschriften in 3234) [ClassicSimilarity], result of:
            0.3414544 = score(doc=3234,freq=5.0), product of:
              0.30001038 = queryWeight, product of:
                2.7097023 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.016993912 = queryNorm
              1.1381419 = fieldWeight in 3234, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=3234)
        0.16 = coord(4/25)
    
  5. Chen, X.: Indexing consistency between online catalogues (2008) 0.10
    0.102730066 = sum of:
      0.102730066 = product of:
        0.42804196 = sum of:
          0.028193591 = weight(abstract_txt:entwicklung in 2209) [ClassicSimilarity], result of:
            0.028193591 = score(doc=2209,freq=1.0), product of:
              0.08959177 = queryWeight, product of:
                1.0470623 = boost
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.016993912 = queryNorm
              0.31468952 = fieldWeight in 2209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0350323 = idf(docFreq=781, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
          0.04017829 = weight(abstract_txt:zwischen in 2209) [ClassicSimilarity], result of:
            0.04017829 = score(doc=2209,freq=2.0), product of:
              0.09005037 = queryWeight, product of:
                1.0497386 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.016993912 = queryNorm
              0.44617575 = fieldWeight in 2209, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
          0.037746314 = weight(abstract_txt:mehr in 2209) [ClassicSimilarity], result of:
            0.037746314 = score(doc=2209,freq=1.0), product of:
              0.1245799 = queryWeight, product of:
                1.512196 = boost
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.016993912 = queryNorm
              0.3029888 = fieldWeight in 2209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8478208 = idf(docFreq=942, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
          0.105020374 = weight(abstract_txt:deskriptoren in 2209) [ClassicSimilarity], result of:
            0.105020374 = score(doc=2209,freq=1.0), product of:
              0.21528655 = queryWeight, product of:
                1.623106 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.016993912 = queryNorm
              0.4878167 = fieldWeight in 2209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
          0.14077455 = weight(abstract_txt:solch in 2209) [ClassicSimilarity], result of:
            0.14077455 = score(doc=2209,freq=1.0), product of:
              0.26172826 = queryWeight, product of:
                1.789632 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.016993912 = queryNorm
              0.5378653 = fieldWeight in 2209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
          0.07612884 = weight(abstract_txt:thesaurus in 2209) [ClassicSimilarity], result of:
            0.07612884 = score(doc=2209,freq=1.0), product of:
              0.23578383 = queryWeight, product of:
                2.6857493 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.016993912 = queryNorm
              0.3228756 = fieldWeight in 2209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=2209)
        0.24 = coord(6/25)