Document (#40083)

Author
Tavakolizadeh-Ravari, M.
Title
Analysis of the long term dynamics in thesaurus developments and its consequences
Source
http://edoc.hu-berlin.de/docviews/abstract.php?id=28144
Imprint
Berlin : Humboldt-Universität zu Berlin / Institut für Bibliotheks- und Informationswissenschaft
Year
2017
Pages
128 S
Abstract
Die Arbeit analysiert die dynamische Entwicklung und den Gebrauch von Thesaurusbegriffen. Zusätzlich konzentriert sie sich auf die Faktoren, die die Zahl von Indexbegriffen pro Dokument oder Zeitschrift beeinflussen. Als Untersuchungsobjekt dienten der MeSH und die entsprechende Datenbank "MEDLINE". Die wichtigsten Konsequenzen sind: 1. Der MeSH-Thesaurus hat sich durch drei unterschiedliche Phasen jeweils logarithmisch entwickelt. Solch einen Thesaurus sollte folgenden Gleichung folgen: "T = 3.076,6 Ln (d) - 22.695 + 0,0039d" (T = Begriffe, Ln = natürlicher Logarithmus und d = Dokumente). Um solch einen Thesaurus zu konstruieren, muss man demnach etwa 1.600 Dokumente von unterschiedlichen Themen des Bereiches des Thesaurus haben. Die dynamische Entwicklung von Thesauri wie MeSH erfordert die Einführung eines neuen Begriffs pro Indexierung von 256 neuen Dokumenten. 2. Die Verteilung der Thesaurusbegriffe erbrachte drei Kategorien: starke, normale und selten verwendete Headings. Die letzte Gruppe ist in einer Testphase, während in der ersten und zweiten Kategorie die neu hinzukommenden Deskriptoren zu einem Thesauruswachstum führen. 3. Es gibt ein logarithmisches Verhältnis zwischen der Zahl von Index-Begriffen pro Aufsatz und dessen Seitenzahl für die Artikeln zwischen einer und einundzwanzig Seiten. 4. Zeitschriftenaufsätze, die in MEDLINE mit Abstracts erscheinen erhalten fast zwei Deskriptoren mehr. 5. Die Findablity der nicht-englisch sprachigen Dokumente in MEDLINE ist geringer als die englische Dokumente. 6. Aufsätze der Zeitschriften mit einem Impact Factor 0 bis fünfzehn erhalten nicht mehr Indexbegriffe als die der anderen von MEDINE erfassten Zeitschriften. 7. In einem Indexierungssystem haben unterschiedliche Zeitschriften mehr oder weniger Gewicht in ihrem Findability. Die Verteilung der Indexbegriffe pro Seite hat gezeigt, dass es bei MEDLINE drei Kategorien der Publikationen gibt. Außerdem gibt es wenige stark bevorzugten Zeitschriften."
Content
Vgl.: https://www.ibi.hu-berlin.de/de/archiv/forschung/prom_habil/dissertationen/Tavakolizadeh-Ravari2007. Vgl. auch: http://mravari.blogfa.com/post-20.aspxgl.
Footnote
Dissertation, Humboldt-Universität zu Berlin - Institut für Bibliotheks- und Informationswissenschaft.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Informetrie
Automatisches Indexieren
Object
MEDLINE
MeSH

Similar documents (content)

  1. Berg-Schorn, E.: MeSH 2006: Deutsche Version lieferbar (2006) 0.22
    0.2152698 = sum of:
      0.2152698 = product of:
        0.7688207 = sum of:
          0.029536994 = weight(abstract_txt:neuen in 960) [ClassicSimilarity], result of:
            0.029536994 = score(doc=960,freq=1.0), product of:
              0.09236359 = queryWeight, product of:
                1.0664788 = boost
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.01692634 = queryNorm
              0.31979045 = fieldWeight in 960, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.071266815 = weight(abstract_txt:zahl in 960) [ClassicSimilarity], result of:
            0.071266815 = score(doc=960,freq=1.0), product of:
              0.16615555 = queryWeight, product of:
                1.4304057 = boost
                6.8626604 = idf(docFreq=122, maxDocs=43254)
                0.01692634 = queryNorm
              0.42891628 = fieldWeight in 960, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8626604 = idf(docFreq=122, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.072836235 = weight(abstract_txt:erhalten in 960) [ClassicSimilarity], result of:
            0.072836235 = score(doc=960,freq=1.0), product of:
              0.16858603 = queryWeight, product of:
                1.4408296 = boost
                6.912671 = idf(docFreq=116, maxDocs=43254)
                0.01692634 = queryNorm
              0.43204194 = fieldWeight in 960, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.912671 = idf(docFreq=116, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.09669165 = weight(abstract_txt:kategorien in 960) [ClassicSimilarity], result of:
            0.09669165 = score(doc=960,freq=1.0), product of:
              0.20363352 = queryWeight, product of:
                1.5835305 = boost
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.01692634 = queryNorm
              0.47483173 = fieldWeight in 960, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.18447997 = weight(abstract_txt:deskriptoren in 960) [ClassicSimilarity], result of:
            0.18447997 = score(doc=960,freq=3.0), product of:
              0.21719459 = queryWeight, product of:
                1.6354088 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.01692634 = queryNorm
              0.84937644 = fieldWeight in 960, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.18236756 = weight(abstract_txt:mesh in 960) [ClassicSimilarity], result of:
            0.18236756 = score(doc=960,freq=2.0), product of:
              0.28242865 = queryWeight, product of:
                2.2840302 = boost
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.01692634 = queryNorm
              0.64571196 = fieldWeight in 960, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
          0.13164149 = weight(abstract_txt:thesaurus in 960) [ClassicSimilarity], result of:
            0.13164149 = score(doc=960,freq=3.0), product of:
              0.23539184 = queryWeight, product of:
                2.6919534 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.01692634 = queryNorm
              0.5592441 = fieldWeight in 960, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0625 = fieldNorm(doc=960)
        0.28 = coord(7/25)
    
  2. Plott, C.; Ball, R.: Mit Sicherheit zum Dokument : Die Identifizierung von Online-Publikationen (2004) 0.12
    0.12189511 = sum of:
      0.12189511 = product of:
        0.5078963 = sum of:
          0.02730622 = weight(abstract_txt:einem in 4550) [ClassicSimilarity], result of:
            0.02730622 = score(doc=4550,freq=1.0), product of:
              0.100337066 = queryWeight, product of:
                1.3613762 = boost
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.01692634 = queryNorm
              0.27214488 = fieldWeight in 4550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
          0.06526132 = weight(abstract_txt:unterschiedliche in 4550) [ClassicSimilarity], result of:
            0.06526132 = score(doc=4550,freq=1.0), product of:
              0.1566849 = queryWeight, product of:
                1.3890421 = boost
                6.66421 = idf(docFreq=149, maxDocs=43254)
                0.01692634 = queryNorm
              0.41651312 = fieldWeight in 4550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.66421 = idf(docFreq=149, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
          0.071266815 = weight(abstract_txt:zahl in 4550) [ClassicSimilarity], result of:
            0.071266815 = score(doc=4550,freq=1.0), product of:
              0.16615555 = queryWeight, product of:
                1.4304057 = boost
                6.8626604 = idf(docFreq=122, maxDocs=43254)
                0.01692634 = queryNorm
              0.42891628 = fieldWeight in 4550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8626604 = idf(docFreq=122, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
          0.037974395 = weight(abstract_txt:mehr in 4550) [ClassicSimilarity], result of:
            0.037974395 = score(doc=4550,freq=1.0), product of:
              0.12501082 = queryWeight, product of:
                1.5195719 = boost
                4.860302 = idf(docFreq=910, maxDocs=43254)
                0.01692634 = queryNorm
              0.30376887 = fieldWeight in 4550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.860302 = idf(docFreq=910, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
          0.06329023 = weight(abstract_txt:gibt in 4550) [ClassicSimilarity], result of:
            0.06329023 = score(doc=4550,freq=2.0), product of:
              0.13947664 = queryWeight, product of:
                1.6050856 = boost
                5.133815 = idf(docFreq=692, maxDocs=43254)
                0.01692634 = queryNorm
              0.4537694 = fieldWeight in 4550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.133815 = idf(docFreq=692, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
          0.24279729 = weight(abstract_txt:dokumente in 4550) [ClassicSimilarity], result of:
            0.24279729 = score(doc=4550,freq=5.0), product of:
              0.27718675 = queryWeight, product of:
                2.6127813 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.01692634 = queryNorm
              0.87593395 = fieldWeight in 4550, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.0625 = fieldNorm(doc=4550)
        0.24 = coord(6/25)
    
  3. Mayr, P.; Walter, A.-K.: Abdeckung und Aktualität des Suchdienstes Google Scholar (2006) 0.11
    0.11375971 = sum of:
      0.11375971 = product of:
        0.56879854 = sum of:
          0.052214522 = weight(abstract_txt:neuen in 132) [ClassicSimilarity], result of:
            0.052214522 = score(doc=132,freq=2.0), product of:
              0.09236359 = queryWeight, product of:
                1.0664788 = boost
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.01692634 = queryNorm
              0.565315 = fieldWeight in 132, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1166472 = idf(docFreq=704, maxDocs=43254)
                0.078125 = fieldNorm(doc=132)
          0.034132775 = weight(abstract_txt:einem in 132) [ClassicSimilarity], result of:
            0.034132775 = score(doc=132,freq=1.0), product of:
              0.100337066 = queryWeight, product of:
                1.3613762 = boost
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.01692634 = queryNorm
              0.3401811 = fieldWeight in 132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.354318 = idf(docFreq=1510, maxDocs=43254)
                0.078125 = fieldNorm(doc=132)
          0.08259717 = weight(abstract_txt:drei in 132) [ClassicSimilarity], result of:
            0.08259717 = score(doc=132,freq=1.0), product of:
              0.18085219 = queryWeight, product of:
                1.8277187 = boost
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.01692634 = queryNorm
              0.4567109 = fieldWeight in 132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8458996 = idf(docFreq=339, maxDocs=43254)
                0.078125 = fieldNorm(doc=132)
          0.13572781 = weight(abstract_txt:dokumente in 132) [ClassicSimilarity], result of:
            0.13572781 = score(doc=132,freq=1.0), product of:
              0.27718675 = queryWeight, product of:
                2.6127813 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.01692634 = queryNorm
              0.48966196 = fieldWeight in 132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.078125 = fieldNorm(doc=132)
          0.26412627 = weight(abstract_txt:zeitschriften in 132) [ClassicSimilarity], result of:
            0.26412627 = score(doc=132,freq=3.0), product of:
              0.29956695 = queryWeight, product of:
                2.7162127 = boost
                6.5157895 = idf(docFreq=173, maxDocs=43254)
                0.01692634 = queryNorm
              0.8816936 = fieldWeight in 132, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5157895 = idf(docFreq=173, maxDocs=43254)
                0.078125 = fieldNorm(doc=132)
        0.2 = coord(5/25)
    
  4. Böll, S.K.: Informations- und bibliothekswissenschaftliche Zeitschriften in Literaturdatenbanken (2010) 0.11
    0.10696981 = sum of:
      0.10696981 = product of:
        0.66856134 = sum of:
          0.12086457 = weight(abstract_txt:kategorien in 235) [ClassicSimilarity], result of:
            0.12086457 = score(doc=235,freq=1.0), product of:
              0.20363352 = queryWeight, product of:
                1.5835305 = boost
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.01692634 = queryNorm
              0.59353966 = fieldWeight in 235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5973077 = idf(docFreq=58, maxDocs=43254)
                0.078125 = fieldNorm(doc=235)
          0.055941194 = weight(abstract_txt:gibt in 235) [ClassicSimilarity], result of:
            0.055941194 = score(doc=235,freq=1.0), product of:
              0.13947664 = queryWeight, product of:
                1.6050856 = boost
                5.133815 = idf(docFreq=692, maxDocs=43254)
                0.01692634 = queryNorm
              0.4010793 = fieldWeight in 235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.133815 = idf(docFreq=692, maxDocs=43254)
                0.078125 = fieldNorm(doc=235)
          0.15076998 = weight(abstract_txt:verteilung in 235) [ClassicSimilarity], result of:
            0.15076998 = score(doc=235,freq=1.0), product of:
              0.23597164 = queryWeight, product of:
                1.7046363 = boost
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.01692634 = queryNorm
              0.6389326 = fieldWeight in 235, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178337 = idf(docFreq=32, maxDocs=43254)
                0.078125 = fieldNorm(doc=235)
          0.3409856 = weight(abstract_txt:zeitschriften in 235) [ClassicSimilarity], result of:
            0.3409856 = score(doc=235,freq=5.0), product of:
              0.29956695 = queryWeight, product of:
                2.7162127 = boost
                6.5157895 = idf(docFreq=173, maxDocs=43254)
                0.01692634 = queryNorm
              1.1382617 = fieldWeight in 235, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5157895 = idf(docFreq=173, maxDocs=43254)
                0.078125 = fieldNorm(doc=235)
        0.16 = coord(4/25)
    
  5. Chen, X.: Indexing consistency between online catalogues (2008) 0.10
    0.10348781 = sum of:
      0.10348781 = product of:
        0.43119922 = sum of:
          0.028343864 = weight(abstract_txt:entwicklung in 3674) [ClassicSimilarity], result of:
            0.028343864 = score(doc=3674,freq=1.0), product of:
              0.08985922 = queryWeight, product of:
                1.051921 = boost
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.01692634 = queryNorm
              0.31542522 = fieldWeight in 3674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0468035 = idf(docFreq=755, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
          0.040628646 = weight(abstract_txt:zwischen in 3674) [ClassicSimilarity], result of:
            0.040628646 = score(doc=3674,freq=2.0), product of:
              0.09067095 = queryWeight, product of:
                1.0566616 = boost
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.01692634 = queryNorm
              0.44808888 = fieldWeight in 3674, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
          0.037974395 = weight(abstract_txt:mehr in 3674) [ClassicSimilarity], result of:
            0.037974395 = score(doc=3674,freq=1.0), product of:
              0.12501082 = queryWeight, product of:
                1.5195719 = boost
                4.860302 = idf(docFreq=910, maxDocs=43254)
                0.01692634 = queryNorm
              0.30376887 = fieldWeight in 3674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.860302 = idf(docFreq=910, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
          0.10650956 = weight(abstract_txt:deskriptoren in 3674) [ClassicSimilarity], result of:
            0.10650956 = score(doc=3674,freq=1.0), product of:
              0.21719459 = queryWeight, product of:
                1.6354088 = boost
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.01692634 = queryNorm
              0.49038774 = fieldWeight in 3674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.846204 = idf(docFreq=45, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
          0.1417395 = weight(abstract_txt:solch in 3674) [ClassicSimilarity], result of:
            0.1417395 = score(doc=3674,freq=1.0), product of:
              0.26277488 = queryWeight, product of:
                1.798845 = boost
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.01692634 = queryNorm
              0.53939515 = fieldWeight in 3674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.630322 = idf(docFreq=20, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
          0.07600325 = weight(abstract_txt:thesaurus in 3674) [ClassicSimilarity], result of:
            0.07600325 = score(doc=3674,freq=1.0), product of:
              0.23539184 = queryWeight, product of:
                2.6919534 = boost
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.01692634 = queryNorm
              0.32287973 = fieldWeight in 3674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1660757 = idf(docFreq=670, maxDocs=43254)
                0.0625 = fieldNorm(doc=3674)
        0.24 = coord(6/25)