Document (#37402)

Author
Glaesener, L.
Title
Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen
Imprint
Köln : Fachhochschule / Fakultät für Informations- und Kommunikationswissenschaften
Year
2012
Pages
III, 34, VII S
Abstract
Ein Bericht über die Ergebnisse und die Prozessanalyse einer automatischen Indexierung mit Mehrwortgruppen. Diese Bachelorarbeit beschreibt, inwieweit der Inhalt informationswissenschaftlicher Fachtexte durch informationswissenschaftliches Fachvokabular erschlossen werden kann und sollte und dass in diesen wissenschaftlichen Texten ein Großteil der fachlichen Inhalte in Mehrwortgruppen vorkommt. Die Ergebnisse wurden durch eine automatische Indexierung mit Mehrwortgruppen mithilfe des Programme Lingo an einer informationswissenschaftlichen Datenbank ermittelt.
Content
Bachelorarbeit im Studiengang Bibliothekswesen der Fakultät für Informations- und Kommunikationswissenschaften an der Fachhochschule Köln.
Theme
Automatisches Indexieren

Similar documents (content)

  1. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.36
    0.36404547 = sum of:
      0.36404547 = product of:
        1.3001624 = sum of:
          0.023265427 = weight(abstract_txt:inhalt in 1054) [ClassicSimilarity], result of:
            0.023265427 = score(doc=1054,freq=1.0), product of:
              0.08798038 = queryWeight, product of:
                1.0853465 = boost
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.011974359 = queryNorm
              0.2644388 = fieldWeight in 1054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.769634 = idf(docFreq=137, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.027956586 = weight(abstract_txt:texten in 1054) [ClassicSimilarity], result of:
            0.027956586 = score(doc=1054,freq=1.0), product of:
              0.09944155 = queryWeight, product of:
                1.1538768 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.011974359 = queryNorm
              0.28113586 = fieldWeight in 1054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.016252747 = weight(abstract_txt:durch in 1054) [ClassicSimilarity], result of:
            0.016252747 = score(doc=1054,freq=2.0), product of:
              0.069267526 = queryWeight, product of:
                1.3619317 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.011974359 = queryNorm
              0.23463732 = fieldWeight in 1054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.11724778 = weight(abstract_txt:lingo in 1054) [ClassicSimilarity], result of:
            0.11724778 = score(doc=1054,freq=4.0), product of:
              0.16291519 = queryWeight, product of:
                1.4769176 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011974359 = queryNorm
              0.71968603 = fieldWeight in 1054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.03491155 = weight(abstract_txt:ergebnisse in 1054) [ClassicSimilarity], result of:
            0.03491155 = score(doc=1054,freq=2.0), product of:
              0.11531644 = queryWeight, product of:
                1.7572603 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.011974359 = queryNorm
              0.30274564 = fieldWeight in 1054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          0.029468555 = weight(abstract_txt:einer in 1054) [ClassicSimilarity], result of:
            0.029468555 = score(doc=1054,freq=5.0), product of:
              0.08686966 = queryWeight, product of:
                1.8679712 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.011974359 = queryNorm
              0.33922726 = fieldWeight in 1054, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
          1.0510598 = weight(abstract_txt:mehrwortgruppen in 1054) [ClassicSimilarity], result of:
            1.0510598 = score(doc=1054,freq=13.0), product of:
              0.75341743 = queryWeight, product of:
                6.3521876 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.011974359 = queryNorm
              1.3950564 = fieldWeight in 1054, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1054)
        0.28 = coord(7/25)
    
  2. Bredack, J.; Lepsky, K.: Automatische Extraktion von Fachterminologie aus Volltexten (2014) 0.31
    0.30707744 = sum of:
      0.30707744 = product of:
        1.5353872 = sum of:
          0.06951376 = weight(abstract_txt:automatische in 4872) [ClassicSimilarity], result of:
            0.06951376 = score(doc=4872,freq=1.0), product of:
              0.091872804 = queryWeight, product of:
                1.1090956 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.011974359 = queryNorm
              0.7566304 = fieldWeight in 4872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.109375 = fieldNorm(doc=4872)
          0.07827844 = weight(abstract_txt:texten in 4872) [ClassicSimilarity], result of:
            0.07827844 = score(doc=4872,freq=1.0), product of:
              0.09944155 = queryWeight, product of:
                1.1538768 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.011974359 = queryNorm
              0.78718036 = fieldWeight in 4872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.109375 = fieldNorm(doc=4872)
          0.16414689 = weight(abstract_txt:lingo in 4872) [ClassicSimilarity], result of:
            0.16414689 = score(doc=4872,freq=1.0), product of:
              0.16291519 = queryWeight, product of:
                1.4769176 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011974359 = queryNorm
              1.0075604 = fieldWeight in 4872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.109375 = fieldNorm(doc=4872)
          0.06912134 = weight(abstract_txt:ergebnisse in 4872) [ClassicSimilarity], result of:
            0.06912134 = score(doc=4872,freq=1.0), product of:
              0.11531644 = queryWeight, product of:
                1.7572603 = boost
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.011974359 = queryNorm
              0.59940577 = fieldWeight in 4872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4802814 = idf(docFreq=500, maxDocs=44218)
                0.109375 = fieldNorm(doc=4872)
          1.1543268 = weight(abstract_txt:mehrwortgruppen in 4872) [ClassicSimilarity], result of:
            1.1543268 = score(doc=4872,freq=2.0), product of:
              0.75341743 = queryWeight, product of:
                6.3521876 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.011974359 = queryNorm
              1.5321212 = fieldWeight in 4872, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.109375 = fieldNorm(doc=4872)
        0.2 = coord(5/25)
    
  3. Lepsky, K.: Automatisches Indexieren (2023) 0.27
    0.26514295 = sum of:
      0.26514295 = product of:
        1.3257147 = sum of:
          0.08276832 = weight(abstract_txt:automatischen in 781) [ClassicSimilarity], result of:
            0.08276832 = score(doc=781,freq=2.0), product of:
              0.090782836 = queryWeight, product of:
                1.1024969 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.011974359 = queryNorm
              0.9117177 = fieldWeight in 781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.059583224 = weight(abstract_txt:automatische in 781) [ClassicSimilarity], result of:
            0.059583224 = score(doc=781,freq=1.0), product of:
              0.091872804 = queryWeight, product of:
                1.1090956 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.011974359 = queryNorm
              0.6485404 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.082420565 = weight(abstract_txt:ermittelt in 781) [ClassicSimilarity], result of:
            0.082420565 = score(doc=781,freq=1.0), product of:
              0.1140586 = queryWeight, product of:
                1.2357752 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.011974359 = queryNorm
              0.72261596 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.85282075 = weight(title_txt:automatisches in 781) [ClassicSimilarity], result of:
            0.85282075 = score(doc=781,freq=1.0), product of:
              0.15289865 = queryWeight, product of:
                1.4307947 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.011974359 = queryNorm
              5.5776863 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.625 = fieldNorm(doc=781)
          0.24812184 = weight(abstract_txt:indexierung in 781) [ClassicSimilarity], result of:
            0.24812184 = score(doc=781,freq=5.0), product of:
              0.17521353 = queryWeight, product of:
                2.166079 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.011974359 = queryNorm
              1.4161112 = fieldWeight in 781, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
        0.2 = coord(5/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.24
    0.2360021 = sum of:
      0.2360021 = product of:
        0.84286463 = sum of:
          0.06828038 = weight(abstract_txt:automatischen in 38) [ClassicSimilarity], result of:
            0.06828038 = score(doc=38,freq=4.0), product of:
              0.090782836 = queryWeight, product of:
                1.1024969 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.011974359 = queryNorm
              0.7521287 = fieldWeight in 38, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.03475688 = weight(abstract_txt:automatische in 38) [ClassicSimilarity], result of:
            0.03475688 = score(doc=38,freq=1.0), product of:
              0.091872804 = queryWeight, product of:
                1.1090956 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.011974359 = queryNorm
              0.3783152 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.03913922 = weight(abstract_txt:texten in 38) [ClassicSimilarity], result of:
            0.03913922 = score(doc=38,freq=1.0), product of:
              0.09944155 = queryWeight, product of:
                1.1538768 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.011974359 = queryNorm
              0.39359018 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.050723746 = weight(abstract_txt:mithilfe in 38) [ClassicSimilarity], result of:
            0.050723746 = score(doc=38,freq=1.0), product of:
              0.118204504 = queryWeight, product of:
                1.2580343 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.011974359 = queryNorm
              0.42911857 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.016089398 = weight(abstract_txt:durch in 38) [ClassicSimilarity], result of:
            0.016089398 = score(doc=38,freq=1.0), product of:
              0.069267526 = queryWeight, product of:
                1.3619317 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.011974359 = queryNorm
              0.2322791 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.59697455 = weight(title_txt:automatisches in 38) [ClassicSimilarity], result of:
            0.59697455 = score(doc=38,freq=1.0), product of:
              0.15289865 = queryWeight, product of:
                1.4307947 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.011974359 = queryNorm
              3.9043806 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.4375 = fieldNorm(doc=38)
          0.036900464 = weight(abstract_txt:einer in 38) [ClassicSimilarity], result of:
            0.036900464 = score(doc=38,freq=4.0), product of:
              0.08686966 = queryWeight, product of:
                1.8679712 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.011974359 = queryNorm
              0.42477968 = fieldWeight in 38, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
        0.28 = coord(7/25)
    
  5. Grün, S.: Bildung von Komposita-Indextermen auf der Basis einer algorithmischen Mehrwortgruppenanalyse mit Lingo (2015) 0.22
    0.22402044 = sum of:
      0.22402044 = product of:
        1.4001278 = sum of:
          0.07702275 = weight(abstract_txt:großteil in 1335) [ClassicSimilarity], result of:
            0.07702275 = score(doc=1335,freq=1.0), product of:
              0.12311317 = queryWeight, product of:
                1.2838898 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.011974359 = queryNorm
              0.6256256 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.078125 = fieldNorm(doc=1335)
          0.039810937 = weight(abstract_txt:durch in 1335) [ClassicSimilarity], result of:
            0.039810937 = score(doc=1335,freq=3.0), product of:
              0.069267526 = queryWeight, product of:
                1.3619317 = boost
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.011974359 = queryNorm
              0.5747417 = fieldWeight in 1335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2473893 = idf(docFreq=1718, maxDocs=44218)
                0.078125 = fieldNorm(doc=1335)
          0.11724778 = weight(abstract_txt:lingo in 1335) [ClassicSimilarity], result of:
            0.11724778 = score(doc=1335,freq=1.0), product of:
              0.16291519 = queryWeight, product of:
                1.4769176 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011974359 = queryNorm
              0.71968603 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=1335)
          1.1660463 = weight(abstract_txt:mehrwortgruppen in 1335) [ClassicSimilarity], result of:
            1.1660463 = score(doc=1335,freq=4.0), product of:
              0.75341743 = queryWeight, product of:
                6.3521876 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.011974359 = queryNorm
              1.5476762 = fieldWeight in 1335, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=1335)
        0.16 = coord(4/25)