Document (#37403)

Author
Glaesener, L.
Title
Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen
Imprint
Köln : Fachhochschule / Fakultät für Informations- und Kommunikationswissenschaften
Year
2012
Pages
III, 34, VII S
Abstract
Ein Bericht über die Ergebnisse und die Prozessanalyse einer automatischen Indexierung mit Mehrwortgruppen. Diese Bachelorarbeit beschreibt, inwieweit der Inhalt informationswissenschaftlicher Fachtexte durch informationswissenschaftliches Fachvokabular erschlossen werden kann und sollte und dass in diesen wissenschaftlichen Texten ein Großteil der fachlichen Inhalte in Mehrwortgruppen vorkommt. Die Ergebnisse wurden durch eine automatische Indexierung mit Mehrwortgruppen mithilfe des Programme Lingo an einer informationswissenschaftlichen Datenbank ermittelt.
Content
Bachelorarbeit im Studiengang Bibliothekswesen der Fakultät für Informations- und Kommunikationswissenschaften an der Fachhochschule Köln.
Theme
Automatisches Indexieren

Similar documents (content)

  1. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.36
    0.36113024 = sum of:
      0.36113024 = product of:
        1.2897508 = sum of:
          0.023045303 = weight(abstract_txt:inhalt in 2055) [ClassicSimilarity], result of:
            0.023045303 = score(doc=2055,freq=1.0), product of:
              0.08744211 = queryWeight, product of:
                1.0775404 = boost
                6.746861 = idf(docFreq=135, maxDocs=42596)
                0.012027775 = queryNorm
              0.26354927 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.746861 = idf(docFreq=135, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          0.02892157 = weight(abstract_txt:texten in 2055) [ClassicSimilarity], result of:
            0.02892157 = score(doc=2055,freq=1.0), product of:
              0.10173731 = queryWeight, product of:
                1.162287 = boost
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.012027775 = queryNorm
              0.28427693 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          0.016544621 = weight(abstract_txt:durch in 2055) [ClassicSimilarity], result of:
            0.016544621 = score(doc=2055,freq=2.0), product of:
              0.07010843 = queryWeight, product of:
                1.3644994 = boost
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.012027775 = queryNorm
              0.23598619 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          0.115896374 = weight(abstract_txt:lingo in 2055) [ClassicSimilarity], result of:
            0.115896374 = score(doc=2055,freq=4.0), product of:
              0.16169338 = queryWeight, product of:
                1.4652758 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.012027775 = queryNorm
              0.71676636 = fieldWeight in 2055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          0.03552422 = weight(abstract_txt:ergebnisse in 2055) [ClassicSimilarity], result of:
            0.03552422 = score(doc=2055,freq=2.0), product of:
              0.11668509 = queryWeight, product of:
                1.7603376 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.012027775 = queryNorm
              0.30444524 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          0.029984783 = weight(abstract_txt:einer in 2055) [ClassicSimilarity], result of:
            0.029984783 = score(doc=2055,freq=5.0), product of:
              0.08789889 = queryWeight, product of:
                1.8712231 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.012027775 = queryNorm
              0.34112814 = fieldWeight in 2055, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
          1.0398339 = weight(abstract_txt:mehrwortgruppen in 2055) [ClassicSimilarity], result of:
            1.0398339 = score(doc=2055,freq=13.0), product of:
              0.7481934 = queryWeight, product of:
                6.3039126 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.012027775 = queryNorm
              1.389793 = fieldWeight in 2055, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.0390625 = fieldNorm(doc=2055)
        0.28 = coord(7/25)
    
  2. Bredack, J.; Lepsky, K.: Automatische Extraktion von Fachterminologie aus Volltexten (2014) 0.31
    0.3051654 = sum of:
      0.3051654 = product of:
        1.525827 = sum of:
          0.0702594 = weight(abstract_txt:automatische in 873) [ClassicSimilarity], result of:
            0.0702594 = score(doc=873,freq=1.0), product of:
              0.0925472 = queryWeight, product of:
                1.108549 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.012027775 = queryNorm
              0.7591737 = fieldWeight in 873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.109375 = fieldNorm(doc=873)
          0.0809804 = weight(abstract_txt:texten in 873) [ClassicSimilarity], result of:
            0.0809804 = score(doc=873,freq=1.0), product of:
              0.10173731 = queryWeight, product of:
                1.162287 = boost
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.012027775 = queryNorm
              0.7959754 = fieldWeight in 873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.109375 = fieldNorm(doc=873)
          0.16225493 = weight(abstract_txt:lingo in 873) [ClassicSimilarity], result of:
            0.16225493 = score(doc=873,freq=1.0), product of:
              0.16169338 = queryWeight, product of:
                1.4652758 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.012027775 = queryNorm
              1.0034729 = fieldWeight in 873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.109375 = fieldNorm(doc=873)
          0.07033437 = weight(abstract_txt:ergebnisse in 873) [ClassicSimilarity], result of:
            0.07033437 = score(doc=873,freq=1.0), product of:
              0.11668509 = queryWeight, product of:
                1.7603376 = boost
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.012027775 = queryNorm
              0.6027708 = fieldWeight in 873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5110474 = idf(docFreq=467, maxDocs=42596)
                0.109375 = fieldNorm(doc=873)
          1.1419979 = weight(abstract_txt:mehrwortgruppen in 873) [ClassicSimilarity], result of:
            1.1419979 = score(doc=873,freq=2.0), product of:
              0.7481934 = queryWeight, product of:
                6.3039126 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.012027775 = queryNorm
              1.5263406 = fieldWeight in 873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.109375 = fieldNorm(doc=873)
        0.2 = coord(5/25)
    
  3. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.24
    0.2384644 = sum of:
      0.2384644 = product of:
        0.8516586 = sum of:
          0.06843444 = weight(abstract_txt:automatischen in 343) [ClassicSimilarity], result of:
            0.06843444 = score(doc=343,freq=4.0), product of:
              0.0909376 = queryWeight, product of:
                1.0988667 = boost
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.012027775 = queryNorm
              0.75254285 = fieldWeight in 343, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
          0.0351297 = weight(abstract_txt:automatische in 343) [ClassicSimilarity], result of:
            0.0351297 = score(doc=343,freq=1.0), product of:
              0.0925472 = queryWeight, product of:
                1.108549 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.012027775 = queryNorm
              0.37958685 = fieldWeight in 343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
          0.0404902 = weight(abstract_txt:texten in 343) [ClassicSimilarity], result of:
            0.0404902 = score(doc=343,freq=1.0), product of:
              0.10173731 = queryWeight, product of:
                1.162287 = boost
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.012027775 = queryNorm
              0.3979877 = fieldWeight in 343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
          0.05087303 = weight(abstract_txt:mithilfe in 343) [ClassicSimilarity], result of:
            0.05087303 = score(doc=343,freq=1.0), product of:
              0.11846009 = queryWeight, product of:
                1.2541783 = boost
                7.8528533 = idf(docFreq=44, maxDocs=42596)
                0.012027775 = queryNorm
              0.42945293 = fieldWeight in 343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8528533 = idf(docFreq=44, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
          0.01637834 = weight(abstract_txt:durch in 343) [ClassicSimilarity], result of:
            0.01637834 = score(doc=343,freq=1.0), product of:
              0.07010843 = queryWeight, product of:
                1.3644994 = boost
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.012027775 = queryNorm
              0.2336144 = fieldWeight in 343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
          0.60280603 = weight(title_txt:automatisches in 343) [ClassicSimilarity], result of:
            0.60280603 = score(doc=343,freq=1.0), product of:
              0.15392366 = queryWeight, product of:
                1.4296376 = boost
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.012027775 = queryNorm
              3.9162662 = fieldWeight in 343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.4375 = fieldNorm(doc=343)
          0.03754689 = weight(abstract_txt:einer in 343) [ClassicSimilarity], result of:
            0.03754689 = score(doc=343,freq=4.0), product of:
              0.08789889 = queryWeight, product of:
                1.8712231 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.012027775 = queryNorm
              0.42716002 = fieldWeight in 343, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.0546875 = fieldNorm(doc=343)
        0.28 = coord(7/25)
    
  4. Grün, S.: Bildung von Komposita-Indextermen auf der Basis einer algorithmischen Mehrwortgruppenanalyse mit Lingo (2015) 0.22
    0.22187775 = sum of:
      0.22187775 = product of:
        1.386736 = sum of:
          0.07672166 = weight(abstract_txt:großteil in 2336) [ClassicSimilarity], result of:
            0.07672166 = score(doc=2336,freq=1.0), product of:
              0.12281677 = queryWeight, product of:
                1.277033 = boost
                7.995954 = idf(docFreq=38, maxDocs=42596)
                0.012027775 = queryNorm
              0.6246839 = fieldWeight in 2336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.995954 = idf(docFreq=38, maxDocs=42596)
                0.078125 = fieldNorm(doc=2336)
          0.040525876 = weight(abstract_txt:durch in 2336) [ClassicSimilarity], result of:
            0.040525876 = score(doc=2336,freq=3.0), product of:
              0.07010843 = queryWeight, product of:
                1.3644994 = boost
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.012027775 = queryNorm
              0.5780457 = fieldWeight in 2336, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.078125 = fieldNorm(doc=2336)
          0.115896374 = weight(abstract_txt:lingo in 2336) [ClassicSimilarity], result of:
            0.115896374 = score(doc=2336,freq=1.0), product of:
              0.16169338 = queryWeight, product of:
                1.4652758 = boost
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.012027775 = queryNorm
              0.71676636 = fieldWeight in 2336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.174609 = idf(docFreq=11, maxDocs=42596)
                0.078125 = fieldNorm(doc=2336)
          1.1535921 = weight(abstract_txt:mehrwortgruppen in 2336) [ClassicSimilarity], result of:
            1.1535921 = score(doc=2336,freq=4.0), product of:
              0.7481934 = queryWeight, product of:
                6.3039126 = boost
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.012027775 = queryNorm
              1.5418369 = fieldWeight in 2336, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.867756 = idf(docFreq=5, maxDocs=42596)
                0.078125 = fieldNorm(doc=2336)
        0.16 = coord(4/25)
    
  5. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.21
    0.21435218 = sum of:
      0.21435218 = product of:
        0.76554346 = sum of:
          0.06843444 = weight(abstract_txt:automatischen in 3488) [ClassicSimilarity], result of:
            0.06843444 = score(doc=3488,freq=4.0), product of:
              0.0909376 = queryWeight, product of:
                1.0988667 = boost
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.012027775 = queryNorm
              0.75254285 = fieldWeight in 3488, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.880392 = idf(docFreq=118, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
          0.0351297 = weight(abstract_txt:automatische in 3488) [ClassicSimilarity], result of:
            0.0351297 = score(doc=3488,freq=1.0), product of:
              0.0925472 = queryWeight, product of:
                1.108549 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.012027775 = queryNorm
              0.37958685 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
          0.0404902 = weight(abstract_txt:texten in 3488) [ClassicSimilarity], result of:
            0.0404902 = score(doc=3488,freq=1.0), product of:
              0.10173731 = queryWeight, product of:
                1.162287 = boost
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.012027775 = queryNorm
              0.3979877 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.277489 = idf(docFreq=79, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
          0.05087303 = weight(abstract_txt:mithilfe in 3488) [ClassicSimilarity], result of:
            0.05087303 = score(doc=3488,freq=1.0), product of:
              0.11846009 = queryWeight, product of:
                1.2541783 = boost
                7.8528533 = idf(docFreq=44, maxDocs=42596)
                0.012027775 = queryNorm
              0.42945293 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8528533 = idf(docFreq=44, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
          0.01637834 = weight(abstract_txt:durch in 3488) [ClassicSimilarity], result of:
            0.01637834 = score(doc=3488,freq=1.0), product of:
              0.07010843 = queryWeight, product of:
                1.3644994 = boost
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.012027775 = queryNorm
              0.2336144 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2718062 = idf(docFreq=1615, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
          0.5166909 = weight(title_txt:automatisches in 3488) [ClassicSimilarity], result of:
            0.5166909 = score(doc=3488,freq=1.0), product of:
              0.15392366 = queryWeight, product of:
                1.4296376 = boost
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.012027775 = queryNorm
              3.3567996 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.951466 = idf(docFreq=14, maxDocs=42596)
                0.375 = fieldNorm(doc=3488)
          0.03754689 = weight(abstract_txt:einer in 3488) [ClassicSimilarity], result of:
            0.03754689 = score(doc=3488,freq=4.0), product of:
              0.08789889 = queryWeight, product of:
                1.8712231 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.012027775 = queryNorm
              0.42716002 = fieldWeight in 3488, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3488)
        0.28 = coord(7/25)