Document (#43783)

Author
Lepsky, K.
Title
Automatisches Indexieren
Source
Grundlagen der Informationswissenschaft. Hrsg.: Rainer Kuhlen, Dirk Lewandowski, Wolfgang Semar und Christa Womser-Hacker. 7., völlig neu gefasste Ausg
Imprint
Berlin : DeGruyter
Year
2023
Pages
S.171-182
Abstract
Unter Indexierung versteht man die Zuordnung von inhaltskennzeichnenden Ausdrücken (Indextermen, Indexaten, Erschließungsmerkmalen) zu Dokumenten. Über die zugeteilten Indexterme soll ein gezieltes Auffinden der Dokumente ermöglicht werden. Indexterme können inhaltsbeschreibende Merkmale wie Notationen, Deskriptoren, kontrollierte oder freie Schlagwörter sein; es kann sich auch um reine Stichwörter handeln, die aus dem Text des Dokuments gewonnen werden. Eine Indexierung kann intellektuell, computerunterstützt oder automatisch erfolgen. Computerunterstützte Indexierungsverfahren kombinieren die intellektuelle Indexierung mit automatischen Vorarbeiten. Bei der automatischen Indexierung werden die Indexterme automatisch aus dem Dokumenttext ermittelt und dem Dokument zugeordnet. Automatische Indexierung bedient sich für die Verarbeitung der Zeichenketten im Dokument linguistischer und statistischer Verfahren.
Footnote
Vgl.: https://doi.org/10.1515/9783110769043.
Theme
Automatisches Indexieren

Similar documents (author)

  1. Lepsky, K.: Art and language : Ernst H. Gombrich and Karl Bühler's theory of language (1996) 5.04
    5.0370636 = sum of:
      5.0370636 = weight(author_txt:lepsky in 5229) [ClassicSimilarity], result of:
        5.0370636 = fieldWeight in 5229, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.059301 = idf(docFreq=37, maxDocs=44218)
          0.625 = fieldNorm(doc=5229)
    
  2. Lepsky, K.: Maschinelle Indexierung von Titelaufnahmen zur Verbesserung der sachlichen Erschließung in Online-Publikumskatalogen (1994) 5.04
    5.0370636 = sum of:
      5.0370636 = weight(author_txt:lepsky in 7064) [ClassicSimilarity], result of:
        5.0370636 = fieldWeight in 7064, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.059301 = idf(docFreq=37, maxDocs=44218)
          0.625 = fieldNorm(doc=7064)
    
  3. Lepsky, K.: RSWK - und was noch? : Stellungnahme zum Bericht 'Sacherschließung in Online-Katalogen' der Expertengruppe Online-Kataloge (1995) 5.04
    5.0370636 = sum of:
      5.0370636 = weight(author_txt:lepsky in 772) [ClassicSimilarity], result of:
        5.0370636 = fieldWeight in 772, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.059301 = idf(docFreq=37, maxDocs=44218)
          0.625 = fieldNorm(doc=772)
    
  4. Lepsky, K.: Bild und Wirklichkeit : die Wirklichkeit im Bild (1987) 5.04
    5.0370636 = sum of:
      5.0370636 = weight(author_txt:lepsky in 1346) [ClassicSimilarity], result of:
        5.0370636 = fieldWeight in 1346, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.059301 = idf(docFreq=37, maxDocs=44218)
          0.625 = fieldNorm(doc=1346)
    
  5. Lepsky, K.: Ernst H. Gombrich : Theorie und Methode (1991) 5.04
    5.0370636 = sum of:
      5.0370636 = weight(author_txt:lepsky in 1685) [ClassicSimilarity], result of:
        5.0370636 = fieldWeight in 1685, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.059301 = idf(docFreq=37, maxDocs=44218)
          0.625 = fieldNorm(doc=1685)
    

Similar documents (content)

  1. Glaesener, L.: Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen (2012) 0.26
    0.25710234 = sum of:
      0.25710234 = product of:
        1.2855117 = sum of:
          0.51966166 = weight(title_txt:automatisches in 401) [ClassicSimilarity], result of:
            0.51966166 = score(doc=401,freq=1.0), product of:
              0.15527993 = queryWeight, product of:
                1.1109096 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.015662553 = queryNorm
              3.346612 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.375 = fieldNorm(doc=401)
          0.044600718 = weight(abstract_txt:kann in 401) [ClassicSimilarity], result of:
            0.044600718 = score(doc=401,freq=1.0), product of:
              0.079180874 = queryWeight, product of:
                1.1218795 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.015662553 = queryNorm
              0.5632764 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.031515863 = weight(abstract_txt:werden in 401) [ClassicSimilarity], result of:
            0.031515863 = score(doc=401,freq=1.0), product of:
              0.07190774 = queryWeight, product of:
                1.3093914 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015662553 = queryNorm
              0.43828195 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.1585001 = weight(abstract_txt:automatischen in 401) [ClassicSimilarity], result of:
            0.1585001 = score(doc=401,freq=1.0), product of:
              0.18439342 = queryWeight, product of:
                1.7120197 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015662553 = queryNorm
              0.8595757 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
          0.5312335 = weight(abstract_txt:indexierung in 401) [ClassicSimilarity], result of:
            0.5312335 = score(doc=401,freq=2.0), product of:
              0.4448559 = queryWeight, product of:
                4.204513 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015662553 = queryNorm
              1.1941698 = fieldWeight in 401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.125 = fieldNorm(doc=401)
        0.2 = coord(5/25)
    
  2. Busch, D.: Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten (2019) 0.22
    0.22371241 = sum of:
      0.22371241 = product of:
        0.9321351 = sum of:
          0.07896657 = weight(abstract_txt:intellektuell in 5628) [ClassicSimilarity], result of:
            0.07896657 = score(doc=5628,freq=1.0), product of:
              0.12582238 = queryWeight, product of:
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.015662553 = queryNorm
              0.62760353 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.08494523 = weight(abstract_txt:zuordnung in 5628) [ClassicSimilarity], result of:
            0.08494523 = score(doc=5628,freq=1.0), product of:
              0.1320956 = queryWeight, product of:
                1.0246257 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.015662553 = queryNorm
              0.6430587 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.034116924 = weight(abstract_txt:werden in 5628) [ClassicSimilarity], result of:
            0.034116924 = score(doc=5628,freq=3.0), product of:
              0.07190774 = queryWeight, product of:
                1.3093914 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015662553 = queryNorm
              0.47445413 = fieldWeight in 5628, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.14009562 = weight(abstract_txt:automatischen in 5628) [ClassicSimilarity], result of:
            0.14009562 = score(doc=5628,freq=2.0), product of:
              0.18439342 = queryWeight, product of:
                1.7120197 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015662553 = queryNorm
              0.7597648 = fieldWeight in 5628, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.124462284 = weight(abstract_txt:dokument in 5628) [ClassicSimilarity], result of:
            0.124462284 = score(doc=5628,freq=1.0), product of:
              0.21469943 = queryWeight, product of:
                1.8473599 = boost
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.015662553 = queryNorm
              0.57970476 = fieldWeight in 5628, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4202213 = idf(docFreq=71, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
          0.4695485 = weight(abstract_txt:indexierung in 5628) [ClassicSimilarity], result of:
            0.4695485 = score(doc=5628,freq=4.0), product of:
              0.4448559 = queryWeight, product of:
                4.204513 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015662553 = queryNorm
              1.055507 = fieldWeight in 5628, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.078125 = fieldNorm(doc=5628)
        0.24 = coord(6/25)
    
  3. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.21
    0.2063312 = sum of:
      0.2063312 = product of:
        0.8597133 = sum of:
          0.05946166 = weight(abstract_txt:zuordnung in 38) [ClassicSimilarity], result of:
            0.05946166 = score(doc=38,freq=1.0), product of:
              0.1320956 = queryWeight, product of:
                1.0246257 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.015662553 = queryNorm
              0.4501411 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.016279787 = weight(abstract_txt:oder in 38) [ClassicSimilarity], result of:
            0.016279787 = score(doc=38,freq=1.0), product of:
              0.07017345 = queryWeight, product of:
                1.0561423 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.015662553 = queryNorm
              0.23199353 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.606272 = weight(title_txt:automatisches in 38) [ClassicSimilarity], result of:
            0.606272 = score(doc=38,freq=1.0), product of:
              0.15527993 = queryWeight, product of:
                1.1109096 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.015662553 = queryNorm
              3.9043806 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.4375 = fieldNorm(doc=38)
          0.019512815 = weight(abstract_txt:kann in 38) [ClassicSimilarity], result of:
            0.019512815 = score(doc=38,freq=1.0), product of:
              0.079180874 = queryWeight, product of:
                1.1218795 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.015662553 = queryNorm
              0.24643344 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.019499445 = weight(abstract_txt:werden in 38) [ClassicSimilarity], result of:
            0.019499445 = score(doc=38,freq=2.0), product of:
              0.07190774 = queryWeight, product of:
                1.3093914 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015662553 = queryNorm
              0.27117312 = fieldWeight in 38, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.13868758 = weight(abstract_txt:automatischen in 38) [ClassicSimilarity], result of:
            0.13868758 = score(doc=38,freq=4.0), product of:
              0.18439342 = queryWeight, product of:
                1.7120197 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015662553 = queryNorm
              0.7521287 = fieldWeight in 38, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
        0.24 = coord(6/25)
    
  4. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.19
    0.18554471 = sum of:
      0.18554471 = product of:
        0.773103 = sum of:
          0.05946166 = weight(abstract_txt:zuordnung in 2487) [ClassicSimilarity], result of:
            0.05946166 = score(doc=2487,freq=1.0), product of:
              0.1320956 = queryWeight, product of:
                1.0246257 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.015662553 = queryNorm
              0.4501411 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.016279787 = weight(abstract_txt:oder in 2487) [ClassicSimilarity], result of:
            0.016279787 = score(doc=2487,freq=1.0), product of:
              0.07017345 = queryWeight, product of:
                1.0561423 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.015662553 = queryNorm
              0.23199353 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.51966166 = weight(title_txt:automatisches in 2487) [ClassicSimilarity], result of:
            0.51966166 = score(doc=2487,freq=1.0), product of:
              0.15527993 = queryWeight, product of:
                1.1109096 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.015662553 = queryNorm
              3.346612 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.375 = fieldNorm(doc=2487)
          0.019512815 = weight(abstract_txt:kann in 2487) [ClassicSimilarity], result of:
            0.019512815 = score(doc=2487,freq=1.0), product of:
              0.079180874 = queryWeight, product of:
                1.1218795 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.015662553 = queryNorm
              0.24643344 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.019499445 = weight(abstract_txt:werden in 2487) [ClassicSimilarity], result of:
            0.019499445 = score(doc=2487,freq=2.0), product of:
              0.07190774 = queryWeight, product of:
                1.3093914 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015662553 = queryNorm
              0.27117312 = fieldWeight in 2487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.13868758 = weight(abstract_txt:automatischen in 2487) [ClassicSimilarity], result of:
            0.13868758 = score(doc=2487,freq=4.0), product of:
              0.18439342 = queryWeight, product of:
                1.7120197 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015662553 = queryNorm
              0.7521287 = fieldWeight in 2487, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
        0.24 = coord(6/25)
    
  5. Larroche-Boutet, V.; Pöhl, K.: ¬Das Nominalsyntagna : über die Nutzbarmachung eines logico-semantischen Konzeptes für dokumentarische Fragestellungen (1993) 0.18
    0.18085788 = sum of:
      0.18085788 = product of:
        0.645921 = sum of:
          0.03222562 = weight(abstract_txt:oder in 5282) [ClassicSimilarity], result of:
            0.03222562 = score(doc=5282,freq=3.0), product of:
              0.07017345 = queryWeight, product of:
                1.0561423 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.015662553 = queryNorm
              0.4592281 = fieldWeight in 5282, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.08322611 = weight(abstract_txt:kontrollierte in 5282) [ClassicSimilarity], result of:
            0.08322611 = score(doc=5282,freq=1.0), product of:
              0.15120824 = queryWeight, product of:
                1.0962479 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.015662553 = queryNorm
              0.55040723 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.022300359 = weight(abstract_txt:kann in 5282) [ClassicSimilarity], result of:
            0.022300359 = score(doc=5282,freq=1.0), product of:
              0.079180874 = queryWeight, product of:
                1.1218795 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.015662553 = queryNorm
              0.2816382 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.12806626 = weight(abstract_txt:indexierungsverfahren in 5282) [ClassicSimilarity], result of:
            0.12806626 = score(doc=5282,freq=2.0), product of:
              0.15996152 = queryWeight, product of:
                1.1275319 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.015662553 = queryNorm
              0.8006066 = fieldWeight in 5282, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.035235807 = weight(abstract_txt:werden in 5282) [ClassicSimilarity], result of:
            0.035235807 = score(doc=5282,freq=5.0), product of:
              0.07190774 = queryWeight, product of:
                1.3093914 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.015662553 = queryNorm
              0.49001414 = fieldWeight in 5282, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.07925005 = weight(abstract_txt:automatischen in 5282) [ClassicSimilarity], result of:
            0.07925005 = score(doc=5282,freq=1.0), product of:
              0.18439342 = queryWeight, product of:
                1.7120197 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.015662553 = queryNorm
              0.42978784 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
          0.26561674 = weight(abstract_txt:indexierung in 5282) [ClassicSimilarity], result of:
            0.26561674 = score(doc=5282,freq=2.0), product of:
              0.4448559 = queryWeight, product of:
                4.204513 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.015662553 = queryNorm
              0.5970849 = fieldWeight in 5282, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=5282)
        0.28 = coord(7/25)