Document (#23283)

Author
Larroche-Boutet, V.
Pöhl, K.
Title
¬Das Nominalsyntagna : über die Nutzbarmachung eines logico-semantischen Konzeptes für dokumentarische Fragestellungen
Source
Nachrichten für Dokumentation. 44(1993) H.5, S.269-276
Year
1993
Abstract
Am Anfang nachfolgender Ausführungen werden die für die Indexierung großer textmengen notwendigen strategischen Entscheidungen aufgezeigt: es müssen sowohl das Indexierungsverfahren (menschliche oder automatische Indexierung) als auch die Indexierungssparche (freie, kontrollierte oder natürliche Sprache) ausgewählt werden. Hierbei hat sich die Forschungsgruppe SYDO-LYON für natürlichsprachige automatische Vollindexierung entschieden. Auf der Grundlage der Unterscheidung zwischen prädikativen und referentiellen Textteilen wird d as Nominalsyntagma als kleinste referentielle Texteinheit definiert, dann das für die Konstituierung eines Nominalsyntagmas entscheidende Phänomen der Aktualisierung erläutert und schließlich auf die morphologischen Mittel zur Erkennung des Nominalsyntagmas hingewiesen. Alle Nominalsyntagma eines Textes werden als dessen potentielle Deskriptoren extrahiert, und Hilfsmittel für die Benutzer einer mit diesem Indexierungsverfahren arbeitenden Datenbank werden vorgestellt. Außerdem wird der begriff der Anapher (d.h. die Wiederaufnahme von Nominalsyntagmen durch Pronomen) kurz definiert, ihre Anwendung als Mittel zur Gewichtung des Deskriptorterme (durch Zählung ihrer Häufigkeit im text) aufgezeigt und morphologische uns syntaktische Regeln zur automatischen Bestimmung des von einem anaphorischen Pronomen aufgenommenen Nominalsyntagmas aufgestellt. Bevor abschließend Ziele und Grenzen der Arbeit diskutiert werden, wird noch auf einen Unterschied zwischen Nominalsyntagma und Deskriptorterm hingewiesen: das Nonimalsyntagma verweist auf ein Objekt, das ein Einzelobjekt oder eine Klasse sein kann, der Deskriptorterm verweist immer auf eine Klasse
Theme
Automatisches Indexieren
Computerlinguistik

Similar documents (content)

  1. Panyr, J.: Automatische Indexierung und Klassifikation (1983) 0.16
    0.16330495 = sum of:
      0.16330495 = product of:
        0.81652474 = sum of:
          0.064536214 = weight(abstract_txt:zwischen in 7692) [ClassicSimilarity], result of:
            0.064536214 = score(doc=7692,freq=1.0), product of:
              0.10227807 = queryWeight, product of:
                1.095943 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.018487731 = queryNorm
              0.6309878 = fieldWeight in 7692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.125 = fieldNorm(doc=7692)
          0.057173967 = weight(abstract_txt:wird in 7692) [ClassicSimilarity], result of:
            0.057173967 = score(doc=7692,freq=2.0), product of:
              0.08571684 = queryWeight, product of:
                1.2287836 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018487731 = queryNorm
              0.6670097 = fieldWeight in 7692, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.125 = fieldNorm(doc=7692)
          0.26788875 = weight(abstract_txt:indexierung in 7692) [ClassicSimilarity], result of:
            0.26788875 = score(doc=7692,freq=3.0), product of:
              0.18316512 = queryWeight, product of:
                1.4666215 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018487731 = queryNorm
              1.4625534 = fieldWeight in 7692, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.125 = fieldNorm(doc=7692)
          0.05406813 = weight(abstract_txt:werden in 7692) [ClassicSimilarity], result of:
            0.05406813 = score(doc=7692,freq=1.0), product of:
              0.12336381 = queryWeight, product of:
                1.9030955 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018487731 = queryNorm
              0.43828195 = fieldWeight in 7692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=7692)
          0.3728577 = weight(abstract_txt:indexierungsverfahren in 7692) [ClassicSimilarity], result of:
            0.3728577 = score(doc=7692,freq=1.0), product of:
              0.32931304 = queryWeight, product of:
                1.9665325 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.018487731 = queryNorm
              1.1322287 = fieldWeight in 7692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.125 = fieldNorm(doc=7692)
        0.2 = coord(5/25)
    
  2. Fuhr, N.: Modelle im Information Retrieval (2023) 0.16
    0.16082092 = sum of:
      0.16082092 = product of:
        0.5025654 = sum of:
          0.121896386 = weight(abstract_txt:natürlichsprachige in 800) [ClassicSimilarity], result of:
            0.121896386 = score(doc=800,freq=1.0), product of:
              0.19690228 = queryWeight, product of:
                1.0752441 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.018487731 = queryNorm
              0.6190705 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.032268107 = weight(abstract_txt:zwischen in 800) [ClassicSimilarity], result of:
            0.032268107 = score(doc=800,freq=1.0), product of:
              0.10227807 = queryWeight, product of:
                1.095943 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.018487731 = queryNorm
              0.3154939 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.040428102 = weight(abstract_txt:wird in 800) [ClassicSimilarity], result of:
            0.040428102 = score(doc=800,freq=4.0), product of:
              0.08571684 = queryWeight, product of:
                1.2287836 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018487731 = queryNorm
              0.4716471 = fieldWeight in 800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.057454664 = weight(abstract_txt:oder in 800) [ClassicSimilarity], result of:
            0.057454664 = score(doc=800,freq=4.0), product of:
              0.108349636 = queryWeight, product of:
                1.3815163 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.018487731 = queryNorm
              0.53027093 = fieldWeight in 800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.077332824 = weight(abstract_txt:indexierung in 800) [ClassicSimilarity], result of:
            0.077332824 = score(doc=800,freq=1.0), product of:
              0.18316512 = queryWeight, product of:
                1.4666215 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018487731 = queryNorm
              0.4222028 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.036067493 = weight(abstract_txt:eines in 800) [ClassicSimilarity], result of:
            0.036067493 = score(doc=800,freq=1.0), product of:
              0.12609792 = queryWeight, product of:
                1.4903774 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018487731 = queryNorm
              0.28602767 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.08304964 = weight(abstract_txt:automatische in 800) [ClassicSimilarity], result of:
            0.08304964 = score(doc=800,freq=1.0), product of:
              0.19208437 = queryWeight, product of:
                1.5019058 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018487731 = queryNorm
              0.43236023 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
          0.05406813 = weight(abstract_txt:werden in 800) [ClassicSimilarity], result of:
            0.05406813 = score(doc=800,freq=4.0), product of:
              0.12336381 = queryWeight, product of:
                1.9030955 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018487731 = queryNorm
              0.43828195 = fieldWeight in 800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=800)
        0.32 = coord(8/25)
    
  3. Lepsky, K.: Automatisches Indexieren (2023) 0.16
    0.15895526 = sum of:
      0.15895526 = product of:
        0.7947763 = sum of:
          0.06093987 = weight(abstract_txt:oder in 781) [ClassicSimilarity], result of:
            0.06093987 = score(doc=781,freq=2.0), product of:
              0.108349636 = queryWeight, product of:
                1.3815163 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.018487731 = queryNorm
              0.56243724 = fieldWeight in 781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.2593822 = weight(abstract_txt:indexierung in 781) [ClassicSimilarity], result of:
            0.2593822 = score(doc=781,freq=5.0), product of:
              0.18316512 = queryWeight, product of:
                1.4666215 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018487731 = queryNorm
              1.4161112 = fieldWeight in 781, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.12457447 = weight(abstract_txt:automatische in 781) [ClassicSimilarity], result of:
            0.12457447 = score(doc=781,freq=1.0), product of:
              0.19208437 = queryWeight, product of:
                1.5019058 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018487731 = queryNorm
              0.6485404 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.070236556 = weight(abstract_txt:werden in 781) [ClassicSimilarity], result of:
            0.070236556 = score(doc=781,freq=3.0), product of:
              0.12336381 = queryWeight, product of:
                1.9030955 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018487731 = queryNorm
              0.56934494 = fieldWeight in 781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.27964327 = weight(abstract_txt:indexierungsverfahren in 781) [ClassicSimilarity], result of:
            0.27964327 = score(doc=781,freq=1.0), product of:
              0.32931304 = queryWeight, product of:
                1.9665325 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.018487731 = queryNorm
              0.8491715 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
        0.2 = coord(5/25)
    
  4. Jüngling, H.: Verbesserung der sachlichen Erschließung von Bibliotheksbeständen durch Automatisierung der DK-Nutzung (1983) 0.13
    0.12880845 = sum of:
      0.12880845 = product of:
        0.64404225 = sum of:
          0.040428102 = weight(abstract_txt:wird in 1541) [ClassicSimilarity], result of:
            0.040428102 = score(doc=1541,freq=1.0), product of:
              0.08571684 = queryWeight, product of:
                1.2287836 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018487731 = queryNorm
              0.4716471 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.125 = fieldNorm(doc=1541)
          0.14692669 = weight(abstract_txt:aufgezeigt in 1541) [ClassicSimilarity], result of:
            0.14692669 = score(doc=1541,freq=1.0), product of:
              0.17700301 = queryWeight, product of:
                1.4417402 = boost
                6.640641 = idf(docFreq=156, maxDocs=44218)
                0.018487731 = queryNorm
              0.83008015 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.640641 = idf(docFreq=156, maxDocs=44218)
                0.125 = fieldNorm(doc=1541)
          0.12494146 = weight(abstract_txt:eines in 1541) [ClassicSimilarity], result of:
            0.12494146 = score(doc=1541,freq=3.0), product of:
              0.12609792 = queryWeight, product of:
                1.4903774 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018487731 = queryNorm
              0.9908289 = fieldWeight in 1541, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.125 = fieldNorm(doc=1541)
          0.25528213 = weight(abstract_txt:hingewiesen in 1541) [ClassicSimilarity], result of:
            0.25528213 = score(doc=1541,freq=1.0), product of:
              0.25581565 = queryWeight, product of:
                1.7332461 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.018487731 = queryNorm
              0.9979144 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.125 = fieldNorm(doc=1541)
          0.076463886 = weight(abstract_txt:werden in 1541) [ClassicSimilarity], result of:
            0.076463886 = score(doc=1541,freq=2.0), product of:
              0.12336381 = queryWeight, product of:
                1.9030955 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018487731 = queryNorm
              0.6198243 = fieldWeight in 1541, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=1541)
        0.2 = coord(5/25)
    
  5. Manecke, H.-J.: Klassifikation, Klassieren (2004) 0.12
    0.12421486 = sum of:
      0.12421486 = product of:
        0.5175619 = sum of:
          0.034225494 = weight(abstract_txt:zwischen in 2902) [ClassicSimilarity], result of:
            0.034225494 = score(doc=2902,freq=2.0), product of:
              0.10227807 = queryWeight, product of:
                1.095943 = boost
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.018487731 = queryNorm
              0.3346318 = fieldWeight in 2902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0479026 = idf(docFreq=771, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.021440236 = weight(abstract_txt:wird in 2902) [ClassicSimilarity], result of:
            0.021440236 = score(doc=2902,freq=2.0), product of:
              0.08571684 = queryWeight, product of:
                1.2287836 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018487731 = queryNorm
              0.25012863 = fieldWeight in 2902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.021545498 = weight(abstract_txt:oder in 2902) [ClassicSimilarity], result of:
            0.021545498 = score(doc=2902,freq=1.0), product of:
              0.108349636 = queryWeight, product of:
                1.3815163 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.018487731 = queryNorm
              0.1988516 = fieldWeight in 2902, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.038255356 = weight(abstract_txt:eines in 2902) [ClassicSimilarity], result of:
            0.038255356 = score(doc=2902,freq=2.0), product of:
              0.12609792 = queryWeight, product of:
                1.4903774 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.018487731 = queryNorm
              0.30337816 = fieldWeight in 2902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.040551096 = weight(abstract_txt:werden in 2902) [ClassicSimilarity], result of:
            0.040551096 = score(doc=2902,freq=4.0), product of:
              0.12336381 = queryWeight, product of:
                1.9030955 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018487731 = queryNorm
              0.32871145 = fieldWeight in 2902, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.3615442 = weight(abstract_txt:klasse in 2902) [ClassicSimilarity], result of:
            0.3615442 = score(doc=2902,freq=7.0), product of:
              0.32431543 = queryWeight, product of:
                1.9515536 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.018487731 = queryNorm
              1.1147919 = fieldWeight in 2902, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
        0.24 = coord(6/25)