Document (#23284)

Author
Larroche-Boutet, V.
Pöhl, K.
Title
¬Das Nominalsyntagna : über die Nutzbarmachung eines logico-semantischen Konzeptes für dokumentarische Fragestellungen
Source
Nachrichten für Dokumentation. 44(1993) H.5, S.269-276
Year
1993
Abstract
Am Anfang nachfolgender Ausführungen werden die für die Indexierung großer textmengen notwendigen strategischen Entscheidungen aufgezeigt: es müssen sowohl das Indexierungsverfahren (menschliche oder automatische Indexierung) als auch die Indexierungssparche (freie, kontrollierte oder natürliche Sprache) ausgewählt werden. Hierbei hat sich die Forschungsgruppe SYDO-LYON für natürlichsprachige automatische Vollindexierung entschieden. Auf der Grundlage der Unterscheidung zwischen prädikativen und referentiellen Textteilen wird d as Nominalsyntagma als kleinste referentielle Texteinheit definiert, dann das für die Konstituierung eines Nominalsyntagmas entscheidende Phänomen der Aktualisierung erläutert und schließlich auf die morphologischen Mittel zur Erkennung des Nominalsyntagmas hingewiesen. Alle Nominalsyntagma eines Textes werden als dessen potentielle Deskriptoren extrahiert, und Hilfsmittel für die Benutzer einer mit diesem Indexierungsverfahren arbeitenden Datenbank werden vorgestellt. Außerdem wird der begriff der Anapher (d.h. die Wiederaufnahme von Nominalsyntagmen durch Pronomen) kurz definiert, ihre Anwendung als Mittel zur Gewichtung des Deskriptorterme (durch Zählung ihrer Häufigkeit im text) aufgezeigt und morphologische uns syntaktische Regeln zur automatischen Bestimmung des von einem anaphorischen Pronomen aufgenommenen Nominalsyntagmas aufgestellt. Bevor abschließend Ziele und Grenzen der Arbeit diskutiert werden, wird noch auf einen Unterschied zwischen Nominalsyntagma und Deskriptorterm hingewiesen: das Nonimalsyntagma verweist auf ein Objekt, das ein Einzelobjekt oder eine Klasse sein kann, der Deskriptorterm verweist immer auf eine Klasse
Theme
Automatisches Indexieren
Computerlinguistik

Similar documents (content)

  1. Panyr, J.: Automatische Indexierung und Klassifikation (1983) 0.17
    0.16565977 = sum of:
      0.16565977 = product of:
        0.82829887 = sum of:
          0.065614246 = weight(abstract_txt:zwischen in 762) [ClassicSimilarity], result of:
            0.065614246 = score(doc=762,freq=1.0), product of:
              0.103542574 = queryWeight, product of:
                1.122104 = boost
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.018201897 = queryNorm
              0.6336934 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.058171038 = weight(abstract_txt:wird in 762) [ClassicSimilarity], result of:
            0.058171038 = score(doc=762,freq=2.0), product of:
              0.08681841 = queryWeight, product of:
                1.2584189 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018201897 = queryNorm
              0.67003113 = fieldWeight in 762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.2688445 = weight(abstract_txt:indexierung in 762) [ClassicSimilarity], result of:
            0.2688445 = score(doc=762,freq=3.0), product of:
              0.18382894 = queryWeight, product of:
                1.4951357 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018201897 = queryNorm
              1.462471 = fieldWeight in 762, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.05492704 = weight(abstract_txt:werden in 762) [ClassicSimilarity], result of:
            0.05492704 = score(doc=762,freq=1.0), product of:
              0.12482195 = queryWeight, product of:
                1.9480009 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.018201897 = queryNorm
              0.4400431 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
          0.38074204 = weight(abstract_txt:indexierungsverfahren in 762) [ClassicSimilarity], result of:
            0.38074204 = score(doc=762,freq=1.0), product of:
              0.3343547 = queryWeight, product of:
                2.0164032 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.018201897 = queryNorm
              1.138737 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.125 = fieldNorm(doc=762)
        0.2 = coord(5/25)
    
  2. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.16
    0.16311121 = sum of:
      0.16311121 = product of:
        0.45308667 = sum of:
          0.082096316 = weight(abstract_txt:extrahiert in 2519) [ClassicSimilarity], result of:
            0.082096316 = score(doc=2519,freq=2.0), product of:
              0.16446847 = queryWeight, product of:
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.018201897 = queryNorm
              0.49916145 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.061072897 = weight(abstract_txt:syntaktische in 2519) [ClassicSimilarity], result of:
            0.061072897 = score(doc=2519,freq=1.0), product of:
              0.17012803 = queryWeight, product of:
                1.01706 = boost
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.018201897 = queryNorm
              0.35898197 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.189939 = idf(docFreq=11, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.0628241 = weight(abstract_txt:arbeitenden in 2519) [ClassicSimilarity], result of:
            0.0628241 = score(doc=2519,freq=1.0), product of:
              0.17336485 = queryWeight, product of:
                1.0266896 = boost
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.018201897 = queryNorm
              0.36238086 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.27695 = idf(docFreq=10, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.020504452 = weight(abstract_txt:zwischen in 2519) [ClassicSimilarity], result of:
            0.020504452 = score(doc=2519,freq=1.0), product of:
              0.103542574 = queryWeight, product of:
                1.122104 = boost
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.018201897 = queryNorm
              0.19802919 = fieldWeight in 2519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.02874265 = weight(abstract_txt:wird in 2519) [ClassicSimilarity], result of:
            0.02874265 = score(doc=2519,freq=5.0), product of:
              0.08681841 = queryWeight, product of:
                1.2584189 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018201897 = queryNorm
              0.3310663 = fieldWeight in 2519, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.02601291 = weight(abstract_txt:oder in 2519) [ClassicSimilarity], result of:
            0.02601291 = score(doc=2519,freq=2.0), product of:
              0.11024695 = queryWeight, product of:
                1.418086 = boost
                4.271175 = idf(docFreq=1641, maxDocs=43254)
                0.018201897 = queryNorm
              0.2359513 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.271175 = idf(docFreq=1641, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.045624703 = weight(abstract_txt:eines in 2519) [ClassicSimilarity], result of:
            0.045624703 = score(doc=2519,freq=4.0), product of:
              0.12726158 = queryWeight, product of:
                1.5235894 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018201897 = queryNorm
              0.3585112 = fieldWeight in 2519, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.077659585 = weight(abstract_txt:definiert in 2519) [ClassicSimilarity], result of:
            0.077659585 = score(doc=2519,freq=2.0), product of:
              0.19968261 = queryWeight, product of:
                1.5582739 = boost
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.018201897 = queryNorm
              0.38891512 = fieldWeight in 2519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.040116 = idf(docFreq=102, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
          0.0485491 = weight(abstract_txt:werden in 2519) [ClassicSimilarity], result of:
            0.0485491 = score(doc=2519,freq=8.0), product of:
              0.12482195 = queryWeight, product of:
                1.9480009 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.018201897 = queryNorm
              0.3889468 = fieldWeight in 2519, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.0390625 = fieldNorm(doc=2519)
        0.36 = coord(9/25)
    
  3. Jüngling, H.: Verbesserung der sachlichen Erschließung von Bibliotheksbeständen durch Automatisierung der DK-Nutzung (1983) 0.13
    0.13047077 = sum of:
      0.13047077 = product of:
        0.6523538 = sum of:
          0.041133136 = weight(abstract_txt:wird in 1541) [ClassicSimilarity], result of:
            0.041133136 = score(doc=1541,freq=1.0), product of:
              0.08681841 = queryWeight, product of:
                1.2584189 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018201897 = queryNorm
              0.47378355 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.125 = fieldNorm(doc=1541)
          0.1481646 = weight(abstract_txt:aufgezeigt in 1541) [ClassicSimilarity], result of:
            0.1481646 = score(doc=1541,freq=1.0), product of:
              0.17821729 = queryWeight, product of:
                1.4721383 = boost
                6.6509643 = idf(docFreq=151, maxDocs=43254)
                0.018201897 = queryNorm
              0.83137053 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6509643 = idf(docFreq=151, maxDocs=43254)
                0.125 = fieldNorm(doc=1541)
          0.12643889 = weight(abstract_txt:eines in 1541) [ClassicSimilarity], result of:
            0.12643889 = score(doc=1541,freq=3.0), product of:
              0.12726158 = queryWeight, product of:
                1.5235894 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018201897 = queryNorm
              0.9935354 = fieldWeight in 1541, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.125 = fieldNorm(doc=1541)
          0.25893864 = weight(abstract_txt:hingewiesen in 1541) [ClassicSimilarity], result of:
            0.25893864 = score(doc=1541,freq=1.0), product of:
              0.25857395 = queryWeight, product of:
                1.773234 = boost
                8.011283 = idf(docFreq=38, maxDocs=43254)
                0.018201897 = queryNorm
              1.0014104 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.011283 = idf(docFreq=38, maxDocs=43254)
                0.125 = fieldNorm(doc=1541)
          0.07767856 = weight(abstract_txt:werden in 1541) [ClassicSimilarity], result of:
            0.07767856 = score(doc=1541,freq=2.0), product of:
              0.12482195 = queryWeight, product of:
                1.9480009 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.018201897 = queryNorm
              0.6223149 = fieldWeight in 1541, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.125 = fieldNorm(doc=1541)
        0.2 = coord(5/25)
    
  4. Kaufmann, E.: ¬Das Indexieren von natürlichsprachlichen Dokumenten und die inverse Seitenhäufigkeit (2001) 0.13
    0.1252801 = sum of:
      0.1252801 = product of:
        0.6264005 = sum of:
          0.09288138 = weight(abstract_txt:extrahiert in 2319) [ClassicSimilarity], result of:
            0.09288138 = score(doc=2319,freq=1.0), product of:
              0.16446847 = queryWeight, product of:
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.018201897 = queryNorm
              0.5647367 = fieldWeight in 2319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.035788 = idf(docFreq=13, maxDocs=43254)
                0.0625 = fieldNorm(doc=2319)
          0.07760872 = weight(abstract_txt:indexierung in 2319) [ClassicSimilarity], result of:
            0.07760872 = score(doc=2319,freq=1.0), product of:
              0.18382894 = queryWeight, product of:
                1.4951357 = boost
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.018201897 = queryNorm
              0.422179 = fieldWeight in 2319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.754864 = idf(docFreq=136, maxDocs=43254)
                0.0625 = fieldNorm(doc=2319)
          0.11941347 = weight(abstract_txt:automatische in 2319) [ClassicSimilarity], result of:
            0.11941347 = score(doc=2319,freq=2.0), product of:
              0.19446096 = queryWeight, product of:
                1.5377647 = boost
                6.9474573 = idf(docFreq=112, maxDocs=43254)
                0.018201897 = queryNorm
              0.6140743 = fieldWeight in 2319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9474573 = idf(docFreq=112, maxDocs=43254)
                0.0625 = fieldNorm(doc=2319)
          0.06727161 = weight(abstract_txt:werden in 2319) [ClassicSimilarity], result of:
            0.06727161 = score(doc=2319,freq=6.0), product of:
              0.12482195 = queryWeight, product of:
                1.9480009 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.018201897 = queryNorm
              0.53894055 = fieldWeight in 2319, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.0625 = fieldNorm(doc=2319)
          0.2692253 = weight(abstract_txt:indexierungsverfahren in 2319) [ClassicSimilarity], result of:
            0.2692253 = score(doc=2319,freq=2.0), product of:
              0.3343547 = queryWeight, product of:
                2.0164032 = boost
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.018201897 = queryNorm
              0.8052086 = fieldWeight in 2319, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.109896 = idf(docFreq=12, maxDocs=43254)
                0.0625 = fieldNorm(doc=2319)
        0.2 = coord(5/25)
    
  5. Manecke, H.-J.: Klassifikation, Klassieren (2004) 0.12
    0.1245181 = sum of:
      0.1245181 = product of:
        0.5188254 = sum of:
          0.034797207 = weight(abstract_txt:zwischen in 4903) [ClassicSimilarity], result of:
            0.034797207 = score(doc=4903,freq=2.0), product of:
              0.103542574 = queryWeight, product of:
                1.122104 = boost
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.018201897 = queryNorm
              0.33606666 = fieldWeight in 4903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.069547 = idf(docFreq=738, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
          0.02181414 = weight(abstract_txt:wird in 4903) [ClassicSimilarity], result of:
            0.02181414 = score(doc=4903,freq=2.0), product of:
              0.08681841 = queryWeight, product of:
                1.2584189 = boost
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.018201897 = queryNorm
              0.25126168 = fieldWeight in 4903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7902684 = idf(docFreq=2655, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
          0.022072686 = weight(abstract_txt:oder in 4903) [ClassicSimilarity], result of:
            0.022072686 = score(doc=4903,freq=1.0), product of:
              0.11024695 = queryWeight, product of:
                1.418086 = boost
                4.271175 = idf(docFreq=1641, maxDocs=43254)
                0.018201897 = queryNorm
              0.20021132 = fieldWeight in 4903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.271175 = idf(docFreq=1641, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
          0.038713843 = weight(abstract_txt:eines in 4903) [ClassicSimilarity], result of:
            0.038713843 = score(doc=4903,freq=2.0), product of:
              0.12726158 = queryWeight, product of:
                1.5235894 = boost
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.018201897 = queryNorm
              0.30420685 = fieldWeight in 4903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5889435 = idf(docFreq=1194, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
          0.041195277 = weight(abstract_txt:werden in 4903) [ClassicSimilarity], result of:
            0.041195277 = score(doc=4903,freq=4.0), product of:
              0.12482195 = queryWeight, product of:
                1.9480009 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.018201897 = queryNorm
              0.33003232 = fieldWeight in 4903, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
          0.3602323 = weight(abstract_txt:klasse in 4903) [ClassicSimilarity], result of:
            0.3602323 = score(doc=4903,freq=7.0), product of:
              0.32393292 = queryWeight, product of:
                1.984729 = boost
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.018201897 = queryNorm
              1.1120583 = fieldWeight in 4903, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.966795 = idf(docFreq=14, maxDocs=43254)
                0.046875 = fieldNorm(doc=4903)
        0.24 = coord(6/25)