Search (76 results, page 1 of 4)

RIAO 91 : Computer aided information retrieval. Conference, Barcelona, 2.-4.5.1991 (1991) 0.05

0.047308575 = product of:
  0.23654287 = sum of:
    0.23654287 = weight(_text_:91 in 4651) [ClassicSimilarity], result of:
      0.23654287 = score(doc=4651,freq=2.0), product of:
        0.19210906 = queryWeight, product of:
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.034476317 = queryNorm
        1.2312946 = fieldWeight in 4651, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.15625 = fieldNorm(doc=4651)
  0.2 = coord(1/5)

Silvester, J.P.: Computer supported indexing : a history and evaluation of NASA's MAI system (1998) 0.03

0.033116 = product of:
  0.16558 = sum of:
    0.16558 = weight(_text_:91 in 1302) [ClassicSimilarity], result of:
      0.16558 = score(doc=1302,freq=2.0), product of:
        0.19210906 = queryWeight, product of:
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.034476317 = queryNorm
        0.86190623 = fieldWeight in 1302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.109375 = fieldNorm(doc=1302)
  0.2 = coord(1/5)

Pages: S.76-91

Nohr, H.: Grundlagen der automatischen Indexierung : ein Lehrbuch (2003) 0.02
```
0.020791857 = product of:
  0.05197964 = sum of:
    0.04730857 = weight(_text_:91 in 1767) [ClassicSimilarity], result of:
      0.04730857 = score(doc=1767,freq=2.0), product of:
        0.19210906 = queryWeight, product of:
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.034476317 = queryNorm
        0.24625893 = fieldWeight in 1767, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.03125 = fieldNorm(doc=1767)
    0.004671065 = product of:
      0.01868426 = sum of:
        0.01868426 = weight(_text_:22 in 1767) [ClassicSimilarity], result of:
          0.01868426 = score(doc=1767,freq=2.0), product of:
            0.12073019 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034476317 = queryNorm
            0.15476047 = fieldWeight in 1767, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1767)
      0.25 = coord(1/4)
  0.4 = coord(2/5)
```
Date

22. 6.2009 12:46:51

Footnote

Rez. in: nfd 54(2003) H.5, S.314 (W. Ratzek): "Um entscheidungsrelevante Daten aus der ständig wachsenden Flut von mehr oder weniger relevanten Dokumenten zu extrahieren, müssen Unternehmen, öffentliche Verwaltung oder Einrichtungen der Fachinformation effektive und effiziente Filtersysteme entwickeln, einsetzen und pflegen. Das vorliegende Lehrbuch von Holger Nohr bietet erstmalig eine grundlegende Einführung in das Thema "automatische Indexierung". Denn: "Wie man Information sammelt, verwaltet und verwendet, wird darüber entscheiden, ob man zu den Gewinnern oder Verlierern gehört" (Bill Gates), heißt es einleitend. Im ersten Kapitel "Einleitung" stehen die Grundlagen im Mittelpunkt. Die Zusammenhänge zwischen Dokumenten-Management-Systeme, Information Retrieval und Indexierung für Planungs-, Entscheidungs- oder Innovationsprozesse, sowohl in Profit- als auch Non-Profit-Organisationen werden beschrieben. Am Ende des einleitenden Kapitels geht Nohr auf die Diskussion um die intellektuelle und automatische Indexierung ein und leitet damit über zum zweiten Kapitel "automatisches Indexieren. Hier geht der Autor überblickartig unter anderem ein auf - Probleme der automatischen Sprachverarbeitung und Indexierung - verschiedene Verfahren der automatischen Indexierung z.B. einfache Stichwortextraktion / Volltextinvertierung, - statistische Verfahren, Pattern-Matching-Verfahren. Die "Verfahren der automatischen Indexierung" behandelt Nohr dann vertiefend und mit vielen Beispielen versehen im umfangreichsten dritten Kapitel. Das vierte Kapitel "Keyphrase Extraction" nimmt eine Passpartout-Status ein: "Eine Zwischenstufe auf dem Weg von der automatischen Indexierung hin zur automatischen Generierung textueller Zusammenfassungen (Automatic Text Summarization) stellen Ansätze dar, die Schlüsselphrasen aus Dokumenten extrahieren (Keyphrase Extraction). Die Grenzen zwischen den automatischen Verfahren der Indexierung und denen des Text Summarization sind fließend." (S. 91). Am Beispiel NCR"s Extractor/Copernic Summarizer beschreibt Nohr die Funktionsweise.
Clavel, G.; Walther, F.; Walther, J.: Indexation automatique de fonds bibliotheconomiques (1993) 0.02
```
0.016558 = product of:
  0.08279 = sum of:
    0.08279 = weight(_text_:91 in 6610) [ClassicSimilarity], result of:
      0.08279 = score(doc=6610,freq=2.0), product of:
        0.19210906 = queryWeight, product of:
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.034476317 = queryNorm
        0.43095312 = fieldWeight in 6610, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6610)
  0.2 = coord(1/5)
```
Abstract

A discussion of developments to date in the field of computerized indexing, based on presentations given at a seminar held at the Institute of Policy Studies in Paris in Nov 91. The methods tested so far, based on a linguistic approach, whether using natural language or special thesauri, encounter the same central problem - they are only successful when applied to collections of similar types of documents covering very specific subject areas. Despite this, the search for some sort of universal indexing metalanguage continues. In the end, computerized indexing works best when used in conjunction with manual indexing - ideally in the hands of a trained library science professional, who can extract the maximum value from a collection of documents for a particular user population

Schneider, C.; Womser-Hacker, C.: Inhaltserschließungssysteme für Patenttexte : Test und Systemvergleich im Projekt PADOK (1986) 0.02

0.015382982 = product of:
  0.07691491 = sum of:
    0.07691491 = weight(_text_:c in 2648) [ClassicSimilarity], result of:
      0.07691491 = score(doc=2648,freq=4.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.64676344 = fieldWeight in 2648, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.09375 = fieldNorm(doc=2648)
  0.2 = coord(1/5)

Jones, K.P.: Natural-language processing and automatic indexing : a reply (1990) 0.01

0.014503214 = product of:
  0.07251607 = sum of:
    0.07251607 = weight(_text_:c in 394) [ClassicSimilarity], result of:
      0.07251607 = score(doc=394,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.6097744 = fieldWeight in 394, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.125 = fieldNorm(doc=394)
  0.2 = coord(1/5)

Footnote: Erwiderung auf: Korycinski, C. u. A.F. Newell

Gibb, F.; Smart, G.: Knowledge-based indexing : the view from SIMPR (1991) 0.01

0.012690312 = product of:
  0.06345156 = sum of:
    0.06345156 = weight(_text_:c in 4424) [ClassicSimilarity], result of:
      0.06345156 = score(doc=4424,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.5335526 = fieldWeight in 4424, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.109375 = fieldNorm(doc=4424)
  0.2 = coord(1/5)

Source: Libraries and expert systems. Ed. C. MacDonald et al

Schwarz, C.: Komplexe Nominalgruppen als Indexierungseinheiten am Beispiel des Projekte CONDOR (1982) 0.01

0.012690312 = product of:
  0.06345156 = sum of:
    0.06345156 = weight(_text_:c in 435) [ClassicSimilarity], result of:
      0.06345156 = score(doc=435,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.5335526 = fieldWeight in 435, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.109375 = fieldNorm(doc=435)
  0.2 = coord(1/5)

Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.01

0.012690312 = product of:
  0.06345156 = sum of:
    0.06345156 = weight(_text_:c in 1168) [ClassicSimilarity], result of:
      0.06345156 = score(doc=1168,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.5335526 = fieldWeight in 1168, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.109375 = fieldNorm(doc=1168)
  0.2 = coord(1/5)

Schröther, C.: Automatische Indexierung, Kategorisierung und inhaltliche Erschließung von Textnachrichten (2003) 0.01

0.012690312 = product of:
  0.06345156 = sum of:
    0.06345156 = weight(_text_:c in 521) [ClassicSimilarity], result of:
      0.06345156 = score(doc=521,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.5335526 = fieldWeight in 521, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.109375 = fieldNorm(doc=521)
  0.2 = coord(1/5)

Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.01
```
0.012123748 = product of:
  0.03030937 = sum of:
    0.025638305 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
      0.025638305 = score(doc=1441,freq=4.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.21558782 = fieldWeight in 1441, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.03125 = fieldNorm(doc=1441)
    0.004671065 = product of:
      0.01868426 = sum of:
        0.01868426 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
          0.01868426 = score(doc=1441,freq=2.0), product of:
            0.12073019 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034476317 = queryNorm
            0.15476047 = fieldWeight in 1441, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1441)
      0.25 = coord(1/4)
  0.4 = coord(2/5)
```
Abstract

This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.01

0.011400042 = product of:
  0.028500104 = sum of:
    0.022661272 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
      0.022661272 = score(doc=1794,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.1905545 = fieldWeight in 1794, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1794)
    0.0058388314 = product of:
      0.023355326 = sum of:
        0.023355326 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
          0.023355326 = score(doc=1794,freq=2.0), product of:
            0.12073019 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.034476317 = queryNorm
            0.19345059 = fieldWeight in 1794, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1794)
      0.25 = coord(1/4)
  0.4 = coord(2/5)

Date: 11. 9.2000 19:53:22

Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.01

0.0090645095 = product of:
  0.045322545 = sum of:
    0.045322545 = weight(_text_:c in 1949) [ClassicSimilarity], result of:
      0.045322545 = score(doc=1949,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.381109 = fieldWeight in 1949, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.078125 = fieldNorm(doc=1949)
  0.2 = coord(1/5)

Siebenkäs, A.; Markscheffel, B.: Conception of a workflow for the semi-automatic construction of a thesaurus for the German printing industry (2015) 0.01

0.008973407 = product of:
  0.04486703 = sum of:
    0.04486703 = weight(_text_:c in 2091) [ClassicSimilarity], result of:
      0.04486703 = score(doc=2091,freq=4.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3772787 = fieldWeight in 2091, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2091)
  0.2 = coord(1/5)

Source: Re:inventing information science in the networked society: Proceedings of the 14th International Symposium on Information Science, Zadar/Croatia, 19th-21st May 2015. Eds.: F. Pehar, C. Schloegl u. C. Wolff

Rasmussen, E.M.: Indexing and retrieval for the Web (2002) 0.01

0.008279 = product of:
  0.041395 = sum of:
    0.041395 = weight(_text_:91 in 4285) [ClassicSimilarity], result of:
      0.041395 = score(doc=4285,freq=2.0), product of:
        0.19210906 = queryWeight, product of:
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.034476317 = queryNorm
        0.21547656 = fieldWeight in 4285, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.5722036 = idf(docFreq=456, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4285)
  0.2 = coord(1/5)

Source: Annual review of information science and technology. 37(2003), S.91-126

Koryconski, C.; Newell, A.F.: Natural-language processing and automatic indexing (1990) 0.01

0.007251607 = product of:
  0.036258034 = sum of:
    0.036258034 = weight(_text_:c in 2313) [ClassicSimilarity], result of:
      0.036258034 = score(doc=2313,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3048872 = fieldWeight in 2313, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0625 = fieldNorm(doc=2313)
  0.2 = coord(1/5)

Salton, G.; Buckley, C.; Allan, J.: Automatic structuring of text files (1992) 0.01

0.007251607 = product of:
  0.036258034 = sum of:
    0.036258034 = weight(_text_:c in 6507) [ClassicSimilarity], result of:
      0.036258034 = score(doc=6507,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3048872 = fieldWeight in 6507, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0625 = fieldNorm(doc=6507)
  0.2 = coord(1/5)

Abdul, H.; Khoo, C.: Automatic indexing of medical literature using phrase matching : an exploratory study 0.01

0.007251607 = product of:
  0.036258034 = sum of:
    0.036258034 = weight(_text_:c in 3601) [ClassicSimilarity], result of:
      0.036258034 = score(doc=3601,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3048872 = fieldWeight in 3601, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0625 = fieldNorm(doc=3601)
  0.2 = coord(1/5)

Krause, J.; Womser-Hacker, C.: PADOK-II : Retrievaltests zur Bewertung von Volltextindexierungsvarianten für das deutsche Patentinformationssystem (1990) 0.01

0.007251607 = product of:
  0.036258034 = sum of:
    0.036258034 = weight(_text_:c in 2653) [ClassicSimilarity], result of:
      0.036258034 = score(doc=2653,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3048872 = fieldWeight in 2653, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0625 = fieldNorm(doc=2653)
  0.2 = coord(1/5)

Fox, C.: Lexical analysis and stoplists (1992) 0.01

0.007251607 = product of:
  0.036258034 = sum of:
    0.036258034 = weight(_text_:c in 3502) [ClassicSimilarity], result of:
      0.036258034 = score(doc=3502,freq=2.0), product of:
        0.118922785 = queryWeight, product of:
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.034476317 = queryNorm
        0.3048872 = fieldWeight in 3502, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4494052 = idf(docFreq=3817, maxDocs=44218)
          0.0625 = fieldNorm(doc=3502)
  0.2 = coord(1/5)

Search (76 results, page 1 of 4)

Authors

Years

Languages

Types

Themes