Search (169 results, page 1 of 9)

Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.00

0.0036952826 = product of:
  0.033257544 = sum of:
    0.005429798 = weight(_text_:in in 2673) [ClassicSimilarity], result of:
      0.005429798 = score(doc=2673,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.1822149 = fieldWeight in 2673, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2673)
    0.027827747 = product of:
      0.04174162 = sum of:
        0.020965107 = weight(_text_:29 in 2673) [ClassicSimilarity], result of:
          0.020965107 = score(doc=2673,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.27205724 = fieldWeight in 2673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2673)
        0.020776514 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
          0.020776514 = score(doc=2673,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.2708308 = fieldWeight in 2673, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2673)
      0.6666667 = coord(2/3)
  0.11111111 = coord(2/18)

Abstract: Examines techniques that discover features in sets of pre-categorized documents, such that similar documents can be found on the WWW. Examines techniques which will classifiy training examples with high accuracy, then explains why this is not necessarily useful. Describes a method for extracting word clusters from the raw document features. Results show that the clustering technique is successful in discovering word groups in personal Web pages which can be used to find similar information on the WWW
Date: 1. 8.1996 22:08:06
Source: Computer networks and ISDN systems. 29(1997) no.8, S.1147-1156

Salton, G.: Fast document classification in automatic information retrieval (1978) 0.00

0.0028365264 = product of:
  0.025528736 = sum of:
    0.0062054833 = weight(_text_:in in 2331) [ClassicSimilarity], result of:
      0.0062054833 = score(doc=2331,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.2082456 = fieldWeight in 2331, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=2331)
    0.019323254 = weight(_text_:der in 2331) [ClassicSimilarity], result of:
      0.019323254 = score(doc=2331,freq=8.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.3948779 = fieldWeight in 2331, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0625 = fieldNorm(doc=2331)
  0.11111111 = coord(2/18)

Abstract: A classified or clustered file is one where related or similar records are grouped into classes or clusters of items in such a way that all itmes within a cluster are jointly retrievable. Clustered files are easily adapted to to broad and narrow search strategies, and simple file updating methods are available. An inexpensive file clustering method applicable to large files is given together with appropriate file search methods
Source: Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg

Munkelt, J.: Erstellung einer DNB-Retrieval-Testkollektion (2018) 0.00
```
0.0025929953 = product of:
  0.023336958 = sum of:
    0.004433411 = weight(_text_:in in 4310) [ClassicSimilarity], result of:
      0.004433411 = score(doc=4310,freq=4.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.14877784 = fieldWeight in 4310, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4310)
    0.018903548 = weight(_text_:der in 4310) [ClassicSimilarity], result of:
      0.018903548 = score(doc=4310,freq=10.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.38630107 = fieldWeight in 4310, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4310)
  0.11111111 = coord(2/18)
```
Abstract

Seit Herbst 2017 findet in der Deutschen Nationalbibliothek die Inhaltserschließung bestimmter Medienwerke rein maschinell statt. Die Qualität dieses Verfahrens, das die Prozessorganisation von Bibliotheken maßgeblich prägen kann, wird unter Fachleuten kontrovers diskutiert. Ihre Standpunkte werden zunächst hinreichend erläutert, ehe die Notwendigkeit einer Qualitätsprüfung des Verfahrens und dessen Grundlagen dargelegt werden. Zentraler Bestandteil einer künftigen Prüfung ist eine Testkollektion. Ihre Erstellung und deren Dokumentation steht im Fokus dieser Arbeit. In diesem Zusammenhang werden auch die Entstehungsgeschichte und Anforderungen an gelungene Testkollektionen behandelt. Abschließend wird ein Retrievaltest durchgeführt, der die Einsatzfähigkeit der erarbeiteten Testkollektion belegt. Seine Ergebnisse dienen ausschließlich der Funktionsüberprüfung. Eine Qualitätsbeurteilung maschineller Inhaltserschließung im Speziellen sowie im Allgemeinen findet nicht statt und ist nicht Ziel der Ausarbeitung.

Salton, G.; Yang, C.S.: On the specification of term values in automatic indexing (1973) 0.00

0.0025709877 = product of:
  0.023138888 = sum of:
    0.007165474 = weight(_text_:in in 5476) [ClassicSimilarity], result of:
      0.007165474 = score(doc=5476,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.24046129 = fieldWeight in 5476, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.125 = fieldNorm(doc=5476)
    0.015973415 = product of:
      0.047920246 = sum of:
        0.047920246 = weight(_text_:29 in 5476) [ClassicSimilarity], result of:
          0.047920246 = score(doc=5476,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.6218451 = fieldWeight in 5476, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.125 = fieldNorm(doc=5476)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Source: Journal of documentation. 29(1973), S.351-372

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.00

0.0025550222 = product of:
  0.0229952 = sum of:
    0.007165474 = weight(_text_:in in 402) [ClassicSimilarity], result of:
      0.007165474 = score(doc=402,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.24046129 = fieldWeight in 402, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.125 = fieldNorm(doc=402)
    0.015829725 = product of:
      0.047489174 = sum of:
        0.047489174 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.047489174 = score(doc=402,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Source: Information processing and management. 22(1986) no.6, S.465-476

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.00

0.002441179 = product of:
  0.021970611 = sum of:
    0.0120770335 = weight(_text_:der in 4157) [ClassicSimilarity], result of:
      0.0120770335 = score(doc=4157,freq=2.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.2467987 = fieldWeight in 4157, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.078125 = fieldNorm(doc=4157)
    0.0098935785 = product of:
      0.029680735 = sum of:
        0.029680735 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.029680735 = score(doc=4157,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Lepsky, K.: Automatische Indexierung in der Inhaltserschließung (1998) 0.00

0.002207394 = product of:
  0.019866545 = sum of:
    0.0053741056 = weight(_text_:in in 1283) [ClassicSimilarity], result of:
      0.0053741056 = score(doc=1283,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.18034597 = fieldWeight in 1283, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=1283)
    0.01449244 = weight(_text_:der in 1283) [ClassicSimilarity], result of:
      0.01449244 = score(doc=1283,freq=2.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.29615843 = fieldWeight in 1283, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.09375 = fieldNorm(doc=1283)
  0.11111111 = coord(2/18)

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.00

0.0019611593 = product of:
  0.017650433 = sum of:
    0.007756854 = weight(_text_:in in 1952) [ClassicSimilarity], result of:
      0.007756854 = score(doc=1952,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.260307 = fieldWeight in 1952, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=1952)
    0.0098935785 = product of:
      0.029680735 = sum of:
        0.029680735 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
          0.029680735 = score(doc=1952,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.38690117 = fieldWeight in 1952, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1952)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Date: 16. 8.1998 12:51:22
Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.513-517.
Source: Proceedings of the 11th annual conference on research and development in information retrieval. Ed.: Y. Chiaramella

Salton, G.; Allan, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine readable texts (1994) 0.00

0.0018129811 = product of:
  0.01631683 = sum of:
    0.0063334443 = weight(_text_:in in 1949) [ClassicSimilarity], result of:
      0.0063334443 = score(doc=1949,freq=4.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.21253976 = fieldWeight in 1949, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=1949)
    0.009983385 = product of:
      0.029950155 = sum of:
        0.029950155 = weight(_text_:29 in 1949) [ClassicSimilarity], result of:
          0.029950155 = score(doc=1949,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.38865322 = fieldWeight in 1949, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.078125 = fieldNorm(doc=1949)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Date: 16. 8.1998 12:30:29
Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.478-483.

Williams, R.V.: Hans Peter Luhn and Herbert M. Ohlman : their roles in the origins of keyword-in-context/permutation automatic indexing (2010) 0.00

0.0017630123 = product of:
  0.01586711 = sum of:
    0.0062054833 = weight(_text_:in in 3440) [ClassicSimilarity], result of:
      0.0062054833 = score(doc=3440,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.2082456 = fieldWeight in 3440, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=3440)
    0.009661627 = weight(_text_:der in 3440) [ClassicSimilarity], result of:
      0.009661627 = score(doc=3440,freq=2.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.19743896 = fieldWeight in 3440, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0625 = fieldNorm(doc=3440)
  0.11111111 = coord(2/18)

Abstract: The invention of automatic indexing using a keyword-in-context approach has generally been attributed solely to Hans Peter Luhn of IBM. This article shows that credit for this invention belongs equally to Luhn and Herbert Ohlman of the System Development Corporation. It also traces the origins of title derivative automatic indexing, its development and implementation, and current status.
Theme: Geschichte der Sacherschließung

Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.00

0.001754703 = product of:
  0.015792327 = sum of:
    0.008866822 = weight(_text_:in in 5001) [ClassicSimilarity], result of:
      0.008866822 = score(doc=5001,freq=16.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.29755569 = fieldWeight in 5001, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5001)
    0.0069255047 = product of:
      0.020776514 = sum of:
        0.020776514 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
          0.020776514 = score(doc=5001,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.2708308 = fieldWeight in 5001, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5001)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: A study was done to test the effectiveness of retrieval using title word searching. It was based on actual search profiles used in the Mechanized Information Center at Ohio State University, in order ro replicate as closely as possible actual searching conditions. Fewer than 50% of the relevant titles were retrieved by keywords in titles. The low rate of retrieval can be attributes to three sources: titles themselves, user and information specialist ignorance of the subject vocabulary in use, and to general language problems. Across fields it was found that the social sciences had the best retrieval rate, with science having the next best, and arts and humanities the lowest. Ways to enhance and supplement keyword in title searching on the computer and in printed indexes are discussed.
Date: 14. 3.1996 13:22:21

Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.00

0.001596889 = product of:
  0.0143720005 = sum of:
    0.0044784215 = weight(_text_:in in 2759) [ClassicSimilarity], result of:
      0.0044784215 = score(doc=2759,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.15028831 = fieldWeight in 2759, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.078125 = fieldNorm(doc=2759)
    0.0098935785 = product of:
      0.029680735 = sum of:
        0.029680735 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
          0.029680735 = score(doc=2759,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.38690117 = fieldWeight in 2759, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2759)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Date: 1. 2.2016 18:25:22
Series: Lecture notes in computer science ; 9398

Molto, M.: Improving full text search performance through textual analysis (1993) 0.00

0.0015769101 = product of:
  0.014192191 = sum of:
    0.0062054833 = weight(_text_:in in 5099) [ClassicSimilarity], result of:
      0.0062054833 = score(doc=5099,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.2082456 = fieldWeight in 5099, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=5099)
    0.007986708 = product of:
      0.023960123 = sum of:
        0.023960123 = weight(_text_:29 in 5099) [ClassicSimilarity], result of:
          0.023960123 = score(doc=5099,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.31092256 = fieldWeight in 5099, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=5099)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: Explores the potential of text analysis as a tool in full text search and design improvement. Reports on a trial analysis performed in the domain of family history. The findings offered insights into possible gains and losses in using one search or design strategy versus another and strong evidence was provided to the potential of text analysis. Makes search and design recommendation
Source: Information processing and management. 29(1993) no.5, S.614-632

Junger, U.: Can indexing be automated? : the example of the Deutsche Nationalbibliothek (2014) 0.00

0.0015426357 = product of:
  0.013883721 = sum of:
    0.005429798 = weight(_text_:in in 1969) [ClassicSimilarity], result of:
      0.005429798 = score(doc=1969,freq=6.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.1822149 = fieldWeight in 1969, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1969)
    0.008453923 = weight(_text_:der in 1969) [ClassicSimilarity], result of:
      0.008453923 = score(doc=1969,freq=2.0), product of:
        0.048934754 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.021906832 = queryNorm
        0.17275909 = fieldWeight in 1969, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1969)
  0.11111111 = coord(2/18)

Abstract: The German Integrated Authority File (Gemeinsame Normdatei, GND), provides a broad controlled vocabulary for indexing documents on all subjects. Traditionally used for intellectual subject cataloging primarily for books, the Deutsche Nationalbibliothek (DNB, German National Library) has been working on developing and implementing procedures for automated assignment of subject headings for online publications. This project, its results, and problems are outlined in this article.
Footnote: Contribution in a special issue "Beyond libraries: Subject metadata in the digital environment and Semantic Web" - Enthält Beiträge der gleichnamigen IFLA Satellite Post-Conference, 17-18 August 2012, Tallinn.

Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.00

0.0014661439 = product of:
  0.013195295 = sum of:
    0.0062697898 = weight(_text_:in in 530) [ClassicSimilarity], result of:
      0.0062697898 = score(doc=530,freq=8.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.21040362 = fieldWeight in 530, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=530)
    0.0069255047 = product of:
      0.020776514 = sum of:
        0.020776514 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
          0.020776514 = score(doc=530,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.2708308 = fieldWeight in 530, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=530)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: Describes an application of Natural Language Processing (NLP) techniques, in HIRMA (Hypertextual Information Retrieval Managed by ARIOSTO), to the problem of document indexing by referring to a system which incorporates natural language processing techniques to determine the subject of the text of documents and to associate them with relevant semantic indexes. Describes briefly the overall system, details of its implementation on a corpus of scientific abstracts related to environmental topics and experimental evidence of the system's behaviour. Analyzes in detail an experiment designed to evaluate the system's retrieval ability in terms of recall and precision
Source: International forum on information and documentation. 22(1997) no.1, S.17-28

Souza, R.R.; Gil-Leiva, I.: Automatic indexing of scientific texts : a methodological comparison (2016) 0.00

0.0014503849 = product of:
  0.013053464 = sum of:
    0.0050667557 = weight(_text_:in in 4913) [ClassicSimilarity], result of:
      0.0050667557 = score(doc=4913,freq=4.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.17003182 = fieldWeight in 4913, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=4913)
    0.007986708 = product of:
      0.023960123 = sum of:
        0.023960123 = weight(_text_:29 in 4913) [ClassicSimilarity], result of:
          0.023960123 = score(doc=4913,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.31092256 = fieldWeight in 4913, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=4913)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Series: Advances in knowledge organization; vol.15
Source: Knowledge organization for a sustainable world: challenges and perspectives for cultural, scientific, and technological sharing in a connected society : proceedings of the Fourteenth International ISKO Conference 27-29 September 2016, Rio de Janeiro, Brazil / organized by International Society for Knowledge Organization (ISKO), ISKO-Brazil, São Paulo State University ; edited by José Augusto Chaves Guimarães, Suellen Oliveira Milani, Vera Dodebei

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.00

0.001442402 = product of:
  0.012981618 = sum of:
    0.0050667557 = weight(_text_:in in 6752) [ClassicSimilarity], result of:
      0.0050667557 = score(doc=6752,freq=4.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.17003182 = fieldWeight in 6752, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=6752)
    0.007914863 = product of:
      0.023744587 = sum of:
        0.023744587 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
          0.023744587 = score(doc=6752,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.30952093 = fieldWeight in 6752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6752)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: AutoSlog is a system that addresses the knowledge engineering bottleneck for information extraction. AutoSlog automatically creates domain specific dictionaries for information extraction, given an appropriate training corpus. Describes experiments with AutoSlog in terrorism, joint ventures and microelectronics domains. Compares the performance of AutoSlog across the 3 domains, discusses the lessons learned and presents results from 2 experiments which demonstrate that novice users can generate effective dictionaries using AutoSlog
Date: 6. 3.1997 16:22:15

Ward, M.L.: ¬The future of the human indexer (1996) 0.00

0.0013271755 = product of:
  0.01194458 = sum of:
    0.006008433 = weight(_text_:in in 7244) [ClassicSimilarity], result of:
      0.006008433 = score(doc=7244,freq=10.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.20163295 = fieldWeight in 7244, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=7244)
    0.0059361467 = product of:
      0.01780844 = sum of:
        0.01780844 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
          0.01780844 = score(doc=7244,freq=2.0), product of:
            0.076713994 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.021906832 = queryNorm
            0.23214069 = fieldWeight in 7244, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=7244)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: Considers the principles of indexing and the intellectual skills involved in order to determine what automatic indexing systems would be required in order to supplant or complement the human indexer. Good indexing requires: considerable prior knowledge of the literature; judgement as to what to index and what depth to index; reading skills; abstracting skills; and classification skills, Illustrates these features with a detailed description of abstracting and indexing processes involved in generating entries for the mechanical engineering database POWERLINK. Briefly assesses the possibility of replacing human indexers with specialist indexing software, with particular reference to the Object Analyzer from the InTEXT automatic indexing system and using the criteria described for human indexers. At present, it is unlikely that the automatic indexer will replace the human indexer, but when more primary texts are available in electronic form, it may be a useful productivity tool for dealing with large quantities of low grade texts (should they be wanted in the database)
Date: 9. 2.1997 18:44:22

Koryconski, C.; Newell, A.F.: Natural-language processing and automatic indexing (1990) 0.00

0.0012854938 = product of:
  0.011569444 = sum of:
    0.003582737 = weight(_text_:in in 2313) [ClassicSimilarity], result of:
      0.003582737 = score(doc=2313,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.120230645 = fieldWeight in 2313, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=2313)
    0.007986708 = product of:
      0.023960123 = sum of:
        0.023960123 = weight(_text_:29 in 2313) [ClassicSimilarity], result of:
          0.023960123 = score(doc=2313,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.31092256 = fieldWeight in 2313, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=2313)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: The task of producing satisfactory indexes by automatic means has been tackled on two fronts: by statistical analysis of text and by attempting content analysis of the text in much the same way as a human indexer does. Though statistical techniques have a lot to offer for free-text database systems, neither method has had much success with back-of-the-book indexing. This review examines some problems associated with the application of natural-language processing techniques to book texts. - Vgl. auch die Erwiderung von K.P. Jones
Source: Indexer. 17(1990), S.21-29

Frants, V.I.; Kamenoff, N.I.; Shapiro, J.: ¬One approach to classification of users and automatic clustering of documents (1993) 0.00

0.0012854938 = product of:
  0.011569444 = sum of:
    0.003582737 = weight(_text_:in in 4569) [ClassicSimilarity], result of:
      0.003582737 = score(doc=4569,freq=2.0), product of:
        0.029798867 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.021906832 = queryNorm
        0.120230645 = fieldWeight in 4569, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=4569)
    0.007986708 = product of:
      0.023960123 = sum of:
        0.023960123 = weight(_text_:29 in 4569) [ClassicSimilarity], result of:
          0.023960123 = score(doc=4569,freq=2.0), product of:
            0.077061385 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.021906832 = queryNorm
            0.31092256 = fieldWeight in 4569, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=4569)
      0.33333334 = coord(1/3)
  0.11111111 = coord(2/18)

Abstract: Shows how to automatically construct a classification of users and a clustering of documents on the basis of users' information needs by creating clusters of documents and cross-references among clusters using users' search requests. Examines feedback in the construction of this classification and clustering so that the classification can be changed over time to reflect the changing needs of the users
Source: Information processing and management. 29(1993) no.2, S.187-195

Search (169 results, page 1 of 9)

Authors

Years

Themes