Search (269 results, page 1 of 14)

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.07

0.067112364 = product of:
  0.13422473 = sum of:
    0.13422473 = sum of:
      0.019353455 = weight(_text_:e in 402) [ClassicSimilarity], result of:
        0.019353455 = score(doc=402,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.2540935 = fieldWeight in 402, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.125 = fieldNorm(doc=402)
      0.11487127 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
        0.11487127 = score(doc=402,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.61904186 = fieldWeight in 402, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.125 = fieldNorm(doc=402)
  0.5 = coord(1/2)

Language: e
Source: Information processing and management. 22(1986) no.6, S.465-476

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.06

0.058723316 = product of:
  0.11744663 = sum of:
    0.11744663 = sum of:
      0.016934272 = weight(_text_:e in 6265) [ClassicSimilarity], result of:
        0.016934272 = score(doc=6265,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.2223318 = fieldWeight in 6265, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.109375 = fieldNorm(doc=6265)
      0.10051236 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
        0.10051236 = score(doc=6265,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.5416616 = fieldWeight in 6265, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.109375 = fieldNorm(doc=6265)
  0.5 = coord(1/2)

Language: e
Source: Information outlook. 9(2005) no.8, S.22-23

Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.06

0.057154737 = sum of:
  0.015209509 = product of:
    0.060838036 = sum of:
      0.060838036 = weight(_text_:authors in 1794) [ClassicSimilarity], result of:
        0.060838036 = score(doc=1794,freq=2.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.25184128 = fieldWeight in 1794, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1794)
    0.25 = coord(1/4)
  0.041945226 = sum of:
    0.006047955 = weight(_text_:e in 1794) [ClassicSimilarity], result of:
      0.006047955 = score(doc=1794,freq=2.0), product of:
        0.07616667 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.052990302 = queryNorm
        0.07940422 = fieldWeight in 1794, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1794)
    0.035897274 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
      0.035897274 = score(doc=1794,freq=2.0), product of:
        0.18556301 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052990302 = queryNorm
        0.19345059 = fieldWeight in 1794, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1794)

Abstract: In this article, we describe and test a two-stage algorithm based on a lexical collocation technique which maps from the lexical clues contained in a document representation into a controlled vocabulary list of subject headings. Using a collection of 4.626 INSPEC documents, we create a 'dictionary' of associations between the lexical items contained in the titles, authors, and abstracts, and controlled vocabulary subject headings assigned to those records by human indexers using a likelihood ratio statistic as the measure of association. In the deployment stage, we use the dictiony to predict which of the controlled vocabulary subject headings best describe new documents when they are presented to the system. Our evaluation of this algorithm, in which we compare the automatically assigned subject headings to the subject headings assigned to the test documents by human catalogers, shows that we can obtain results comparable to, and consistent with, human cataloging. In effect we have cast this as a classic partial match information retrieval problem. We consider the problem to be one of 'retrieving' (or assigning) the most probably 'relevant' (or correct) controlled vocabulary subject headings to a document based on the clues contained in that document
Date: 11. 9.2000 19:53:22
Language: e

Greiner-Petter, A.; Schubotz, M.; Cohl, H.S.; Gipp, B.: Semantic preserving bijective mappings for expressions involving special functions between computer algebra systems and document preparation systems (2019) 0.05
```
0.0546311 = sum of:
  0.021074915 = product of:
    0.08429966 = sum of:
      0.08429966 = weight(_text_:authors in 5499) [ClassicSimilarity], result of:
        0.08429966 = score(doc=5499,freq=6.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.34896153 = fieldWeight in 5499, product of:
            2.4494898 = tf(freq=6.0), with freq of:
              6.0 = termFreq=6.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.03125 = fieldNorm(doc=5499)
    0.25 = coord(1/4)
  0.033556182 = sum of:
    0.0048383637 = weight(_text_:e in 5499) [ClassicSimilarity], result of:
      0.0048383637 = score(doc=5499,freq=2.0), product of:
        0.07616667 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.052990302 = queryNorm
        0.063523374 = fieldWeight in 5499, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03125 = fieldNorm(doc=5499)
    0.028717818 = weight(_text_:22 in 5499) [ClassicSimilarity], result of:
      0.028717818 = score(doc=5499,freq=2.0), product of:
        0.18556301 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052990302 = queryNorm
        0.15476047 = fieldWeight in 5499, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.03125 = fieldNorm(doc=5499)
```
Abstract

Purpose Modern mathematicians and scientists of math-related disciplines often use Document Preparation Systems (DPS) to write and Computer Algebra Systems (CAS) to calculate mathematical expressions. Usually, they translate the expressions manually between DPS and CAS. This process is time-consuming and error-prone. The purpose of this paper is to automate this translation. This paper uses Maple and Mathematica as the CAS, and LaTeX as the DPS. Design/methodology/approach Bruce Miller at the National Institute of Standards and Technology (NIST) developed a collection of special LaTeX macros that create links from mathematical symbols to their definitions in the NIST Digital Library of Mathematical Functions (DLMF). The authors are using these macros to perform rule-based translations between the formulae in the DLMF and CAS. Moreover, the authors develop software to ease the creation of new rules and to discover inconsistencies. Findings The authors created 396 mappings and translated 58.8 percent of DLMF formulae (2,405 expressions) successfully between Maple and DLMF. For a significant percentage, the special function definitions in Maple and the DLMF were different. An atomic symbol in one system maps to a composite expression in the other system. The translator was also successfully used for automatic verification of mathematical online compendia and CAS. The evaluation techniques discovered two errors in the DLMF and one defect in Maple. Originality/value This paper introduces the first translation tool for special functions between LaTeX and CAS. The approach improves error-prone manual translations and can be used to verify mathematical online compendia and CAS.

Date

20. 1.2015 18:30:22

Language

e
Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.05
```
0.04572379 = sum of:
  0.0121676065 = product of:
    0.048670426 = sum of:
      0.048670426 = weight(_text_:authors in 1442) [ClassicSimilarity], result of:
        0.048670426 = score(doc=1442,freq=2.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.20147301 = fieldWeight in 1442, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.03125 = fieldNorm(doc=1442)
    0.25 = coord(1/4)
  0.033556182 = sum of:
    0.0048383637 = weight(_text_:e in 1442) [ClassicSimilarity], result of:
      0.0048383637 = score(doc=1442,freq=2.0), product of:
        0.07616667 = queryWeight, product of:
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.052990302 = queryNorm
        0.063523374 = fieldWeight in 1442, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.43737 = idf(docFreq=28552, maxDocs=44218)
          0.03125 = fieldNorm(doc=1442)
    0.028717818 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
      0.028717818 = score(doc=1442,freq=2.0), product of:
        0.18556301 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.052990302 = queryNorm
        0.15476047 = fieldWeight in 1442, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.03125 = fieldNorm(doc=1442)
```
Abstract

The main objective of this research was to analyze whether there was a characteristic distribution behavior of relevant terms over a scientific text that could contribute as a criterion for their process of automatic indexing. The terms considered in this study were only full noun phrases contained in the texts themselves. The texts were considered a total of 98 doctoral theses of the eight areas of knowledge in a same university. Initially, 20 full noun phrases were automatically extracted from each text as candidates to be the most relevant terms, and each author of each text assigned a relevance value 0-6 (not relevant and highly relevant, respectively) for each of the 20 noun phrases sent. Only, 22.1 % of noun phrases were considered not relevant. A relevance values of the terms assigned by the authors were associated with their positions in the text. Each full noun phrases found in the text was considered as a valid linear position. The results that were obtained showed values resulting from this distribution by considering two types of position: linear, with values consolidated into ten equal consecutive parts; and structural, considering parts of the text (such as introduction, development and conclusion). As a result of considerable importance, all areas of knowledge related to the Natural Sciences showed a characteristic behavior in the distribution of relevant terms, as well as all areas of knowledge related to Social Sciences showed the same characteristic behavior of distribution, but distinct from the Natural Sciences. The difference of the distribution behavior between the Natural and Social Sciences can be clearly visualized through graphs. All behaviors, including the general behavior of all areas of knowledge together, were characterized in polynomial equations and can be applied in future as criteria for automatic indexing. Until the present date this work has become inedited of for two reasons: to present a method for characterizing the distribution of relevant terms in a scientific text, and also, through this method, pointing out a quantitative trait difference between the Natural and Social Sciences.

Language

e

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.04

0.041945226 = product of:
  0.08389045 = sum of:
    0.08389045 = sum of:
      0.01209591 = weight(_text_:e in 1952) [ClassicSimilarity], result of:
        0.01209591 = score(doc=1952,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.15880844 = fieldWeight in 1952, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.078125 = fieldNorm(doc=1952)
      0.07179455 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
        0.07179455 = score(doc=1952,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.38690117 = fieldWeight in 1952, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.078125 = fieldNorm(doc=1952)
  0.5 = coord(1/2)

Date: 16. 8.1998 12:51:22
Language: e

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.04

0.041945226 = product of:
  0.08389045 = sum of:
    0.08389045 = sum of:
      0.01209591 = weight(_text_:e in 4157) [ClassicSimilarity], result of:
        0.01209591 = score(doc=4157,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.15880844 = fieldWeight in 4157, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.078125 = fieldNorm(doc=4157)
      0.07179455 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
        0.07179455 = score(doc=4157,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.38690117 = fieldWeight in 4157, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.078125 = fieldNorm(doc=4157)
  0.5 = coord(1/2)

Language: e
Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.04

0.041945226 = product of:
  0.08389045 = sum of:
    0.08389045 = sum of:
      0.01209591 = weight(_text_:e in 2759) [ClassicSimilarity], result of:
        0.01209591 = score(doc=2759,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.15880844 = fieldWeight in 2759, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.078125 = fieldNorm(doc=2759)
      0.07179455 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
        0.07179455 = score(doc=2759,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.38690117 = fieldWeight in 2759, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.078125 = fieldNorm(doc=2759)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22
Language: e

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.04

0.0355603 = product of:
  0.0711206 = sum of:
    0.0711206 = sum of:
      0.01368496 = weight(_text_:e in 6752) [ClassicSimilarity], result of:
        0.01368496 = score(doc=6752,freq=4.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.17967124 = fieldWeight in 6752, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0625 = fieldNorm(doc=6752)
      0.057435635 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
        0.057435635 = score(doc=6752,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.30952093 = fieldWeight in 6752, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=6752)
  0.5 = coord(1/2)

Date: 6. 3.1997 16:22:15
Language: e

Pulgarin, A.; Gil-Leiva, I.: Bibliometric analysis of the automatic indexing literature : 1956-2000 (2004) 0.03

0.03434686 = sum of:
  0.030113291 = product of:
    0.120453164 = sum of:
      0.120453164 = weight(_text_:authors in 2566) [ClassicSimilarity], result of:
        0.120453164 = score(doc=2566,freq=4.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.49862027 = fieldWeight in 2566, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0546875 = fieldNorm(doc=2566)
    0.25 = coord(1/4)
  0.004233568 = product of:
    0.008467136 = sum of:
      0.008467136 = weight(_text_:e in 2566) [ClassicSimilarity], result of:
        0.008467136 = score(doc=2566,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 2566, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=2566)
    0.5 = coord(1/2)

Abstract: We present a bibliometric study of a corpus of 839 bibliographic references about automatic indexing, covering the period 1956-2000. We analyse the distribution of authors and works, the obsolescence and its dispersion, and the distribution of the literature by topic, year, and source type. We conclude that: (i) there has been a constant interest on the part of researchers; (ii) the most studied topics were the techniques and methods employed and the general aspects of automatic indexing; (iii) the productivity of the authors does fit a Lotka distribution (Dmax=0.02 and critical value=0.054); (iv) the annual aging factor is 95%; and (v) the dispersion of the literature is low.
Language: e

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.03

0.033556182 = product of:
  0.067112364 = sum of:
    0.067112364 = sum of:
      0.0096767275 = weight(_text_:e in 4709) [ClassicSimilarity], result of:
        0.0096767275 = score(doc=4709,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.12704675 = fieldWeight in 4709, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0625 = fieldNorm(doc=4709)
      0.057435635 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
        0.057435635 = score(doc=4709,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.30952093 = fieldWeight in 4709, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0625 = fieldNorm(doc=4709)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19
Language: e

Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.03

0.029361658 = product of:
  0.058723316 = sum of:
    0.058723316 = sum of:
      0.008467136 = weight(_text_:e in 5001) [ClassicSimilarity], result of:
        0.008467136 = score(doc=5001,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 5001, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5001)
      0.05025618 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
        0.05025618 = score(doc=5001,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.2708308 = fieldWeight in 5001, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5001)
  0.5 = coord(1/2)

Date: 14. 3.1996 13:22:21
Language: e

Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.03

0.029361658 = product of:
  0.058723316 = sum of:
    0.058723316 = sum of:
      0.008467136 = weight(_text_:e in 530) [ClassicSimilarity], result of:
        0.008467136 = score(doc=530,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 530, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=530)
      0.05025618 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
        0.05025618 = score(doc=530,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.2708308 = fieldWeight in 530, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=530)
  0.5 = coord(1/2)

Language: e
Source: International forum on information and documentation. 22(1997) no.1, S.17-28

Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.03

0.029361658 = product of:
  0.058723316 = sum of:
    0.058723316 = sum of:
      0.008467136 = weight(_text_:e in 2673) [ClassicSimilarity], result of:
        0.008467136 = score(doc=2673,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 2673, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=2673)
      0.05025618 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
        0.05025618 = score(doc=2673,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.2708308 = fieldWeight in 2673, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=2673)
  0.5 = coord(1/2)

Date: 1. 8.1996 22:08:06
Language: e

Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.03

0.029361658 = product of:
  0.058723316 = sum of:
    0.058723316 = sum of:
      0.008467136 = weight(_text_:e in 5291) [ClassicSimilarity], result of:
        0.008467136 = score(doc=5291,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 5291, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5291)
      0.05025618 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
        0.05025618 = score(doc=5291,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.2708308 = fieldWeight in 5291, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5291)
  0.5 = coord(1/2)

Date: 22. 7.2006 17:32:00
Language: e

Oliver, C.: Leveraging KOS to extend our reach with automated processes (2021) 0.03

0.029173577 = sum of:
  0.024335213 = product of:
    0.09734085 = sum of:
      0.09734085 = weight(_text_:authors in 722) [ClassicSimilarity], result of:
        0.09734085 = score(doc=722,freq=2.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.40294603 = fieldWeight in 722, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0625 = fieldNorm(doc=722)
    0.25 = coord(1/4)
  0.0048383637 = product of:
    0.0096767275 = sum of:
      0.0096767275 = weight(_text_:e in 722) [ClassicSimilarity], result of:
        0.0096767275 = score(doc=722,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.12704675 = fieldWeight in 722, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0625 = fieldNorm(doc=722)
    0.5 = coord(1/2)

Abstract: This article provides a conclusion to the special issue on Artificial Intelligence (AI) and Automated Processes for Subject Access. The authors who contributed to this special issue have provoked interesting questions as well as bringing attention to important issues. This concluding article looks at common themes and highlights some of the questions raised.
Language: e

Wolfe, EW.: a case study in automated metadata enhancement : Natural Language Processing in the humanities (2019) 0.03

0.025526881 = sum of:
  0.021293312 = product of:
    0.08517325 = sum of:
      0.08517325 = weight(_text_:authors in 5236) [ClassicSimilarity], result of:
        0.08517325 = score(doc=5236,freq=2.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.35257778 = fieldWeight in 5236, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5236)
    0.25 = coord(1/4)
  0.004233568 = product of:
    0.008467136 = sum of:
      0.008467136 = weight(_text_:e in 5236) [ClassicSimilarity], result of:
        0.008467136 = score(doc=5236,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 5236, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=5236)
    0.5 = coord(1/2)

Abstract: The Black Book Interactive Project at the University of Kansas (KU) is developing an expanded corpus of novels by African American authors, with an emphasis on lesser known writers and a goal of expanding research in this field. Using a custom metadata schema with an emphasis on race-related elements, each novel is analyzed for a variety of elements such as literary style, targeted content analysis, historical context, and other areas. Librarians at KU have worked to develop a variety of computational text analysis processes designed to assist with specific aspects of this metadata collection, including text mining and natural language processing, automated subject extraction based on word sense disambiguation, harvesting data from Wikidata, and other actions.
Language: e

Chou, C.; Chu, T.: ¬An analysis of BERT (NLP) for assisted subject indexing for Project Gutenberg (2022) 0.03

0.025526881 = sum of:
  0.021293312 = product of:
    0.08517325 = sum of:
      0.08517325 = weight(_text_:authors in 1139) [ClassicSimilarity], result of:
        0.08517325 = score(doc=1139,freq=2.0), product of:
          0.24157293 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.052990302 = queryNorm
          0.35257778 = fieldWeight in 1139, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1139)
    0.25 = coord(1/4)
  0.004233568 = product of:
    0.008467136 = sum of:
      0.008467136 = weight(_text_:e in 1139) [ClassicSimilarity], result of:
        0.008467136 = score(doc=1139,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.1111659 = fieldWeight in 1139, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.0546875 = fieldNorm(doc=1139)
    0.5 = coord(1/2)

Abstract: In light of AI (Artificial Intelligence) and NLP (Natural language processing) technologies, this article examines the feasibility of using AI/NLP models to enhance the subject indexing of digital resources. While BERT (Bidirectional Encoder Representations from Transformers) models are widely used in scholarly communities, the authors assess whether BERT models can be used in machine-assisted indexing in the Project Gutenberg collection, through suggesting Library of Congress subject headings filtered by certain Library of Congress Classification subclass labels. The findings of this study are informative for further research on BERT models to assist with automatic subject indexing for digital library collections.
Language: e

Ward, M.L.: ¬The future of the human indexer (1996) 0.03

0.025167137 = product of:
  0.050334275 = sum of:
    0.050334275 = sum of:
      0.0072575454 = weight(_text_:e in 7244) [ClassicSimilarity], result of:
        0.0072575454 = score(doc=7244,freq=2.0), product of:
          0.07616667 = queryWeight, product of:
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.052990302 = queryNorm
          0.09528506 = fieldWeight in 7244, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.43737 = idf(docFreq=28552, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
      0.043076728 = weight(_text_:22 in 7244) [ClassicSimilarity], result of:
        0.043076728 = score(doc=7244,freq=2.0), product of:
          0.18556301 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.052990302 = queryNorm
          0.23214069 = fieldWeight in 7244, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=7244)
  0.5 = coord(1/2)

Date: 9. 2.1997 18:44:22
Language: e

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.03

0.02512809 = product of:
  0.05025618 = sum of:
    0.05025618 = product of:
      0.10051236 = sum of:
        0.10051236 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
          0.10051236 = score(doc=262,freq=2.0), product of:
            0.18556301 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.052990302 = queryNorm
            0.5416616 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 20.10.2000 12:22:23

Search (269 results, page 1 of 14)

Authors

Years

Languages

Types

Themes

Subjects

Classifications