Search (27 results, page 1 of 2)

  • × language_ss:"e"
  • × theme_ss:"Automatisches Indexieren"
  • × type_ss:"a"
  1. Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.08
    0.07934082 = product of:
      0.15868165 = sum of:
        0.15868165 = sum of:
          0.10820947 = weight(_text_:n in 1952) [ClassicSimilarity], result of:
            0.10820947 = score(doc=1952,freq=4.0), product of:
              0.16062054 = queryWeight, product of:
                4.3116565 = idf(docFreq=1611, maxDocs=44218)
                0.03725263 = queryNorm
              0.67369634 = fieldWeight in 1952, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3116565 = idf(docFreq=1611, maxDocs=44218)
                0.078125 = fieldNorm(doc=1952)
          0.050472174 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
            0.050472174 = score(doc=1952,freq=2.0), product of:
              0.13045236 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.03725263 = queryNorm
              0.38690117 = fieldWeight in 1952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.078125 = fieldNorm(doc=1952)
      0.5 = coord(1/2)
    
    Date
    16. 8.1998 12:51:22
  2. Jardine, N.; Rijsbergen, C.J. van: ¬The use of hierarchic clustering in information retrieval (1971) 0.03
    0.030606259 = product of:
      0.061212517 = sum of:
        0.061212517 = product of:
          0.122425035 = sum of:
            0.122425035 = weight(_text_:n in 5170) [ClassicSimilarity], result of:
              0.122425035 = score(doc=5170,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.76220036 = fieldWeight in 5170, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.125 = fieldNorm(doc=5170)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  3. Garfield, E.; Sager, N.: Mechanical indexing, structural linguistics and information retrieval (1993) 0.03
    0.030606259 = product of:
      0.061212517 = sum of:
        0.061212517 = product of:
          0.122425035 = sum of:
            0.122425035 = weight(_text_:n in 5900) [ClassicSimilarity], result of:
              0.122425035 = score(doc=5900,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.76220036 = fieldWeight in 5900, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.125 = fieldNorm(doc=5900)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Fuhr, N.; Knorz, G.: Retrieval test evaluation of a rule based automatic indexing (AIR/PHYS) (1984) 0.02
    0.022954693 = product of:
      0.045909386 = sum of:
        0.045909386 = product of:
          0.09181877 = sum of:
            0.09181877 = weight(_text_:n in 2321) [ClassicSimilarity], result of:
              0.09181877 = score(doc=2321,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.57165027 = fieldWeight in 2321, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2321)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02
    0.020188868 = product of:
      0.040377736 = sum of:
        0.040377736 = product of:
          0.08075547 = sum of:
            0.08075547 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
              0.08075547 = score(doc=402,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.61904186 = fieldWeight in 402, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=402)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information processing and management. 22(1986) no.6, S.465-476
  6. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02
    0.01766526 = product of:
      0.03533052 = sum of:
        0.03533052 = product of:
          0.07066104 = sum of:
            0.07066104 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
              0.07066104 = score(doc=6265,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.5416616 = fieldWeight in 6265, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6265)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information outlook. 9(2005) no.8, S.22-23
  7. Dolamic, L.; Savoy, J.: Indexing and searching strategies for the Russian language (2009) 0.01
    0.013526184 = product of:
      0.027052367 = sum of:
        0.027052367 = product of:
          0.054104734 = sum of:
            0.054104734 = weight(_text_:n in 3301) [ClassicSimilarity], result of:
              0.054104734 = score(doc=3301,freq=4.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.33684817 = fieldWeight in 3301, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3301)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This paper describes and evaluates various stemming and indexing strategies for the Russian language. We design and evaluate two stemming approaches, a light and a more aggressive one, and compare these stemmers to the Snowball stemmer, to no stemming, and also to a language-independent approach (n-gram). To evaluate the suggested stemming strategies we apply various probabilistic information retrieval (IR) models, including the Okapi, the Divergence from Randomness (DFR), a statistical language model (LM), as well as two vector-space approaches, namely, the classical tf idf scheme and the dtu-dtn model. We find that the vector-space dtu-dtn and the DFR models tend to result in better retrieval effectiveness than the Okapi, LM, or tf idf models, while only the latter two IR approaches result in statistically significant performance differences. Ignoring stemming generally reduces the MAP by more than 50%, and these differences are always significant. When applying an n-gram approach, performance differences are usually lower than an approach involving stemming. Finally, our light stemmer tends to perform best, although performance differences between the light, aggressive, and Snowball stemmers are not statistically significant.
  8. Cohen, J.D.: Highlights: language- and domain-independent automatic indexing terms for abstracting (1995) 0.01
    0.013390238 = product of:
      0.026780477 = sum of:
        0.026780477 = product of:
          0.053560954 = sum of:
            0.053560954 = weight(_text_:n in 1793) [ClassicSimilarity], result of:
              0.053560954 = score(doc=1793,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.33346266 = fieldWeight in 1793, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1793)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Presents a model of drawing index terms from text. The approach uses no stop list, stemmer, or other language and domain specific component, allowing operation in any language or domain with only trivial modification. The method uses n-grams counts, achieving a function similar to, but more general than, a stemmer. The generated index terms, called 'highlights', are suitable for identifying the topic for perusal and selection. An extension is also described and demonstrated which selects index terms to represent a subset of documents, distinguishing them from the corpus. Presents some experimental results, showing operation in English, Spanish, German, Georgian, Russian and Japanese
  9. Pfeifer, U.; Fuhr, N.; Huynh, T.: Searching structured documents with the enhanced retrieval functionality of freeWAIS-sf and SFgate (1995) 0.01
    0.013390238 = product of:
      0.026780477 = sum of:
        0.026780477 = product of:
          0.053560954 = sum of:
            0.053560954 = weight(_text_:n in 2214) [ClassicSimilarity], result of:
              0.053560954 = score(doc=2214,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.33346266 = fieldWeight in 2214, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2214)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  10. Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.01
    0.012618043 = product of:
      0.025236087 = sum of:
        0.025236087 = product of:
          0.050472174 = sum of:
            0.050472174 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
              0.050472174 = score(doc=4157,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.38690117 = fieldWeight in 4157, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4157)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill
  11. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.01
    0.012618043 = product of:
      0.025236087 = sum of:
        0.025236087 = product of:
          0.050472174 = sum of:
            0.050472174 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
              0.050472174 = score(doc=2759,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.38690117 = fieldWeight in 2759, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2759)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  12. Wacholder, N.; Byrd, R.J.: Retrieving information from full text using linguistic knowledge (1994) 0.01
    0.0114773465 = product of:
      0.022954693 = sum of:
        0.022954693 = product of:
          0.045909386 = sum of:
            0.045909386 = weight(_text_:n in 8524) [ClassicSimilarity], result of:
              0.045909386 = score(doc=8524,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.28582513 = fieldWeight in 8524, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=8524)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  13. Mansour, N.; Haraty, R.A.; Daher, W.; Houri, M.: ¬An auto-indexing method for Arabic text (2008) 0.01
    0.0114773465 = product of:
      0.022954693 = sum of:
        0.022954693 = product of:
          0.045909386 = sum of:
            0.045909386 = weight(_text_:n in 2103) [ClassicSimilarity], result of:
              0.045909386 = score(doc=2103,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.28582513 = fieldWeight in 2103, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2103)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  14. Fauzi, F.; Belkhatir, M.: Multifaceted conceptual image indexing on the world wide web (2013) 0.01
    0.0114773465 = product of:
      0.022954693 = sum of:
        0.022954693 = product of:
          0.045909386 = sum of:
            0.045909386 = weight(_text_:n in 2721) [ClassicSimilarity], result of:
              0.045909386 = score(doc=2721,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.28582513 = fieldWeight in 2721, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2721)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    In this paper, we describe a user-centered design of an automated multifaceted concept-based indexing framework which analyzes the semantics of the Web image contextual information and classifies it into five broad semantic concept facets: signal, object, abstract, scene, and relational; and identifies the semantic relationships between the concepts. An important aspect of our indexing model is that it relates to the users' levels of image descriptions. Also, a major contribution relies on the fact that the classification is performed automatically with the raw image contextual information extracted from any general webpage and is not solely based on image tags like state-of-the-art solutions. Human Language Technology techniques and an external knowledge base are used to analyze the information both syntactically and semantically. Experimental results on a human-annotated Web image collection and corresponding contextual information indicate that our method outperforms empirical frameworks employing tf-idf and location-based tf-idf weighting schemes as well as n-gram indexing in a recall/precision based evaluation framework.
  15. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01
    0.010094434 = product of:
      0.020188868 = sum of:
        0.020188868 = product of:
          0.040377736 = sum of:
            0.040377736 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
              0.040377736 = score(doc=4709,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.30952093 = fieldWeight in 4709, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4709)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19
  16. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01
    0.010094434 = product of:
      0.020188868 = sum of:
        0.020188868 = product of:
          0.040377736 = sum of:
            0.040377736 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
              0.040377736 = score(doc=6752,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.30952093 = fieldWeight in 6752, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6752)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    6. 3.1997 16:22:15
  17. Vledutz-Stokolov, N.: Concept recognition in an automatic text-processing system for the life sciences (1987) 0.01
    0.009564456 = product of:
      0.019128911 = sum of:
        0.019128911 = product of:
          0.038257822 = sum of:
            0.038257822 = weight(_text_:n in 2849) [ClassicSimilarity], result of:
              0.038257822 = score(doc=2849,freq=2.0), product of:
                0.16062054 = queryWeight, product of:
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.03725263 = queryNorm
                0.23818761 = fieldWeight in 2849, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.3116565 = idf(docFreq=1611, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2849)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  18. Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.01
    0.00883263 = product of:
      0.01766526 = sum of:
        0.01766526 = product of:
          0.03533052 = sum of:
            0.03533052 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
              0.03533052 = score(doc=5001,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.2708308 = fieldWeight in 5001, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5001)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 3.1996 13:22:21
  19. Bordoni, L.; Pazienza, M.T.: Documents automatic indexing in an environmental domain (1997) 0.01
    0.00883263 = product of:
      0.01766526 = sum of:
        0.01766526 = product of:
          0.03533052 = sum of:
            0.03533052 = weight(_text_:22 in 530) [ClassicSimilarity], result of:
              0.03533052 = score(doc=530,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.2708308 = fieldWeight in 530, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=530)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    International forum on information and documentation. 22(1997) no.1, S.17-28
  20. Wolfekuhler, M.R.; Punch, W.F.: Finding salient features for personal Web pages categories (1997) 0.01
    0.00883263 = product of:
      0.01766526 = sum of:
        0.01766526 = product of:
          0.03533052 = sum of:
            0.03533052 = weight(_text_:22 in 2673) [ClassicSimilarity], result of:
              0.03533052 = score(doc=2673,freq=2.0), product of:
                0.13045236 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03725263 = queryNorm
                0.2708308 = fieldWeight in 2673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2673)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06