Search (39 results, page 1 of 2)

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.03

0.027614072 = product of:
  0.055228144 = sum of:
    0.055228144 = product of:
      0.11045629 = sum of:
        0.11045629 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.11045629 = score(doc=402,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information processing and management. 22(1986) no.6, S.465-476

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.02

0.024162313 = product of:
  0.048324626 = sum of:
    0.048324626 = product of:
      0.09664925 = sum of:
        0.09664925 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
          0.09664925 = score(doc=262,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.5416616 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 20.10.2000 12:22:23

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02

0.024162313 = product of:
  0.048324626 = sum of:
    0.048324626 = product of:
      0.09664925 = sum of:
        0.09664925 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
          0.09664925 = score(doc=6265,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.5416616 = fieldWeight in 6265, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6265)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information outlook. 9(2005) no.8, S.22-23

Mansour, N.; Haraty, R.A.; Daher, W.; Houri, M.: ¬An auto-indexing method for Arabic text (2008) 0.02
```
0.023132125 = product of:
  0.04626425 = sum of:
    0.04626425 = product of:
      0.0925285 = sum of:
        0.0925285 = weight(_text_:64 in 2103) [ClassicSimilarity], result of:
          0.0925285 = score(doc=2103,freq=2.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.34695902 = fieldWeight in 2103, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.046875 = fieldNorm(doc=2103)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This work addresses the information retrieval problem of auto-indexing Arabic documents. Auto-indexing a text document refers to automatically extracting words that are suitable for building an index for the document. In this paper, we propose an auto-indexing method for Arabic text documents. This method is mainly based on morphological analysis and on a technique for assigning weights to words. The morphological analysis uses a number of grammatical rules to extract stem words that become candidate index words. The weight assignment technique computes weights for these words relative to the container document. The weight is based on how spread is the word in a document and not only on its rate of occurrence. The candidate index words are then sorted in descending order by weight so that information retrievers can select the more important index words. We empirically verify the usefulness of our method using several examples. For these examples, we obtained an average recall of 46% and an average precision of 64%.

Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.02

0.023132125 = product of:
  0.04626425 = sum of:
    0.04626425 = product of:
      0.0925285 = sum of:
        0.0925285 = weight(_text_:64 in 903) [ClassicSimilarity], result of:
          0.0925285 = score(doc=903,freq=2.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.34695902 = fieldWeight in 903, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.046875 = fieldNorm(doc=903)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Vgl.: https://edoccluster.cms.hu-berlin.de/docviews/abstract.php?lang=ger&id=39543; http://http://edoc.hu-berlin.de/series/berliner-handreichungen/2012-329/PDF/329.pdf. Vgl. auch den Beitrag in: iwp 64(2013) H.2/3, S. 96-106.

Willis, C.; Losee, R.M.: ¬A random walk on an ontology : using thesaurus structure for automatic subject indexing (2013) 0.02

0.021809177 = product of:
  0.043618355 = sum of:
    0.043618355 = product of:
      0.08723671 = sum of:
        0.08723671 = weight(_text_:64 in 1016) [ClassicSimilarity], result of:
          0.08723671 = score(doc=1016,freq=4.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.3271161 = fieldWeight in 1016, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.03125 = fieldNorm(doc=1016)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Korrektur einer Referenz in: JASIST 64(2013) no.8, S.1757.
Source: Journal of the American Society for Information Science and Technology. 64(2013) no.7, S.1330-1344

Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.02

0.020710554 = product of:
  0.041421108 = sum of:
    0.041421108 = product of:
      0.082842216 = sum of:
        0.082842216 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
          0.082842216 = score(doc=58,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.46428138 = fieldWeight in 58, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=58)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 14. 6.2015 22:12:44

Hauer, M.: Automatische Indexierung (2000) 0.02

0.020710554 = product of:
  0.041421108 = sum of:
    0.041421108 = product of:
      0.082842216 = sum of:
        0.082842216 = weight(_text_:22 in 5887) [ClassicSimilarity], result of:
          0.082842216 = score(doc=5887,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.46428138 = fieldWeight in 5887, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5887)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Wissen in Aktion: Wege des Knowledge Managements. 22. Online-Tagung der DGI, Frankfurt am Main, 2.-4.5.2000. Proceedings. Hrsg.: R. Schmidt

Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.02

0.020710554 = product of:
  0.041421108 = sum of:
    0.041421108 = product of:
      0.082842216 = sum of:
        0.082842216 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
          0.082842216 = score(doc=2051,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.46428138 = fieldWeight in 2051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=2051)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 14. 6.2015 22:12:56

Hauer, M.: Tiefenindexierung im Bibliothekskatalog : 17 Jahre intelligentCAPTURE (2019) 0.02

0.020710554 = product of:
  0.041421108 = sum of:
    0.041421108 = product of:
      0.082842216 = sum of:
        0.082842216 = weight(_text_:22 in 5629) [ClassicSimilarity], result of:
          0.082842216 = score(doc=5629,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.46428138 = fieldWeight in 5629, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5629)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: B.I.T.online. 22(2019) H.2, S.163-166

Kempf, A.O.: Automatische Inhaltserschließung in der Fachinformation (2013) 0.02

0.019276772 = product of:
  0.038553543 = sum of:
    0.038553543 = product of:
      0.07710709 = sum of:
        0.07710709 = weight(_text_:64 in 905) [ClassicSimilarity], result of:
          0.07710709 = score(doc=905,freq=2.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.28913254 = fieldWeight in 905, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.0390625 = fieldNorm(doc=905)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.96-106

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.02

0.017258795 = product of:
  0.03451759 = sum of:
    0.03451759 = product of:
      0.06903518 = sum of:
        0.06903518 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
          0.06903518 = score(doc=1952,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.38690117 = fieldWeight in 1952, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1952)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 16. 8.1998 12:51:22

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02

0.017258795 = product of:
  0.03451759 = sum of:
    0.03451759 = product of:
      0.06903518 = sum of:
        0.06903518 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.06903518 = score(doc=4157,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Tsareva, P.V.: Algoritmy dlya raspoznavaniya pozitivnykh i negativnykh vkhozdenii deskriptorov v tekst i protsedura avtomaticheskoi klassifikatsii tekstov (1999) 0.02

0.017258795 = product of:
  0.03451759 = sum of:
    0.03451759 = product of:
      0.06903518 = sum of:
        0.06903518 = weight(_text_:22 in 374) [ClassicSimilarity], result of:
          0.06903518 = score(doc=374,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.38690117 = fieldWeight in 374, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=374)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 4.2002 10:22:41

Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.02

0.017258795 = product of:
  0.03451759 = sum of:
    0.03451759 = product of:
      0.06903518 = sum of:
        0.06903518 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
          0.06903518 = score(doc=2759,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.38690117 = fieldWeight in 2759, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2759)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22

Schneider, A.: Moderne Retrievalverfahren in klassischen bibliotheksbezogenen Anwendungen : Projekte und Perspektiven (2008) 0.02

0.015421417 = product of:
  0.030842833 = sum of:
    0.030842833 = product of:
      0.061685666 = sum of:
        0.061685666 = weight(_text_:64 in 4031) [ClassicSimilarity], result of:
          0.061685666 = score(doc=4031,freq=2.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.23130602 = fieldWeight in 4031, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.03125 = fieldNorm(doc=4031)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: 64 S

Kajanan, S.; Bao, Y.; Datta, A.; VanderMeer, D.; Dutta, K.: Efficient automatic search query formulation using phrase-level analysis (2014) 0.02
```
0.015421417 = product of:
  0.030842833 = sum of:
    0.030842833 = product of:
      0.061685666 = sum of:
        0.061685666 = weight(_text_:64 in 1264) [ClassicSimilarity], result of:
          0.061685666 = score(doc=1264,freq=2.0), product of:
            0.26668423 = queryWeight, product of:
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.050953664 = queryNorm
            0.23130602 = fieldWeight in 1264, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.2338576 = idf(docFreq=640, maxDocs=44218)
              0.03125 = fieldNorm(doc=1264)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Over the past decade, the volume of information available digitally over the Internet has grown enormously. Technical developments in the area of search, such as Google's Page Rank algorithm, have proved so good at serving relevant results that Internet search has become integrated into daily human activity. One can endlessly explore topics of interest simply by querying and reading through the resulting links. Yet, although search engines are well known for providing relevant results based on users' queries, users do not always receive the results they are looking for. Google's Director of Research describes clickstream evidence of frustrated users repeatedly reformulating queries and searching through page after page of results. Given the general quality of search engine results, one must consider the possibility that the frustrated user's query is not effective; that is, it does not describe the essence of the user's interest. Indeed, extensive research into human search behavior has found that humans are not very effective at formulating good search queries that describe what they are interested in. Ideally, the user should simply point to a portion of text that sparked the user's interest, and a system should automatically formulate a search query that captures the essence of the text. In this paper, we describe an implemented system that provides this capability. We first describe how our work differs from existing work in automatic query formulation, and propose a new method for improved quantification of the relevance of candidate search terms drawn from input text using phrase-level analysis. We then propose an implementable method designed to provide relevant queries based on a user's text input. We demonstrate the quality of our results and performance of our system through experimental studies. Our results demonstrate that our system produces relevant search terms with roughly two-thirds precision and recall compared to search terms selected by experts, and that typical users find significantly more relevant results (31% more relevant) more quickly (64% faster) using our system than self-formulated search queries. Further, we show that our implementation can scale to request loads of up to 10 requests per second within current online responsiveness expectations (<2-second response times at the highest loads tested).

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.01

0.013807036 = product of:
  0.027614072 = sum of:
    0.027614072 = product of:
      0.055228144 = sum of:
        0.055228144 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
          0.055228144 = score(doc=4709,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.30952093 = fieldWeight in 4709, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=4709)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.01

0.013807036 = product of:
  0.027614072 = sum of:
    0.027614072 = product of:
      0.055228144 = sum of:
        0.055228144 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
          0.055228144 = score(doc=6752,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.30952093 = fieldWeight in 6752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6752)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 6. 3.1997 16:22:15

Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006) 0.01

0.013807036 = product of:
  0.027614072 = sum of:
    0.027614072 = product of:
      0.055228144 = sum of:
        0.055228144 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
          0.055228144 = score(doc=3581,freq=2.0), product of:
            0.17843105 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050953664 = queryNorm
            0.30952093 = fieldWeight in 3581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=3581)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 24. 3.2006 12:22:02

Search (39 results, page 1 of 2)

Authors

Years

Languages

Types

Themes