Search (26 results, page 1 of 2)

Salton, G.: Another look at automatic text-retrieval systems (1986) 0.08

0.07970953 = product of:
  0.119564295 = sum of:
    0.099504575 = weight(_text_:retrieval in 1356) [ClassicSimilarity], result of:
      0.099504575 = score(doc=1356,freq=10.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.74731416 = fieldWeight in 1356, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=1356)
    0.020059723 = product of:
      0.060179166 = sum of:
        0.060179166 = weight(_text_:29 in 1356) [ClassicSimilarity], result of:
          0.060179166 = score(doc=1356,freq=2.0), product of:
            0.15484026 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.04401763 = queryNorm
            0.38865322 = fieldWeight in 1356, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.078125 = fieldNorm(doc=1356)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)

Footnote: Bezugnahme auf: Blair, D.C.: An evaluation of retrieval effectiveness for a full-text document-retrieval system. Comm. ACM 28(1985) S.280-299. - Vgl. auch: Blair, D.C.: Full text retrieval ... Int. Class. 13(1986) S.18-23; Blair, D.C., M.E. Maron: full-text information retrieval ... Inf. Proc. Man. 26(1990) S.437-447.
Source: Communications of the Association for Computing Machinery. 29(1986), S.648-656

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.07

0.06867101 = product of:
  0.10300651 = sum of:
    0.07119968 = weight(_text_:retrieval in 402) [ClassicSimilarity], result of:
      0.07119968 = score(doc=402,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5347345 = fieldWeight in 402, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.125 = fieldNorm(doc=402)
    0.031806834 = product of:
      0.0954205 = sum of:
        0.0954205 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.0954205 = score(doc=402,freq=2.0), product of:
            0.15414225 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04401763 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)

Source: Information processing and management. 22(1986) no.6, S.465-476

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.06

0.055207662 = product of:
  0.08281149 = sum of:
    0.062932216 = weight(_text_:retrieval in 1952) [ClassicSimilarity], result of:
      0.062932216 = score(doc=1952,freq=4.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.47264296 = fieldWeight in 1952, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=1952)
    0.019879272 = product of:
      0.059637815 = sum of:
        0.059637815 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
          0.059637815 = score(doc=1952,freq=2.0), product of:
            0.15414225 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04401763 = queryNorm
            0.38690117 = fieldWeight in 1952, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1952)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)

Date: 16. 8.1998 12:51:22
Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.513-517.
Source: Proceedings of the 11th annual conference on research and development in information retrieval. Ed.: Y. Chiaramella

Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.05

0.05081014 = product of:
  0.07621521 = sum of:
    0.06229972 = weight(_text_:retrieval in 5001) [ClassicSimilarity], result of:
      0.06229972 = score(doc=5001,freq=8.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.46789268 = fieldWeight in 5001, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5001)
    0.01391549 = product of:
      0.04174647 = sum of:
        0.04174647 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
          0.04174647 = score(doc=5001,freq=2.0), product of:
            0.15414225 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04401763 = queryNorm
            0.2708308 = fieldWeight in 5001, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5001)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)

Abstract: A study was done to test the effectiveness of retrieval using title word searching. It was based on actual search profiles used in the Mechanized Information Center at Ohio State University, in order ro replicate as closely as possible actual searching conditions. Fewer than 50% of the relevant titles were retrieved by keywords in titles. The low rate of retrieval can be attributes to three sources: titles themselves, user and information specialist ignorance of the subject vocabulary in use, and to general language problems. Across fields it was found that the social sciences had the best retrieval rate, with science having the next best, and arts and humanities the lowest. Ways to enhance and supplement keyword in title searching on the computer and in printed indexes are discussed.
Date: 14. 3.1996 13:22:21

Salton, G.; McGill, M. J.: Information Retrieval: Grundlegendes für Informationswissenschaftler (1987) 0.03

0.025691971 = product of:
  0.07707591 = sum of:
    0.07707591 = weight(_text_:retrieval in 8648) [ClassicSimilarity], result of:
      0.07707591 = score(doc=8648,freq=6.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5788671 = fieldWeight in 8648, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=8648)
  0.33333334 = coord(1/3)

Content: Enthält die Kapitel: Information Retrieval: eine Einführung; Invertierte Dateisysteme; Textanalyse und automatisches Indexieren; Die experimentellen Retrievalsysteme SMART und SIRE; Die Bewertung von Retrievalsystemen; Fortgeschrittene Retrievaltechniken; Verarbeitung natürlicher Sprache; Informationstechnologie: Hardware und Software; Datenbankmanagementsysteme; Zukünftige Entwicklungen im Information Retrieval

Salton, G.: Automatic text processing : the transformation, analysis, and retrieval of information by computer (1989) 0.03

0.025691971 = product of:
  0.07707591 = sum of:
    0.07707591 = weight(_text_:retrieval in 1307) [ClassicSimilarity], result of:
      0.07707591 = score(doc=1307,freq=6.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5788671 = fieldWeight in 1307, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=1307)
  0.33333334 = coord(1/3)

COMPASS: Information retrieval / Use of / On-line computers
Subject: Information retrieval / Use of / On-line computers

Fuhr, N.; Knorz, G.: Retrieval test evaluation of a rule based automatic indexing (AIR/PHYS) (1984) 0.03

0.02517289 = product of:
  0.07551867 = sum of:
    0.07551867 = weight(_text_:retrieval in 2321) [ClassicSimilarity], result of:
      0.07551867 = score(doc=2321,freq=4.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5671716 = fieldWeight in 2321, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.09375 = fieldNorm(doc=2321)
  0.33333334 = coord(1/3)

Source: Research and development in information retrieval. Proc. of the 3rd joint BCS and ACM symp., Cambridge, 2.-6.7.1984. Ed.: C.J. van Rijsbergen

Salton, G.; McGill, M. J.: Introduction to modern information retrieval (1983) 0.02

0.023733227 = product of:
  0.07119968 = sum of:
    0.07119968 = weight(_text_:retrieval in 2328) [ClassicSimilarity], result of:
      0.07119968 = score(doc=2328,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5347345 = fieldWeight in 2328, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.125 = fieldNorm(doc=2328)
  0.33333334 = coord(1/3)

Research and development in information retrieval : Proc., Berlin, 18.-20.5.1982 (1983) 0.02

0.023733227 = product of:
  0.07119968 = sum of:
    0.07119968 = weight(_text_:retrieval in 2332) [ClassicSimilarity], result of:
      0.07119968 = score(doc=2332,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5347345 = fieldWeight in 2332, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.125 = fieldNorm(doc=2332)
  0.33333334 = coord(1/3)

Advances in intelligent retrieval: Proc. of a conference ... Wadham College, Oxford, 16.-17.4.1985 (1986) 0.02
```
0.023547081 = product of:
  0.07064124 = sum of:
    0.07064124 = weight(_text_:retrieval in 1384) [ClassicSimilarity], result of:
      0.07064124 = score(doc=1384,freq=14.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.5305404 = fieldWeight in 1384, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=1384)
  0.33333334 = coord(1/3)
```
Content

Enthält die Beiträge: ADDIS, T.: Extended relational analysis: a design approach to knowledge-based systems; PARKINSON, D.: Supercomputers and non-numeric processing; McGREGOR, D.R. u. J.R. MALONE: An architectural approach to advances in information retrieval; ALLEN, M.J. u. O.S. HARRISON: Word processing and information retrieval: some practical problems; MURTAGH, F.: Clustering and nearest neighborhood searching; ENSER, P.G.B.: Experimenting with the automatic classification of books; TESKEY, N. u. Z. RAZAK: An analysis of ranking for free text retrieval systems; ZARRI, G.P.: Interactive information retrieval: an artificial intelligence approach to deal with biographical data; HANCOX, P. u. F. SMITH: A case system processor for the PRECIS indexing language; ROUAULT, J.: Linguistic methods in information retrieval systems; ARAGON-RAMIREZ, V. u. C.D. PAICE: Design of a system for the online elucidation of natural language search statements; BROOKS, H.M., P.J. DANIELS u. N.J. BELKIN: Problem descriptions and user models: developing an intelligent interface for document retrieval systems; BLACK, W.J., P. HARGREAVES u. P.B. MAYES: HEADS: a cataloguing advisory system; BELL, D.A.: An architecture for integrating data, knowledge, and information bases

Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 0.02

0.020766575 = product of:
  0.06229972 = sum of:
    0.06229972 = weight(_text_:retrieval in 2415) [ClassicSimilarity], result of:
      0.06229972 = score(doc=2415,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.46789268 = fieldWeight in 2415, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.109375 = fieldNorm(doc=2415)
  0.33333334 = coord(1/3)

Fuhr, N.: Probabilistisches Indexing and Retrieval (1988) 0.02

0.020766575 = product of:
  0.06229972 = sum of:
    0.06229972 = weight(_text_:retrieval in 4829) [ClassicSimilarity], result of:
      0.06229972 = score(doc=4829,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.46789268 = fieldWeight in 4829, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.109375 = fieldNorm(doc=4829)
  0.33333334 = coord(1/3)

Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.02
```
0.020553578 = product of:
  0.061660733 = sum of:
    0.061660733 = weight(_text_:retrieval in 2322) [ClassicSimilarity], result of:
      0.061660733 = score(doc=2322,freq=6.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.46309367 = fieldWeight in 2322, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0625 = fieldNorm(doc=2322)
  0.33333334 = coord(1/3)
```
Abstract

Ausgehend von theoretischen Indexierungsansätzen wird das klassische Vektorraum-Modell für automatische Indexierung (mit dem Trennschärfen-Modell) erläutert. Das Clustering in Information-Retrieval-Systemem wird als eine natürliche logische Folge aus diesem Modell aufgefaßt und in allen seinen Ausprägungen (d.h. als Dokumenten-, Term- oder Dokumenten- und Termklassifikation) behandelt. Anschließend werden die Suchstrategien in vorklassifizierten Dokumentenbeständen (Clustersuche) detailliert beschrieben. Zum Schluß wird noch die sinnvolle Anwendung der Clusteranalyse in Information-Retrieval-Systemen kurz diskutiert

Porter, M.F.: ¬An algorithm for suffix stripping (1980) 0.02

0.017799921 = product of:
  0.05339976 = sum of:
    0.05339976 = weight(_text_:retrieval in 3122) [ClassicSimilarity], result of:
      0.05339976 = score(doc=3122,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.40105087 = fieldWeight in 3122, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.09375 = fieldNorm(doc=3122)
  0.33333334 = coord(1/3)

Footnote: Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.313-316.

Salton, G.: Automatic processing of foreign language documents (1985) 0.02
```
0.015698053 = product of:
  0.04709416 = sum of:
    0.04709416 = weight(_text_:retrieval in 3650) [ClassicSimilarity], result of:
      0.04709416 = score(doc=3650,freq=14.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.3536936 = fieldWeight in 3650, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.03125 = fieldNorm(doc=3650)
  0.33333334 = coord(1/3)
```
Abstract

The attempt to computerize a process, such as indexing, abstracting, classifying, or retrieving information, begins with an analysis of the process into its intellectual and nonintellectual components. That part of the process which is amenable to computerization is mechanical or algorithmic. What is not is intellectual or creative and requires human intervention. Gerard Salton has been an innovator, experimenter, and promoter in the area of mechanized information systems since the early 1960s. He has been particularly ingenious at analyzing the process of information retrieval into its algorithmic components. He received a doctorate in applied mathematics from Harvard University before moving to the computer science department at Cornell, where he developed a prototype automatic retrieval system called SMART. Working with this system he and his students contributed for over a decade to our theoretical understanding of the retrieval process. On a more practical level, they have contributed design criteria for operating retrieval systems. The following selection presents one of the early descriptions of the SMART system; it is valuable as it shows the direction automatic retrieval methods were to take beyond simple word-matching techniques. These include various word normalization techniques to improve recall, for instance, the separation of words into stems and affixes; the correlation and clustering, using statistical association measures, of related terms; and the identification, using a concept thesaurus, of synonymous, broader, narrower, and sibling terms. They include, as weIl, techniques, both linguistic and statistical, to deal with the thorny problem of how to automatically extract from texts index terms that consist of more than one word. They include weighting techniques and various documentrequest matching algorithms. Significant among the latter are those which produce a retrieval output of citations ranked in relevante order. During the 1970s, Salton and his students went an to further refine these various techniques, particularly the weighting and statistical association measures. Many of their early innovations seem commonplace today. Some of their later techniques are still ahead of their time and await technological developments for implementation. The particular focus of the selection that follows is an the evaluation of a particular component of the SMART system, a multilingual thesaurus. By mapping English language expressions and their German equivalents to a common concept number, the thesaurus permitted the automatic processing of German language documents against English language queries and vice versa. The results of the evaluation, as it turned out, were somewhat inconclusive. However, this SMART experiment suggested in a bold and optimistic way how one might proceed to answer such complex questions as What is meant by retrieval language compatability? How it is to be achieved, and how evaluated?
Lochbaum, K.E.; Streeter, A.R.: Comparing and combining the effectiveness of latent semantic indexing and the ordinary vector space model for information retrieval (1989) 0.02
```
0.015415184 = product of:
  0.046245553 = sum of:
    0.046245553 = weight(_text_:retrieval in 3458) [ClassicSimilarity], result of:
      0.046245553 = score(doc=3458,freq=6.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.34732026 = fieldWeight in 3458, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.046875 = fieldNorm(doc=3458)
  0.33333334 = coord(1/3)
```
Abstract

A retrievalsystem was built to find individuals with appropriate expertise within a large research establishment on the basis of their authored documents. The expert-locating system uses a new method for automatic indexing and retrieval based on singular value decomposition, a matrix decomposition technique related to the factor analysis. Organizational groups, represented by the documents they write, and the terms contained in these documents, are fit simultaneously into a 100-dimensional "semantic" space. User queries are positioned in the semantic space, and the most similar groups are returned to the user. Here we compared the standard vector-space model with this new technique and found that combining the two methods improved performance over either alone. We also examined the effects of various experimental variables on the system`s retrieval accuracy. In particular, the effects of: term weighting functions in the semantic space construction and in query construction, suffix stripping, and using lexical units larger than a a single word were studied.
Stock, M.: Textwortmethode und Übersetzungsrelation : Eine Methode zum Aufbau von kombinierten Literaturnachweis- und Terminologiedatenbanken (1989) 0.01
```
0.014833267 = product of:
  0.0444998 = sum of:
    0.0444998 = weight(_text_:retrieval in 3412) [ClassicSimilarity], result of:
      0.0444998 = score(doc=3412,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.33420905 = fieldWeight in 3412, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=3412)
  0.33333334 = coord(1/3)
```
Abstract

Geisteswissenschaftliche Fachinformation erfordert eine enge Kooperation zwischen Literaturnachweis- und Terminologieinformationssystemen. Eine geeignete Dokumentationsmethode für die Auswertung geisteswissen- schaftlicher Literatur ist die Textwortwethode. Dem originalsprachig aufgenommenen Begriffsrepertoire ist ein einheitssprachiger Zugriff beizuordnen, der einerseits ein vollständiges und genaues Retrieval garantiert und andererseits den Aufbau fachspezifischer Wörterbücher vorantreibt
Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.01
```
0.014833267 = product of:
  0.0444998 = sum of:
    0.0444998 = weight(_text_:retrieval in 1845) [ClassicSimilarity], result of:
      0.0444998 = score(doc=1845,freq=8.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.33420905 = fieldWeight in 1845, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1845)
  0.33333334 = coord(1/3)
```
Abstract

It may be possible to improve the quality of automatic indexing systems by using complex descriptors, for example, phrases, in addition to the simple descriptors (words or word stems) that are normally used in automatically constructed representations of document content. This study is directed toward the goal of developing effective methods of identifying phrases in natural language text from which good quality phrase descriptors can be constructed. The effectiveness of one method, a simple nonsyntactic phrase indexing procedure, has been tested on five experimental document collections. The results have been analyzed in order to identify the inadequacies of the procedure, and to determine what kinds of information about text structure are needed in order to construct phrase descriptors that are good indicators of document content. Two primary conclusions have been reached: (1) In the retrieval experiments, the nonsyntactic phrase construction procedure did not consistently yield substantial improvements in effectiveness. It is therefore not likely that phrase indexing of this kind will prove to be an important method of enhancing the performance of automatic document indexing and retrieval systems in operational environments. (2) Many of the shortcomings of the nonsyntactic approach can be overcome by incorporating syntactic information into the phrase construction process. However, a general syntactic analysis facility may be required, since many useful sources of phrases cannot be exploited if only a limited inventory of syntactic patterns can be recognized. Further research should be conducted into methods of incorporating automatic syntactic analysis into content analysis for document retrieval.

Croft, W.B.: Automatic indexing : file organization and display for information retrieval (1989) 0.01

0.014833267 = product of:
  0.0444998 = sum of:
    0.0444998 = weight(_text_:retrieval in 2412) [ClassicSimilarity], result of:
      0.0444998 = score(doc=2412,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.33420905 = fieldWeight in 2412, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.078125 = fieldNorm(doc=2412)
  0.33333334 = coord(1/3)

Zimmermann, H.: Automatische Indexierung: Entwicklung und Perspektiven (1983) 0.01
```
0.011866613 = product of:
  0.03559984 = sum of:
    0.03559984 = weight(_text_:retrieval in 2318) [ClassicSimilarity], result of:
      0.03559984 = score(doc=2318,freq=2.0), product of:
        0.1331496 = queryWeight, product of:
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.04401763 = queryNorm
        0.26736724 = fieldWeight in 2318, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.024915 = idf(docFreq=5836, maxDocs=44218)
          0.0625 = fieldNorm(doc=2318)
  0.33333334 = coord(1/3)
```
Abstract

Die Automatische Indexierung als ein Teilgebiet der Inhaltserschließung wird inzwischen in einer Reihe von Gebieten, vor allem in der Fachinformation und Kommunikation praktisch eingesetzt. Dabei dominieren äußerst einfache Systeme, die (noch) erhebliche Anpassungen des Benutzers an die jeweilige Systemstrategie voraussetzen. Unter Berücksichtigung des Konzepts der Einheit von Informationserschließung und -retrieval werden höherwertige ("intelligentere") Verfahren vorgestellt, die der Entlastung des Informationssuchenden wie auch der Verbesserung der Rechercheergebnisse dienen sollen

Search (26 results, page 1 of 2)

Authors

Languages

Types

Themes