Search (27 results, page 1 of 2)

  • × author_ss:"Ellis, D."
  1. Ellis, D.: Progress and problems in information retrieval (1996) 0.01
    0.007138887 = product of:
      0.028555548 = sum of:
        0.028555548 = weight(_text_:information in 789) [ClassicSimilarity], result of:
          0.028555548 = score(doc=789,freq=18.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.46549135 = fieldWeight in 789, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=789)
      0.25 = coord(1/4)
    
    Abstract
    An introduction to the principal generic approaches to information retrieval research with their associated concepts, models and systems, this text is designed to keep the information professional up to date with the major themes and developments that have preoccupied researchers in recent month in relation to textual and documentary retrieval systems.
    COMPASS
    Information retrieval
    Content
    First published 1991 as New horizons in information retrieval
    Footnote
    Rez. in: Managing information 3(1996) no.10, S.49 (D. Bawden); Program 32(1998) no.2, S.190-192 (C. Revie)
    LCSH
    Information retrieval
    Subject
    Information retrieval
    Information retrieval
  2. Ellis, D.: ¬A behavioral model for information retrieval system design (1989) 0.01
    0.0067306077 = product of:
      0.02692243 = sum of:
        0.02692243 = weight(_text_:information in 2707) [ClassicSimilarity], result of:
          0.02692243 = score(doc=2707,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.43886948 = fieldWeight in 2707, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=2707)
      0.25 = coord(1/4)
    
    Source
    Journal of information science. 15(1989) no.4, S.237-247
  3. Ellis, D.: Theory and explanation in information retrieval research (1984) 0.01
    0.0067306077 = product of:
      0.02692243 = sum of:
        0.02692243 = weight(_text_:information in 5337) [ClassicSimilarity], result of:
          0.02692243 = score(doc=5337,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.43886948 = fieldWeight in 5337, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=5337)
      0.25 = coord(1/4)
    
    Source
    Journal of information science. 8(1984), S.25-38
  4. Ellis, D.: Paradigms in information retrieval research (1994) 0.01
    0.0067306077 = product of:
      0.02692243 = sum of:
        0.02692243 = weight(_text_:information in 1261) [ClassicSimilarity], result of:
          0.02692243 = score(doc=1261,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.43886948 = fieldWeight in 1261, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=1261)
      0.25 = coord(1/4)
    
    Source
    Encyclopedia of library and information science. Vol.54, [=Suppl.17]
  5. Ellis, D.; Wilson, T.; Allen, D.: Information science and information systems : conjunct subjects - disjunct disciplines (1999) 0.01
    0.0066512655 = product of:
      0.026605062 = sum of:
        0.026605062 = weight(_text_:information in 4345) [ClassicSimilarity], result of:
          0.026605062 = score(doc=4345,freq=10.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.43369597 = fieldWeight in 4345, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=4345)
      0.25 = coord(1/4)
    
    Content
    Beitrag eines Themenheftes: The 50th Anniversary of the Journal of the American Society for Information Science. Pt.2: Paradigms, models, and models of information science
    Source
    Journal of the American Society for Information Science. 50(1999) no.12, S.1095-1107
  6. Ellis, D.: New horizons in information retrieval (1990) 0.01
    0.0059490725 = product of:
      0.02379629 = sum of:
        0.02379629 = weight(_text_:information in 815) [ClassicSimilarity], result of:
          0.02379629 = score(doc=815,freq=8.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.38790947 = fieldWeight in 815, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=815)
      0.25 = coord(1/4)
    
    Footnote
    Rez. in: Canadian library journal. 48(1991) S.434 (E. Frick): "The book is full of information about the development of concepts and systems in this most fascinating part of professional work"
    PRECIS
    Information retrieval / Research
    Subject
    Information retrieval / Research
  7. Ellis, D.: Information retrieval research (1997) 0.01
    0.0050479556 = product of:
      0.020191822 = sum of:
        0.020191822 = weight(_text_:information in 1161) [ClassicSimilarity], result of:
          0.020191822 = score(doc=1161,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.3291521 = fieldWeight in 1161, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.09375 = fieldNorm(doc=1161)
      0.25 = coord(1/4)
    
    Abstract
    Presents a brief summary of the different approaches to research in the area of information retrieval and discusses the problem or undesirability of identifying a single approach as 'best' for the field
  8. Ellis, D.: ¬A behavioral approach to information system design (1989) 0.00
    0.004759258 = product of:
      0.019037032 = sum of:
        0.019037032 = weight(_text_:information in 2706) [ClassicSimilarity], result of:
          0.019037032 = score(doc=2706,freq=2.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.3103276 = fieldWeight in 2706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=2706)
      0.25 = coord(1/4)
    
  9. Spink, A.; Wilson, T.D.; Ford, N.; Foster, A.; Ellis, D.: Information seeking and mediated searching : Part 1: theoretical framework and research design (2002) 0.00
    0.004371658 = product of:
      0.017486632 = sum of:
        0.017486632 = weight(_text_:information in 5240) [ClassicSimilarity], result of:
          0.017486632 = score(doc=5240,freq=12.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.2850541 = fieldWeight in 5240, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=5240)
      0.25 = coord(1/4)
    
    Abstract
    In this issue we begin with the first of four parts of a five part series of papers by Spink, Wilson, Ford, Foster, and Ellis. Spink, et alia, in the first section of this report set forth the design of a project to test whether existing models of the information search process are appropriate for an environment of mediated successive searching which they believe characterizes much information seeking behavior. Their goal is to develop an integrated model of the process. Data were collected from 198 individuals, 87 in Texas and 111 in Sheffield in the U.K., with individuals with real information needs engaged in interaction with operational information retrieval systems by use of transaction logs, recordings of interactions with intermediaries, pre, and post search interviews, questionnaire responses, relevance judgments of retrieved text, and responses to a test of cognitive styles. Questionnaires were based upon the Kuhlthau model, the Saracevic model, the Ellis model, and incorporated a visual analog scale to avoid a consistency bias.
    Source
    Journal of the American Society for Information Science and Technology. 53(2002) no.9, S.695-703
  10. Ellis, D.: Hypertext; origins and use (1991) 0.00
    0.0042066295 = product of:
      0.016826518 = sum of:
        0.016826518 = weight(_text_:information in 4916) [ClassicSimilarity], result of:
          0.016826518 = score(doc=4916,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.27429342 = fieldWeight in 4916, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=4916)
      0.25 = coord(1/4)
    
    Abstract
    Presents a brief introduction to the concept of hypertext illustrated with examples from experimental and operational systems. The origins of the hypertext concept are described and different generic types of hypertext systems outlined. The potential and problems of hypertext are discussed with particular reference to information retrieval
    Source
    International journal of information management. 11(1991) no.1, S.5-13
  11. Ellis, D.; Haugan, M.: Modelling the information seeking patterns of engineers and research scientists in an industrial environment (1997) 0.00
    0.0042066295 = product of:
      0.016826518 = sum of:
        0.016826518 = weight(_text_:information in 4713) [ClassicSimilarity], result of:
          0.016826518 = score(doc=4713,freq=16.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.27429342 = fieldWeight in 4713, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4713)
      0.25 = coord(1/4)
    
    Abstract
    The study explores the role of information and information seeking in the Research and Development Department of an international oil and gas company. The information seeking patterns of engineers and research scientists at Statoil's Research Centre, in Trondheim, Norway were studied in relation to their research activities in different phases and types of project. The project phases were evaluation of alternative solutions; development and testing; and summary of experiences. The project types were incremental; radical; and fundamental. Eight major characteristics were identified in the patterns: surveying; chaining; monitoring; browsing; distinguishing; filtering; extracting and ending. The study analyses the requirements for different types of information in an environment where the need for internal and external resources are intertwined; it also compares features of the information seeking patterns of engineers and research scientists from this and previous studies. It was found that, although there were differences in the features of the information seeking patterns of the research scientists and engineers, the behavioural characteristics were similar; and the study identified identical or very similar categories of information seeking behaviour to those of previous studies of academic researchers.
  12. Ellis, D.; Oldman, H.: ¬The English literature researcher in the age of the Internet (2005) 0.00
    0.004164351 = product of:
      0.016657405 = sum of:
        0.016657405 = weight(_text_:information in 4657) [ClassicSimilarity], result of:
          0.016657405 = score(doc=4657,freq=2.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.27153665 = fieldWeight in 4657, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.109375 = fieldNorm(doc=4657)
      0.25 = coord(1/4)
    
    Source
    Journal of information science. 31(2005) no.1, S.29-
  13. Ford, N.; Wilson, T.D.; Foster, A.; Ellis, D.; Spink, A.: Information seeking and mediated searching : Part 4: cognitive styles in information seeking (2002) 0.00
    0.0039907596 = product of:
      0.015963038 = sum of:
        0.015963038 = weight(_text_:information in 5239) [ClassicSimilarity], result of:
          0.015963038 = score(doc=5239,freq=10.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.2602176 = fieldWeight in 5239, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=5239)
      0.25 = coord(1/4)
    
    Abstract
    In "Part 4. Cognitive Styles in Information Seeking,'' where Ford is the primary author, the results of the application of the Riding's Cognitive Styles Analysis and the Pask's holist/serialist portion of the Ford's Study Process Questionnaire to the 111 U.K. participants. were correlated using Spearman's coefficient with reports of focused thinking, degree of change in the intermediary's perception of the problem and personal knowledge, problem stage, degree of differentiating activity, change in problem perception, engagement in exploring activity, changes in questioning, valuing of serendipitous information, and other variables. The results would indicate that field independent individuals report clearer more focused thinking, see themselves in an earlier problem stage, and report higher levels of change in perception of the problem. Holists value serendipity and report engagement in Kuhlthau's exploring stage. They are seen by intermediaries as exhibiting fewer changes in questioning behavior. A fifth section will appear in a later issue.
    Source
    Journal of the American Society for Information Science and Technology. 53(2002) no.9, S.728-735
  14. Ellis, D.: ¬The dilemma of measurement in information retrieval research (1996) 0.00
    0.0039349417 = product of:
      0.015739767 = sum of:
        0.015739767 = weight(_text_:information in 3003) [ClassicSimilarity], result of:
          0.015739767 = score(doc=3003,freq=14.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.256578 = fieldWeight in 3003, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3003)
      0.25 = coord(1/4)
    
    Abstract
    The problem of measurement in information retrieval research is traced to its source in the first retrieval tests. The problem is seen as presenting a chronic dilemma for the field. This dilemma has taken 3 forms as the discipline has evloved: (1) the dilemma of measurement in the archetypal approach: stated relevance versus user relevance; (2) the dilemma of measurement in the probabilistic approach: realism versus formalism; and (3) the dilemma of measurement in the Information Retrieval-Expert System (IR-ES) approach: linear measures of relevance versus logarithmic measures of knowledge. It is argued that the dilemma of measurement has remained intractable even given the different assumptions of the different approaches for 3 connecte reasons - the nature of the subject matter of the field; the nature of relevance jidgement; and the nature of cognition and knowledge. Finally, it is concluded that the original vision of information retrieval research as a discipline founded on quantification proved restricting for its theoretical and methodological development and that increasing recognition of this is reflected in growing interest in qualitative methods in information retrieval research in relation to cognitive, behavioral, and affective aspects of the information retrieval interaction
    Source
    Journal of the American Society for Information Science. 47(1996) no.1, S.23-36
  15. Spink, A.; Wilson, T.; Ellis, D.; Ford, N.: Modeling users' successive searches in digital environments : a National Science Foundation/British Library funded study (1998) 0.00
    0.003753695 = product of:
      0.01501478 = sum of:
        0.01501478 = weight(_text_:information in 1255) [ClassicSimilarity], result of:
          0.01501478 = score(doc=1255,freq=26.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.2447598 = fieldWeight in 1255, product of:
              5.0990195 = tf(freq=26.0), with freq of:
                26.0 = termFreq=26.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1255)
      0.25 = coord(1/4)
    
    Abstract
    As digital libraries become a major source of information for many people, we need to know more about how people seek and retrieve information in digital environments. Quite commonly, users with a problem-at-hand and associated question-in-mind repeatedly search a literature for answers, and seek information in stages over extended periods from a variety of digital information resources. The process of repeatedly searching over time in relation to a specific, but possibly an evolving information problem (including changes or shifts in a variety of variables), is called the successive search phenomenon. The study outlined in this paper is currently investigating this new and little explored line of inquiry for information retrieval, Web searching, and digital libraries. The purpose of the research project is to investigate the nature, manifestations, and behavior of successive searching by users in digital environments, and to derive criteria for use in the design of information retrieval interfaces and systems supporting successive searching behavior. This study includes two related projects. The first project is based in the School of Library and Information Sciences at the University of North Texas and is funded by a National Science Foundation POWRE Grant <http://www.nsf.gov/cgi-bin/show?award=9753277>. The second project is based at the Department of Information Studies at the University of Sheffield (UK) and is funded by a grant from the British Library <http://www.shef. ac.uk/~is/research/imrg/uncerty.html> Research and Innovation Center. The broad objectives of each project are to examine the nature and extent of successive search episodes in digital environments by real users over time. The specific aim of the current project is twofold: * To characterize progressive changes and shifts that occur in: user situational context; user information problem; uncertainty reduction; user cognitive styles; cognitive and affective states of the user, and consequently in their queries; and * To characterize related changes over time in the type and use of information resources and search strategies particularly related to given capabilities of IR systems, and IR search engines, and examine changes in users' relevance judgments and criteria, and characterize their differences. The study is an observational, longitudinal data collection in the U.S. and U.K. Three questionnaires are used to collect data: reference, client post search and searcher post search questionnaires. Each successive search episode with a search intermediary for textual materials on the DIALOG Information Service is audiotaped and search transaction logs are recorded. Quantitative analysis includes statistical analysis using Likert scale data from the questionnaires and log-linear analysis of sequential data. Qualitative methods include: content analysis, structuring taxonomies; and diagrams to describe shifts and transitions within and between each search episode. Outcomes of the study are the development of appropriate model(s) for IR interactions in successive search episodes and the derivation of a set of design criteria for interfaces and systems supporting successive searching.
    Theme
    Information Gateway
  16. Foster, A.E.; Ellis, D.: Serendipity and its study (2014) 0.00
    0.0036430482 = product of:
      0.014572193 = sum of:
        0.014572193 = weight(_text_:information in 1794) [ClassicSimilarity], result of:
          0.014572193 = score(doc=1794,freq=12.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.23754507 = fieldWeight in 1794, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1794)
      0.25 = coord(1/4)
    
    Abstract
    Purpose - The purpose of this paper is to explore the concept of serendipity and approaches to its study particularly in relation to information studies. Design/methodology/approach - The origins of the term serendipity are described and its elaboration as an exploratory and explanatory concept in science and the social sciences are outlined. The distinction between serendipity and serendipity pattern is explained and theoretical and empirical studies of both serendipity and the serendipity patterns are explored. The relationship between information encountering is described. Empirical studies of serendipity using Citation Classics and other research approaches in information studies are described. Findings - The discrepancy between occurrences of serendipity in studies using Citation Classics and reported serendipity in philosophy of science, research anecdotes, information encountering and information seeking by inter-disciplinary researchers is highlighted. A comparison between a process model of serendipity and serendipity as an emergent behavioural characteristic are indicates directions for future research. Originality/value - The paper provides and original synthesis of the theoretical and empirical literature on serendipity with particular reference to work in information studies and an indication of the methodological difficulties involved in its study.
  17. Wilson, T.D.; Ford, N.; Ellis, D.; Foster, A.; Spink, A.: Information seeking and mediated searching : Part 2: uncertainty and Its correlates (2002) 0.00
    0.0035694437 = product of:
      0.014277775 = sum of:
        0.014277775 = weight(_text_:information in 5232) [ClassicSimilarity], result of:
          0.014277775 = score(doc=5232,freq=8.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.23274569 = fieldWeight in 5232, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=5232)
      0.25 = coord(1/4)
    
    Abstract
    In "Part 2. Uncertainty and Its Correlates,'' where Wilson is the primary author, after a review of uncertainty as a concept in information seeking and decision research, it is hypothesized that if the Kuhlthau problem solving stage model is appropriate the searchers will recognize the stage in which they currently are operating. Secondly to test Wilson's contention that operationalized uncertainty would be useful in characterizing users, it is hypothesized that uncertainty will decrease as the searcher proceeds through problem stages and after the completion of the search. A review of pre and post search interviews reveals that uncertainty can be operationalized, and that academic researchers have no difficulty with a stage model of the information seeking process. Uncertainty is unrelated to sex, age, or discipline, but is related to problem stage and domain knowledge. Both concepts appear robust.
    Source
    Journal of the American Society for Information Science and Technology. 53(2002) no.9, S.704-715
  18. Ellis, D.: Is the manual creation of hypertext worth the effort? (1995) 0.00
    0.0029745363 = product of:
      0.011898145 = sum of:
        0.011898145 = weight(_text_:information in 4888) [ClassicSimilarity], result of:
          0.011898145 = score(doc=4888,freq=2.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.19395474 = fieldWeight in 4888, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=4888)
      0.25 = coord(1/4)
    
    Abstract
    Offers a definition of hypertext. Describes the range of uses to which hypertext systems may be put with particular attention to library and information service organisations. Discusses the evaluation of hypertext systems and experimental methodology
  19. Ellis, D.; Ford, N.; Furner, J.: In search of the unknown user : indexing, hypertext and the World Wide Web (1998) 0.00
    0.0029446408 = product of:
      0.011778563 = sum of:
        0.011778563 = weight(_text_:information in 4714) [ClassicSimilarity], result of:
          0.011778563 = score(doc=4714,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.1920054 = fieldWeight in 4714, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4714)
      0.25 = coord(1/4)
    
    Abstract
    For the purposes of this article, the indexing of information is interpreted as the pre-processing of information in order to enable its retrieval. The definition thus spans a dimension extending from classification-based approaches (pre-co-ordinate) to keyword searching (post-co-ordinate). In the first section we clarify our use of terminology, by briefly describing a framework for modelling IR systems in terms of sets of objects, relationships and functions. In the following 3 sections, we discuss the application of indexing functions to document collections of 3 specific types: (1) 'conventional' text databases; (2) hypertext databases; and (3) the World Wide Web, globally distributed across the Internet
  20. Ellis, D.; Furner-Hines, J.; Willett, P.: Measuring the degree of similarity between objects in text retrieval systems (1993) 0.00
    0.0025239778 = product of:
      0.010095911 = sum of:
        0.010095911 = weight(_text_:information in 6716) [ClassicSimilarity], result of:
          0.010095911 = score(doc=6716,freq=4.0), product of:
            0.06134496 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.034944877 = queryNorm
            0.16457605 = fieldWeight in 6716, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=6716)
      0.25 = coord(1/4)
    
    Abstract
    Describes the use of a variety of similarity coefficients in the measurement of the degree of similarity between objects that contain textual information, such as documents, paragraphs, index terms or queries. The work is intended as a preliminary to future investigation of the calculations involved in measuring the degree of similarity between structured objects that may be represented by graph theoretic forms. Descusses the role of similarity coefficients in text retrieval in terms of: document and query similarity; document and document similarity; cocitation analysis; term and term similarity; and the similarity between sets of judgements, such as relevance judgements. Describes several methods for expressing the formulae used to define similarity coefficients and compares their attributes. Concludes with details the characteristics of similarity coefficients; equivalence and monotonicity; consideration of negative matches; geometric analyses; and the meaning of correlation coefficients
    Source
    Perspectives in information management. 3(1993) no.2, S.128-149