Search (16 results, page 1 of 1)

  • × language_ss:"e"
  • × theme_ss:"Dokumentenmanagement"
  • × year_i:[2000 TO 2010}
  1. Mas, S.; Marleau, Y.: Proposition of a faceted classification model to support corporate information organization and digital records management (2009) 0.04
    0.044601 = sum of:
      0.039442163 = product of:
        0.1972108 = sum of:
          0.1972108 = weight(_text_:3a in 2918) [ClassicSimilarity], result of:
            0.1972108 = score(doc=2918,freq=2.0), product of:
              0.35089764 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.041389145 = queryNorm
              0.56201804 = fieldWeight in 2918, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=2918)
        0.2 = coord(1/5)
      0.005158836 = product of:
        0.010317672 = sum of:
          0.010317672 = weight(_text_:a in 2918) [ClassicSimilarity], result of:
            0.010317672 = score(doc=2918,freq=16.0), product of:
              0.04772363 = queryWeight, product of:
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.041389145 = queryNorm
              0.2161963 = fieldWeight in 2918, product of:
                4.0 = tf(freq=16.0), with freq of:
                  16.0 = termFreq=16.0
                1.153047 = idf(docFreq=37942, maxDocs=44218)
                0.046875 = fieldNorm(doc=2918)
        0.5 = coord(1/2)
    
    Abstract
    The employees of an organization often use a personal hierarchical classification scheme to organize digital documents that are stored on their own workstations. As this may make it hard for other employees to retrieve these documents, there is a risk that the organization will lose track of needed documentation. Furthermore, the inherent boundaries of such a hierarchical structure require making arbitrary decisions about which specific criteria the classification will b.e based on (for instance, the administrative activity or the document type, although a document can have several attributes and require classification in several classes).A faceted classification model to support corporate information organization is proposed. Partially based on Ranganathan's facets theory, this model aims not only to standardize the organization of digital documents, but also to simplify the management of a document throughout its life cycle for both individuals and organizations, while ensuring compliance to regulatory and policy requirements.
    Footnote
    Vgl.: http://ieeexplore.ieee.org/Xplore/login.jsp?reload=true&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F4755313%2F4755314%2F04755480.pdf%3Farnumber%3D4755480&authDecision=-203.
    Type
    a
  2. Bantin, P.: Electronic records management : a review of the work of a decade and a reflection on future directions (2002) 0.00
    0.003647848 = product of:
      0.007295696 = sum of:
        0.007295696 = product of:
          0.014591392 = sum of:
            0.014591392 = weight(_text_:a in 4255) [ClassicSimilarity], result of:
              0.014591392 = score(doc=4255,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.30574775 = fieldWeight in 4255, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.09375 = fieldNorm(doc=4255)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a
  3. Frohmann, B.: Revisiting "what is a document?" (2009) 0.00
    0.00252053 = product of:
      0.00504106 = sum of:
        0.00504106 = product of:
          0.01008212 = sum of:
            0.01008212 = weight(_text_:a in 2837) [ClassicSimilarity], result of:
              0.01008212 = score(doc=2837,freq=22.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.21126054 = fieldWeight in 2837, product of:
                  4.690416 = tf(freq=22.0), with freq of:
                    22.0 = termFreq=22.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2837)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Purpose - The purpose of this paper is to provide a reconsideration of Michael Buckland's important question, "What is a document?", analysing the point and purpose of definitions of "document" and "documentation". Design/methodology/approach - Two philosophical notions of the point of definitions are contrasted: John Stuart Mill's concept of a "real" definition, purporting to specify the nature of the definiendum; and a concept of definition based upon a foundationalist philosophy of language. Both conceptions assume that a general, philosophical justification for using words as we do is always in order. This assumption is criticized by deploying Hilary Putnam's arguments against the orthodox Wittgensteinian interpretation of criteria governing the use of language. The example of the cabinets of curiosities of the sixteenth-century English and European virtuosi is developed to show how one might productively think about what documents might be, but without a definition of a document. Findings - Other than for specific, instrumentalist purposes (often appropriate for specific case studies), there is no general philosophical reason for asking, what is a document? There are good reasons for pursuing studies of documentation without the impediments of definitions of "document" or "documentation". Originality/value - The paper makes an original contribution to the new interest in documentation studies by providing conceptual resources for multiplying, rather than restricting, the areas of application of the concepts of documents and documentation.
    Type
    a
  4. Mas, S.; Zaher, L'H.; Zacklad, M.: Design & evaluation of multi-viewed knowledge system for administrative electronic document organization (2008) 0.00
    0.0024318986 = product of:
      0.004863797 = sum of:
        0.004863797 = product of:
          0.009727594 = sum of:
            0.009727594 = weight(_text_:a in 2480) [ClassicSimilarity], result of:
              0.009727594 = score(doc=2480,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.20383182 = fieldWeight in 2480, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2480)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This communication describes part of a current research carried out at the Université de Technologie de Troyes and funded by a postdoctoral grant from the Fonds québécois de la recherche sur la société et la culture. Under the title "Design and evaluation of a faceted classification for uniform and personal organization of administrative electronic documents", our research investigates the feasibility of creating a faceted and multi-points-of-view classification scheme for administrative document organization and retrieval in online environments.
  5. Vasudevan, M.C.; Mohan, M.; Kapoor, A.: Information system for knowledge management in the specialized division of a hospital (2006) 0.00
    0.0023790773 = product of:
      0.0047581545 = sum of:
        0.0047581545 = product of:
          0.009516309 = sum of:
            0.009516309 = weight(_text_:a in 1499) [ClassicSimilarity], result of:
              0.009516309 = score(doc=1499,freq=10.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.19940455 = fieldWeight in 1499, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1499)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Information systems are essential support for knowledge management in all types of enterprises. This paper describes the evolution and development of a specialized hospital information system. The system is designed to integrate for access and retrieval from databases of patients' case records, and related images - CATSCAN, MRI, X-Ray - and to enable online access to full text of relevant papers on the Internet/WWW. The generation of information products and services from the system is briefly described.
    Source
    Knowledge organization, information systems and other essays: Professor A. Neelameghan Festschrift. Ed. by K.S. Raghavan and K.N. Prasad
    Type
    a
  6. Mas, S.; Zaher, L'H.; Zacklad, M.: Design and evaluation of multi-viewed knowledge system for administrative electronic document organization (2008) 0.00
    0.0021279112 = product of:
      0.0042558224 = sum of:
        0.0042558224 = product of:
          0.008511645 = sum of:
            0.008511645 = weight(_text_:a in 2256) [ClassicSimilarity], result of:
              0.008511645 = score(doc=2256,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.17835285 = fieldWeight in 2256, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2256)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    This paper describes part of a current research investigating the feasibility of creating a faceted and multi-viewed knowledge organization system (KOS) for administrative document organization in online environments. Preliminary findings support the faceted and multi-viewed classification as a promising altemative to the hierarchical paradigm for personal administrative electronic documents organization. Further analysis about identification of semantic relations between facets to reduce number of facet descriptors is required. Technical improvements are also needed to enhance the faceted navigation interface used within the pilot test.
    Type
    a
  7. Batley, S.: ¬The I in information architecture : the challenge of content management (2007) 0.00
    0.0021060861 = product of:
      0.0042121722 = sum of:
        0.0042121722 = product of:
          0.0084243445 = sum of:
            0.0084243445 = weight(_text_:a in 809) [ClassicSimilarity], result of:
              0.0084243445 = score(doc=809,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.17652355 = fieldWeight in 809, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0625 = fieldNorm(doc=809)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Purpose - The purpose of this paper is to provide a review of content management in the context of information architecture. Design/methodology/approach - The method adopted is a review of definitions of information architecture and an analysis of the importance of content and its management within information architecture. Findings - Concludes that reality will not necessarily match the vision of organisations investing in information architecture. Originality/value - The paper considers practical issues around content and records management.
    Type
    a
  8. Murthy, S.S.: ¬The National Tuberculosis Institute, Bangalore : recent development in library and information services (2006) 0.00
    0.0021060861 = product of:
      0.0042121722 = sum of:
        0.0042121722 = product of:
          0.0084243445 = sum of:
            0.0084243445 = weight(_text_:a in 1502) [ClassicSimilarity], result of:
              0.0084243445 = score(doc=1502,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.17652355 = fieldWeight in 1502, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1502)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Briefly describes the information products and services, the related databases, development of digital library and web-resources and web-based services, vocabulary control tools, networking, and other projects of the Library of the National Tuberculosis Institute (NTI), Bangalore. Acknowledges the involvement of and advice and assistance provided by Prof. A. Neelameghan to these programmes and projects.
    Source
    Knowledge organization, information systems and other essays: Professor A. Neelameghan Festschrift. Ed. by K.S. Raghavan and K.N. Prasad
    Type
    a
  9. Hare, C.E.; McLeod, J.: How to manage records in the e-environment : 2nd ed. (2006) 0.00
    0.0018428253 = product of:
      0.0036856506 = sum of:
        0.0036856506 = product of:
          0.0073713013 = sum of:
            0.0073713013 = weight(_text_:a in 1749) [ClassicSimilarity], result of:
              0.0073713013 = score(doc=1749,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.1544581 = fieldWeight in 1749, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1749)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    A practical approach to developing and operating an effective programme to manage hybrid records within an organization. This title positions records management as an integral business function linked to the organisation's business aims and objectives. The authors also address the records requirements of new and significant pieces of legislation, such as data protection and freedom of information, as well as exploring strategies for managing electronic records. Bullet points, checklists and examples assist the reader throughout, making this a one-stop resource for information in this area.
    Footnote
    1. Aufl. u.d.T.: Developing a records management programme
  10. Salminen, A.: Modeling documents in their context (2009) 0.00
    0.0018428253 = product of:
      0.0036856506 = sum of:
        0.0036856506 = product of:
          0.0073713013 = sum of:
            0.0073713013 = weight(_text_:a in 3847) [ClassicSimilarity], result of:
              0.0073713013 = score(doc=3847,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.1544581 = fieldWeight in 3847, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3847)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This entry describes notions and methods for analyzing and modeling documents in an organizational context. A model for the analysis process is provided and methods for data gathering, modeling, and user needs analysis described. The methods have been originally developed and tested during document standardization activities carried out in the Finnish Parliament and ministries. Later the methods have been adopted and adapted in other Finnish organizations in their document management development projects. The methods are intended especially for cases where the goal is to develop an Extensible Markup Language (XML)-based solution for document management. This entry emphasizes the importance of analyzing and describing documents in their organizational context.
    Type
    a
  11. Meer, K. van der: Document information systems (2009) 0.00
    0.001823924 = product of:
      0.003647848 = sum of:
        0.003647848 = product of:
          0.007295696 = sum of:
            0.007295696 = weight(_text_:a in 3771) [ClassicSimilarity], result of:
              0.007295696 = score(doc=3771,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.15287387 = fieldWeight in 3771, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3771)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    From characteristics of information, documents and document information systems (document IS), motives to use document IS are described. A few cases are presented. The functional aspects of document IS are described, derived from ISO standard 15489 on records management and the Sarbanes-Oxley law, and made operational in MoReq and DoD standard 5015.2. Explicit attention is given to related subjects from a viewpoint of document management: information sharing (workflow, knowledge management), and interoperability of Information and Communication Technology (ICT) tools; authenticity because of the possible evidential value of documents; and digital longevity because of the possible long-time function of archival documents. The technical aspects answer functional demands; important information science standards, and standard components for 12 characteristics of document IS are described, among others ODMA, the XML family, OAIS, and metadata schemes. The design methodological aspects answer functional demands and technical possibilities. Models are introduced and the way of working of, e.g., a digitization project is described.
    Type
    a
  12. Veal, D.C.: Techniques of document management : a review of text retrieval and related technologies (2001) 0.00
    0.001719612 = product of:
      0.003439224 = sum of:
        0.003439224 = product of:
          0.006878448 = sum of:
            0.006878448 = weight(_text_:a in 4150) [ClassicSimilarity], result of:
              0.006878448 = score(doc=4150,freq=4.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.14413087 = fieldWeight in 4150, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4150)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a
  13. Smith, C.: Building an Internet archive system for the British Broadcasting Corporation (2005) 0.00
    0.0015795645 = product of:
      0.003159129 = sum of:
        0.003159129 = product of:
          0.006318258 = sum of:
            0.006318258 = weight(_text_:a in 5593) [ClassicSimilarity], result of:
              0.006318258 = score(doc=5593,freq=6.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.13239266 = fieldWeight in 5593, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5593)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The amount of digital content produced at academic research institutions is large, and libraries and archives at these institutions have a responsibility to bring this digital material under curatorial control in order to manage and preserve it over time. But this is a daunting task with few proven models, requiring new technology, policies, procedures, core staff competencies, and cost models. The MIT Libraries are working with the DSpace(TM) open-source digital repository platform to explore the problem of capturing research and teaching material in any digital format and preserving it over time. By collaborating on this problem with other research institutions using the DSpace platform in the United States, the United Kingdom, Europe, and other parts of the world, as well as with other important efforts in the digital preservation arena, we are beginning to see ways of managing arbitrary digital content that might make digital preservation an achievable goal.
    Type
    a
  14. Myburgh, S.: Records organization and access (2009) 0.00
    0.001289709 = product of:
      0.002579418 = sum of:
        0.002579418 = product of:
          0.005158836 = sum of:
            0.005158836 = weight(_text_:a in 3868) [ClassicSimilarity], result of:
              0.005158836 = score(doc=3868,freq=4.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.10809815 = fieldWeight in 3868, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3868)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Records, as documents which provide evidence of business transactions that have taken place, are collected and preserved for as long as they are useful to the organization, or as is demanded by law. In order to be useful, however, they must be organized in such a way that they can easily be identified, located, accessed, and used, for whatever purpose. First, the records must be described by identifying the most useful salient characteristics; then, they are categorized in various ways, according to their age, function, level of confidentiality, privacy and security, and access to them controlled according to these categories. Records may be arranged by one of several ordinal systems, usually involving letters and numbers, but also color: these symbolically represent the characteristics that are considered as important descriptors. Thus, records can be accessed (or protected from access) by their category; they can be located by correspondence between terms (which may be words or numbers) used to describe characteristics and terms used in searching for particular records or records series. These principles apply to both physical and virtual records.
    Type
    a
  15. Hughes, C.: Modern records management : key skills and core competencies (2005) 0.00
    0.0012159493 = product of:
      0.0024318986 = sum of:
        0.0024318986 = product of:
          0.004863797 = sum of:
            0.004863797 = weight(_text_:a in 4516) [ClassicSimilarity], result of:
              0.004863797 = score(doc=4516,freq=8.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.10191591 = fieldWeight in 4516, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4516)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Discusses the skills and competencies required by contemporary records managers. The role of the records manager has recently gained a higher profile - due to increasing legislation (resulting from various corporate scandals worldwide) requiring organisations to focus an the integrity, currency and accessibility of their records. Case studies are included in the book to indicate the wide range of activities undertaken by records managers. The book addresses the skills and competencies required by records managers and considers how these are acquired and developed. Using guidance, actions plans can be developed by the reader. Using research undertaken by the author, a sample of roles currently undertaken is evaluated in detail to provide examples of the competencies and skills required for each role.
    Content
    Key Features - Fills the gap in the literature addressing competencies for records managers - Emerging legislation and regulatory frameworks and recent corporate scandals haue provided opportunities for records managers to increase their profile and involvement at the strategic level of their organisation. This book provides information an the required competencies to support and enable this higher profile - The author is a highly experienced practitioner, and also has a post-graduate qualification in the area - The use of case studies of practicing records managers provides real-life examples of the diversity of the contemporary records manager The Author Ceri Hughes is Associate Director of Global Knowledge Management at KPMG Financial Advisory Services. She is an active member of the Records Management Society of Great Britain, served four years an the Society's executive and is currently the Immediate Past Chair, having chaired the Society 2001-2003. Readership Records management and knowledge management practitioners. Data protection/Privacy/ Freedom of Information professionals. Information management sector training professionals. Records management students Contents Part 1: The role of the records manager - Records management in the modern environment: Records managers-'basement to boardroom': evaluating the range of roles: Records management in the knowledge economy Part 2: Skills and competencies - Core competencies: Core skills: Adding value Part 3: Records management and career progression - Continuing professional development: Higher education: Professional networks: Continuing career development Part 4: Conclusions - Conclusion: Useful directory
  16. Lam-Adesina, A.M.; Jones, G.J.F.: Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents (2006) 0.00
    0.0010747575 = product of:
      0.002149515 = sum of:
        0.002149515 = product of:
          0.00429903 = sum of:
            0.00429903 = weight(_text_:a in 977) [ClassicSimilarity], result of:
              0.00429903 = score(doc=977,freq=4.0), product of:
                0.04772363 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.041389145 = queryNorm
                0.090081796 = fieldWeight in 977, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=977)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Important legacy paper documents are digitized and collected in online accessible archives. This enables the preservation, sharing, and significantly the searching of these documents. The text contents of these document images can be transcribed automatically using OCR systems and then stored in an information retrieval system. However, OCR systems make errors in character recognition which have previously been shown to impact on document retrieval behaviour. In particular relevance feedback query-expansion methods, which are often effective for improving electronic text retrieval, are observed to be less reliable for retrieval of scanned document images. Our experimental examination of the effects of character recognition errors on an ad hoc OCR retrieval task demonstrates that, while baseline information retrieval can remain relatively unaffected by transcription errors, relevance feedback via query expansion becomes highly unstable. This paper examines the reason for this behaviour, and introduces novel modifications to standard relevance feedback methods. These methods are shown experimentally to improve the effectiveness of relevance feedback for errorful OCR transcriptions. The new methods combine similar recognised character strings based on term collection frequency and a string edit-distance measure. The techniques are domain independent and make no use of external resources such as dictionaries or training data.
    Type
    a