Search (8 results, page 1 of 1)

  • × theme_ss:"Automatisches Indexieren"
  • × theme_ss:"Retrievalstudien"
  1. Krause, J.; Womser-Hacker, C.: PADOK-II : Retrievaltests zur Bewertung von Volltextindexierungsvarianten für das deutsche Patentinformationssystem (1990) 0.01
    0.008658063 = product of:
      0.07792256 = sum of:
        0.07792256 = product of:
          0.15584512 = sum of:
            0.15584512 = weight(_text_:bewertung in 2653) [ClassicSimilarity], result of:
              0.15584512 = score(doc=2653,freq=4.0), product of:
                0.19736631 = queryWeight, product of:
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.031243727 = queryNorm
                0.78962374 = fieldWeight in 2653, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2653)
          0.5 = coord(1/2)
      0.11111111 = coord(1/9)
    
    Abstract
    Vorgestellt werden die Ergebnisse extensiver Retrievaltests von zwei Varianten von Inhalteserschließungen (Freitext und PASSAT) für das deutsche Patentinformationssystem auf der Basis von Volltexten. Die Tests führte die Fachgruppe Linguistische Informationswissenschaft der Universität Regensburg von 1986-1989 in Zusammenarbeit mit dem Deutschen Patentamt, dem Fachinformationszentrum Karlsruhe und meheren industrieellen Partnern durch. Der Schwerpunkt des Berichts liegt auf dem allgemeinen Ansatz der Bewertung der Ziele des Projekts und auf der Darstellung der statistischen Evaluierungsergebnisse.
  2. Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002) 0.00
    0.0045916312 = product of:
      0.04132468 = sum of:
        0.04132468 = product of:
          0.08264936 = sum of:
            0.08264936 = weight(_text_:bewertung in 883) [ClassicSimilarity], result of:
              0.08264936 = score(doc=883,freq=2.0), product of:
                0.19736631 = queryWeight, product of:
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.031243727 = queryNorm
                0.41876122 = fieldWeight in 883, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.31699 = idf(docFreq=216, maxDocs=44218)
                  0.046875 = fieldNorm(doc=883)
          0.5 = coord(1/2)
      0.11111111 = coord(1/9)
    
    Abstract
    In Anlehnung an die in den neunziger Jahren durchgeführten Erschließungsprojekte MILOS I und MILOS II, die die Eignung eines Verfahrens zur automatischen Indexierung für Bibliothekskataloge zum Thema hatten, wurde eine empirische Untersuchung anhand einer repräsentativen Stichprobe von Titelsätzen aus dem Österreichischen Verbundkatalog durchgeführt. Ziel war die Prüfung und Bewertung der Einsatzmöglichkeit dieses Verfahrens in den Online-Katalogen des Verbundes. Der Realsituation der OPAC-Benutzung gemäß wurde ausschließlich die Auswirkung auf den automatisch generierten Begriffen angereicherten Basic Index ("Alle Felder") untersucht. Dazu wurden 100 Suchanfragen zunächst im ursprünglichen Basic Index und sodann im angereicherten Basic Index in einem OPAC unter Aleph 500 durchgeführt. Die Tests erbrachten einen Zuwachs an relevanten Treffern bei nur leichten Verlusten an Precision, eine Reduktion der Nulltreffer-Ergebnisse sowie Aufschlüsse über die Auswirkung einer vorhandenen verbalen Sacherschließung.
  3. Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.00
    0.003292406 = product of:
      0.029631654 = sum of:
        0.029631654 = product of:
          0.059263308 = sum of:
            0.059263308 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
              0.059263308 = score(doc=262,freq=2.0), product of:
                0.109410204 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.031243727 = queryNorm
                0.5416616 = fieldWeight in 262, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=262)
          0.5 = coord(1/2)
      0.11111111 = coord(1/9)
    
    Date
    20.10.2000 12:22:23
  4. Toepfer, M.; Seifert, C.: Content-based quality estimation for automatic subject indexing of short texts under precision and recall constraints 0.00
    0.0029211747 = product of:
      0.026290573 = sum of:
        0.026290573 = product of:
          0.07887172 = sum of:
            0.07887172 = weight(_text_:quality in 4309) [ClassicSimilarity], result of:
              0.07887172 = score(doc=4309,freq=10.0), product of:
                0.14124142 = queryWeight, product of:
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.031243727 = queryNorm
                0.5584178 = fieldWeight in 4309, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4309)
          0.33333334 = coord(1/3)
      0.11111111 = coord(1/9)
    
    Abstract
    Semantic annotations have to satisfy quality constraints to be useful for digital libraries, which is particularly challenging on large and diverse datasets. Confidence scores of multi-label classification methods typically refer only to the relevance of particular subjects, disregarding indicators of insufficient content representation at the document-level. Therefore, we propose a novel approach that detects documents rather than concepts where quality criteria are met. Our approach uses a deep, multi-layered regression architecture, which comprises a variety of content-based indicators. We evaluated multiple configurations using text collections from law and economics, where the available content is restricted to very short texts. Notably, we demonstrate that the proposed quality estimation technique can determine subsets of the previously unseen data where considerable gains in document-level recall can be achieved, while upholding precision at the same time. Hence, the approach effectively performs a filtering that ensures high data quality standards in operative information retrieval systems.
  5. Munkelt, J.; Schaer, P.; Lepsky, K.: Towards an IR test collection for the German National Library (2018) 0.00
    0.0027152793 = product of:
      0.024437513 = sum of:
        0.024437513 = product of:
          0.073312536 = sum of:
            0.073312536 = weight(_text_:quality in 4311) [ClassicSimilarity], result of:
              0.073312536 = score(doc=4311,freq=6.0), product of:
                0.14124142 = queryWeight, product of:
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.031243727 = queryNorm
                0.51905835 = fieldWeight in 4311, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4311)
          0.33333334 = coord(1/3)
      0.11111111 = coord(1/9)
    
    Abstract
    Automatic content indexing is one of the innovations that are increasingly changing the way libraries work. In theory, it promises a cataloguing service that would hardly be possible with humans in terms of speed, quantity and maybe quality. The German National Library (DNB) has also recognised this potential and is increasingly relying on the automatic indexing of their catalogue content. The DNB took a major step in this direction in 2017, which was announced in two papers. The announcement was rather restrained, but the content of the papers is all the more explosive for the library community: Since September 2017, the DNB has discontinued the intellectual indexing of series Band H and has switched to an automatic process for these series. The subject indexing of online publications (series O) has been purely automatical since 2010; from September 2017, monographs and periodicals published outside the publishing industry and university publications will no longer be indexed by people. This raises the question: What is the quality of the automatic indexing compared to the manual work or in other words to which degree can the automatic indexing replace people without a signi cant drop in regards to quality?
  6. Chevallet, J.-P.; Bruandet, M.F.: Impact de l'utilisation de multi terms sur la qualité des résponses dùn système de recherche d'information a indexation automatique (1999) 0.00
    0.0020902227 = product of:
      0.018812004 = sum of:
        0.018812004 = product of:
          0.05643601 = sum of:
            0.05643601 = weight(_text_:quality in 6253) [ClassicSimilarity], result of:
              0.05643601 = score(doc=6253,freq=2.0), product of:
                0.14124142 = queryWeight, product of:
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.031243727 = queryNorm
                0.39957124 = fieldWeight in 6253, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6253)
          0.33333334 = coord(1/3)
      0.11111111 = coord(1/9)
    
    Footnote
    Übers. d. Titels: Impact of the use of multi-terms on the quality of the answers of an information retrieval system based on automatic indexing
  7. Gödert, W.; Liebig, M.: Maschinelle Indexierung auf dem Prüfstand : Ergebnisse eines Retrievaltests zum MILOS II Projekt (1997) 0.00
    0.0018289449 = product of:
      0.016460504 = sum of:
        0.016460504 = product of:
          0.04938151 = sum of:
            0.04938151 = weight(_text_:quality in 1174) [ClassicSimilarity], result of:
              0.04938151 = score(doc=1174,freq=2.0), product of:
                0.14124142 = queryWeight, product of:
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.031243727 = queryNorm
                0.34962484 = fieldWeight in 1174, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.5206327 = idf(docFreq=1307, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1174)
          0.33333334 = coord(1/3)
      0.11111111 = coord(1/9)
    
    Abstract
    The test ran between Nov 95-Aug 96 in Cologne Fachhochschule fur Bibliothekswesen (College of Librarianship).The test basis was a database of 190,000 book titles published between 1990-95. MILOS II mechanized indexing methods proved helpful in avoiding or reducing numbers of unsatisfied/no result retrieval searches. Retrieval from mechanised indexing is 3 times more successful than from title keyword data. MILOS II also used a standardized semantic vocabulary. Mechanised indexing demands high quality software and output data
  8. Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.00
    0.001646203 = product of:
      0.014815827 = sum of:
        0.014815827 = product of:
          0.029631654 = sum of:
            0.029631654 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
              0.029631654 = score(doc=5001,freq=2.0), product of:
                0.109410204 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.031243727 = queryNorm
                0.2708308 = fieldWeight in 5001, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5001)
          0.5 = coord(1/2)
      0.11111111 = coord(1/9)
    
    Date
    14. 3.1996 13:22:21