Search (9 results, page 1 of 1)

  • × theme_ss:"Volltextretrieval"
  1. Wenzel, F.: Semantische Eingrenzung im Freitext-Retrieval auf der Basis morphologischer Segmentierungen (1980) 0.02
    0.02041765 = product of:
      0.0408353 = sum of:
        0.0408353 = product of:
          0.12250589 = sum of:
            0.12250589 = weight(_text_:basis in 2037) [ClassicSimilarity], result of:
              0.12250589 = score(doc=2037,freq=2.0), product of:
                0.2492982 = queryWeight, product of:
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.056051385 = queryNorm
                0.491403 = fieldWeight in 2037, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2037)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  2. Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993) 0.02
    0.015188378 = product of:
      0.030376757 = sum of:
        0.030376757 = product of:
          0.060753513 = sum of:
            0.060753513 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
              0.060753513 = score(doc=7701,freq=2.0), product of:
                0.1962824 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.056051385 = queryNorm
                0.30952093 = fieldWeight in 7701, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7701)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    23. 1.1999 19:22:09
  3. Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.02
    0.015188378 = product of:
      0.030376757 = sum of:
        0.030376757 = product of:
          0.060753513 = sum of:
            0.060753513 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
              0.060753513 = score(doc=4919,freq=2.0), product of:
                0.1962824 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.056051385 = queryNorm
                0.30952093 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2000 17:48:06
  4. Zillmann, H.: OSIRIS und eLib : Information Retrieval und Search Engines in Full-text Databases (2001) 0.02
    0.015188378 = product of:
      0.030376757 = sum of:
        0.030376757 = product of:
          0.060753513 = sum of:
            0.060753513 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
              0.060753513 = score(doc=5937,freq=2.0), product of:
                0.1962824 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.056051385 = queryNorm
                0.30952093 = fieldWeight in 5937, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5937)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    14. 6.2001 12:22:31
  5. Dambeck, H.; Engler, T.: Gesucht und gefunden : Neun Volltext-Suchprogramme für den Desktop (2002) 0.02
    0.015188378 = product of:
      0.030376757 = sum of:
        0.030376757 = product of:
          0.060753513 = sum of:
            0.060753513 = weight(_text_:22 in 1169) [ClassicSimilarity], result of:
              0.060753513 = score(doc=1169,freq=2.0), product of:
                0.1962824 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.056051385 = queryNorm
                0.30952093 = fieldWeight in 1169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1169)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    c't. 2002, H.22, S.190-197
  6. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.01
    0.014437458 = product of:
      0.028874915 = sum of:
        0.028874915 = product of:
          0.08662474 = sum of:
            0.08662474 = weight(_text_:basis in 4088) [ClassicSimilarity], result of:
              0.08662474 = score(doc=4088,freq=4.0), product of:
                0.2492982 = queryWeight, product of:
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.056051385 = queryNorm
                0.3474744 = fieldWeight in 4088, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4088)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    So far, methods for ellipsis and anaphor resolution have been developed and the effects of anaphor resolution have been analyzed in the context of statistical information retrieval of scientific abstracts. No significant improvements has been observed. Analyzes the effects of ellipsis and anaphor resolution on proximity searching in a full text database. Anaphora and ellipsis are classified on the basis of the type of their correlates / antecedents rather than, as traditional, on the basis of their own linguistic type. The classification differentiates proper names and common nouns of basic words, compound words, and phrases. The study was carried out in a newspaper article database containing 55.000 full text articles. A set of 154 keyword pairs in different categories was created. Human resolution of keyword ellipsis and anaphora was performed to identify sentences and paragraphs which would match proximity searches after resolution. Findings indicate that ellipsis and anaphor resolution is most relevant for proper name phrases and only marginal in the other keyword categories. Therefore the recall effect of restricted resolution of proper name phrases only was analyzed for keyword pairs containing at least 1 proper name phrase. Findings indicate a recall increase of 38.2% in sentence searches, and 28.8% in paragraph searches when proper name ellipsis were resolved. The recall increase was 17.6% sentence searches, and 19.8% in paragraph searches when proper name anaphora were resolved. Some simple and computationally justifiable resolution method might be developed only for proper name phrases to support keyword based full text information retrieval. Discusses elements of such a method
  7. Couvreur, T.R.; Benzel, R.N.; Miller, S.F.; Zeitler, D.N.; Lee, D.L.; Singhal, M.; Shivaratri, N.; Wong, W.Y.P.: ¬An analysis of performance and cost factors in searching large text databases using parallel search systems (1994) 0.01
    0.014292354 = product of:
      0.028584708 = sum of:
        0.028584708 = product of:
          0.08575412 = sum of:
            0.08575412 = weight(_text_:basis in 7657) [ClassicSimilarity], result of:
              0.08575412 = score(doc=7657,freq=2.0), product of:
                0.2492982 = queryWeight, product of:
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.056051385 = queryNorm
                0.3439821 = fieldWeight in 7657, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=7657)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    The results of modelling the performance of searching large text databases (>10 GBytes) via various parallel hardware architectures and search algorithms are discussed. The performance under load and the cost of each configuration are compared. Strengths, weaknesses, performance sensitivities, and search features supported for each configuration are also addressed. In addition, a common search workload used in the modelling is described. The search workload is derived from a set of searches run against the Chemical Abstracts file of bibliographic and abstract text available on STN International. This common workload is applied to all configurations modelled to provide a common basis of comparison
  8. Kugler, A.: Automatisierte Volltexterschließung von Retrodigitalisaten am Beispiel historischer Zeitungen (2018) 0.01
    0.012250589 = product of:
      0.024501178 = sum of:
        0.024501178 = product of:
          0.07350353 = sum of:
            0.07350353 = weight(_text_:basis in 4595) [ClassicSimilarity], result of:
              0.07350353 = score(doc=4595,freq=2.0), product of:
                0.2492982 = queryWeight, product of:
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.056051385 = queryNorm
                0.2948418 = fieldWeight in 4595, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.4476724 = idf(docFreq=1406, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4595)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    Seit ein paar Jahren postuliert die DFG in ihren Praxisregeln "Digitalisierung", dass eine ausschließliche Bilddigitalisierung nicht mehr den wissenschaftlichen Ansprüchen Genüge leiste, sondern der digitale Volltext notwendig sei, da dieser die Basis für eine wissenschaftliche Nachnutzung darstellt. Um ein besseres Verständnis davon zu erlangen, was sich hinter dem Begriff "Volltext" verbirgt, wird im Folgenden ein kleiner Einblick in die technischen Verfahren zur automatisierten Volltexterschließung von Retrodigitalisaten geboten. Fortschritte und auch Grenzen der aktuellen Methoden werden vorgestellt und wie Qualität in diesem Zusammenhang überhaupt bemessen werden kann. Die automatisierten Verfahren zur Volltexterschließung werden am Beispiel historischer Zeitungen erläutert, da deren Zugänglichmachung gerade in den Geisteswissenschaften ein großes Desiderat ist und diese Quellengattung zugleich aufgrund der Spaltenstruktur besondere technische Herausforderungen mit sich bringt. 2016 wurde das DFG-Projekt zur Erstellung eines "Masterplan Zeitungsdigitalisierung" fertiggestellt, dessen Ergebnisse hier einfließen.
  9. Sievert, M.E.; McKinin, E.J.: Why full-text misses some relevant documents : an analysis of documents not retrieved by CCML or MEDIS (1989) 0.01
    0.011391283 = product of:
      0.022782566 = sum of:
        0.022782566 = product of:
          0.045565132 = sum of:
            0.045565132 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
              0.045565132 = score(doc=3564,freq=2.0), product of:
                0.1962824 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.056051385 = queryNorm
                0.23214069 = fieldWeight in 3564, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3564)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    9. 1.1996 10:22:31