Search (19 results, page 1 of 1)

  • × theme_ss:"Volltextretrieval"
  1. Kugler, A.: Automatisierte Volltexterschließung von Retrodigitalisaten am Beispiel historischer Zeitungen (2018) 0.01
    0.0148078 = product of:
      0.088846795 = sum of:
        0.088846795 = weight(_text_:digitalisierung in 4595) [ClassicSimilarity], result of:
          0.088846795 = score(doc=4595,freq=2.0), product of:
            0.2226278 = queryWeight, product of:
              6.0201335 = idf(docFreq=291, maxDocs=44218)
              0.036980543 = queryNorm
            0.3990822 = fieldWeight in 4595, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.0201335 = idf(docFreq=291, maxDocs=44218)
              0.046875 = fieldNorm(doc=4595)
      0.16666667 = coord(1/6)
    
    Abstract
    Seit ein paar Jahren postuliert die DFG in ihren Praxisregeln "Digitalisierung", dass eine ausschließliche Bilddigitalisierung nicht mehr den wissenschaftlichen Ansprüchen Genüge leiste, sondern der digitale Volltext notwendig sei, da dieser die Basis für eine wissenschaftliche Nachnutzung darstellt. Um ein besseres Verständnis davon zu erlangen, was sich hinter dem Begriff "Volltext" verbirgt, wird im Folgenden ein kleiner Einblick in die technischen Verfahren zur automatisierten Volltexterschließung von Retrodigitalisaten geboten. Fortschritte und auch Grenzen der aktuellen Methoden werden vorgestellt und wie Qualität in diesem Zusammenhang überhaupt bemessen werden kann. Die automatisierten Verfahren zur Volltexterschließung werden am Beispiel historischer Zeitungen erläutert, da deren Zugänglichmachung gerade in den Geisteswissenschaften ein großes Desiderat ist und diese Quellengattung zugleich aufgrund der Spaltenstruktur besondere technische Herausforderungen mit sich bringt. 2016 wurde das DFG-Projekt zur Erstellung eines "Masterplan Zeitungsdigitalisierung" fertiggestellt, dessen Ergebnisse hier einfließen.
  2. Reinisch, F.: Wer suchet - der findet? : oder Die Überwindung der sprachlichen Grenzen bei der Suche in Volltextdatenbanken (2000) 0.01
    0.008947723 = product of:
      0.053686332 = sum of:
        0.053686332 = product of:
          0.080529496 = sum of:
            0.04044667 = weight(_text_:29 in 4919) [ClassicSimilarity], result of:
              0.04044667 = score(doc=4919,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.31092256 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
            0.040082823 = weight(_text_:22 in 4919) [ClassicSimilarity], result of:
              0.040082823 = score(doc=4919,freq=2.0), product of:
                0.12949955 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.036980543 = queryNorm
                0.30952093 = fieldWeight in 4919, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=4919)
          0.6666667 = coord(2/3)
      0.16666667 = coord(1/6)
    
    Date
    22. 7.2000 17:48:06
    Source
    Dokumente und Datenbanken in elektronischen Netzen: Tagungsberichte vom 6. und 7. Österreichischen Online-Informationstreffen bzw. vom 7. und 8. Österreichischen Dokumentartag, Schloß Seggau, Seggauberg bei Leibnitz, 26.-29. September 1995, Congresszentrum Igls bei Innsbruck, 21.-24. Oktober 1997. Hrsg.: E. Pipp
  3. Tenopir, C.: Searching by controlled vocabulary or free text (1987) 0.00
    0.0044940747 = product of:
      0.026964447 = sum of:
        0.026964447 = product of:
          0.08089334 = sum of:
            0.08089334 = weight(_text_:29 in 1350) [ClassicSimilarity], result of:
              0.08089334 = score(doc=1350,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.6218451 = fieldWeight in 1350, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.125 = fieldNorm(doc=1350)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    Library journal. 112(1987) Nov.15, S.58-29
  4. Pipp, E.: Volltextdatenbanken im Vergleich (2001) 0.00
    0.0039323154 = product of:
      0.023593891 = sum of:
        0.023593891 = product of:
          0.07078167 = sum of:
            0.07078167 = weight(_text_:29 in 6509) [ClassicSimilarity], result of:
              0.07078167 = score(doc=6509,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.5441145 = fieldWeight in 6509, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6509)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    29. 9.2001 11:28:25
  5. Witt, M.: Au sujet des mots-clés (1997) 0.00
    0.0031777907 = product of:
      0.019066744 = sum of:
        0.019066744 = product of:
          0.05720023 = sum of:
            0.05720023 = weight(_text_:29 in 1666) [ClassicSimilarity], result of:
              0.05720023 = score(doc=1666,freq=4.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.43971092 = fieldWeight in 1666, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1666)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    29. 1.1996 16:50:24
    29. 7.1998 18:19:41
  6. Blair, D.C.; Maron, M.E.: ¬An evaluation of retrieval effectiveness for a full-text document-retrieval system (1985) 0.00
    0.002808797 = product of:
      0.016852781 = sum of:
        0.016852781 = product of:
          0.05055834 = sum of:
            0.05055834 = weight(_text_:29 in 1345) [ClassicSimilarity], result of:
              0.05055834 = score(doc=1345,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.38865322 = fieldWeight in 1345, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1345)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Footnote
    Vgl. auch : Salton, G.: Another look ... Comm. ACM 29(1986) S.S.648-656; Blair, D.C.: Full text retrieval ... Int. Class. 13(1986) S.18-23: Blair, D.C., M.E. Maron: Full-text information retrieval ... Inf. proc. man. 26(1990) S.437-447.
  7. Salton, G.: Another look at automatic text-retrieval systems (1986) 0.00
    0.002808797 = product of:
      0.016852781 = sum of:
        0.016852781 = product of:
          0.05055834 = sum of:
            0.05055834 = weight(_text_:29 in 1356) [ClassicSimilarity], result of:
              0.05055834 = score(doc=1356,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.38865322 = fieldWeight in 1356, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1356)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    Communications of the Association for Computing Machinery. 29(1986), S.648-656
  8. Wenzel, F.: Semantische Eingrenzung im Freitext-Retrieval auf der Basis morphologischer Segmentierungen (1980) 0.00
    0.002808797 = product of:
      0.016852781 = sum of:
        0.016852781 = product of:
          0.05055834 = sum of:
            0.05055834 = weight(_text_:29 in 2037) [ClassicSimilarity], result of:
              0.05055834 = score(doc=2037,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.38865322 = fieldWeight in 2037, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2037)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    Nachrichten für Dokumentation. 31(1980) H.1, S.29-35
  9. Molto, M.: Improving full text search performance through textual analysis (1993) 0.00
    0.0022470374 = product of:
      0.013482223 = sum of:
        0.013482223 = product of:
          0.04044667 = sum of:
            0.04044667 = weight(_text_:29 in 5099) [ClassicSimilarity], result of:
              0.04044667 = score(doc=5099,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.31092256 = fieldWeight in 5099, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5099)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    Information processing and management. 29(1993) no.5, S.614-632
  10. Kristensen, J.: Expanding end-users' query statements for free text searching with a search-aid thesaurus (1993) 0.00
    0.0022470374 = product of:
      0.013482223 = sum of:
        0.013482223 = product of:
          0.04044667 = sum of:
            0.04044667 = weight(_text_:29 in 6621) [ClassicSimilarity], result of:
              0.04044667 = score(doc=6621,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.31092256 = fieldWeight in 6621, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6621)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    Information processing and management. 29(1993) no.6, S.733-744
  11. Laegreid, J.A.: SIFT: a Norwegian information retrieval system (1993) 0.00
    0.0022268237 = product of:
      0.013360942 = sum of:
        0.013360942 = product of:
          0.040082823 = sum of:
            0.040082823 = weight(_text_:22 in 7701) [ClassicSimilarity], result of:
              0.040082823 = score(doc=7701,freq=2.0), product of:
                0.12949955 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.036980543 = queryNorm
                0.30952093 = fieldWeight in 7701, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7701)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    23. 1.1999 19:22:09
  12. Zillmann, H.: OSIRIS und eLib : Information Retrieval und Search Engines in Full-text Databases (2001) 0.00
    0.0022268237 = product of:
      0.013360942 = sum of:
        0.013360942 = product of:
          0.040082823 = sum of:
            0.040082823 = weight(_text_:22 in 5937) [ClassicSimilarity], result of:
              0.040082823 = score(doc=5937,freq=2.0), product of:
                0.12949955 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.036980543 = queryNorm
                0.30952093 = fieldWeight in 5937, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5937)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    14. 6.2001 12:22:31
  13. Dambeck, H.; Engler, T.: Gesucht und gefunden : Neun Volltext-Suchprogramme für den Desktop (2002) 0.00
    0.0022268237 = product of:
      0.013360942 = sum of:
        0.013360942 = product of:
          0.040082823 = sum of:
            0.040082823 = weight(_text_:22 in 1169) [ClassicSimilarity], result of:
              0.040082823 = score(doc=1169,freq=2.0), product of:
                0.12949955 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.036980543 = queryNorm
                0.30952093 = fieldWeight in 1169, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1169)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Source
    c't. 2002, H.22, S.190-197
  14. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998) 0.00
    0.0019661577 = product of:
      0.011796946 = sum of:
        0.011796946 = product of:
          0.035390835 = sum of:
            0.035390835 = weight(_text_:29 in 1682) [ClassicSimilarity], result of:
              0.035390835 = score(doc=1682,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.27205724 = fieldWeight in 1682, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1682)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    29. 7.1998 19:54:49
  15. Preston, L.A.; Ebbs, C.M.; Luther, J.: 'Full text' access evaluation : are we getting the real thing? (1998) 0.00
    0.0019661577 = product of:
      0.011796946 = sum of:
        0.011796946 = product of:
          0.035390835 = sum of:
            0.035390835 = weight(_text_:29 in 2695) [ClassicSimilarity], result of:
              0.035390835 = score(doc=2695,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.27205724 = fieldWeight in 2695, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2695)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Footnote
    Part of an issue devoted to 'Experimentation and collaboration: creating series for a new millenium', part 2, Proceedings of the North American Serials Interest Group, Inc.'s 12th annual conference, 29 May - 1 June 1997, University of Michigan Ann Arbor, Michigan
  16. Blair, D.C.: Full text retrieval : Evaluation and implications (1986) 0.00
    0.0016852779 = product of:
      0.010111667 = sum of:
        0.010111667 = product of:
          0.030335002 = sum of:
            0.030335002 = weight(_text_:29 in 2047) [ClassicSimilarity], result of:
              0.030335002 = score(doc=2047,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.23319192 = fieldWeight in 2047, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2047)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Footnote
    Vgl.: Blair, D.C., M.E. Maron: An evaluation ... Comm. ACM 28(1985) S.280-299; Salton, G.: Another look ... Comm. ACM 29(1986) S.648-656; Blair, D.C., M.E. Maron: Full-text information retrieval ... Inf. Proc. Man. 26(1990) S.437-447.
  17. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.00
    0.0016852779 = product of:
      0.010111667 = sum of:
        0.010111667 = product of:
          0.030335002 = sum of:
            0.030335002 = weight(_text_:29 in 27) [ClassicSimilarity], result of:
              0.030335002 = score(doc=27,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.23319192 = fieldWeight in 27, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.046875 = fieldNorm(doc=27)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    9.12.1997 18:33:29
  18. Sievert, M.E.; McKinin, E.J.: Why full-text misses some relevant documents : an analysis of documents not retrieved by CCML or MEDIS (1989) 0.00
    0.0016701177 = product of:
      0.010020706 = sum of:
        0.010020706 = product of:
          0.030062117 = sum of:
            0.030062117 = weight(_text_:22 in 3564) [ClassicSimilarity], result of:
              0.030062117 = score(doc=3564,freq=2.0), product of:
                0.12949955 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.036980543 = queryNorm
                0.23214069 = fieldWeight in 3564, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3564)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    9. 1.1996 10:22:31
  19. Rösener, C.: ¬Die Stecknadel im Heuhaufen : Natürlichsprachlicher Zugang zu Volltextdatenbanken (2005) 0.00
    0.0011235187 = product of:
      0.0067411116 = sum of:
        0.0067411116 = product of:
          0.020223334 = sum of:
            0.020223334 = weight(_text_:29 in 548) [ClassicSimilarity], result of:
              0.020223334 = score(doc=548,freq=2.0), product of:
                0.13008599 = queryWeight, product of:
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.036980543 = queryNorm
                0.15546128 = fieldWeight in 548, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5176873 = idf(docFreq=3565, maxDocs=44218)
                  0.03125 = fieldNorm(doc=548)
          0.33333334 = coord(1/3)
      0.16666667 = coord(1/6)
    
    Date
    29. 3.2009 11:11:45