Search (59 results, page 1 of 3)

  • × theme_ss:"Kataloganreicherung"
  1. Ingwersen, P.; Wormell, I.: Modern indexing and retrieval techniques matching different types of information needs (1989) 0.02
    0.015971016 = product of:
      0.074531406 = sum of:
        0.016539034 = weight(_text_:information in 7322) [ClassicSimilarity], result of:
          0.016539034 = score(doc=7322,freq=4.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.3840108 = fieldWeight in 7322, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.109375 = fieldNorm(doc=7322)
        0.034724083 = weight(_text_:retrieval in 7322) [ClassicSimilarity], result of:
          0.034724083 = score(doc=7322,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.46789268 = fieldWeight in 7322, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.109375 = fieldNorm(doc=7322)
        0.02326829 = product of:
          0.04653658 = sum of:
            0.04653658 = weight(_text_:22 in 7322) [ClassicSimilarity], result of:
              0.04653658 = score(doc=7322,freq=2.0), product of:
                0.085914485 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02453417 = queryNorm
                0.5416616 = fieldWeight in 7322, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7322)
          0.5 = coord(1/2)
      0.21428572 = coord(3/14)
    
    Source
    International forum on information and documentation. 14(1989), S.17-22
  2. Smith, I.A.; O'Brien, A.: Signposts to information : keywords, indexes and thesauri (1993) 0.01
    0.010886654 = product of:
      0.050804388 = sum of:
        0.021511177 = weight(_text_:system in 8116) [ClassicSimilarity], result of:
          0.021511177 = score(doc=8116,freq=2.0), product of:
            0.07727166 = queryWeight, product of:
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.02453417 = queryNorm
            0.27838376 = fieldWeight in 8116, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.0625 = fieldNorm(doc=8116)
        0.009450877 = weight(_text_:information in 8116) [ClassicSimilarity], result of:
          0.009450877 = score(doc=8116,freq=4.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.21943474 = fieldWeight in 8116, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0625 = fieldNorm(doc=8116)
        0.019842334 = weight(_text_:retrieval in 8116) [ClassicSimilarity], result of:
          0.019842334 = score(doc=8116,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.26736724 = fieldWeight in 8116, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0625 = fieldNorm(doc=8116)
      0.21428572 = coord(3/14)
    
    Abstract
    Shortened and revised version of a paper presented during an Education Librarian's Group session at the Under One Umbrella Two Conference, organized by the Library Association in Manchester, 9-11 Jul 93. With the possibility of free text searching and easy access to CD-ROM databases, users often do not realise that subject searching is enhanced by familiarity with the controlled vocabulary underlying the indexing system of the database. Discusses the construction of information retrieval thesauri, the problems of terminology and how the use of thesauri can make searching more effective, using education thesauri as examples
  3. Kantor, P.B.; Voorhees, E.: Information retrieval with scanned texts (2000) 0.01
    0.010717765 = product of:
      0.07502435 = sum of:
        0.018901754 = weight(_text_:information in 3901) [ClassicSimilarity], result of:
          0.018901754 = score(doc=3901,freq=4.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.43886948 = fieldWeight in 3901, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=3901)
        0.056122594 = weight(_text_:retrieval in 3901) [ClassicSimilarity], result of:
          0.056122594 = score(doc=3901,freq=4.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.75622874 = fieldWeight in 3901, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=3901)
      0.14285715 = coord(2/14)
    
    Source
    Information retrieval. 2(2000), S.165-176
  4. Gratch, B.; Settel, B.; Atherton, P.: Characteristics of book indexes for subject retrieval in the humanities and social sciences (1978) 0.01
    0.008284625 = product of:
      0.057992373 = sum of:
        0.034724083 = weight(_text_:retrieval in 1061) [ClassicSimilarity], result of:
          0.034724083 = score(doc=1061,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.46789268 = fieldWeight in 1061, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.109375 = fieldNorm(doc=1061)
        0.02326829 = product of:
          0.04653658 = sum of:
            0.04653658 = weight(_text_:22 in 1061) [ClassicSimilarity], result of:
              0.04653658 = score(doc=1061,freq=2.0), product of:
                0.085914485 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02453417 = queryNorm
                0.5416616 = fieldWeight in 1061, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=1061)
          0.5 = coord(1/2)
      0.14285715 = coord(2/14)
    
    Source
    Indexer. 11(1978), S.14-22
  5. Wormell, I.: Factual data retrieval according to SAP-technique (1983) 0.01
    0.0075786044 = product of:
      0.053050227 = sum of:
        0.013365558 = weight(_text_:information in 845) [ClassicSimilarity], result of:
          0.013365558 = score(doc=845,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.3103276 = fieldWeight in 845, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=845)
        0.03968467 = weight(_text_:retrieval in 845) [ClassicSimilarity], result of:
          0.03968467 = score(doc=845,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.5347345 = fieldWeight in 845, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=845)
      0.14285715 = coord(2/14)
    
    Source
    International forum on information and documentation. 8(1983), S.13-15
  6. Wormell, I.: Factual data retrieval according to SAP technique (1983) 0.01
    0.0075786044 = product of:
      0.053050227 = sum of:
        0.013365558 = weight(_text_:information in 1399) [ClassicSimilarity], result of:
          0.013365558 = score(doc=1399,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.3103276 = fieldWeight in 1399, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=1399)
        0.03968467 = weight(_text_:retrieval in 1399) [ClassicSimilarity], result of:
          0.03968467 = score(doc=1399,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.5347345 = fieldWeight in 1399, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=1399)
      0.14285715 = coord(2/14)
    
    Source
    International forum on information and documentation. 8(1983) no.3, S.13-15
  7. Ingwersen, P.; Wormell, I.: Means to improve subject access and representation in modern information retrieval (1988) 0.01
    0.0075786044 = product of:
      0.053050227 = sum of:
        0.013365558 = weight(_text_:information in 4430) [ClassicSimilarity], result of:
          0.013365558 = score(doc=4430,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.3103276 = fieldWeight in 4430, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.125 = fieldNorm(doc=4430)
        0.03968467 = weight(_text_:retrieval in 4430) [ClassicSimilarity], result of:
          0.03968467 = score(doc=4430,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.5347345 = fieldWeight in 4430, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.125 = fieldNorm(doc=4430)
      0.14285715 = coord(2/14)
    
  8. Wormell, I.: Subject Access Project : the use of book indexes for subject retrieval systems in libraries (1981) 0.01
    0.0066312784 = product of:
      0.046418946 = sum of:
        0.011694863 = weight(_text_:information in 3116) [ClassicSimilarity], result of:
          0.011694863 = score(doc=3116,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.27153665 = fieldWeight in 3116, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.109375 = fieldNorm(doc=3116)
        0.034724083 = weight(_text_:retrieval in 3116) [ClassicSimilarity], result of:
          0.034724083 = score(doc=3116,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.46789268 = fieldWeight in 3116, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.109375 = fieldNorm(doc=3116)
      0.14285715 = coord(2/14)
    
    Source
    International forum on information and documentation. 6(1981), S.24-28
  9. Rädler, K.: Kataloganreicherung mit digitalen Inhaltsverzeichnissen eröffnet neue Geschäftsfelder : Erfahrungen aus der Vorarlberger Landesbibliothek (2008) 0.01
    0.0057039345 = product of:
      0.02661836 = sum of:
        0.005906798 = weight(_text_:information in 1942) [ClassicSimilarity], result of:
          0.005906798 = score(doc=1942,freq=4.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.13714671 = fieldWeight in 1942, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1942)
        0.012401459 = weight(_text_:retrieval in 1942) [ClassicSimilarity], result of:
          0.012401459 = score(doc=1942,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.16710453 = fieldWeight in 1942, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1942)
        0.008310104 = product of:
          0.016620208 = sum of:
            0.016620208 = weight(_text_:22 in 1942) [ClassicSimilarity], result of:
              0.016620208 = score(doc=1942,freq=2.0), product of:
                0.085914485 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02453417 = queryNorm
                0.19345059 = fieldWeight in 1942, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1942)
          0.5 = coord(1/2)
      0.21428572 = coord(3/14)
    
    Abstract
    Vom verflixten siebten Jahr kann man in Bregenz bisher nicht sprechen, im Gegenteil: Die Vorarlberger Landesbibliothek (VLB) hat kurz vor Ostern 2008 ihr 100.000. Inhaltsverzeichnis mit intelligentCAPTURE gescannt. Damit ist der Katalogeintrag von etwa 70 Prozent des betroffenen Freihandbestandes und sämtlicher Neuzugänge ab 2002 zusätzlich mit dem Inhaltsverzeichnis und zum Teil mit der Abbildung des Buchdeckels versehen. Die derzeitige Durchsatzrate beträgt pro Jahr 30.000 bibliographische Einheiten bei einem Personaleinsatz von 40 Stunden pro Woche. Die vollständige Erschließung des Buchbestandes in der Freihandaufstellung ist in Reichweite; ganz ohne Fördermittel im laufenden Betrieb; eine schlanke Lösung sowohl organisatorisch als auch finanziell. Da im Rahmen der Zusammenarbeit mit dandelon.com inzwischen weitere Bibliotheken in einem elegant gelösten Arbeitsablauf kooperativ erschließen, können wir bei Neuerscheinungen bereits die Hälfte als Fremddaten übernehmen. Die VLB hat mit der Digitalisierung, OCR-Konvertierung und maschinellen Indexierung von Inhaltsverzeichnissen und dem mehrsprachigen, semantischen Retrieval in der Suchmaschine dandelon.com zusammen mit der Firma AGI. Information Management Consultants Impulse im Bibliothekswesen gesetzt. Wer 2002 noch bezweifelte, dass dieser Ansatz erfolgreich sein wurde, wurde inzwischen eines Besseren belehrt. Zu offensichtlich ist der Mehrwert, den allein die Moglichkeit der entscheidend verbesserten Relevanzbeurteilung uber den OPAC bietet (http://vlb-katalog.vorarlberg.de). Er erfordert von einer zeitgemasen Bibliothek geradezu das Angebot solcher Zusatzinformationen. Aber die Anreicherung mit den Angaben aus den Inhaltsverzeichnissen bieten weit mehr, denn es entsteht durch die maschinelle Indexierung eine vollig neue Dimension an Retrievalmoglichkeiten. Die vollständige Erfassung unseres Bestandes war immer erklärtes Ziel unseres Projektes, weil wir gesehen haben, dass wir den Nutzern dadurch einen erheblichen Zusatznutzen bieten können, der in einem unbeschreiblich günstigen Verhältnis zum personellen und finanziellen Einsatz steht. Damit, dass wir dieses Ziel so schnell erreichen könnten, hatten wir nicht gerechnet.
    Date
    22. 6.2008 17:14:24
    Source
    Information - Wissenschaft und Praxis. 59(2008) H.4, S.241-242
  10. Hauer, M.: Collaborative Catalog Enrichment : Digitalisierung und Information Retrieval (2011) 0.01
    0.005333207 = product of:
      0.0248883 = sum of:
        0.004176737 = weight(_text_:information in 160) [ClassicSimilarity], result of:
          0.004176737 = score(doc=160,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.09697737 = fieldWeight in 160, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=160)
        0.012401459 = weight(_text_:retrieval in 160) [ClassicSimilarity], result of:
          0.012401459 = score(doc=160,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.16710453 = fieldWeight in 160, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.0390625 = fieldNorm(doc=160)
        0.008310104 = product of:
          0.016620208 = sum of:
            0.016620208 = weight(_text_:22 in 160) [ClassicSimilarity], result of:
              0.016620208 = score(doc=160,freq=2.0), product of:
                0.085914485 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02453417 = queryNorm
                0.19345059 = fieldWeight in 160, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=160)
          0.5 = coord(1/2)
      0.21428572 = coord(3/14)
    
    Source
    ¬Die Kraft der digitalen Unordnung: 32. Arbeits- und Fortbildungstagung der ASpB e. V., Sektion 5 im Deutschen Bibliotheksverband, 22.-25. September 2009 in der Universität Karlsruhe. Hrsg: Jadwiga Warmbrunn u.a
  11. Hauer, M.; Diedrichs, R.: Kataloganreicherung in Europa : Bibliotheken als Information-Retrieval-Systeme in einer digitalen Welt (2010) 0.01
    0.005114303 = product of:
      0.035800118 = sum of:
        0.0100241685 = weight(_text_:information in 3571) [ClassicSimilarity], result of:
          0.0100241685 = score(doc=3571,freq=8.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.23274569 = fieldWeight in 3571, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=3571)
        0.025775949 = weight(_text_:retrieval in 3571) [ClassicSimilarity], result of:
          0.025775949 = score(doc=3571,freq=6.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.34732026 = fieldWeight in 3571, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=3571)
      0.14285715 = coord(2/14)
    
    Abstract
    Bibliotheken sind nicht chaotische Sammlungen von Medien, sondern werden stets als Information-Retrieval-Systeme implementiert, denn das Wiederfinden bei exakten und vor allem bei vagen Anfragen ist von Anfang an das Sammlungsziel. Niemand kennt zum Zeitpunkt der Sammlung die Fragestellung, die einen zukünftigen Benutzer zu diesem Medium führen wird. Die Art der Sammlungsorganisation bestimmt in hohem Maße die zukünftig mögliche Antwortmenge und deren informatorische Qualität. Der Siegeszug der digitalen Information-Retrieval-Systeme im Internet hat in den letzten zehn Jahren Erwartungen und Märkte geschaffen, die weder Karteikarten, noch deren digitales Ebenbild, die heutigen Bibliotheksmanagement-Systeme befriedigen können. Es genügt nicht mehr, dass Google und andere kostenlos das aufsammeln und nachweisen, was im Internet frei zugänglich ist. Alte, noch »marktfähige« Inhalte werden neu aufbereitet, mit neuen Inhalten kombiniert, um neue Geschäftsfelder zu schaffen. Es geht um Verdrängung und Neuverteilung, und die älteren Inhalte der Bibliotheken sind meist noch hinreichend »marktfähig«.
    Series
    Lesesaal: Information digital
  12. Tseng, Y.-H.: Automatic cataloguing and searching for retrospective data by use of OCR text (2001) 0.00
    0.0049224477 = product of:
      0.034457132 = sum of:
        0.008681185 = weight(_text_:information in 5421) [ClassicSimilarity], result of:
          0.008681185 = score(doc=5421,freq=6.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.20156369 = fieldWeight in 5421, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=5421)
        0.025775949 = weight(_text_:retrieval in 5421) [ClassicSimilarity], result of:
          0.025775949 = score(doc=5421,freq=6.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.34732026 = fieldWeight in 5421, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=5421)
      0.14285715 = coord(2/14)
    
    Abstract
    This article describes our efforts in supporting information retrieval from OCR degraded text. In particular, we report our approach to an automatic cataloging and searching contest for books in multiple languages. In this contest, 500 books in English, German, French, and Italian published during the 1770s to 1970s are scanned into images and OCRed to digital text. The goal is to use only automatic ways to extract information for sophisticated searching. We adopted the vector space retrieval model, an n-gram indexing method, and a special weighting scheme to tackle this problem. Although the performance by this approach is slightly inferior to the best approach, which is mainly based on regular expression match, one advantage of our approach is that it is less language dependent and less layout sensitive, thus is readily applicable to other languages and document collections. Problems of OCR text retrieval for some Asian languages are also discussed in this article, and solutions are suggested
    Source
    Journal of the American Society for Information Science and technology. 52(2001) no.5, S.378-390
  13. Diodato, V.P.: Author indexing (1981) 0.00
    0.0047366275 = product of:
      0.03315639 = sum of:
        0.008353474 = weight(_text_:information in 6983) [ClassicSimilarity], result of:
          0.008353474 = score(doc=6983,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.19395474 = fieldWeight in 6983, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.078125 = fieldNorm(doc=6983)
        0.024802918 = weight(_text_:retrieval in 6983) [ClassicSimilarity], result of:
          0.024802918 = score(doc=6983,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.33420905 = fieldWeight in 6983, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.078125 = fieldNorm(doc=6983)
      0.14285715 = coord(2/14)
    
    Abstract
    Indexing terms supplied by authors can increase subject control of their documents. The terms can be used in the creation of indexes, abstracts, and other devices for information retrieval in the special library. An examination of the American Mathematical Society author indexing program suggests that contributions of authors enhance indexing efforts of editors
  14. Cousins, S.A.: Enhancing subject access to OPACs : controlled vocabulary vs. natural language (1992) 0.00
    0.0043982905 = product of:
      0.030788032 = sum of:
        0.0050120843 = weight(_text_:information in 2230) [ClassicSimilarity], result of:
          0.0050120843 = score(doc=2230,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.116372846 = fieldWeight in 2230, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2230)
        0.025775949 = weight(_text_:retrieval in 2230) [ClassicSimilarity], result of:
          0.025775949 = score(doc=2230,freq=6.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.34732026 = fieldWeight in 2230, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=2230)
      0.14285715 = coord(2/14)
    
    Abstract
    Experimental evidence suggests that enhancing the subject content of OPAC records can improve retrieval performance. This is based on the use of natural language index terms derived from the table of contents and back-of-the-book index of documents. The research reported here investigates the alternative approach of translating these natural language terms into controlled vocabulary. Subject queries were collected by interview at the catalogue, and indexing of the queries demonstrated the impressive ability of PRECIS, and to a lesser extent LCSH, to represent users' information needs. DDC performed poorly in this respect. The assumption was made that an index language adequately specific to represent users' queries should be adequate to represent document contents. Searches were carried out on three test databases, and both natural language and PRECIS enhancement of MARC records increased the number of relevant documents found, with PRECIS showing the better performance. However, with weak stemming the advantage of PRECIS was lost. Consideration must also be given to the potential advantages of controlled vocabulary, over and above basic retrieval performance measures
    Theme
    Verbale Doksprachen im Online-Retrieval
  15. Peis, E.; Fernandez-Molina, J.C.: Enrichment of bibliographic records of online catalogs through ORC and SGML technology (1998) 0.00
    0.0041357614 = product of:
      0.028950328 = sum of:
        0.018822279 = weight(_text_:system in 3385) [ClassicSimilarity], result of:
          0.018822279 = score(doc=3385,freq=2.0), product of:
            0.07727166 = queryWeight, product of:
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.02453417 = queryNorm
            0.2435858 = fieldWeight in 3385, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3385)
        0.010128049 = weight(_text_:information in 3385) [ClassicSimilarity], result of:
          0.010128049 = score(doc=3385,freq=6.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.23515764 = fieldWeight in 3385, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3385)
      0.14285715 = coord(2/14)
    
    Abstract
    Reports results of research into the feasibility of using OCR scanner technology to capture contents pages of collective monographs and to extract the bibliographic information of each individual work and process this using a standardized language, such as SGML, for tagging electronic documents. By this means, data can be used as electronic information or stored in OPACs, thus providing additional access points. Outlines a pilot system to test the initial hypotheses, show the feasibility of achieving the suggested goals and develop the tasks required for them to be carried out as automatically as possible
    Source
    Information technology and libraries. 17(1998) no.3, S.161-172
  16. Hauer, M.; Diedrichs, R.: Zwischenbilanz Collaborative Catalog Enrichment (2009) 0.00
    0.004060812 = product of:
      0.028425682 = sum of:
        0.007161612 = weight(_text_:information in 3148) [ClassicSimilarity], result of:
          0.007161612 = score(doc=3148,freq=12.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.16628155 = fieldWeight in 3148, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3148)
        0.02126407 = weight(_text_:retrieval in 3148) [ClassicSimilarity], result of:
          0.02126407 = score(doc=3148,freq=12.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.28652456 = fieldWeight in 3148, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3148)
      0.14285715 = coord(2/14)
    
    Content
    Leseprobe: "Bibliothek sind Information Retrieval Systeme Bibliotheken sind nicht chaotische Sammlungen von Medien, sondern sie werden stets als Information Retrieval Systeme implementiert, denn das Wiederfinden bei exakten und bei vagen Anfragen ist von Anfang an das Sammlungsziel. Niemand kennt zum Zeitpunkt der Sammlung die Fragestellung, welche einen zukünftigen Benutzerzu diesem Medium führen soll. Die Art der Sammlungsorganisation bestimmt in hohem Maße die zukünftig mögliche Antwortmenge und deren informatorische Qualität. Viele Sammler kennen die einzelnen Medien recht gut durch eigene Lektüre oder Nutzung, durch Verarbeitung der Inhalte in eigenen Schriften, durch Kenntnis der Autoren oder durch Einschätzungen, Empfehlungen, Meinungen von anderen. Diese Sammler sind "Antwortmaschinen" - können oft mit hoher Präzision und angepasst auf den Kenntnisstand des Fragenden Erklärungen geben, welche aus einer Summe von Medieninhalten gelernt wurden. Genau diesen Typ wünschen sich die meisten Benutzer, führt er doch schnell, fachlich ausgewogen und verständlich ans Ziel. Der nicht ganz so begabte Sammler oder Sammler deutlich größerer Medienmengen kann nur mehr oder weniger gut, auf einzelne Medien oder Mediengruppen hinführen, gibt aber keine fachliche Auskunft mehr - sondern liefert nur Hinweise auf mögliche "Antwort-Container". Zumindest seit der berühmten Bibliothek von Alexandria hilft dieser Sammler seinem eigenen Gedächtnis mit kurzen Notizen nach: Listen von Titeln, von Autoren, von Themen und Referenzen auf den Standort. Praktischer als geschriebene Listen sind wegen der leichteren Sortierbarkeit Karteikarten. Der digitale Record in den Datenbanksystemen der Bibliotheken ist logisch nichts anderes, nur deutlich schneller sortierbar. Als sich in den Siebziger Jahren die heutigen Bibliothekssysteme entwickelten, entschieden sich fast alle Anbieter für zumeist relationale Datenbank-Management-Systeme als Basis-Technologie - gut bewährt in Lagerverwaltung und Buchhaltung in Wirtschaft und Verwaltung. In Welten mit sehr wenig Textinformation. Vage Suche war im Ansatz nicht vorgesehen, ging es doch zunächst nur um digitale "Karteikarten".
    Information Retrieval, Digitalisierung, Speichersysteme, Virtualisierung und Weiterentwicklungen der Datenbank-Management-Systeme erlauben heute aber, dass Katalog und Medium technisch zusammenfallen und eine ganz neue Qualität erreichen können. Kataloganreicherung, wie seit 2002 von der Gruppe um dandelon.com betrieben - angefangen hat es in Bibliotheken in Vorarlberg, Liechtenstein und Schweiz, heute zählt Deutschland, Norwegen, Schweden und Italien dazu - versucht Information Retrieval-Technologie mit den relationalen Datenbanksystemen der Bibliotheken zusammen zubringen. Dazu werden mehr Daten benötigt, als die bisherigen bibliothekarischen Titelbeschreibungen hergeben. Sprachverarbeitungskonzepte sind notwendig, um die Vielfalt der Sprache wieder einzufangen und neue Konzepte für die Anzeige die-serTexte oder anderen Datentypen. Kataloganreicherung ist die derzeit technisch sinnvolle Voraussetzung zum Einsatz von moderner Information Retrieval-Technologien in Bibliotheken. Deren Einsatz ist bei Bibliotheksverbünden auf dem Vormarsch. Die maschinelle Indexierung - mit linguistischen und/oder statistischen Methoden - ist ein mögliches Verfahren in Information Retrieval-Systemen, dessen Resultate auch direkt in "klassischen" OPAC-Systemen nachgenutzt werden kann und damit auch dort eine Recherche auf breiterer terminologischer Basis gestattet. Diese Erweiterung des OPACs um die maschinellen Indexierungsergebnisse stand bei der Vorarlberger Landesbibliothek, dem Pionier unter den "Kataloganreichern" und wohl noch immer größten Einzel-Produzenten, schon 2002 vor der Anzeige der Inhaltsverzeichnisse im Vordergrund. Die maschinelle Indexierung aus intelligentCAPTURE kann leicht in die Kataloge übernommen werden. Schon bald zeigte sich aber im jeweiligen Bibliothekssystem das fehlende Ranking. 2004 startete deshalb "dandelon.com". Die maschinelle Indexierung wird in homöopathischer Dosis in den HEBIS-Katalog übernommen und kommt derzeit beim GBV. Für die Deutschen Nationalbibliothek hat Frau Direktor Dr. Niggemmann auf dem Deutschen Bibliothekartag 2009 in Erfurt die maschinelle Indexierung Klassifizierung, Extraktion weiterer Metadaten und die Ergänzung um zusätzliche Daten - auch abweichend von bisherigen Normdateien - als Projekt angekündigt und deren Übernahme in den Katalog. Ein Meilenstein! Die DNB kündigt damit den Perspektivenwechsel der bibliothekarischen Sicht hin zur Sicht des Endbenutzers an."
  17. Leissing, U.; Rädler, K.; Hauer, M.: Query-Expansion durch Fachthesauri : Erfahrungsbericht zu dandelon.com, Vorarlberger Parlamentsinformationssystem und vorarlberg.at (2010) 0.00
    0.0035505537 = product of:
      0.024853874 = sum of:
        0.014881751 = weight(_text_:retrieval in 3728) [ClassicSimilarity], result of:
          0.014881751 = score(doc=3728,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.20052543 = fieldWeight in 3728, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=3728)
        0.009972124 = product of:
          0.019944249 = sum of:
            0.019944249 = weight(_text_:22 in 3728) [ClassicSimilarity], result of:
              0.019944249 = score(doc=3728,freq=2.0), product of:
                0.085914485 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.02453417 = queryNorm
                0.23214069 = fieldWeight in 3728, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3728)
          0.5 = coord(1/2)
      0.14285715 = coord(2/14)
    
    Source
    Wissensspeicher in digitalen Räumen: Nachhaltigkeit - Verfügbarkeit - semantische Interoperabilität. Proceedings der 11. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Konstanz, 20. bis 22. Februar 2008. Hrsg.: J. Sieglerschmidt u. H.P.Ohly
    Theme
    Verbale Doksprachen im Online-Retrieval
  18. Bett, C.E.: ¬The Subject Access Project : a comparison with PRECIS (1979) 0.00
    0.0030730253 = product of:
      0.043022353 = sum of:
        0.043022353 = weight(_text_:system in 829) [ClassicSimilarity], result of:
          0.043022353 = score(doc=829,freq=2.0), product of:
            0.07727166 = queryWeight, product of:
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.02453417 = queryNorm
            0.5567675 = fieldWeight in 829, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1495528 = idf(docFreq=5152, maxDocs=44218)
              0.125 = fieldNorm(doc=829)
      0.071428575 = coord(1/14)
    
    Theme
    Preserved Context Index System (PRECIS)
  19. Lohmann, H.: KASCADE: Dokumentanreicherung und automatische Inhaltserschließung : Projektbericht und Ergebnisse des Retrievaltests (2000) 0.00
    0.0030709715 = product of:
      0.021496799 = sum of:
        0.0041347584 = weight(_text_:information in 494) [ClassicSimilarity], result of:
          0.0041347584 = score(doc=494,freq=4.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.0960027 = fieldWeight in 494, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02734375 = fieldNorm(doc=494)
        0.017362041 = weight(_text_:retrieval in 494) [ClassicSimilarity], result of:
          0.017362041 = score(doc=494,freq=8.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.23394634 = fieldWeight in 494, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02734375 = fieldNorm(doc=494)
      0.14285715 = coord(2/14)
    
    Abstract
    Verbesserungen des Gewichtungsverfahrens sollten schließlich unterstützt werden durch Maßnahmen zur Optimierung der Retrievalumgebung, etwa durch - den Einsatz von Hypertextwerkzeugen; - die Integration der THEAS-Komponente, mit der die automatische Mehrwortgruppengewinnung mit Hilfe eines Mehrwortgruppen-Parsers möglich ist; dies könnte im Rahmen der Dialogführung mit dem Nutzer eingesetzt werden, wenn dieser Teilkomponenten solcher Mehrwortgruppen im Retrieval verwendet. Mit THEAS wäre daneben der Einstieg in das Retrieval über das Register mit kanonischen Formen möglich; - die Integration von Wörterbuchfunktionen zur Benutzerunterstützung. Eine Weiterentwicklung des SELIX-Verfahrens könnte daneben in diese Richtungen erfolgen: - Bildung von Dokument-Clustern. Dabei werden Dokumente in einem Dokumenten-Raum einander in dem Maße zugeordnet, in dem ihre selektierten Grundformen übereinstimmen. - Errichtung von statistisch basierten semantischen Netzen, in denen Grundformen einander in einem Begriffs-Raum in dem Maße zugeordnet werden, in dem sie "gemeinsam" in Dokumenten auftreten.
    RSWK
    Online-Katalog / Automatische Indexierung / Inhaltsverzeichnis / Scanning / Information Retrieval / Projekt
    Subject
    Online-Katalog / Automatische Indexierung / Inhaltsverzeichnis / Scanning / Information Retrieval / Projekt
  20. Pienaar, R.E.: Enhancement of subject access in online public access catalogues (OPACs) (1989) 0.00
    0.0028419765 = product of:
      0.019893834 = sum of:
        0.0050120843 = weight(_text_:information in 2863) [ClassicSimilarity], result of:
          0.0050120843 = score(doc=2863,freq=2.0), product of:
            0.04306919 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.02453417 = queryNorm
            0.116372846 = fieldWeight in 2863, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2863)
        0.014881751 = weight(_text_:retrieval in 2863) [ClassicSimilarity], result of:
          0.014881751 = score(doc=2863,freq=2.0), product of:
            0.07421378 = queryWeight, product of:
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.02453417 = queryNorm
            0.20052543 = fieldWeight in 2863, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.024915 = idf(docFreq=5836, maxDocs=44218)
              0.046875 = fieldNorm(doc=2863)
      0.14285715 = coord(2/14)
    
    Abstract
    Subject access plays an important part when plans are made for an on-line public access catalogue (OPAC). Therefore, current subject searching facilities in OPACs should be accessed carefully. These facilities include, amongst other things, phrase searching, keyword searching and class number retrieval. Users reaction have been instrumental in identifying certain shortcomings in subject access to OPACs. Points out certain shortcomings in subject access, as shown by a number of research projects undertaken during the past decade. Highlights aspects which could, in future, improve subject access in OPACs. This could be achieved by either enhancing content representation in the records within the OPAC data base or by enhancing the searchability and browsability of the OPAC. Considers whether the on-line catalogue will ever be a finished, perfect product, or achieve the universal, familiar uniformity experienced by users of the 20th century card catalogue.
    Source
    South African journal of library and information science. 57(1989) no.4, S.378-382