Document (#17510)

Author
Anzai, H.
Yamamoto, T.
Ishizuka, H.
Title
Experimental service of cataloguing database through WWW
Source
ULIS. 15(1996) no.2, S.1-16
Year
1996
Abstract
An information retrieval system for a cataloguing database through the WWW is developed, and experimentally served to Japan MARC and ULIS (Univeristy of Library and Information Science) OPAC data. Since Japanese words are not separated by obvious delimiters, ensuring the same segmentation between the query and the database is a problem. The present system solves the problem by using the multiple hash screening technique for processing both book titles and query strings, based on the same dictionary and using similar algorithms. Database management is handled by ADABAS, reducing management chores and and response time. The effectiveness of the multiple hash screening technique for a Japanese text based information system is examined, and the limitation of the Web's hypertext environment for a bibliographic information retrieval service is discussed
Footnote
[In Japanisch]

Similar documents (content)

  1. Rorvig, M.; Smith, M.M.; Uemura, A.: ¬The N-gram hypothesis applied to matched sets of visualized Japanese-English technical documents (1999) 0.12
    0.11910326 = sum of:
      0.11910326 = product of:
        0.7443954 = sum of:
          0.13756384 = weight(abstract_txt:japan in 6675) [ClassicSimilarity], result of:
            0.13756384 = score(doc=6675,freq=1.0), product of:
              0.16543499 = queryWeight, product of:
                1.0226433 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.02127866 = queryNorm
              0.8315281 = fieldWeight in 6675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.109375 = fieldNorm(doc=6675)
          0.01776819 = weight(abstract_txt:information in 6675) [ClassicSimilarity], result of:
            0.01776819 = score(doc=6675,freq=1.0), product of:
              0.06710269 = queryWeight, product of:
                1.3025982 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02127866 = queryNorm
              0.264791 = fieldWeight in 6675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.109375 = fieldNorm(doc=6675)
          0.109360754 = weight(abstract_txt:technique in 6675) [ClassicSimilarity], result of:
            0.109360754 = score(doc=6675,freq=1.0), product of:
              0.17887191 = queryWeight, product of:
                1.5038226 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.02127866 = queryNorm
              0.6113914 = fieldWeight in 6675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.109375 = fieldNorm(doc=6675)
          0.4797026 = weight(abstract_txt:japanese in 6675) [ClassicSimilarity], result of:
            0.4797026 = score(doc=6675,freq=3.0), product of:
              0.33233452 = queryWeight, product of:
                2.0498083 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.02127866 = queryNorm
              1.4434329 = fieldWeight in 6675, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.109375 = fieldNorm(doc=6675)
        0.16 = coord(4/25)
    
  2. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.11
    0.11180878 = sum of:
      0.11180878 = product of:
        0.5590439 = sum of:
          0.2682035 = weight(abstract_txt:segmentation in 4580) [ClassicSimilarity], result of:
            0.2682035 = score(doc=4580,freq=9.0), product of:
              0.1802514 = queryWeight, product of:
                1.0674556 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.02127866 = queryNorm
              1.4879413 = fieldWeight in 4580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.044905473 = weight(abstract_txt:problem in 4580) [ClassicSimilarity], result of:
            0.044905473 = score(doc=4580,freq=2.0), product of:
              0.11389799 = queryWeight, product of:
                1.2000064 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.02127866 = queryNorm
              0.39426047 = fieldWeight in 4580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.017585946 = weight(abstract_txt:information in 4580) [ClassicSimilarity], result of:
            0.017585946 = score(doc=4580,freq=3.0), product of:
              0.06710269 = queryWeight, product of:
                1.3025982 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02127866 = queryNorm
              0.26207513 = fieldWeight in 4580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.1658571 = weight(abstract_txt:delimiters in 4580) [ClassicSimilarity], result of:
            0.1658571 = score(doc=4580,freq=1.0), product of:
              0.2721485 = queryWeight, product of:
                1.3116364 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.02127866 = queryNorm
              0.6094361 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.062491857 = weight(abstract_txt:technique in 4580) [ClassicSimilarity], result of:
            0.062491857 = score(doc=4580,freq=1.0), product of:
              0.17887191 = queryWeight, product of:
                1.5038226 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.02127866 = queryNorm
              0.34936652 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
        0.2 = coord(5/25)
    
  3. Sormunen, E.; Kekäläinen, J.; Koivisto, J.; Järvelin, K.: Document text characteristics affect the ranking of the most relevant documents by expanded structured queries (2001) 0.11
    0.11083594 = sum of:
      0.11083594 = product of:
        0.46181643 = sum of:
          0.02309072 = weight(abstract_txt:through in 4487) [ClassicSimilarity], result of:
            0.02309072 = score(doc=4487,freq=1.0), product of:
              0.09210535 = queryWeight, product of:
                1.0791155 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.02127866 = queryNorm
              0.250699 = fieldWeight in 4487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
          0.054356024 = weight(abstract_txt:query in 4487) [ClassicSimilarity], result of:
            0.054356024 = score(doc=4487,freq=2.0), product of:
              0.12936446 = queryWeight, product of:
                1.2788894 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.02127866 = queryNorm
              0.4201774 = fieldWeight in 4487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
          0.020306503 = weight(abstract_txt:information in 4487) [ClassicSimilarity], result of:
            0.020306503 = score(doc=4487,freq=4.0), product of:
              0.06710269 = queryWeight, product of:
                1.3025982 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02127866 = queryNorm
              0.3026183 = fieldWeight in 4487, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
          0.029107692 = weight(abstract_txt:system in 4487) [ClassicSimilarity], result of:
            0.029107692 = score(doc=4487,freq=2.0), product of:
              0.097652964 = queryWeight, product of:
                1.3608613 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.02127866 = queryNorm
              0.2980728 = fieldWeight in 4487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
          0.062491857 = weight(abstract_txt:technique in 4487) [ClassicSimilarity], result of:
            0.062491857 = score(doc=4487,freq=1.0), product of:
              0.17887191 = queryWeight, product of:
                1.5038226 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.02127866 = queryNorm
              0.34936652 = fieldWeight in 4487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
          0.27246362 = weight(abstract_txt:screening in 4487) [ClassicSimilarity], result of:
            0.27246362 = score(doc=4487,freq=1.0), product of:
              0.47738147 = queryWeight, product of:
                2.456735 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.02127866 = queryNorm
              0.5707461 = fieldWeight in 4487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.0625 = fieldNorm(doc=4487)
        0.24 = coord(6/25)
    
  4. Cathro, W.S.: ¬The development of national bibliographic and document access services in Australia (1996) 0.09
    0.09154784 = sum of:
      0.09154784 = product of:
        0.45773917 = sum of:
          0.048982814 = weight(abstract_txt:through in 2579) [ClassicSimilarity], result of:
            0.048982814 = score(doc=2579,freq=2.0), product of:
              0.09210535 = queryWeight, product of:
                1.0791155 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.02127866 = queryNorm
              0.5318129 = fieldWeight in 2579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.09375 = fieldNorm(doc=2579)
          0.10266103 = weight(abstract_txt:service in 2579) [ClassicSimilarity], result of:
            0.10266103 = score(doc=2579,freq=4.0), product of:
              0.11972443 = queryWeight, product of:
                1.2303166 = boost
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.02127866 = queryNorm
              0.8574777 = fieldWeight in 2579, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5732145 = idf(docFreq=1240, maxDocs=44218)
                0.09375 = fieldNorm(doc=2579)
          0.015229877 = weight(abstract_txt:information in 2579) [ClassicSimilarity], result of:
            0.015229877 = score(doc=2579,freq=1.0), product of:
              0.06710269 = queryWeight, product of:
                1.3025982 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02127866 = queryNorm
              0.22696373 = fieldWeight in 2579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=2579)
          0.053474244 = weight(abstract_txt:system in 2579) [ClassicSimilarity], result of:
            0.053474244 = score(doc=2579,freq=3.0), product of:
              0.097652964 = queryWeight, product of:
                1.3608613 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.02127866 = queryNorm
              0.54759467 = fieldWeight in 2579, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=2579)
          0.23739122 = weight(abstract_txt:japanese in 2579) [ClassicSimilarity], result of:
            0.23739122 = score(doc=2579,freq=1.0), product of:
              0.33233452 = queryWeight, product of:
                2.0498083 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.02127866 = queryNorm
              0.71431404 = fieldWeight in 2579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.09375 = fieldNorm(doc=2579)
        0.2 = coord(5/25)
    
  5. Peng, F.; Huang, X.: Machine learning for Asian language text classification (2007) 0.09
    0.09010267 = sum of:
      0.09010267 = product of:
        0.5631417 = sum of:
          0.23653325 = weight(abstract_txt:segmentation in 831) [ClassicSimilarity], result of:
            0.23653325 = score(doc=831,freq=7.0), product of:
              0.1802514 = queryWeight, product of:
                1.0674556 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.02127866 = queryNorm
              1.3122408 = fieldWeight in 831, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.038133826 = weight(abstract_txt:same in 831) [ClassicSimilarity], result of:
            0.038133826 = score(doc=831,freq=1.0), product of:
              0.12868664 = queryWeight, product of:
                1.2755346 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.02127866 = queryNorm
              0.2963309 = fieldWeight in 831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.014358865 = weight(abstract_txt:information in 831) [ClassicSimilarity], result of:
            0.014358865 = score(doc=831,freq=2.0), product of:
              0.06710269 = queryWeight, product of:
                1.3025982 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02127866 = queryNorm
              0.21398345 = fieldWeight in 831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
          0.27411574 = weight(abstract_txt:japanese in 831) [ClassicSimilarity], result of:
            0.27411574 = score(doc=831,freq=3.0), product of:
              0.33233452 = queryWeight, product of:
                2.0498083 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.02127866 = queryNorm
              0.8248188 = fieldWeight in 831, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=831)
        0.16 = coord(4/25)