Document (#27227)

Author
Melucci, M.
Title
Making digital libraries effective : automatic generation of links for similarity search across hyper-textbooks
Source
Journal of the American Society for Information Science and technology. 55(2004) no.5, S.414-430
Year
2004
Abstract
Textbooks are more available in electronic format now than in the past. Because textbooks are typically large, the end user needs effective tools to rapidly access information encapsulated in textbooks stored in digital libraries. Statistical similarity-based links among hypertextbooks are a means to provide those tools. In this paper, the design and the implementation of a tool that generates networks of links within and across hypertextbooks through a completely automatic and unsupervised procedure is described. The design is based an statistical techniques. The overall methodology is presented together with the results of a case study reached through a working prototype that shows that connecting hyper-textbooks is an efficient way to provide an effective retrieval capability.
Theme
Computer Based Training
Hypertext

Similar documents (author)

  1. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:melucci in 1150) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 1150, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=1150)
    
  2. Melucci, M.: Contextual search : a computational framework (2012) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:melucci in 4913) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 4913, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=4913)
    
  3. Agosti, M.; Melucci, M.: Information retrieval techniques for the automatic construction of hypertext (2000) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:melucci in 4671) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 4671, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=4671)
    
  4. Melucci, M.; Orio, N.: Combining melody processing and information retrieval techniques : methodology, evaluation, and system implementation (2004) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:melucci in 3087) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 3087, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=3087)
    
  5. Melucci, M.; Orio, N.: Design, implementation, and evaluation of a methodology for automatic stemmer generation (2007) 4.65
    4.649496 = sum of:
      4.649496 = weight(author_txt:melucci in 268) [ClassicSimilarity], result of:
        4.649496 = fieldWeight in 268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.5 = fieldNorm(doc=268)
    

Similar documents (content)

  1. Zhang, Y.; Sun, Y.; Xie, B.: Quality of health information for consumers on the web : a systematic review of indicators, criteria, tools, and evaluation results (2015) 0.10
    0.09795635 = sum of:
      0.09795635 = product of:
        0.48978177 = sum of:
          0.041721135 = weight(abstract_txt:typically in 2218) [ClassicSimilarity], result of:
            0.041721135 = score(doc=2218,freq=1.0), product of:
              0.10219348 = queryWeight, product of:
                1.0060406 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.015550873 = queryNorm
              0.40825632 = fieldWeight in 2218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0625 = fieldNorm(doc=2218)
          0.013358575 = weight(abstract_txt:that in 2218) [ClassicSimilarity], result of:
            0.013358575 = score(doc=2218,freq=5.0), product of:
              0.04034066 = queryWeight, product of:
                1.0948031 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015550873 = queryNorm
              0.3311442 = fieldWeight in 2218, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2218)
          0.025518613 = weight(abstract_txt:design in 2218) [ClassicSimilarity], result of:
            0.025518613 = score(doc=2218,freq=2.0), product of:
              0.073636055 = queryWeight, product of:
                1.2077142 = boost
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.015550873 = queryNorm
              0.34655052 = fieldWeight in 2218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.0625 = fieldNorm(doc=2218)
          0.056064837 = weight(abstract_txt:across in 2218) [ClassicSimilarity], result of:
            0.056064837 = score(doc=2218,freq=2.0), product of:
              0.12444559 = queryWeight, product of:
                1.5700326 = boost
                5.097017 = idf(docFreq=734, maxDocs=44218)
                0.015550873 = queryNorm
              0.45051688 = fieldWeight in 2218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.097017 = idf(docFreq=734, maxDocs=44218)
                0.0625 = fieldNorm(doc=2218)
          0.3531186 = weight(abstract_txt:textbooks in 2218) [ClassicSimilarity], result of:
            0.3531186 = score(doc=2218,freq=1.0), product of:
              0.72575414 = queryWeight, product of:
                5.994924 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.015550873 = queryNorm
              0.48655403 = fieldWeight in 2218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=2218)
        0.2 = coord(5/25)
    
  2. Wilson, R.; Landoni, M.; Gibb, F.: ¬The WEB Book experiments in electronic textbook design (2003) 0.10
    0.09555801 = sum of:
      0.09555801 = product of:
        0.79631674 = sum of:
          0.008961204 = weight(abstract_txt:that in 4449) [ClassicSimilarity], result of:
            0.008961204 = score(doc=4449,freq=1.0), product of:
              0.04034066 = queryWeight, product of:
                1.0948031 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015550873 = queryNorm
              0.22213829 = fieldWeight in 4449, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=4449)
          0.038277924 = weight(abstract_txt:design in 4449) [ClassicSimilarity], result of:
            0.038277924 = score(doc=4449,freq=2.0), product of:
              0.073636055 = queryWeight, product of:
                1.2077142 = boost
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.015550873 = queryNorm
              0.5198258 = fieldWeight in 4449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9207718 = idf(docFreq=2382, maxDocs=44218)
                0.09375 = fieldNorm(doc=4449)
          0.7490776 = weight(abstract_txt:textbooks in 4449) [ClassicSimilarity], result of:
            0.7490776 = score(doc=4449,freq=2.0), product of:
              0.72575414 = queryWeight, product of:
                5.994924 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.015550873 = queryNorm
              1.0321369 = fieldWeight in 4449, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.09375 = fieldNorm(doc=4449)
        0.12 = coord(3/25)
    
  3. Kousha, K.; Thelwall, M.: ¬An automatic method for assessing the teaching impact of books from online academic syllabi (2016) 0.09
    0.09288409 = sum of:
      0.09288409 = product of:
        0.46442047 = sum of:
          0.010347508 = weight(abstract_txt:that in 3226) [ClassicSimilarity], result of:
            0.010347508 = score(doc=3226,freq=3.0), product of:
              0.04034066 = queryWeight, product of:
                1.0948031 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015550873 = queryNorm
              0.2565032 = fieldWeight in 3226, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3226)
          0.019321693 = weight(abstract_txt:through in 3226) [ClassicSimilarity], result of:
            0.019321693 = score(doc=3226,freq=1.0), product of:
              0.07707128 = queryWeight, product of:
                1.2355639 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.015550873 = queryNorm
              0.250699 = fieldWeight in 3226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.0625 = fieldNorm(doc=3226)
          0.039643828 = weight(abstract_txt:across in 3226) [ClassicSimilarity], result of:
            0.039643828 = score(doc=3226,freq=1.0), product of:
              0.12444559 = queryWeight, product of:
                1.5700326 = boost
                5.097017 = idf(docFreq=734, maxDocs=44218)
                0.015550873 = queryNorm
              0.31856355 = fieldWeight in 3226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.097017 = idf(docFreq=734, maxDocs=44218)
                0.0625 = fieldNorm(doc=3226)
          0.041988842 = weight(abstract_txt:automatic in 3226) [ClassicSimilarity], result of:
            0.041988842 = score(doc=3226,freq=1.0), product of:
              0.12930591 = queryWeight, product of:
                1.6003984 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015550873 = queryNorm
              0.32472485 = fieldWeight in 3226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=3226)
          0.3531186 = weight(abstract_txt:textbooks in 3226) [ClassicSimilarity], result of:
            0.3531186 = score(doc=3226,freq=1.0), product of:
              0.72575414 = queryWeight, product of:
                5.994924 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.015550873 = queryNorm
              0.48655403 = fieldWeight in 3226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=3226)
        0.2 = coord(5/25)
    
  4. Johnson, K.G.: ¬A select survey of AACR2 tools for serials cataloging (1997) 0.09
    0.092519194 = sum of:
      0.092519194 = product of:
        0.7709933 = sum of:
          0.0119482735 = weight(abstract_txt:that in 909) [ClassicSimilarity], result of:
            0.0119482735 = score(doc=909,freq=1.0), product of:
              0.04034066 = queryWeight, product of:
                1.0948031 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015550873 = queryNorm
              0.2961844 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.125 = fieldNorm(doc=909)
          0.0528078 = weight(abstract_txt:tools in 909) [ClassicSimilarity], result of:
            0.0528078 = score(doc=909,freq=1.0), product of:
              0.094909094 = queryWeight, product of:
                1.3711116 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.015550873 = queryNorm
              0.556404 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.125 = fieldNorm(doc=909)
          0.7062372 = weight(abstract_txt:textbooks in 909) [ClassicSimilarity], result of:
            0.7062372 = score(doc=909,freq=1.0), product of:
              0.72575414 = queryWeight, product of:
                5.994924 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.015550873 = queryNorm
              0.97310805 = fieldWeight in 909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.125 = fieldNorm(doc=909)
        0.12 = coord(3/25)
    
  5. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.09
    0.08915868 = sum of:
      0.08915868 = product of:
        0.3714945 = sum of:
          0.015521261 = weight(abstract_txt:that in 1345) [ClassicSimilarity], result of:
            0.015521261 = score(doc=1345,freq=3.0), product of:
              0.04034066 = queryWeight, product of:
                1.0948031 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015550873 = queryNorm
              0.38475478 = fieldWeight in 1345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.0993213 = weight(abstract_txt:unsupervised in 1345) [ClassicSimilarity], result of:
            0.0993213 = score(doc=1345,freq=1.0), product of:
              0.1390443 = queryWeight, product of:
                1.173493 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.015550873 = queryNorm
              0.71431404 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.028982539 = weight(abstract_txt:through in 1345) [ClassicSimilarity], result of:
            0.028982539 = score(doc=1345,freq=1.0), product of:
              0.07707128 = queryWeight, product of:
                1.2355639 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.015550873 = queryNorm
              0.3760485 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.06298327 = weight(abstract_txt:automatic in 1345) [ClassicSimilarity], result of:
            0.06298327 = score(doc=1345,freq=1.0), product of:
              0.12930591 = queryWeight, product of:
                1.6003984 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.015550873 = queryNorm
              0.48708728 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.0884907 = weight(abstract_txt:similarity in 1345) [ClassicSimilarity], result of:
            0.0884907 = score(doc=1345,freq=1.0), product of:
              0.16220592 = queryWeight, product of:
                1.7924715 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.015550873 = queryNorm
              0.54554546 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
          0.07619544 = weight(abstract_txt:effective in 1345) [ClassicSimilarity], result of:
            0.07619544 = score(doc=1345,freq=1.0), product of:
              0.16805497 = queryWeight, product of:
                2.2345507 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.015550873 = queryNorm
              0.45339596 = fieldWeight in 1345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.09375 = fieldNorm(doc=1345)
        0.24 = coord(6/25)