Document (#2301)

Author
Al-Hawamdeh, S.
Smith, G.
Willett, P.
Vere, R. de
Title
Using nearest-neighbour searching techniques to access full-text documents
Source
Online review. 15(1991) nos.3/4, S.173-190
Year
1991
Abstract
Summarises the results to date of a continuing programme of research at Sheffield Univ. to investigate the use of nearest-neighbour retrieval algorithms for full text searching. Given a natural language query statement, the research methods result in a ranking of the paragraphs comprising a full text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Al-Hawamdeh, S.; Smith, G.; Willett, P.: Paragraph-based access to full-text documents using a hypertext system (1991) 5.23
    5.230314 = sum of:
      5.230314 = sum of:
        0.8389172 = weight(author_txt:smith in 7504) [ClassicSimilarity], result of:
          0.8389172 = score(doc=7504,freq=1.0), product of:
            0.34656295 = queryWeight, product of:
              6.45514 = idf(docFreq=188, maxDocs=44218)
              0.053687904 = queryNorm
            2.4206777 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.45514 = idf(docFreq=188, maxDocs=44218)
              0.375 = fieldNorm(doc=7504)
        1.6169183 = weight(author_txt:willett in 7504) [ClassicSimilarity], result of:
          1.6169183 = score(doc=7504,freq=1.0), product of:
            0.5367369 = queryWeight, product of:
              1.244485 = boost
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.053687904 = queryNorm
            3.012497 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.033325 = idf(docFreq=38, maxDocs=44218)
              0.375 = fieldNorm(doc=7504)
        2.7744782 = weight(author_txt:hawamdeh in 7504) [ClassicSimilarity], result of:
          2.7744782 = score(doc=7504,freq=1.0), product of:
            0.7692904 = queryWeight, product of:
              1.4898896 = boost
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.053687904 = queryNorm
            3.606542 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.375 = fieldNorm(doc=7504)
    
  2. Hawamdeh, S.: Knowledge management : cultivating knowledge professionals (2003) 1.54
    1.541377 = sum of:
      1.541377 = product of:
        4.6241307 = sum of:
          4.6241307 = weight(author_txt:hawamdeh in 1465) [ClassicSimilarity], result of:
            4.6241307 = score(doc=1465,freq=1.0), product of:
              0.7692904 = queryWeight, product of:
                1.4898896 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.053687904 = queryNorm
              6.010904 = fieldWeight in 1465, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.625 = fieldNorm(doc=1465)
        0.33333334 = coord(1/3)
    
  3. AI-Hawamdeh, S.: Knowledge Management in Asia : introduction to the special topic section (2005) 1.23
    1.2331015 = sum of:
      1.2331015 = product of:
        3.6993043 = sum of:
          3.6993043 = weight(author_txt:hawamdeh in 4231) [ClassicSimilarity], result of:
            3.6993043 = score(doc=4231,freq=1.0), product of:
              0.7692904 = queryWeight, product of:
                1.4898896 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.053687904 = queryNorm
              4.808723 = fieldWeight in 4231, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.5 = fieldNorm(doc=4231)
        0.33333334 = coord(1/3)
    
  4. AI-Hawamdeh, S.: Designing an interdisciplinary graduate program in knowledge management (2005) 1.23
    1.2331015 = sum of:
      1.2331015 = product of:
        3.6993043 = sum of:
          3.6993043 = weight(author_txt:hawamdeh in 4236) [ClassicSimilarity], result of:
            3.6993043 = score(doc=4236,freq=1.0), product of:
              0.7692904 = queryWeight, product of:
                1.4898896 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.053687904 = queryNorm
              4.808723 = fieldWeight in 4236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.5 = fieldNorm(doc=4236)
        0.33333334 = coord(1/3)
    
  5. Teng, S.; Hawamdeh, S.: Knowledge management in public libraries (2002) 1.23
    1.2331015 = sum of:
      1.2331015 = product of:
        3.6993043 = sum of:
          3.6993043 = weight(author_txt:hawamdeh in 682) [ClassicSimilarity], result of:
            3.6993043 = score(doc=682,freq=1.0), product of:
              0.7692904 = queryWeight, product of:
                1.4898896 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.053687904 = queryNorm
              4.808723 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.5 = fieldNorm(doc=682)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Mohan, K.C.: Boolean and nearest neighbour text searching in a multi-strategy retrieval system (1996) 0.16
    0.16227207 = sum of:
      0.16227207 = product of:
        1.0142004 = sum of:
          0.051101744 = weight(abstract_txt:searching in 7255) [ClassicSimilarity], result of:
            0.051101744 = score(doc=7255,freq=1.0), product of:
              0.10904217 = queryWeight, product of:
                1.5896996 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016008707 = queryNorm
              0.46864203 = fieldWeight in 7255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.109375 = fieldNorm(doc=7255)
          0.10468163 = weight(abstract_txt:query in 7255) [ClassicSimilarity], result of:
            0.10468163 = score(doc=7255,freq=1.0), product of:
              0.2013329 = queryWeight, product of:
                2.6455796 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016008707 = queryNorm
              0.519943 = fieldWeight in 7255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.109375 = fieldNorm(doc=7255)
          0.35057896 = weight(abstract_txt:nearest in 7255) [ClassicSimilarity], result of:
            0.35057896 = score(doc=7255,freq=1.0), product of:
              0.39369622 = queryWeight, product of:
                3.0206363 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.016008707 = queryNorm
              0.8904809 = fieldWeight in 7255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.109375 = fieldNorm(doc=7255)
          0.50783813 = weight(abstract_txt:neighbour in 7255) [ClassicSimilarity], result of:
            0.50783813 = score(doc=7255,freq=1.0), product of:
              0.5040275 = queryWeight, product of:
                3.4177866 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016008707 = queryNorm
              1.0075604 = fieldWeight in 7255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.109375 = fieldNorm(doc=7255)
        0.16 = coord(4/25)
    
  2. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.15
    0.14620219 = sum of:
      0.14620219 = product of:
        0.6091758 = sum of:
          0.09133436 = weight(abstract_txt:keyword in 4088) [ClassicSimilarity], result of:
            0.09133436 = score(doc=4088,freq=5.0), product of:
              0.108247735 = queryWeight, product of:
                1.1199851 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.016008707 = queryNorm
              0.84375304 = fieldWeight in 4088, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.11474625 = weight(abstract_txt:paragraphs in 4088) [ClassicSimilarity], result of:
            0.11474625 = score(doc=4088,freq=1.0), product of:
              0.21551543 = queryWeight, product of:
                1.580309 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.016008707 = queryNorm
              0.5324271 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.029200995 = weight(abstract_txt:searching in 4088) [ClassicSimilarity], result of:
            0.029200995 = score(doc=4088,freq=1.0), product of:
              0.10904217 = queryWeight, product of:
                1.5896996 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016008707 = queryNorm
              0.26779544 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.19506784 = weight(abstract_txt:paragraph in 4088) [ClassicSimilarity], result of:
            0.19506784 = score(doc=4088,freq=2.0), product of:
              0.24365005 = queryWeight, product of:
                1.6802971 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.016008707 = queryNorm
              0.8006066 = fieldWeight in 4088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.063777946 = weight(abstract_txt:text in 4088) [ClassicSimilarity], result of:
            0.063777946 = score(doc=4088,freq=3.0), product of:
              0.14569111 = queryWeight, product of:
                2.250505 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016008707 = queryNorm
              0.4377614 = fieldWeight in 4088, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.11504835 = weight(abstract_txt:full in 4088) [ClassicSimilarity], result of:
            0.11504835 = score(doc=4088,freq=3.0), product of:
              0.21589354 = queryWeight, product of:
                2.7395756 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.016008707 = queryNorm
              0.5328939 = fieldWeight in 4088, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
        0.24 = coord(6/25)
    
  3. Savoy, J.: ¬An extended vector-processing scheme for searching information in hypertext systems (1996) 0.13
    0.13053839 = sum of:
      0.13053839 = product of:
        0.65269196 = sum of:
          0.029200995 = weight(abstract_txt:searching in 4036) [ClassicSimilarity], result of:
            0.029200995 = score(doc=4036,freq=1.0), product of:
              0.10904217 = queryWeight, product of:
                1.5896996 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016008707 = queryNorm
              0.26779544 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.07314879 = weight(abstract_txt:similarity in 4036) [ClassicSimilarity], result of:
            0.07314879 = score(doc=4036,freq=1.0), product of:
              0.20112565 = queryWeight, product of:
                2.1589947 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.016008707 = queryNorm
              0.36369696 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.059818074 = weight(abstract_txt:query in 4036) [ClassicSimilarity], result of:
            0.059818074 = score(doc=4036,freq=1.0), product of:
              0.2013329 = queryWeight, product of:
                2.6455796 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.016008707 = queryNorm
              0.2971103 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.20033084 = weight(abstract_txt:nearest in 4036) [ClassicSimilarity], result of:
            0.20033084 = score(doc=4036,freq=1.0), product of:
              0.39369622 = queryWeight, product of:
                3.0206363 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.016008707 = queryNorm
              0.5088462 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
          0.29019323 = weight(abstract_txt:neighbour in 4036) [ClassicSimilarity], result of:
            0.29019323 = score(doc=4036,freq=1.0), product of:
              0.5040275 = queryWeight, product of:
                3.4177866 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016008707 = queryNorm
              0.5757488 = fieldWeight in 4036, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=4036)
        0.2 = coord(5/25)
    
  4. Loughran, H.: ¬A review of nearest neighbour information retrieval (1994) 0.12
    0.124734014 = sum of:
      0.124734014 = product of:
        1.0394502 = sum of:
          0.05840199 = weight(abstract_txt:searching in 616) [ClassicSimilarity], result of:
            0.05840199 = score(doc=616,freq=1.0), product of:
              0.10904217 = queryWeight, product of:
                1.5896996 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.016008707 = queryNorm
              0.5355909 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.125 = fieldNorm(doc=616)
          0.40066168 = weight(abstract_txt:nearest in 616) [ClassicSimilarity], result of:
            0.40066168 = score(doc=616,freq=1.0), product of:
              0.39369622 = queryWeight, product of:
                3.0206363 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.016008707 = queryNorm
              1.0176924 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.125 = fieldNorm(doc=616)
          0.58038646 = weight(abstract_txt:neighbour in 616) [ClassicSimilarity], result of:
            0.58038646 = score(doc=616,freq=1.0), product of:
              0.5040275 = queryWeight, product of:
                3.4177866 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016008707 = queryNorm
              1.1514976 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.125 = fieldNorm(doc=616)
        0.12 = coord(3/25)
    
  5. Cribbin, T.: Discovering latent topical structure by second-order similarity analysis (2011) 0.12
    0.11528207 = sum of:
      0.11528207 = product of:
        0.720513 = sum of:
          0.16356567 = weight(abstract_txt:similarity in 4470) [ClassicSimilarity], result of:
            0.16356567 = score(doc=4470,freq=5.0), product of:
              0.20112565 = queryWeight, product of:
                2.1589947 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.016008707 = queryNorm
              0.81325114 = fieldWeight in 4470, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=4470)
          0.0664232 = weight(abstract_txt:full in 4470) [ClassicSimilarity], result of:
            0.0664232 = score(doc=4470,freq=1.0), product of:
              0.21589354 = queryWeight, product of:
                2.7395756 = boost
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.016008707 = queryNorm
              0.30766645 = fieldWeight in 4470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922663 = idf(docFreq=874, maxDocs=44218)
                0.0625 = fieldNorm(doc=4470)
          0.20033084 = weight(abstract_txt:nearest in 4470) [ClassicSimilarity], result of:
            0.20033084 = score(doc=4470,freq=1.0), product of:
              0.39369622 = queryWeight, product of:
                3.0206363 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.016008707 = queryNorm
              0.5088462 = fieldWeight in 4470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=4470)
          0.29019323 = weight(abstract_txt:neighbour in 4470) [ClassicSimilarity], result of:
            0.29019323 = score(doc=4470,freq=1.0), product of:
              0.5040275 = queryWeight, product of:
                3.4177866 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016008707 = queryNorm
              0.5757488 = fieldWeight in 4470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=4470)
        0.16 = coord(4/25)