Document (#2301)

Author
Al-Hawamdeh, S.
Smith, G.
Willett, P.
Vere, R. de
Title
Using nearest-neighbour searching techniques to access full-text documents
Source
Online review. 15(1991) nos.3/4, S.173-190
Year
1991
Abstract
Summarises the results to date of a continuing programme of research at Sheffield Univ. to investigate the use of nearest-neighbour retrieval algorithms for full text searching. Given a natural language query statement, the research methods result in a ranking of the paragraphs comprising a full text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Al-Hawamdeh, S.; Smith, G.; Willett, P.: Paragraph-based access to full-text documents using a hypertext system (1991) 5.21
    5.211973 = sum of:
      5.211973 = sum of:
        0.8446561 = weight(author_txt:smith in 7504) [ClassicSimilarity], result of:
          0.8446561 = score(doc=7504,freq=1.0), product of:
            0.34912723 = queryWeight, product of:
              6.451563 = idf(docFreq=179, maxDocs=41962)
              0.05411514 = queryNorm
            2.419336 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.451563 = idf(docFreq=179, maxDocs=41962)
              0.375 = fieldNorm(doc=7504)
        1.6146698 = weight(author_txt:willett in 7504) [ClassicSimilarity], result of:
          1.6146698 = score(doc=7504,freq=1.0), product of:
            0.5377572 = queryWeight, product of:
              1.2410842 = boost
              8.006933 = idf(docFreq=37, maxDocs=41962)
              0.05411514 = queryNorm
            3.0026 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.006933 = idf(docFreq=37, maxDocs=41962)
              0.375 = fieldNorm(doc=7504)
        2.7526476 = weight(author_txt:hawamdeh in 7504) [ClassicSimilarity], result of:
          2.7526476 = score(doc=7504,freq=1.0), product of:
            0.767416 = queryWeight, product of:
              1.4825985 = boost
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.05411514 = queryNorm
            3.586904 = fieldWeight in 7504, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.565078 = idf(docFreq=7, maxDocs=41962)
              0.375 = fieldNorm(doc=7504)
    
  2. Hawamdeh, S.: Knowledge management : cultivating knowledge professionals (2003) 1.53
    1.5292487 = sum of:
      1.5292487 = product of:
        4.587746 = sum of:
          4.587746 = weight(author_txt:hawamdeh in 3466) [ClassicSimilarity], result of:
            4.587746 = score(doc=3466,freq=1.0), product of:
              0.767416 = queryWeight, product of:
                1.4825985 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.05411514 = queryNorm
              5.9781737 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.625 = fieldNorm(doc=3466)
        0.33333334 = coord(1/3)
    
  3. AI-Hawamdeh, S.: Knowledge Management in Asia : introduction to the special topic section (2005) 1.22
    1.2233989 = sum of:
      1.2233989 = product of:
        3.6701968 = sum of:
          3.6701968 = weight(author_txt:hawamdeh in 232) [ClassicSimilarity], result of:
            3.6701968 = score(doc=232,freq=1.0), product of:
              0.767416 = queryWeight, product of:
                1.4825985 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.05411514 = queryNorm
              4.782539 = fieldWeight in 232, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.5 = fieldNorm(doc=232)
        0.33333334 = coord(1/3)
    
  4. AI-Hawamdeh, S.: Designing an interdisciplinary graduate program in knowledge management (2005) 1.22
    1.2233989 = sum of:
      1.2233989 = product of:
        3.6701968 = sum of:
          3.6701968 = weight(author_txt:hawamdeh in 237) [ClassicSimilarity], result of:
            3.6701968 = score(doc=237,freq=1.0), product of:
              0.767416 = queryWeight, product of:
                1.4825985 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.05411514 = queryNorm
              4.782539 = fieldWeight in 237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.5 = fieldNorm(doc=237)
        0.33333334 = coord(1/3)
    
  5. Teng, S.; Hawamdeh, S.: Knowledge management in public libraries (2002) 1.22
    1.2233989 = sum of:
      1.2233989 = product of:
        3.6701968 = sum of:
          3.6701968 = weight(author_txt:hawamdeh in 1808) [ClassicSimilarity], result of:
            3.6701968 = score(doc=1808,freq=1.0), product of:
              0.767416 = queryWeight, product of:
                1.4825985 = boost
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.05411514 = queryNorm
              4.782539 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.565078 = idf(docFreq=7, maxDocs=41962)
                0.5 = fieldNorm(doc=1808)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Mohan, K.C.: Boolean and nearest neighbour text searching in a multi-strategy retrieval system (1996) 0.16
    0.16122676 = sum of:
      0.16122676 = product of:
        1.0076673 = sum of:
          0.05023421 = weight(abstract_txt:searching in 325) [ClassicSimilarity], result of:
            0.05023421 = score(doc=325,freq=1.0), product of:
              0.107768334 = queryWeight, product of:
                1.5520533 = boost
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.016292743 = queryNorm
              0.46613145 = fieldWeight in 325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.109375 = fieldNorm(doc=325)
          0.10365527 = weight(abstract_txt:query in 325) [ClassicSimilarity], result of:
            0.10365527 = score(doc=325,freq=1.0), product of:
              0.19994758 = queryWeight, product of:
                2.589195 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.016292743 = queryNorm
              0.51841223 = fieldWeight in 325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.109375 = fieldNorm(doc=325)
          0.35505426 = weight(abstract_txt:nearest in 325) [ClassicSimilarity], result of:
            0.35505426 = score(doc=325,freq=1.0), product of:
              0.39690626 = queryWeight, product of:
                2.978551 = boost
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.016292743 = queryNorm
              0.89455444 = fieldWeight in 325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.109375 = fieldNorm(doc=325)
          0.49872357 = weight(abstract_txt:neighbour in 325) [ClassicSimilarity], result of:
            0.49872357 = score(doc=325,freq=1.0), product of:
              0.4978113 = queryWeight, product of:
                3.3357494 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.016292743 = queryNorm
              1.0018326 = fieldWeight in 325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.109375 = fieldNorm(doc=325)
        0.16 = coord(4/25)
    
  2. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.15
    0.14796996 = sum of:
      0.14796996 = product of:
        0.6165415 = sum of:
          0.091637045 = weight(abstract_txt:keyword in 4157) [ClassicSimilarity], result of:
            0.091637045 = score(doc=4157,freq=5.0), product of:
              0.10845033 = queryWeight, product of:
                1.1009345 = boost
                6.0460978 = idf(docFreq=269, maxDocs=41962)
                0.016292743 = queryNorm
              0.84496784 = fieldWeight in 4157, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0460978 = idf(docFreq=269, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
          0.028705262 = weight(abstract_txt:searching in 4157) [ClassicSimilarity], result of:
            0.028705262 = score(doc=4157,freq=1.0), product of:
              0.107768334 = queryWeight, product of:
                1.5520533 = boost
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.016292743 = queryNorm
              0.26636082 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
          0.11603491 = weight(abstract_txt:paragraphs in 4157) [ClassicSimilarity], result of:
            0.11603491 = score(doc=4157,freq=1.0), product of:
              0.21705307 = queryWeight, product of:
                1.5575035 = boost
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.016292743 = queryNorm
              0.53459233 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.553477 = idf(docFreq=21, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
          0.20151477 = weight(abstract_txt:paragraph in 4157) [ClassicSimilarity], result of:
            0.20151477 = score(doc=4157,freq=2.0), product of:
              0.24890564 = queryWeight, product of:
                1.6678747 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.016292743 = queryNorm
              0.80960304 = fieldWeight in 4157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
          0.064274184 = weight(abstract_txt:text in 4157) [ClassicSimilarity], result of:
            0.064274184 = score(doc=4157,freq=3.0), product of:
              0.14639668 = queryWeight, product of:
                2.2155027 = boost
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.016292743 = queryNorm
              0.4390413 = fieldWeight in 4157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.05569 = idf(docFreq=1975, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
          0.11437532 = weight(abstract_txt:full in 4157) [ClassicSimilarity], result of:
            0.11437532 = score(doc=4157,freq=3.0), product of:
              0.21497852 = queryWeight, product of:
                2.6847522 = boost
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.016292743 = queryNorm
              0.5320314 = fieldWeight in 4157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.0625 = fieldNorm(doc=4157)
        0.24 = coord(6/25)
    
  3. Savoy, J.: ¬An extended vector-processing scheme for searching information in hypertext systems (1996) 0.13
    0.12990718 = sum of:
      0.12990718 = product of:
        0.6495359 = sum of:
          0.028705262 = weight(abstract_txt:searching in 4105) [ClassicSimilarity], result of:
            0.028705262 = score(doc=4105,freq=1.0), product of:
              0.107768334 = queryWeight, product of:
                1.5520533 = boost
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.016292743 = queryNorm
              0.26636082 = fieldWeight in 4105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.0625 = fieldNorm(doc=4105)
          0.07372599 = weight(abstract_txt:similarity in 4105) [ClassicSimilarity], result of:
            0.07372599 = score(doc=4105,freq=1.0), product of:
              0.20211439 = queryWeight, product of:
                2.1254928 = boost
                5.836377 = idf(docFreq=332, maxDocs=41962)
                0.016292743 = queryNorm
              0.36477357 = fieldWeight in 4105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.836377 = idf(docFreq=332, maxDocs=41962)
                0.0625 = fieldNorm(doc=4105)
          0.059231583 = weight(abstract_txt:query in 4105) [ClassicSimilarity], result of:
            0.059231583 = score(doc=4105,freq=1.0), product of:
              0.19994758 = queryWeight, product of:
                2.589195 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.016292743 = queryNorm
              0.29623556 = fieldWeight in 4105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.0625 = fieldNorm(doc=4105)
          0.20288815 = weight(abstract_txt:nearest in 4105) [ClassicSimilarity], result of:
            0.20288815 = score(doc=4105,freq=1.0), product of:
              0.39690626 = queryWeight, product of:
                2.978551 = boost
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.016292743 = queryNorm
              0.51117396 = fieldWeight in 4105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.0625 = fieldNorm(doc=4105)
          0.28498492 = weight(abstract_txt:neighbour in 4105) [ClassicSimilarity], result of:
            0.28498492 = score(doc=4105,freq=1.0), product of:
              0.4978113 = queryWeight, product of:
                3.3357494 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.016292743 = queryNorm
              0.5724758 = fieldWeight in 4105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.0625 = fieldNorm(doc=4105)
        0.2 = coord(5/25)
    
  4. Loughran, H.: ¬A review of nearest neighbour information retrieval (1994) 0.12
    0.12397879 = sum of:
      0.12397879 = product of:
        1.0331566 = sum of:
          0.057410523 = weight(abstract_txt:searching in 685) [ClassicSimilarity], result of:
            0.057410523 = score(doc=685,freq=1.0), product of:
              0.107768334 = queryWeight, product of:
                1.5520533 = boost
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.016292743 = queryNorm
              0.53272164 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.261773 = idf(docFreq=1607, maxDocs=41962)
                0.125 = fieldNorm(doc=685)
          0.4057763 = weight(abstract_txt:nearest in 685) [ClassicSimilarity], result of:
            0.4057763 = score(doc=685,freq=1.0), product of:
              0.39690626 = queryWeight, product of:
                2.978551 = boost
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.016292743 = queryNorm
              1.0223479 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.125 = fieldNorm(doc=685)
          0.56996983 = weight(abstract_txt:neighbour in 685) [ClassicSimilarity], result of:
            0.56996983 = score(doc=685,freq=1.0), product of:
              0.4978113 = queryWeight, product of:
                3.3357494 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.016292743 = queryNorm
              1.1449516 = fieldWeight in 685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.125 = fieldNorm(doc=685)
        0.12 = coord(3/25)
    
  5. Cribbin, T.: Discovering latent topical structure by second-order similarity analysis (2011) 0.12
    0.11500223 = sum of:
      0.11500223 = product of:
        0.71876395 = sum of:
          0.16485631 = weight(abstract_txt:similarity in 1471) [ClassicSimilarity], result of:
            0.16485631 = score(doc=1471,freq=5.0), product of:
              0.20211439 = queryWeight, product of:
                2.1254928 = boost
                5.836377 = idf(docFreq=332, maxDocs=41962)
                0.016292743 = queryNorm
              0.8156585 = fieldWeight in 1471, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.836377 = idf(docFreq=332, maxDocs=41962)
                0.0625 = fieldNorm(doc=1471)
          0.06603462 = weight(abstract_txt:full in 1471) [ClassicSimilarity], result of:
            0.06603462 = score(doc=1471,freq=1.0), product of:
              0.21497852 = queryWeight, product of:
                2.6847522 = boost
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.016292743 = queryNorm
              0.30716848 = fieldWeight in 1471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9146957 = idf(docFreq=836, maxDocs=41962)
                0.0625 = fieldNorm(doc=1471)
          0.20288815 = weight(abstract_txt:nearest in 1471) [ClassicSimilarity], result of:
            0.20288815 = score(doc=1471,freq=1.0), product of:
              0.39690626 = queryWeight, product of:
                2.978551 = boost
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.016292743 = queryNorm
              0.51117396 = fieldWeight in 1471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.178783 = idf(docFreq=31, maxDocs=41962)
                0.0625 = fieldNorm(doc=1471)
          0.28498492 = weight(abstract_txt:neighbour in 1471) [ClassicSimilarity], result of:
            0.28498492 = score(doc=1471,freq=1.0), product of:
              0.4978113 = queryWeight, product of:
                3.3357494 = boost
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.016292743 = queryNorm
              0.5724758 = fieldWeight in 1471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.159613 = idf(docFreq=11, maxDocs=41962)
                0.0625 = fieldNorm(doc=1471)
        0.16 = coord(4/25)