Document (#4032)

Author
Shuman, B.A.
Title
One false drop deserves another : file selection as a means of increasing precision in online searches
Source
13th National Online Meeting. Ed.: M.E. Williams
Imprint
Medford, NJ : Learned Information Inc.
Year
1992
Pages
S.345-350
Abstract
The online fals drop or false coordination, occurs in free text (natural language) searching when a lack of predetermined term relationships or the presence of any of various types of linguistic idiosyncrasies leads to unexpected and unintended consequences. They are due to at least 3 factors: file selection; language structure; and failure to anticipate on the part of the searcher during preparation. Uses actual free text onlines searches as a vehicle to discuss common causes of false drops in online searching and examples are given. Concludes that file selection will overcome many of the logic problems inherent in retrieval of falsely coordinated terms. Remedies are suggested in each case, to be implemented prior to initiating the search rather than as costly midcourse corrections

Similar documents (content)

  1. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.15
    0.1465575 = sum of:
      0.1465575 = product of:
        0.61065626 = sum of:
          0.04006704 = weight(abstract_txt:text in 1025) [ClassicSimilarity], result of:
            0.04006704 = score(doc=1025,freq=4.0), product of:
              0.07915646 = queryWeight, product of:
                1.087424 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.017976146 = queryNorm
              0.5061752 = fieldWeight in 1025, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
          0.033394646 = weight(abstract_txt:searching in 1025) [ClassicSimilarity], result of:
            0.033394646 = score(doc=1025,freq=2.0), product of:
              0.08832618 = queryWeight, product of:
                1.1486837 = boost
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.017976146 = queryNorm
              0.37808323 = fieldWeight in 1025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
          0.15775044 = weight(abstract_txt:drops in 1025) [ClassicSimilarity], result of:
            0.15775044 = score(doc=1025,freq=2.0), product of:
              0.19736733 = queryWeight, product of:
                1.2141668 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.017976146 = queryNorm
              0.7992733 = fieldWeight in 1025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
          0.042883474 = weight(abstract_txt:searches in 1025) [ClassicSimilarity], result of:
            0.042883474 = score(doc=1025,freq=1.0), product of:
              0.1314745 = queryWeight, product of:
                1.4014463 = boost
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.017976146 = queryNorm
              0.32617334 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
          0.053233486 = weight(abstract_txt:free in 1025) [ClassicSimilarity], result of:
            0.053233486 = score(doc=1025,freq=1.0), product of:
              0.15185817 = queryWeight, product of:
                1.5061728 = boost
                5.6087584 = idf(docFreq=433, maxDocs=43556)
                0.017976146 = queryNorm
              0.3505474 = fieldWeight in 1025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6087584 = idf(docFreq=433, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
          0.28332722 = weight(abstract_txt:false in 1025) [ClassicSimilarity], result of:
            0.28332722 = score(doc=1025,freq=2.0), product of:
              0.42059183 = queryWeight, product of:
                3.0699532 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.017976146 = queryNorm
              0.6736394 = fieldWeight in 1025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.0625 = fieldNorm(doc=1025)
        0.24 = coord(6/25)
    
  2. Horn, M.E.: "Garbage" in, "refuse and refuse disposal" out : making the most of the subject authority file in the OPAC (2002) 0.12
    0.12497263 = sum of:
      0.12497263 = product of:
        0.44633082 = sum of:
          0.030050278 = weight(abstract_txt:text in 1279) [ClassicSimilarity], result of:
            0.030050278 = score(doc=1279,freq=1.0), product of:
              0.07915646 = queryWeight, product of:
                1.087424 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.017976146 = queryNorm
              0.3796314 = fieldWeight in 1279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.033294074 = weight(abstract_txt:language in 1279) [ClassicSimilarity], result of:
            0.033294074 = score(doc=1279,freq=1.0), product of:
              0.084754996 = queryWeight, product of:
                1.1252224 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.017976146 = queryNorm
              0.39282727 = fieldWeight in 1279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.061349884 = weight(abstract_txt:searching in 1279) [ClassicSimilarity], result of:
            0.061349884 = score(doc=1279,freq=3.0), product of:
              0.08832618 = queryWeight, product of:
                1.1486837 = boost
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.017976146 = queryNorm
              0.69458324 = fieldWeight in 1279, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2775235 = idf(docFreq=1642, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.090969585 = weight(abstract_txt:searches in 1279) [ClassicSimilarity], result of:
            0.090969585 = score(doc=1279,freq=2.0), product of:
              0.1314745 = queryWeight, product of:
                1.4014463 = boost
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.017976146 = queryNorm
              0.69191813 = fieldWeight in 1279, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2187734 = idf(docFreq=640, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.0334862 = weight(abstract_txt:online in 1279) [ClassicSimilarity], result of:
            0.0334862 = score(doc=1279,freq=1.0), product of:
              0.09739313 = queryWeight, product of:
                1.4772892 = boost
                3.667467 = idf(docFreq=3023, maxDocs=43556)
                0.017976146 = queryNorm
              0.34382504 = fieldWeight in 1279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.667467 = idf(docFreq=3023, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.07985023 = weight(abstract_txt:free in 1279) [ClassicSimilarity], result of:
            0.07985023 = score(doc=1279,freq=1.0), product of:
              0.15185817 = queryWeight, product of:
                1.5061728 = boost
                5.6087584 = idf(docFreq=433, maxDocs=43556)
                0.017976146 = queryNorm
              0.5258211 = fieldWeight in 1279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6087584 = idf(docFreq=433, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
          0.11733057 = weight(abstract_txt:file in 1279) [ClassicSimilarity], result of:
            0.11733057 = score(doc=1279,freq=1.0), product of:
              0.22467698 = queryWeight, product of:
                2.243782 = boost
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.017976146 = queryNorm
              0.52221894 = fieldWeight in 1279, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.09375 = fieldNorm(doc=1279)
        0.28 = coord(7/25)
    
  3. Gillaspie, L.: ¬The role of linguistic phenomena in retrieval performance (1995) 0.11
    0.113317944 = sum of:
      0.113317944 = product of:
        0.7082372 = sum of:
          0.04006704 = weight(abstract_txt:text in 3927) [ClassicSimilarity], result of:
            0.04006704 = score(doc=3927,freq=1.0), product of:
              0.07915646 = queryWeight, product of:
                1.087424 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.017976146 = queryNorm
              0.5061752 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.125 = fieldNorm(doc=3927)
          0.044392098 = weight(abstract_txt:language in 3927) [ClassicSimilarity], result of:
            0.044392098 = score(doc=3927,freq=1.0), product of:
              0.084754996 = queryWeight, product of:
                1.1252224 = boost
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.017976146 = queryNorm
              0.5237697 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1901574 = idf(docFreq=1792, maxDocs=43556)
                0.125 = fieldNorm(doc=3927)
          0.22309281 = weight(abstract_txt:drops in 3927) [ClassicSimilarity], result of:
            0.22309281 = score(doc=3927,freq=1.0), product of:
              0.19736733 = queryWeight, product of:
                1.2141668 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.017976146 = queryNorm
              1.1303432 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.125 = fieldNorm(doc=3927)
          0.40068522 = weight(abstract_txt:false in 3927) [ClassicSimilarity], result of:
            0.40068522 = score(doc=3927,freq=1.0), product of:
              0.42059183 = queryWeight, product of:
                3.0699532 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.017976146 = queryNorm
              0.95267 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.125 = fieldNorm(doc=3927)
        0.16 = coord(4/25)
    
  4. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.11
    0.10755332 = sum of:
      0.10755332 = product of:
        0.89627767 = sum of:
          0.23662566 = weight(abstract_txt:drops in 3415) [ClassicSimilarity], result of:
            0.23662566 = score(doc=3415,freq=2.0), product of:
              0.19736733 = queryWeight, product of:
                1.2141668 = boost
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.017976146 = queryNorm
              1.19891 = fieldWeight in 3415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.042746 = idf(docFreq=13, maxDocs=43556)
                0.09375 = fieldNorm(doc=3415)
          0.23466115 = weight(abstract_txt:file in 3415) [ClassicSimilarity], result of:
            0.23466115 = score(doc=3415,freq=4.0), product of:
              0.22467698 = queryWeight, product of:
                2.243782 = boost
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.017976146 = queryNorm
              1.0444379 = fieldWeight in 3415, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.09375 = fieldNorm(doc=3415)
          0.4249909 = weight(abstract_txt:false in 3415) [ClassicSimilarity], result of:
            0.4249909 = score(doc=3415,freq=2.0), product of:
              0.42059183 = queryWeight, product of:
                3.0699532 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.017976146 = queryNorm
              1.0104592 = fieldWeight in 3415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.09375 = fieldNorm(doc=3415)
        0.12 = coord(3/25)
    
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.10
    0.09738195 = sum of:
      0.09738195 = product of:
        0.6086372 = sum of:
          0.02003352 = weight(abstract_txt:text in 1301) [ClassicSimilarity], result of:
            0.02003352 = score(doc=1301,freq=1.0), product of:
              0.07915646 = queryWeight, product of:
                1.087424 = boost
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.017976146 = queryNorm
              0.2530876 = fieldWeight in 1301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0494018 = idf(docFreq=2063, maxDocs=43556)
                0.0625 = fieldNorm(doc=1301)
          0.17490609 = weight(abstract_txt:file in 1301) [ClassicSimilarity], result of:
            0.17490609 = score(doc=1301,freq=5.0), product of:
              0.22467698 = queryWeight, product of:
                2.243782 = boost
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.017976146 = queryNorm
              0.778478 = fieldWeight in 1301, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5703354 = idf(docFreq=450, maxDocs=43556)
                0.0625 = fieldNorm(doc=1301)
          0.21335499 = weight(abstract_txt:drop in 1301) [ClassicSimilarity], result of:
            0.21335499 = score(doc=1301,freq=1.0), product of:
              0.38316286 = queryWeight, product of:
                2.3924751 = boost
                8.909214 = idf(docFreq=15, maxDocs=43556)
                0.017976146 = queryNorm
              0.5568259 = fieldWeight in 1301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.909214 = idf(docFreq=15, maxDocs=43556)
                0.0625 = fieldNorm(doc=1301)
          0.20034261 = weight(abstract_txt:false in 1301) [ClassicSimilarity], result of:
            0.20034261 = score(doc=1301,freq=1.0), product of:
              0.42059183 = queryWeight, product of:
                3.0699532 = boost
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.017976146 = queryNorm
              0.476335 = fieldWeight in 1301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62136 = idf(docFreq=57, maxDocs=43556)
                0.0625 = fieldNorm(doc=1301)
        0.16 = coord(4/25)