Document (#4032)

Author
Shuman, B.A.
Title
One false drop deserves another : file selection as a means of increasing precision in online searches
Source
13th National Online Meeting. Ed.: M.E. Williams
Imprint
Medford, NJ : Learned Information Inc.
Year
1992
Pages
S.345-350
Abstract
The online fals drop or false coordination, occurs in free text (natural language) searching when a lack of predetermined term relationships or the presence of any of various types of linguistic idiosyncrasies leads to unexpected and unintended consequences. They are due to at least 3 factors: file selection; language structure; and failure to anticipate on the part of the searcher during preparation. Uses actual free text onlines searches as a vehicle to discuss common causes of false drops in online searching and examples are given. Concludes that file selection will overcome many of the logic problems inherent in retrieval of falsely coordinated terms. Remedies are suggested in each case, to be implemented prior to initiating the search rather than as costly midcourse corrections

Similar documents (content)

  1. Leppanen, E.: Homografiongelma tekstihaussa ja homografien disambiguoinnin vaikutukset (1996) 0.15
    0.14726044 = sum of:
      0.14726044 = product of:
        0.6135852 = sum of:
          0.03992373 = weight(abstract_txt:text in 27) [ClassicSimilarity], result of:
            0.03992373 = score(doc=27,freq=4.0), product of:
              0.0789813 = queryWeight, product of:
                1.0858362 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017987184 = queryNorm
              0.5054833 = fieldWeight in 27, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
          0.033581123 = weight(abstract_txt:searching in 27) [ClassicSimilarity], result of:
            0.033581123 = score(doc=27,freq=2.0), product of:
              0.08867006 = queryWeight, product of:
                1.1505107 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.017987184 = queryNorm
              0.37871996 = fieldWeight in 27, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
          0.15862373 = weight(abstract_txt:drops in 27) [ClassicSimilarity], result of:
            0.15862373 = score(doc=27,freq=2.0), product of:
              0.19812943 = queryWeight, product of:
                1.2160785 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.017987184 = queryNorm
              0.8006066 = fieldWeight in 27, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
          0.043124784 = weight(abstract_txt:searches in 27) [ClassicSimilarity], result of:
            0.043124784 = score(doc=27,freq=1.0), product of:
              0.13199015 = queryWeight, product of:
                1.4036955 = boost
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.017987184 = queryNorm
              0.3267273 = fieldWeight in 27, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
          0.053170625 = weight(abstract_txt:free in 27) [ClassicSimilarity], result of:
            0.053170625 = score(doc=27,freq=1.0), product of:
              0.15176491 = queryWeight, product of:
                1.5051779 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.017987184 = queryNorm
              0.3503486 = fieldWeight in 27, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
          0.2851612 = weight(abstract_txt:false in 27) [ClassicSimilarity], result of:
            0.2851612 = score(doc=27,freq=2.0), product of:
              0.42247817 = queryWeight, product of:
                3.075742 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.017987184 = queryNorm
              0.67497265 = fieldWeight in 27, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.0625 = fieldNorm(doc=27)
        0.24 = coord(6/25)
    
  2. Horn, M.E.: "Garbage" in, "refuse and refuse disposal" out : making the most of the subject authority file in the OPAC (2002) 0.13
    0.12526825 = sum of:
      0.12526825 = product of:
        0.44738662 = sum of:
          0.029942797 = weight(abstract_txt:text in 156) [ClassicSimilarity], result of:
            0.029942797 = score(doc=156,freq=1.0), product of:
              0.0789813 = queryWeight, product of:
                1.0858362 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017987184 = queryNorm
              0.37911248 = fieldWeight in 156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.033119306 = weight(abstract_txt:language in 156) [ClassicSimilarity], result of:
            0.033119306 = score(doc=156,freq=1.0), product of:
              0.08447279 = queryWeight, product of:
                1.1229504 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017987184 = queryNorm
              0.3920707 = fieldWeight in 156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.061692458 = weight(abstract_txt:searching in 156) [ClassicSimilarity], result of:
            0.061692458 = score(doc=156,freq=3.0), product of:
              0.08867006 = queryWeight, product of:
                1.1505107 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.017987184 = queryNorm
              0.695753 = fieldWeight in 156, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.09148149 = weight(abstract_txt:searches in 156) [ClassicSimilarity], result of:
            0.09148149 = score(doc=156,freq=2.0), product of:
              0.13199015 = queryWeight, product of:
                1.4036955 = boost
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.017987184 = queryNorm
              0.6930933 = fieldWeight in 156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.03346761 = weight(abstract_txt:online in 156) [ClassicSimilarity], result of:
            0.03346761 = score(doc=156,freq=1.0), product of:
              0.09737398 = queryWeight, product of:
                1.4766216 = boost
                3.6661522 = idf(docFreq=3073, maxDocs=44218)
                0.017987184 = queryNorm
              0.34370178 = fieldWeight in 156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6661522 = idf(docFreq=3073, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.07975594 = weight(abstract_txt:free in 156) [ClassicSimilarity], result of:
            0.07975594 = score(doc=156,freq=1.0), product of:
              0.15176491 = queryWeight, product of:
                1.5051779 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.017987184 = queryNorm
              0.5255229 = fieldWeight in 156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
          0.117927 = weight(abstract_txt:file in 156) [ClassicSimilarity], result of:
            0.117927 = score(doc=156,freq=1.0), product of:
              0.22547685 = queryWeight, product of:
                2.2469776 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.017987184 = queryNorm
              0.52301157 = fieldWeight in 156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.09375 = fieldNorm(doc=156)
        0.28 = coord(7/25)
    
  3. Gillaspie, L.: ¬The role of linguistic phenomena in retrieval performance (1995) 0.11
    0.113870315 = sum of:
      0.113870315 = product of:
        0.7116895 = sum of:
          0.03992373 = weight(abstract_txt:text in 3861) [ClassicSimilarity], result of:
            0.03992373 = score(doc=3861,freq=1.0), product of:
              0.0789813 = queryWeight, product of:
                1.0858362 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017987184 = queryNorm
              0.5054833 = fieldWeight in 3861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=3861)
          0.044159073 = weight(abstract_txt:language in 3861) [ClassicSimilarity], result of:
            0.044159073 = score(doc=3861,freq=1.0), product of:
              0.08447279 = queryWeight, product of:
                1.1229504 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.017987184 = queryNorm
              0.5227609 = fieldWeight in 3861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.125 = fieldNorm(doc=3861)
          0.22432783 = weight(abstract_txt:drops in 3861) [ClassicSimilarity], result of:
            0.22432783 = score(doc=3861,freq=1.0), product of:
              0.19812943 = queryWeight, product of:
                1.2160785 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.017987184 = queryNorm
              1.1322287 = fieldWeight in 3861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.125 = fieldNorm(doc=3861)
          0.40327886 = weight(abstract_txt:false in 3861) [ClassicSimilarity], result of:
            0.40327886 = score(doc=3861,freq=1.0), product of:
              0.42247817 = queryWeight, product of:
                3.075742 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.017987184 = queryNorm
              0.9545555 = fieldWeight in 3861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.125 = fieldNorm(doc=3861)
        0.16 = coord(4/25)
    
  4. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.11
    0.10818377 = sum of:
      0.10818377 = product of:
        0.90153146 = sum of:
          0.23793559 = weight(abstract_txt:drops in 2417) [ClassicSimilarity], result of:
            0.23793559 = score(doc=2417,freq=2.0), product of:
              0.19812943 = queryWeight, product of:
                1.2160785 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.017987184 = queryNorm
              1.2009099 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.235854 = weight(abstract_txt:file in 2417) [ClassicSimilarity], result of:
            0.235854 = score(doc=2417,freq=4.0), product of:
              0.22547685 = queryWeight, product of:
                2.2469776 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.017987184 = queryNorm
              1.0460231 = fieldWeight in 2417, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.42774186 = weight(abstract_txt:false in 2417) [ClassicSimilarity], result of:
            0.42774186 = score(doc=2417,freq=2.0), product of:
              0.42247817 = queryWeight, product of:
                3.075742 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.017987184 = queryNorm
              1.012459 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
        0.12 = coord(3/25)
    
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.10
    0.09721693 = sum of:
      0.09721693 = product of:
        0.6076058 = sum of:
          0.019961866 = weight(abstract_txt:text in 303) [ClassicSimilarity], result of:
            0.019961866 = score(doc=303,freq=1.0), product of:
              0.0789813 = queryWeight, product of:
                1.0858362 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017987184 = queryNorm
              0.25274166 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.1757952 = weight(abstract_txt:file in 303) [ClassicSimilarity], result of:
            0.1757952 = score(doc=303,freq=5.0), product of:
              0.22547685 = queryWeight, product of:
                2.2469776 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.017987184 = queryNorm
              0.7796596 = fieldWeight in 303, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.21020934 = weight(abstract_txt:drop in 303) [ClassicSimilarity], result of:
            0.21020934 = score(doc=303,freq=1.0), product of:
              0.37945318 = queryWeight, product of:
                2.3800235 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.017987184 = queryNorm
              0.55397964 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.20163943 = weight(abstract_txt:false in 303) [ClassicSimilarity], result of:
            0.20163943 = score(doc=303,freq=1.0), product of:
              0.42247817 = queryWeight, product of:
                3.075742 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.017987184 = queryNorm
              0.47727776 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
        0.16 = coord(4/25)