Document (#12931)

Author
Gillaspie, L.
Title
¬The role of linguistic phenomena in retrieval performance
Source
Forging new partnerships in information: converging technologies. Proceedings of the 58th Annual Meeting of the American Society for Information Science, ASIS'95, Chicago, IL, 9-12 October 1995. Ed.: T. Kinney
Imprint
Medford, NJ : Learned Information
Year
1995
Pages
S.90-96
Abstract
This progress report presents findings from a failure analysis of 2 commercial full text computer assisted legal research (CALR) systems. Linguistic analyzes of unretrieved documents als false drops reveal a number of potential causes for performance problems in these databases, ranging from synonymy and homography to discourse level cohesive relations. Ecxamines and discusses examples of natural language phenomena that affects Boolean retrieval system performance
Theme
Computerlinguistik

Similar documents (content)

  1. Shuman, B.A.: One false drop deserves another : file selection as a means of increasing precision in online searches (1992) 0.28
    0.28011578 = sum of:
      0.28011578 = product of:
        0.87536186 = sum of:
          0.038821805 = weight(abstract_txt:examples in 4031) [ClassicSimilarity], result of:
            0.038821805 = score(doc=4031,freq=1.0), product of:
              0.09963125 = queryWeight, product of:
                4.9875827 = idf(docFreq=819, maxDocs=44218)
                0.01997586 = queryNorm
              0.3896549 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9875827 = idf(docFreq=819, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.041007593 = weight(abstract_txt:natural in 4031) [ClassicSimilarity], result of:
            0.041007593 = score(doc=4031,freq=1.0), product of:
              0.10333671 = queryWeight, product of:
                1.0184261 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.01997586 = queryNorm
              0.39683473 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.026263468 = weight(abstract_txt:retrieval in 4031) [ClassicSimilarity], result of:
            0.026263468 = score(doc=4031,freq=1.0), product of:
              0.09673638 = queryWeight, product of:
                1.3935165 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01997586 = queryNorm
              0.27149525 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.108004816 = weight(abstract_txt:causes in 4031) [ClassicSimilarity], result of:
            0.108004816 = score(doc=4031,freq=1.0), product of:
              0.19707908 = queryWeight, product of:
                1.4064441 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.01997586 = queryNorm
              0.5480278 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.108004816 = weight(abstract_txt:failure in 4031) [ClassicSimilarity], result of:
            0.108004816 = score(doc=4031,freq=1.0), product of:
              0.19707908 = queryWeight, product of:
                1.4064441 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.01997586 = queryNorm
              0.5480278 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.19705793 = weight(abstract_txt:false in 4031) [ClassicSimilarity], result of:
            0.19705793 = score(doc=4031,freq=2.0), product of:
              0.23355961 = queryWeight, product of:
                1.5310912 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.01997586 = queryNorm
              0.8437158 = fieldWeight in 4031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.2325294 = weight(abstract_txt:drops in 4031) [ClassicSimilarity], result of:
            0.2325294 = score(doc=4031,freq=1.0), product of:
              0.32859707 = queryWeight, product of:
                1.8160762 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01997586 = queryNorm
              0.707643 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
          0.12367207 = weight(abstract_txt:linguistic in 4031) [ClassicSimilarity], result of:
            0.12367207 = score(doc=4031,freq=1.0), product of:
              0.27177083 = queryWeight, product of:
                2.3357084 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01997586 = queryNorm
              0.45506012 = fieldWeight in 4031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=4031)
        0.32 = coord(8/25)
    
  2. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.12
    0.123909436 = sum of:
      0.123909436 = product of:
        0.774434 = sum of:
          0.03151616 = weight(abstract_txt:retrieval in 2417) [ClassicSimilarity], result of:
            0.03151616 = score(doc=2417,freq=1.0), product of:
              0.09673638 = queryWeight, product of:
                1.3935165 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01997586 = queryNorm
              0.3257943 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.23646954 = weight(abstract_txt:false in 2417) [ClassicSimilarity], result of:
            0.23646954 = score(doc=2417,freq=2.0), product of:
              0.23355961 = queryWeight, product of:
                1.5310912 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.01997586 = queryNorm
              1.012459 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.39461547 = weight(abstract_txt:drops in 2417) [ClassicSimilarity], result of:
            0.39461547 = score(doc=2417,freq=2.0), product of:
              0.32859707 = queryWeight, product of:
                1.8160762 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01997586 = queryNorm
              1.2009099 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
          0.111832775 = weight(abstract_txt:performance in 2417) [ClassicSimilarity], result of:
            0.111832775 = score(doc=2417,freq=1.0), product of:
              0.25761873 = queryWeight, product of:
                2.785169 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01997586 = queryNorm
              0.43410188 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.09375 = fieldNorm(doc=2417)
        0.16 = coord(4/25)
    
  3. Dubois, C.P.R.: Text retrieval 92 : summary of papers and trends (1993) 0.11
    0.113479786 = sum of:
      0.113479786 = product of:
        0.5673989 = sum of:
          0.018497892 = weight(abstract_txt:from in 6255) [ClassicSimilarity], result of:
            0.018497892 = score(doc=6255,freq=1.0), product of:
              0.061190575 = queryWeight, product of:
                1.1083055 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01997586 = queryNorm
              0.30229968 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.109375 = fieldNorm(doc=6255)
          0.13924645 = weight(abstract_txt:progress in 6255) [ClassicSimilarity], result of:
            0.13924645 = score(doc=6255,freq=2.0), product of:
              0.14806049 = queryWeight, product of:
                1.2190508 = boost
                6.0801163 = idf(docFreq=274, maxDocs=44218)
                0.01997586 = queryNorm
              0.94047004 = fieldWeight in 6255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0801163 = idf(docFreq=274, maxDocs=44218)
                0.109375 = fieldNorm(doc=6255)
          0.05199901 = weight(abstract_txt:retrieval in 6255) [ClassicSimilarity], result of:
            0.05199901 = score(doc=6255,freq=2.0), product of:
              0.09673638 = queryWeight, product of:
                1.3935165 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01997586 = queryNorm
              0.53753316 = fieldWeight in 6255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=6255)
          0.1731409 = weight(abstract_txt:linguistic in 6255) [ClassicSimilarity], result of:
            0.1731409 = score(doc=6255,freq=1.0), product of:
              0.27177083 = queryWeight, product of:
                2.3357084 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.01997586 = queryNorm
              0.6370842 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.109375 = fieldNorm(doc=6255)
          0.18451467 = weight(abstract_txt:performance in 6255) [ClassicSimilarity], result of:
            0.18451467 = score(doc=6255,freq=2.0), product of:
              0.25761873 = queryWeight, product of:
                2.785169 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01997586 = queryNorm
              0.7162316 = fieldWeight in 6255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.109375 = fieldNorm(doc=6255)
        0.2 = coord(5/25)
    
  4. Turtle, H.; Flood, J.: Query evaluation : strategies and optimizations (1995) 0.11
    0.11203175 = sum of:
      0.11203175 = product of:
        0.5601587 = sum of:
          0.06561215 = weight(abstract_txt:natural in 4087) [ClassicSimilarity], result of:
            0.06561215 = score(doc=4087,freq=1.0), product of:
              0.10333671 = queryWeight, product of:
                1.0184261 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.01997586 = queryNorm
              0.63493556 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.125 = fieldNorm(doc=4087)
          0.11813308 = weight(abstract_txt:analyzes in 4087) [ClassicSimilarity], result of:
            0.11813308 = score(doc=4087,freq=1.0), product of:
              0.15293708 = queryWeight, product of:
                1.2389637 = boost
                6.1794343 = idf(docFreq=248, maxDocs=44218)
                0.01997586 = queryNorm
              0.7724293 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1794343 = idf(docFreq=248, maxDocs=44218)
                0.125 = fieldNorm(doc=4087)
          0.12351808 = weight(abstract_txt:legal in 4087) [ClassicSimilarity], result of:
            0.12351808 = score(doc=4087,freq=1.0), product of:
              0.15755014 = queryWeight, product of:
                1.2575104 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.01997586 = queryNorm
              0.7839922 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.125 = fieldNorm(doc=4087)
          0.04202155 = weight(abstract_txt:retrieval in 4087) [ClassicSimilarity], result of:
            0.04202155 = score(doc=4087,freq=1.0), product of:
              0.09673638 = queryWeight, product of:
                1.3935165 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01997586 = queryNorm
              0.43439242 = fieldWeight in 4087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=4087)
          0.2108739 = weight(abstract_txt:performance in 4087) [ClassicSimilarity], result of:
            0.2108739 = score(doc=4087,freq=2.0), product of:
              0.25761873 = queryWeight, product of:
                2.785169 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01997586 = queryNorm
              0.81855035 = fieldWeight in 4087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.125 = fieldNorm(doc=4087)
        0.2 = coord(5/25)
    
  5. Cavanagh, A.K.: ¬A comparison of the retrieval performance of multi-disciplinary table-of-contents databases with conventional specialised databases (1997) 0.11
    0.10862857 = sum of:
      0.10862857 = product of:
        0.54314286 = sum of:
          0.013212779 = weight(abstract_txt:from in 770) [ClassicSimilarity], result of:
            0.013212779 = score(doc=770,freq=1.0), product of:
              0.061190575 = queryWeight, product of:
                1.1083055 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.01997586 = queryNorm
              0.21592833 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=770)
          0.026263468 = weight(abstract_txt:retrieval in 770) [ClassicSimilarity], result of:
            0.026263468 = score(doc=770,freq=1.0), product of:
              0.09673638 = queryWeight, product of:
                1.3935165 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.01997586 = queryNorm
              0.27149525 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=770)
          0.13934101 = weight(abstract_txt:false in 770) [ClassicSimilarity], result of:
            0.13934101 = score(doc=770,freq=1.0), product of:
              0.23355961 = queryWeight, product of:
                1.5310912 = boost
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.01997586 = queryNorm
              0.5965972 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.636444 = idf(docFreq=57, maxDocs=44218)
                0.078125 = fieldNorm(doc=770)
          0.2325294 = weight(abstract_txt:drops in 770) [ClassicSimilarity], result of:
            0.2325294 = score(doc=770,freq=1.0), product of:
              0.32859707 = queryWeight, product of:
                1.8160762 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01997586 = queryNorm
              0.707643 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=770)
          0.13179618 = weight(abstract_txt:performance in 770) [ClassicSimilarity], result of:
            0.13179618 = score(doc=770,freq=2.0), product of:
              0.25761873 = queryWeight, product of:
                2.785169 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01997586 = queryNorm
              0.51159394 = fieldWeight in 770, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=770)
        0.2 = coord(5/25)