Document (#32988)

Author
Kim, G.
Title
Relationship between index term specificity and relevance judgment
Source
Information processing and management. 42(2006) no.5, S.1218-1229
Year
2006
Abstract
Concurrent concepts of specificity are discussed and differentiated from each other to investigate the relationship between index term specificity and users' relevance judgments. The identified concepts are term-document specificity, hierarchical specificity, statement specificity, and posting specificity. Among them, term-document specificity, which is a relationship between an index term and the document indexed with the term, is regarded as a fruitful research area. In an experiment involving three searches with 175 retrieved documents from 356 matched index terms, the impact of specificity on relevance judgments is analyzed and found to be statistically significant. Implications for index practice and for future research are discussed.

Similar documents (content)

  1. Sparck Jones, K.: ¬A statistical interpretation of term specificity and its application in retrieval (2004) 0.35
    0.34591803 = sum of:
      0.34591803 = product of:
        1.4413252 = sum of:
          0.05039322 = weight(abstract_txt:statistically in 421) [ClassicSimilarity], result of:
            0.05039322 = score(doc=421,freq=1.0), product of:
              0.0786733 = queryWeight, product of:
                1.2004195 = boost
                6.8324027 = idf(docFreq=123, maxDocs=42306)
                0.009592258 = queryNorm
              0.64053774 = fieldWeight in 421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8324027 = idf(docFreq=123, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
          0.054163698 = weight(abstract_txt:regarded in 421) [ClassicSimilarity], result of:
            0.054163698 = score(doc=421,freq=1.0), product of:
              0.082550205 = queryWeight, product of:
                1.2296413 = boost
                6.998724 = idf(docFreq=104, maxDocs=42306)
                0.009592258 = queryNorm
              0.6561304 = fieldWeight in 421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.998724 = idf(docFreq=104, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
          0.037144467 = weight(abstract_txt:document in 421) [ClassicSimilarity], result of:
            0.037144467 = score(doc=421,freq=1.0), product of:
              0.092586815 = queryWeight, product of:
                2.2555609 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.009592258 = queryNorm
              0.40118527 = fieldWeight in 421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
          0.08386405 = weight(abstract_txt:index in 421) [ClassicSimilarity], result of:
            0.08386405 = score(doc=421,freq=1.0), product of:
              0.18892373 = queryWeight, product of:
                4.159562 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.009592258 = queryNorm
              0.44390425 = fieldWeight in 421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
          0.18569857 = weight(abstract_txt:term in 421) [ClassicSimilarity], result of:
            0.18569857 = score(doc=421,freq=3.0), product of:
              0.23648033 = queryWeight, product of:
                5.0979137 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.009592258 = queryNorm
              0.78526014 = fieldWeight in 421, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
          1.0300611 = weight(abstract_txt:specificity in 421) [ClassicSimilarity], result of:
            1.0300611 = score(doc=421,freq=3.0), product of:
              0.84825885 = queryWeight, product of:
                11.8251 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.009592258 = queryNorm
              1.214324 = fieldWeight in 421, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.09375 = fieldNorm(doc=421)
        0.24 = coord(6/25)
    
  2. Chu, H.: Factors affecting relevance judgment : a report from TREC Legal track (2011) 0.27
    0.27049223 = sum of:
      0.27049223 = product of:
        0.9660437 = sum of:
          0.027654275 = weight(abstract_txt:retrieved in 1541) [ClassicSimilarity], result of:
            0.027654275 = score(doc=1541,freq=2.0), product of:
              0.054845165 = queryWeight, product of:
                1.0022788 = boost
                5.704649 = idf(docFreq=382, maxDocs=42306)
                0.009592258 = queryNorm
              0.5042245 = fieldWeight in 1541, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.704649 = idf(docFreq=382, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.015845072 = weight(abstract_txt:research in 1541) [ClassicSimilarity], result of:
            0.015845072 = score(doc=1541,freq=5.0), product of:
              0.035122838 = queryWeight, product of:
                1.1343032 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.009592258 = queryNorm
              0.451133 = fieldWeight in 1541, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.09398805 = weight(abstract_txt:judgment in 1541) [ClassicSimilarity], result of:
            0.09398805 = score(doc=1541,freq=5.0), product of:
              0.0913479 = queryWeight, product of:
                1.2935066 = boost
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.009592258 = queryNorm
              1.0289022 = fieldWeight in 1541, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.06996898 = weight(abstract_txt:judgments in 1541) [ClassicSimilarity], result of:
            0.06996898 = score(doc=1541,freq=1.0), product of:
              0.16165428 = queryWeight, product of:
                2.433481 = boost
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.009592258 = queryNorm
              0.43283102 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.12641768 = weight(abstract_txt:relevance in 1541) [ClassicSimilarity], result of:
            0.12641768 = score(doc=1541,freq=11.0), product of:
              0.12343024 = queryWeight, product of:
                2.6042986 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.009592258 = queryNorm
              1.0242035 = fieldWeight in 1541, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.07147542 = weight(abstract_txt:term in 1541) [ClassicSimilarity], result of:
            0.07147542 = score(doc=1541,freq=1.0), product of:
              0.23648033 = queryWeight, product of:
                5.0979137 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.009592258 = queryNorm
              0.30224678 = fieldWeight in 1541, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
          0.5606943 = weight(abstract_txt:specificity in 1541) [ClassicSimilarity], result of:
            0.5606943 = score(doc=1541,freq=2.0), product of:
              0.84825885 = queryWeight, product of:
                11.8251 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.009592258 = queryNorm
              0.6609943 = fieldWeight in 1541, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.0625 = fieldNorm(doc=1541)
        0.28 = coord(7/25)
    
  3. Savolainen, R.; Kari, J.: User-defined relevance criteria in web searching (2006) 0.16
    0.16492926 = sum of:
      0.16492926 = product of:
        0.82464623 = sum of:
          0.0070861313 = weight(abstract_txt:research in 1740) [ClassicSimilarity], result of:
            0.0070861313 = score(doc=1740,freq=1.0), product of:
              0.035122838 = queryWeight, product of:
                1.1343032 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.009592258 = queryNorm
              0.20175281 = fieldWeight in 1740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.0625 = fieldNorm(doc=1740)
          0.05944326 = weight(abstract_txt:judgment in 1740) [ClassicSimilarity], result of:
            0.05944326 = score(doc=1740,freq=2.0), product of:
              0.0913479 = queryWeight, product of:
                1.2935066 = boost
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.009592258 = queryNorm
              0.65073484 = fieldWeight in 1740, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3622246 = idf(docFreq=72, maxDocs=42306)
                0.0625 = fieldNorm(doc=1740)
          0.12118983 = weight(abstract_txt:judgments in 1740) [ClassicSimilarity], result of:
            0.12118983 = score(doc=1740,freq=3.0), product of:
              0.16165428 = queryWeight, product of:
                2.433481 = boost
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.009592258 = queryNorm
              0.7496853 = fieldWeight in 1740, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9252963 = idf(docFreq=112, maxDocs=42306)
                0.0625 = fieldNorm(doc=1740)
          0.07623273 = weight(abstract_txt:relevance in 1740) [ClassicSimilarity], result of:
            0.07623273 = score(doc=1740,freq=4.0), product of:
              0.12343024 = queryWeight, product of:
                2.6042986 = boost
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.009592258 = queryNorm
              0.61761796 = fieldWeight in 1740, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9409437 = idf(docFreq=821, maxDocs=42306)
                0.0625 = fieldNorm(doc=1740)
          0.5606943 = weight(abstract_txt:specificity in 1740) [ClassicSimilarity], result of:
            0.5606943 = score(doc=1740,freq=2.0), product of:
              0.84825885 = queryWeight, product of:
                11.8251 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.009592258 = queryNorm
              0.6609943 = fieldWeight in 1740, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.0625 = fieldNorm(doc=1740)
        0.2 = coord(5/25)
    
  4. Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.16
    0.16177179 = sum of:
      0.16177179 = product of:
        0.5777564 = sum of:
          0.017860841 = weight(abstract_txt:indexed in 4645) [ClassicSimilarity], result of:
            0.017860841 = score(doc=4645,freq=1.0), product of:
              0.06254615 = queryWeight, product of:
                1.0703348 = boost
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.009592258 = queryNorm
              0.2855626 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0920024 = idf(docFreq=259, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.018921366 = weight(abstract_txt:involving in 4645) [ClassicSimilarity], result of:
            0.018921366 = score(doc=4645,freq=1.0), product of:
              0.06499814 = queryWeight, product of:
                1.0911133 = boost
                6.2102666 = idf(docFreq=230, maxDocs=42306)
                0.009592258 = queryNorm
              0.29110625 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2102666 = idf(docFreq=230, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.007515977 = weight(abstract_txt:research in 4645) [ClassicSimilarity], result of:
            0.007515977 = score(doc=4645,freq=2.0), product of:
              0.035122838 = queryWeight, product of:
                1.1343032 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.009592258 = queryNorm
              0.21399117 = fieldWeight in 4645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.032168053 = weight(abstract_txt:document in 4645) [ClassicSimilarity], result of:
            0.032168053 = score(doc=4645,freq=3.0), product of:
              0.092586815 = queryWeight, product of:
                2.2555609 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.009592258 = queryNorm
              0.34743664 = fieldWeight in 4645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.072628394 = weight(abstract_txt:index in 4645) [ClassicSimilarity], result of:
            0.072628394 = score(doc=4645,freq=3.0), product of:
              0.18892373 = queryWeight, product of:
                4.159562 = boost
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.009592258 = queryNorm
              0.38443235 = fieldWeight in 4645, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7349787 = idf(docFreq=1009, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.13130873 = weight(abstract_txt:term in 4645) [ClassicSimilarity], result of:
            0.13130873 = score(doc=4645,freq=6.0), product of:
              0.23648033 = queryWeight, product of:
                5.0979137 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.009592258 = queryNorm
              0.5552628 = fieldWeight in 4645, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
          0.29735303 = weight(abstract_txt:specificity in 4645) [ClassicSimilarity], result of:
            0.29735303 = score(doc=4645,freq=1.0), product of:
              0.84825885 = queryWeight, product of:
                11.8251 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.009592258 = queryNorm
              0.35054517 = fieldWeight in 4645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.046875 = fieldNorm(doc=4645)
        0.28 = coord(7/25)
    
  5. Tamine, L.; Chouquet, C.; Palmer, T.: Analysis of biomedical and health queries : lessons learned from TREC and CLEF evaluation benchmarks (2015) 0.16
    0.16071184 = sum of:
      0.16071184 = product of:
        0.8035592 = sum of:
          0.0070861313 = weight(abstract_txt:research in 4342) [ClassicSimilarity], result of:
            0.0070861313 = score(doc=4342,freq=1.0), product of:
              0.035122838 = queryWeight, product of:
                1.1343032 = boost
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.009592258 = queryNorm
              0.20175281 = fieldWeight in 4342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.228045 = idf(docFreq=4557, maxDocs=42306)
                0.0625 = fieldNorm(doc=4342)
          0.013527255 = weight(abstract_txt:between in 4342) [ClassicSimilarity], result of:
            0.013527255 = score(doc=4342,freq=1.0), product of:
              0.06187098 = queryWeight, product of:
                1.8438411 = boost
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.009592258 = queryNorm
              0.2186365 = fieldWeight in 4342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.498184 = idf(docFreq=3478, maxDocs=42306)
                0.0625 = fieldNorm(doc=4342)
          0.024762979 = weight(abstract_txt:document in 4342) [ClassicSimilarity], result of:
            0.024762979 = score(doc=4342,freq=1.0), product of:
              0.092586815 = queryWeight, product of:
                2.2555609 = boost
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.009592258 = queryNorm
              0.26745686 = fieldWeight in 4342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2793097 = idf(docFreq=1592, maxDocs=42306)
                0.0625 = fieldNorm(doc=4342)
          0.07147542 = weight(abstract_txt:term in 4342) [ClassicSimilarity], result of:
            0.07147542 = score(doc=4342,freq=1.0), product of:
              0.23648033 = queryWeight, product of:
                5.0979137 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.009592258 = queryNorm
              0.30224678 = fieldWeight in 4342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.0625 = fieldNorm(doc=4342)
          0.6867074 = weight(abstract_txt:specificity in 4342) [ClassicSimilarity], result of:
            0.6867074 = score(doc=4342,freq=3.0), product of:
              0.84825885 = queryWeight, product of:
                11.8251 = boost
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.009592258 = queryNorm
              0.80954933 = fieldWeight in 4342, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4782968 = idf(docFreq=64, maxDocs=42306)
                0.0625 = fieldNorm(doc=4342)
        0.2 = coord(5/25)