Document (#34265)

Author
Arsenault, C.
Ménard, E.
Title
Searching titles with initial articles in library catalogs : a case study and search behavior analysis
Source
Library resources and technical services. 51(2007) no.3, S.190-203
Year
2007
Abstract
This study examines problems caused by initial articles in library catalogs. The problematic records observed are those whose titles begin with a word erroneously considered to be an article at the retrieval stage. Many retrieval algorithms edit queries by removing initial words corresponding to articles found in an exclusion list even whether the initial word is an article or not. Consequently, a certain number of documents remain more difficult to find. The study also examines user behavior during known-item retrieval using the title index in library catalogs, concentrating on the problems caused by the presence of an initial article or of a word homograph to an article. Measures of success and effectiveness are taken to determine if retrieval is affected in such cases.
Theme
Katalogfragen allgemein

Similar documents (author)

  1. Arsenault, C.; Ménard, E.; Leide, J.E.: Tensions in cataloging : observations on standards and implementation (1998) 4.89
    4.8872385 = sum of:
      4.8872385 = sum of:
        2.3771882 = weight(author_txt:ménard in 1034) [ClassicSimilarity], result of:
          2.3771882 = score(doc=1034,freq=1.0), product of:
            0.69417566 = queryWeight, product of:
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.07601625 = queryNorm
            3.4244766 = fieldWeight in 1034, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.131938 = idf(docFreq=12, maxDocs=44218)
              0.375 = fieldNorm(doc=1034)
        2.51005 = weight(author_txt:arsenault in 1034) [ClassicSimilarity], result of:
          2.51005 = score(doc=1034,freq=1.0), product of:
            0.7198056 = queryWeight, product of:
              1.0182934 = boost
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.07601625 = queryNorm
            3.487122 = fieldWeight in 1034, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.298992 = idf(docFreq=10, maxDocs=44218)
              0.375 = fieldNorm(doc=1034)
    
  2. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 2.09
    2.0917084 = sum of:
      2.0917084 = product of:
        4.183417 = sum of:
          4.183417 = weight(author_txt:arsenault in 87) [ClassicSimilarity], result of:
            4.183417 = score(doc=87,freq=1.0), product of:
              0.7198056 = queryWeight, product of:
                1.0182934 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.07601625 = queryNorm
              5.81187 = fieldWeight in 87, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=87)
        0.5 = coord(1/2)
    
  3. Arsenault, C.: Word division in the transcription of Chinese script in the title fields of bibliographic Records (2001) 2.09
    2.0917084 = sum of:
      2.0917084 = product of:
        4.183417 = sum of:
          4.183417 = weight(author_txt:arsenault in 5434) [ClassicSimilarity], result of:
            4.183417 = score(doc=5434,freq=1.0), product of:
              0.7198056 = queryWeight, product of:
                1.0182934 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.07601625 = queryNorm
              5.81187 = fieldWeight in 5434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=5434)
        0.5 = coord(1/2)
    
  4. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 2.09
    2.0917084 = sum of:
      2.0917084 = product of:
        4.183417 = sum of:
          4.183417 = weight(author_txt:arsenault in 609) [ClassicSimilarity], result of:
            4.183417 = score(doc=609,freq=1.0), product of:
              0.7198056 = queryWeight, product of:
                1.0182934 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.07601625 = queryNorm
              5.81187 = fieldWeight in 609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.625 = fieldNorm(doc=609)
        0.5 = coord(1/2)
    
  5. Ménard, E.: Indexing and retrieving images in a multilingual world (2008) 1.98
    1.9809904 = sum of:
      1.9809904 = product of:
        3.9619808 = sum of:
          3.9619808 = weight(author_txt:ménard in 2239) [ClassicSimilarity], result of:
            3.9619808 = score(doc=2239,freq=1.0), product of:
              0.69417566 = queryWeight, product of:
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07601625 = queryNorm
              5.7074614 = fieldWeight in 2239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.625 = fieldNorm(doc=2239)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Bachir, I.; Buxton, A.: ¬The use of topic sentences for evaluating the representativeness of Arabic article titles (1993) 0.14
    0.143107 = sum of:
      0.143107 = product of:
        0.59627914 = sum of:
          0.07012392 = weight(abstract_txt:corresponding in 6985) [ClassicSimilarity], result of:
            0.07012392 = score(doc=6985,freq=1.0), product of:
              0.11788574 = queryWeight, product of:
                6.345029 = idf(docFreq=210, maxDocs=44218)
                0.018579228 = queryNorm
              0.5948465 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.345029 = idf(docFreq=210, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.05293221 = weight(abstract_txt:examines in 6985) [ClassicSimilarity], result of:
            0.05293221 = score(doc=6985,freq=1.0), product of:
              0.123132825 = queryWeight, product of:
                1.4453442 = boost
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.018579228 = queryNorm
              0.42987895 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.026651261 = weight(abstract_txt:library in 6985) [ClassicSimilarity], result of:
            0.026651261 = score(doc=6985,freq=1.0), product of:
              0.08920779 = queryWeight, product of:
                1.5067159 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.018579228 = queryNorm
              0.29875487 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.23012789 = weight(abstract_txt:titles in 6985) [ClassicSimilarity], result of:
            0.23012789 = score(doc=6985,freq=5.0), product of:
              0.19181535 = queryWeight, product of:
                1.8039564 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.018579228 = queryNorm
              1.1997366 = fieldWeight in 6985, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.15599784 = weight(abstract_txt:articles in 6985) [ClassicSimilarity], result of:
            0.15599784 = score(doc=6985,freq=3.0), product of:
              0.20089212 = queryWeight, product of:
                2.2610567 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.018579228 = queryNorm
              0.77652544 = fieldWeight in 6985, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
          0.060446054 = weight(abstract_txt:article in 6985) [ClassicSimilarity], result of:
            0.060446054 = score(doc=6985,freq=1.0), product of:
              0.16949198 = queryWeight, product of:
                2.3981366 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.018579228 = queryNorm
              0.35663077 = fieldWeight in 6985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.09375 = fieldNorm(doc=6985)
        0.24 = coord(6/25)
    
  2. Beall, J.; Kafadar, K.: Measuring typographical errors' impact on retrieval in bibliographic databases (2007) 0.14
    0.14128625 = sum of:
      0.14128625 = product of:
        0.5886927 = sum of:
          0.061381225 = weight(abstract_txt:presence in 261) [ClassicSimilarity], result of:
            0.061381225 = score(doc=261,freq=1.0), product of:
              0.121813394 = queryWeight, product of:
                1.0165223 = boost
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.018579228 = queryNorm
              0.5038955 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
          0.038953613 = weight(abstract_txt:study in 261) [ClassicSimilarity], result of:
            0.038953613 = score(doc=261,freq=2.0), product of:
              0.10297542 = queryWeight, product of:
                1.6188134 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.018579228 = queryNorm
              0.3782807 = fieldWeight in 261, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
          0.038402613 = weight(abstract_txt:retrieval in 261) [ClassicSimilarity], result of:
            0.038402613 = score(doc=261,freq=1.0), product of:
              0.14144856 = queryWeight, product of:
                2.1907792 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018579228 = queryNorm
              0.27149525 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
          0.05037171 = weight(abstract_txt:article in 261) [ClassicSimilarity], result of:
            0.05037171 = score(doc=261,freq=1.0), product of:
              0.16949198 = queryWeight, product of:
                2.3981366 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.018579228 = queryNorm
              0.2971923 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
          0.2464245 = weight(abstract_txt:word in 261) [ClassicSimilarity], result of:
            0.2464245 = score(doc=261,freq=5.0), product of:
              0.25952408 = queryWeight, product of:
                2.5699153 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018579228 = queryNorm
              0.9495246 = fieldWeight in 261, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
          0.1531591 = weight(abstract_txt:catalogs in 261) [ClassicSimilarity], result of:
            0.1531591 = score(doc=261,freq=1.0), product of:
              0.3232017 = queryWeight, product of:
                2.8679185 = boost
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.018579228 = queryNorm
              0.4738809 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.078125 = fieldNorm(doc=261)
        0.24 = coord(6/25)
    
  3. Milojevic, S.; Sugimoto, C.R.; Yan, E.; Ding, Y.: ¬The cognitive structure of Library and Information Science : analysis of article title words (2011) 0.12
    0.117536366 = sum of:
      0.117536366 = product of:
        0.41977274 = sum of:
          0.02512705 = weight(abstract_txt:library in 4608) [ClassicSimilarity], result of:
            0.02512705 = score(doc=4608,freq=2.0), product of:
              0.08920779 = queryWeight, product of:
                1.5067159 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.018579228 = queryNorm
              0.28166878 = fieldWeight in 4608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.049272858 = weight(abstract_txt:study in 4608) [ClassicSimilarity], result of:
            0.049272858 = score(doc=4608,freq=5.0), product of:
              0.10297542 = queryWeight, product of:
                1.6188134 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.018579228 = queryNorm
              0.47849143 = fieldWeight in 4608, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.051738992 = weight(abstract_txt:behavior in 4608) [ClassicSimilarity], result of:
            0.051738992 = score(doc=4608,freq=1.0), product of:
              0.15891564 = queryWeight, product of:
                1.6419793 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.018579228 = queryNorm
              0.3255752 = fieldWeight in 4608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.068610884 = weight(abstract_txt:titles in 4608) [ClassicSimilarity], result of:
            0.068610884 = score(doc=4608,freq=1.0), product of:
              0.19181535 = queryWeight, product of:
                1.8039564 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.018579228 = queryNorm
              0.35769236 = fieldWeight in 4608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.060043596 = weight(abstract_txt:articles in 4608) [ClassicSimilarity], result of:
            0.060043596 = score(doc=4608,freq=1.0), product of:
              0.20089212 = queryWeight, product of:
                2.2610567 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.018579228 = queryNorm
              0.29888478 = fieldWeight in 4608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.04029737 = weight(abstract_txt:article in 4608) [ClassicSimilarity], result of:
            0.04029737 = score(doc=4608,freq=1.0), product of:
              0.16949198 = queryWeight, product of:
                2.3981366 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.018579228 = queryNorm
              0.23775385 = fieldWeight in 4608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
          0.12468202 = weight(abstract_txt:word in 4608) [ClassicSimilarity], result of:
            0.12468202 = score(doc=4608,freq=2.0), product of:
              0.25952408 = queryWeight, product of:
                2.5699153 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018579228 = queryNorm
              0.48042563 = fieldWeight in 4608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=4608)
        0.28 = coord(7/25)
    
  4. Borgman, C.L.: Why are online catalogs still hard to use? (1996) 0.11
    0.11128338 = sum of:
      0.11128338 = product of:
        0.46368074 = sum of:
          0.025418885 = weight(abstract_txt:problems in 4380) [ClassicSimilarity], result of:
            0.025418885 = score(doc=4380,freq=1.0), product of:
              0.10815675 = queryWeight, product of:
                1.3546003 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.018579228 = queryNorm
              0.23501894 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
          0.030877119 = weight(abstract_txt:examines in 4380) [ClassicSimilarity], result of:
            0.030877119 = score(doc=4380,freq=1.0), product of:
              0.123132825 = queryWeight, product of:
                1.4453442 = boost
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.018579228 = queryNorm
              0.2507627 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5853753 = idf(docFreq=1225, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
          0.06402373 = weight(abstract_txt:behavior in 4380) [ClassicSimilarity], result of:
            0.06402373 = score(doc=4380,freq=2.0), product of:
              0.15891564 = queryWeight, product of:
                1.6419793 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.018579228 = queryNorm
              0.40287876 = fieldWeight in 4380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
          0.053763658 = weight(abstract_txt:retrieval in 4380) [ClassicSimilarity], result of:
            0.053763658 = score(doc=4380,freq=4.0), product of:
              0.14144856 = queryWeight, product of:
                2.1907792 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018579228 = queryNorm
              0.38009337 = fieldWeight in 4380, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
          0.04986545 = weight(abstract_txt:article in 4380) [ClassicSimilarity], result of:
            0.04986545 = score(doc=4380,freq=2.0), product of:
              0.16949198 = queryWeight, product of:
                2.3981366 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.018579228 = queryNorm
              0.29420537 = fieldWeight in 4380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
          0.23973191 = weight(abstract_txt:catalogs in 4380) [ClassicSimilarity], result of:
            0.23973191 = score(doc=4380,freq=5.0), product of:
              0.3232017 = queryWeight, product of:
                2.8679185 = boost
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.018579228 = queryNorm
              0.7417409 = fieldWeight in 4380, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.0656753 = idf(docFreq=278, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4380)
        0.24 = coord(6/25)
    
  5. Keller, B.: Subject content through title : a masters theses matching study at Indiana State University (1992) 0.11
    0.10535985 = sum of:
      0.10535985 = product of:
        0.52679926 = sum of:
          0.022209385 = weight(abstract_txt:library in 534) [ClassicSimilarity], result of:
            0.022209385 = score(doc=534,freq=1.0), product of:
              0.08920779 = queryWeight, product of:
                1.5067159 = boost
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.018579228 = queryNorm
              0.2489624 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1867187 = idf(docFreq=4964, maxDocs=44218)
                0.078125 = fieldNorm(doc=534)
          0.04770824 = weight(abstract_txt:study in 534) [ClassicSimilarity], result of:
            0.04770824 = score(doc=534,freq=3.0), product of:
              0.10297542 = queryWeight, product of:
                1.6188134 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.018579228 = queryNorm
              0.46329734 = fieldWeight in 534, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.078125 = fieldNorm(doc=534)
          0.121288046 = weight(abstract_txt:titles in 534) [ClassicSimilarity], result of:
            0.121288046 = score(doc=534,freq=2.0), product of:
              0.19181535 = queryWeight, product of:
                1.8039564 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.018579228 = queryNorm
              0.6323167 = fieldWeight in 534, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.078125 = fieldNorm(doc=534)
          0.110204384 = weight(abstract_txt:word in 534) [ClassicSimilarity], result of:
            0.110204384 = score(doc=534,freq=1.0), product of:
              0.25952408 = queryWeight, product of:
                2.5699153 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.018579228 = queryNorm
              0.4246403 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=534)
          0.22538918 = weight(abstract_txt:initial in 534) [ClassicSimilarity], result of:
            0.22538918 = score(doc=534,freq=1.0), product of:
              0.49577358 = queryWeight, product of:
                4.585599 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.018579228 = queryNorm
              0.4546212 = fieldWeight in 534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.078125 = fieldNorm(doc=534)
        0.2 = coord(5/25)