Document (#33032)

Author
Zhang, X.
Han, H.
Title
¬An empirical testing of user stereotypes of information retrieval systems
Source
Information processing and management. 41(2005) no.3, S.651-664
Year
2005
Abstract
Stereotyping is a technique used in many information systems to represent user groups and/or to generate initial individual user models. However, there has been a lack of evidence on the accuracy of their use in representing users. We propose a formal evaluation method to test the accuracy or homogeneity of the stereotypes that are based on users' explicit characteristics. Using the method, the results of an empirical testing on 11 common user stereotypes of information retrieval (IR) systems are reported. The participants' memberships in the stereotypes were predicted using discriminant analysis, based on their IR knowledge. The actual membership and the predicted membership of each stereotype were compared. The data show that "librarians/IR professionals" is an accurate stereotype in representing its members, while some others, such as "undergraduate students" and "social sciences/humanities" users, are not accurate stereotypes. The data also demonstrate that based on the user's IR knowledge a stereotype can be made more accurate or homogeneous. The results show the promise that our method can help better detect the differences among stereotype members, and help with better stereotype design and user modeling. We assume that accurate stereotypes have better performance in user modeling and thus the system performance. Limitations and future directions of the study are discussed.

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.54
    4.5423746 = sum of:
      4.5423746 = weight(author_txt:zhang in 5775) [ClassicSimilarity], result of:
        4.5423746 = fieldWeight in 5775, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.5 = fieldNorm(doc=5775)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.54
    4.5423746 = sum of:
      4.5423746 = weight(author_txt:zhang in 237) [ClassicSimilarity], result of:
        4.5423746 = fieldWeight in 237, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.5 = fieldNorm(doc=237)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 7711) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 7711, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=7711)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 3212) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 3212, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=3212)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.01
    4.01493 = sum of:
      4.01493 = weight(author_txt:zhang in 6403) [ClassicSimilarity], result of:
        4.01493 = fieldWeight in 6403, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.625 = fieldNorm(doc=6403)
    

Similar documents (content)

  1. Shapira, B.; Shoval, P.; Hanani, U.: Stereotypes in information filtering systems (1997) 0.57
    0.5721727 = sum of:
      0.5721727 = product of:
        2.043474 = sum of:
          0.011203766 = weight(abstract_txt:based in 157) [ClassicSimilarity], result of:
            0.011203766 = score(doc=157,freq=1.0), product of:
              0.03748731 = queryWeight, product of:
                1.1787686 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009975788 = queryNorm
              0.29886824 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          0.027469398 = weight(abstract_txt:systems in 157) [ClassicSimilarity], result of:
            0.027469398 = score(doc=157,freq=4.0), product of:
              0.04293924 = queryWeight, product of:
                1.2615765 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.009975788 = queryNorm
              0.6397272 = fieldWeight in 157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          0.031462424 = weight(abstract_txt:users in 157) [ClassicSimilarity], result of:
            0.031462424 = score(doc=157,freq=4.0), product of:
              0.047005612 = queryWeight, product of:
                1.3199615 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.009975788 = queryNorm
              0.66933334 = fieldWeight in 157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          0.010843233 = weight(abstract_txt:that in 157) [ClassicSimilarity], result of:
            0.010843233 = score(doc=157,freq=2.0), product of:
              0.03451599 = queryWeight, product of:
                1.460229 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.009975788 = queryNorm
              0.314151 = fieldWeight in 157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          0.03456736 = weight(abstract_txt:user in 157) [ClassicSimilarity], result of:
            0.03456736 = score(doc=157,freq=1.0), product of:
              0.10009884 = queryWeight, product of:
                2.7240555 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.009975788 = queryNorm
              0.34533226 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          0.4940966 = weight(abstract_txt:stereotype in 157) [ClassicSimilarity], result of:
            0.4940966 = score(doc=157,freq=1.0), product of:
              0.55479485 = queryWeight, product of:
                5.8543277 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.009975788 = queryNorm
              0.89059335 = fieldWeight in 157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
          1.4338312 = weight(abstract_txt:stereotypes in 157) [ClassicSimilarity], result of:
            1.4338312 = score(doc=157,freq=5.0), product of:
              0.701445 = queryWeight, product of:
                7.2110457 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.009975788 = queryNorm
              2.0441108 = fieldWeight in 157, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=157)
        0.28 = coord(7/25)
    
  2. Mooney, G.; John, R.: Intelligent information retrieval from the World Wide Web using fuzzy user modelling (1997) 0.19
    0.1858491 = sum of:
      0.1858491 = product of:
        0.77437127 = sum of:
          0.019717637 = weight(abstract_txt:show in 1175) [ClassicSimilarity], result of:
            0.019717637 = score(doc=1175,freq=1.0), product of:
              0.04773619 = queryWeight, product of:
                1.0860871 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.009975788 = queryNorm
              0.4130543 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
          0.022888036 = weight(abstract_txt:performance in 1175) [ClassicSimilarity], result of:
            0.022888036 = score(doc=1175,freq=1.0), product of:
              0.052725036 = queryWeight, product of:
                1.1414299 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.009975788 = queryNorm
              0.43410188 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
          0.013734699 = weight(abstract_txt:systems in 1175) [ClassicSimilarity], result of:
            0.013734699 = score(doc=1175,freq=1.0), product of:
              0.04293924 = queryWeight, product of:
                1.2615765 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.009975788 = queryNorm
              0.3198636 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
          0.007667323 = weight(abstract_txt:that in 1175) [ClassicSimilarity], result of:
            0.007667323 = score(doc=1175,freq=1.0), product of:
              0.03451599 = queryWeight, product of:
                1.460229 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.009975788 = queryNorm
              0.22213829 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
          0.06913472 = weight(abstract_txt:user in 1175) [ClassicSimilarity], result of:
            0.06913472 = score(doc=1175,freq=4.0), product of:
              0.10009884 = queryWeight, product of:
                2.7240555 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.009975788 = queryNorm
              0.6906645 = fieldWeight in 1175, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
          0.64122885 = weight(abstract_txt:stereotypes in 1175) [ClassicSimilarity], result of:
            0.64122885 = score(doc=1175,freq=1.0), product of:
              0.701445 = queryWeight, product of:
                7.2110457 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.009975788 = queryNorm
              0.9141542 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=1175)
        0.24 = coord(6/25)
    
  3. Singh, V.K.; Chayko, M.; Inamdar, R.; Floegel, D.: Female librarians and male computer programmers? : gender bias in occupational images on digital media platforms (2020) 0.13
    0.12781587 = sum of:
      0.12781587 = product of:
        1.0651323 = sum of:
          0.009156466 = weight(abstract_txt:systems in 6) [ClassicSimilarity], result of:
            0.009156466 = score(doc=6,freq=1.0), product of:
              0.04293924 = queryWeight, product of:
                1.2615765 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.009975788 = queryNorm
              0.2132424 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=6)
          0.0088534625 = weight(abstract_txt:that in 6) [ClassicSimilarity], result of:
            0.0088534625 = score(doc=6,freq=3.0), product of:
              0.03451599 = queryWeight, product of:
                1.460229 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.009975788 = queryNorm
              0.2565032 = fieldWeight in 6, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=6)
          1.0471224 = weight(abstract_txt:stereotypes in 6) [ClassicSimilarity], result of:
            1.0471224 = score(doc=6,freq=6.0), product of:
              0.701445 = queryWeight, product of:
                7.2110457 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.009975788 = queryNorm
              1.4928075 = fieldWeight in 6, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=6)
        0.12 = coord(3/25)
    
  4. Crossan, G.; Burton, P.F.: Teleworking stereotypes : a case study (1993) 0.10
    0.0999594 = sum of:
      0.0999594 = product of:
        0.83299506 = sum of:
          0.07182368 = weight(abstract_txt:homogeneous in 6691) [ClassicSimilarity], result of:
            0.07182368 = score(doc=6691,freq=1.0), product of:
              0.080937244 = queryWeight, product of:
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.009975788 = queryNorm
              0.8873996 = fieldWeight in 6691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.109375 = fieldNorm(doc=6691)
          0.01307106 = weight(abstract_txt:based in 6691) [ClassicSimilarity], result of:
            0.01307106 = score(doc=6691,freq=1.0), product of:
              0.03748731 = queryWeight, product of:
                1.1787686 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009975788 = queryNorm
              0.3486796 = fieldWeight in 6691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.109375 = fieldNorm(doc=6691)
          0.74810034 = weight(abstract_txt:stereotypes in 6691) [ClassicSimilarity], result of:
            0.74810034 = score(doc=6691,freq=1.0), product of:
              0.701445 = queryWeight, product of:
                7.2110457 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.009975788 = queryNorm
              1.0665132 = fieldWeight in 6691, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.109375 = fieldNorm(doc=6691)
        0.12 = coord(3/25)
    
  5. Hong, H.; Ye, Q.: Crowd characteristics and crowd wisdom : evidence from an online investment community (2020) 0.10
    0.095915414 = sum of:
      0.095915414 = product of:
        0.23978853 = sum of:
          0.013145092 = weight(abstract_txt:show in 5763) [ClassicSimilarity], result of:
            0.013145092 = score(doc=5763,freq=1.0), product of:
              0.04773619 = queryWeight, product of:
                1.0860871 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.009975788 = queryNorm
              0.27536952 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.03737601 = weight(abstract_txt:performance in 5763) [ClassicSimilarity], result of:
            0.03737601 = score(doc=5763,freq=6.0), product of:
              0.052725036 = queryWeight, product of:
                1.1414299 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.009975788 = queryNorm
              0.70888543 = fieldWeight in 5763, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.0074691772 = weight(abstract_txt:based in 5763) [ClassicSimilarity], result of:
            0.0074691772 = score(doc=5763,freq=1.0), product of:
              0.03748731 = queryWeight, product of:
                1.1787686 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.009975788 = queryNorm
              0.19924548 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.024255546 = weight(abstract_txt:help in 5763) [ClassicSimilarity], result of:
            0.024255546 = score(doc=5763,freq=2.0), product of:
              0.05699928 = queryWeight, product of:
                1.1867944 = boost
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.009975788 = queryNorm
              0.42554125 = fieldWeight in 5763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.81445 = idf(docFreq=974, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.021744572 = weight(abstract_txt:empirical in 5763) [ClassicSimilarity], result of:
            0.021744572 = score(doc=5763,freq=1.0), product of:
              0.066768646 = queryWeight, product of:
                1.2844793 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.009975788 = queryNorm
              0.3256704 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.007228822 = weight(abstract_txt:that in 5763) [ClassicSimilarity], result of:
            0.007228822 = score(doc=5763,freq=2.0), product of:
              0.03451599 = queryWeight, product of:
                1.460229 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.009975788 = queryNorm
              0.20943399 = fieldWeight in 5763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.032703 = weight(abstract_txt:accuracy in 5763) [ClassicSimilarity], result of:
            0.032703 = score(doc=5763,freq=1.0), product of:
              0.08764566 = queryWeight, product of:
                1.471655 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.009975788 = queryNorm
              0.37312746 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.03760343 = weight(abstract_txt:testing in 5763) [ClassicSimilarity], result of:
            0.03760343 = score(doc=5763,freq=1.0), product of:
              0.09619599 = queryWeight, product of:
                1.5417689 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.009975788 = queryNorm
              0.39090434 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.035217986 = weight(abstract_txt:better in 5763) [ClassicSimilarity], result of:
            0.035217986 = score(doc=5763,freq=2.0), product of:
              0.08366339 = queryWeight, product of:
                1.7609789 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.009975788 = queryNorm
              0.4209486 = fieldWeight in 5763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
          0.023044907 = weight(abstract_txt:user in 5763) [ClassicSimilarity], result of:
            0.023044907 = score(doc=5763,freq=1.0), product of:
              0.10009884 = queryWeight, product of:
                2.7240555 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.009975788 = queryNorm
              0.23022151 = fieldWeight in 5763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=5763)
        0.4 = coord(10/25)