Document (#33864)

Author
Rosso, M.A.
Title
User-based identification of Web genres
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.7, S.1053-1072
Year
2008
Abstract
This research explores the use of genre as a document descriptor in order to improve the effectiveness of Web searching. A major issue to be resolved is the identification of what document categories should be used as genres. As genre is a kind of folk typology, document categories must enjoy widespread recognition by their intended user groups in order to qualify as genres. Three user studies were conducted to develop a genre palette and show that it is recognizable to users. (Palette is a term used to denote a classification, attributable to Karlgren, Bretan, Dewe, Hallberg, and Wolkert, 1998.) To simplify the users' classification task, it was decided to focus on Web pages from the edu domain. The first study was a survey of user terminology for Web pages. Three participants separated 100 Web page printouts into stacks according to genre, assigning names and definitions to each genre. The second study aimed to refine the resulting set of 48 (often conceptually and lexically similar) genre names and definitions into a smaller palette of user-preferred terminology. Ten participants classified the same 100 Web pages. A set of five principles for creating a genre palette from individuals' sortings was developed, and the list of 48 was trimmed to 18 genres. The third study aimed to show that users would agree on the genres of Web pages when choosing from the genre palette. In an online experiment in which 257 participants categorized a new set of 55 pages using the 18 genres, on average, over 70% agreed on the genre of each page. Suggestions for improving the genre palette and future directions for the work are discussed.
Theme
Internet
Inhaltsanalyse

Similar documents (content)

  1. Santini, M.: Zero, single, or multi? : genre of web pages through the users' perspective (2008) 0.26
    0.25917855 = sum of:
      0.25917855 = product of:
        0.9256377 = sum of:
          0.01350156 = weight(abstract_txt:show in 2059) [ClassicSimilarity], result of:
            0.01350156 = score(doc=2059,freq=1.0), product of:
              0.0490307 = queryWeight, product of:
                1.006784 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.011053401 = queryNorm
              0.27536952 = fieldWeight in 2059, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.013882466 = weight(abstract_txt:order in 2059) [ClassicSimilarity], result of:
            0.013882466 = score(doc=2059,freq=1.0), product of:
              0.049948584 = queryWeight, product of:
                1.0161642 = boost
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.011053401 = queryNorm
              0.27793512 = fieldWeight in 2059, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.009503752 = weight(abstract_txt:study in 2059) [ClassicSimilarity], result of:
            0.009503752 = score(doc=2059,freq=1.0), product of:
              0.044412572 = queryWeight, product of:
                1.173548 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.011053401 = queryNorm
              0.21398787 = fieldWeight in 2059, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.024086641 = weight(abstract_txt:users in 2059) [ClassicSimilarity], result of:
            0.024086641 = score(doc=2059,freq=5.0), product of:
              0.048280306 = queryWeight, product of:
                1.2235814 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.011053401 = queryNorm
              0.49889165 = fieldWeight in 2059, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.06807501 = weight(abstract_txt:page in 2059) [ClassicSimilarity], result of:
            0.06807501 = score(doc=2059,freq=4.0), product of:
              0.090820506 = queryWeight, product of:
                1.3702323 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.011053401 = queryNorm
              0.74955547 = fieldWeight in 2059, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.13902967 = weight(abstract_txt:pages in 2059) [ClassicSimilarity], result of:
            0.13902967 = score(doc=2059,freq=4.0), product of:
              0.19841619 = queryWeight, product of:
                3.2022913 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.011053401 = queryNorm
              0.7006972 = fieldWeight in 2059, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
          0.6575586 = weight(abstract_txt:genre in 2059) [ClassicSimilarity], result of:
            0.6575586 = score(doc=2059,freq=8.0), product of:
              0.5590643 = queryWeight, product of:
                7.60183 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.011053401 = queryNorm
              1.176177 = fieldWeight in 2059, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0625 = fieldNorm(doc=2059)
        0.28 = coord(7/25)
    
  2. Hajibayova, L.; Jacob, E.K.: User-generated genre tags through the lens of genre theories (2014) 0.25
    0.25287804 = sum of:
      0.25287804 = product of:
        1.0536585 = sum of:
          0.010080251 = weight(abstract_txt:study in 1450) [ClassicSimilarity], result of:
            0.010080251 = score(doc=1450,freq=2.0), product of:
              0.044412572 = queryWeight, product of:
                1.173548 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.011053401 = queryNorm
              0.22696841 = fieldWeight in 1450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.016434573 = weight(abstract_txt:categories in 1450) [ClassicSimilarity], result of:
            0.016434573 = score(doc=1450,freq=1.0), product of:
              0.06771375 = queryWeight, product of:
                1.1831524 = boost
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.011053401 = queryNorm
              0.24270657 = fieldWeight in 1450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17774 = idf(docFreq=677, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.011425298 = weight(abstract_txt:users in 1450) [ClassicSimilarity], result of:
            0.011425298 = score(doc=1450,freq=2.0), product of:
              0.048280306 = queryWeight, product of:
                1.2235814 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.011053401 = queryNorm
              0.23664509 = fieldWeight in 1450, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.025623351 = weight(abstract_txt:user in 1450) [ClassicSimilarity], result of:
            0.025623351 = score(doc=1450,freq=3.0), product of:
              0.085677765 = queryWeight, product of:
                2.1042936 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.011053401 = queryNorm
              0.2990665 = fieldWeight in 1450, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.2711839 = weight(abstract_txt:genres in 1450) [ClassicSimilarity], result of:
            0.2711839 = score(doc=1450,freq=4.0), product of:
              0.39875028 = queryWeight, product of:
                4.972942 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.011053401 = queryNorm
              0.6800846 = fieldWeight in 1450, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.7189112 = weight(abstract_txt:genre in 1450) [ClassicSimilarity], result of:
            0.7189112 = score(doc=1450,freq=17.0), product of:
              0.5590643 = queryWeight, product of:
                7.60183 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.011053401 = queryNorm
              1.2859185 = fieldWeight in 1450, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
        0.24 = coord(6/25)
    
  3. Dillon, A.; Gushrowski, B.A.: Genres and the Web : is the personal home page the first uniquely digital genre? (2000) 0.24
    0.24302375 = sum of:
      0.24302375 = product of:
        1.012599 = sum of:
          0.075135514 = weight(abstract_txt:recognizable in 4389) [ClassicSimilarity], result of:
            0.075135514 = score(doc=4389,freq=1.0), product of:
              0.1053155 = queryWeight, product of:
                1.0433581 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.011053401 = queryNorm
              0.71343267 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
          0.04254688 = weight(abstract_txt:page in 4389) [ClassicSimilarity], result of:
            0.04254688 = score(doc=4389,freq=1.0), product of:
              0.090820506 = queryWeight, product of:
                1.3702323 = boost
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.011053401 = queryNorm
              0.46847218 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9964437 = idf(docFreq=298, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
          0.042705584 = weight(abstract_txt:user in 4389) [ClassicSimilarity], result of:
            0.042705584 = score(doc=4389,freq=3.0), product of:
              0.085677765 = queryWeight, product of:
                2.1042936 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.011053401 = queryNorm
              0.49844417 = fieldWeight in 4389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
          0.12288602 = weight(abstract_txt:pages in 4389) [ClassicSimilarity], result of:
            0.12288602 = score(doc=4389,freq=2.0), product of:
              0.19841619 = queryWeight, product of:
                3.2022913 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.011053401 = queryNorm
              0.61933464 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
          0.2259866 = weight(abstract_txt:genres in 4389) [ClassicSimilarity], result of:
            0.2259866 = score(doc=4389,freq=1.0), product of:
              0.39875028 = queryWeight, product of:
                4.972942 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.011053401 = queryNorm
              0.5667372 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
          0.5033384 = weight(abstract_txt:genre in 4389) [ClassicSimilarity], result of:
            0.5033384 = score(doc=4389,freq=3.0), product of:
              0.5590643 = queryWeight, product of:
                7.60183 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.011053401 = queryNorm
              0.9003229 = fieldWeight in 4389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.078125 = fieldNorm(doc=4389)
        0.24 = coord(6/25)
    
  4. Wu, I.-C.; Niu, Y.-F.: Effects of anchoring process under preference stabilities for interactive movie recommendations (2015) 0.22
    0.21747167 = sum of:
      0.21747167 = product of:
        0.7766845 = sum of:
          0.01350156 = weight(abstract_txt:show in 2130) [ClassicSimilarity], result of:
            0.01350156 = score(doc=2130,freq=1.0), product of:
              0.0490307 = queryWeight, product of:
                1.006784 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.011053401 = queryNorm
              0.27536952 = fieldWeight in 2130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.013882466 = weight(abstract_txt:order in 2130) [ClassicSimilarity], result of:
            0.013882466 = score(doc=2130,freq=1.0), product of:
              0.049948584 = queryWeight, product of:
                1.0161642 = boost
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.011053401 = queryNorm
              0.27793512 = fieldWeight in 2130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.446962 = idf(docFreq=1407, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.009503752 = weight(abstract_txt:study in 2130) [ClassicSimilarity], result of:
            0.009503752 = score(doc=2130,freq=1.0), product of:
              0.044412572 = queryWeight, product of:
                1.173548 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.011053401 = queryNorm
              0.21398787 = fieldWeight in 2130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.021543747 = weight(abstract_txt:users in 2130) [ClassicSimilarity], result of:
            0.021543747 = score(doc=2130,freq=4.0), product of:
              0.048280306 = queryWeight, product of:
                1.2235814 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.011053401 = queryNorm
              0.44622225 = fieldWeight in 2130, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.027895171 = weight(abstract_txt:user in 2130) [ClassicSimilarity], result of:
            0.027895171 = score(doc=2130,freq=2.0), product of:
              0.085677765 = queryWeight, product of:
                2.1042936 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.011053401 = queryNorm
              0.32558239 = fieldWeight in 2130, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.36157855 = weight(abstract_txt:genres in 2130) [ClassicSimilarity], result of:
            0.36157855 = score(doc=2130,freq=4.0), product of:
              0.39875028 = queryWeight, product of:
                4.972942 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.011053401 = queryNorm
              0.90677947 = fieldWeight in 2130, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.3287793 = weight(abstract_txt:genre in 2130) [ClassicSimilarity], result of:
            0.3287793 = score(doc=2130,freq=2.0), product of:
              0.5590643 = queryWeight, product of:
                7.60183 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.011053401 = queryNorm
              0.5880885 = fieldWeight in 2130, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
        0.28 = coord(7/25)
    
  5. Crowston, K.; Kwasnik, B.H.: Can document-genre metadata improve information access to large digital collections? (2004) 0.19
    0.19154884 = sum of:
      0.19154884 = product of:
        1.1971803 = sum of:
          0.063036144 = weight(abstract_txt:identification in 824) [ClassicSimilarity], result of:
            0.063036144 = score(doc=824,freq=4.0), product of:
              0.08628167 = queryWeight, product of:
                1.3355542 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.011053401 = queryNorm
              0.7305856 = fieldWeight in 824, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.032440834 = weight(abstract_txt:document in 824) [ClassicSimilarity], result of:
            0.032440834 = score(doc=824,freq=3.0), product of:
              0.06981201 = queryWeight, product of:
                1.4713397 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.011053401 = queryNorm
              0.46468848 = fieldWeight in 824, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.40425712 = weight(abstract_txt:genres in 824) [ClassicSimilarity], result of:
            0.40425712 = score(doc=824,freq=5.0), product of:
              0.39875028 = queryWeight, product of:
                4.972942 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.011053401 = queryNorm
              1.0138103 = fieldWeight in 824, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.6974462 = weight(abstract_txt:genre in 824) [ClassicSimilarity], result of:
            0.6974462 = score(doc=824,freq=9.0), product of:
              0.5590643 = queryWeight, product of:
                7.60183 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.011053401 = queryNorm
              1.2475241 = fieldWeight in 824, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
        0.16 = coord(4/25)