Document (#34111)

Author
Morrison, P.J.
Title
Tagging and searching : search retrieval effectiveness of folksonomies on the World Wide Web
Source
Information processing and management. 44(2008) no.4, S.1562-1579
Year
2008
Abstract
Many Web sites have begun allowing users to submit items to a collection and tag them with keywords. The folksonomies built from these tags are an interesting topic that has seen little empirical research. This study compared the search information retrieval (IR) performance of folksonomies from social bookmarking Web sites against search engines and subject directories. Thirty-four participants created 103 queries for various information needs. Results from each IR system were collected and participants judged relevance. Folksonomy search results overlapped with those from the other systems, and documents found by both search engines and folksonomies were significantly more likely to be judged relevant than those returned by any single IR system type. The search engines in the study had the highest precision and recall, but the folksonomies fared surprisingly well. Del.icio.us was statistically indistinguishable from the directories in many cases. Overall the directories were more precise than the folksonomies but they had similar recall scores. Better query handling may enhance folksonomy IR performance further. The folksonomies studied were promising, and may be able to improve Web search performance.
Theme
Folksonomies

Similar documents (content)

  1. Munk, T.B.; Mork, K.: Folksonomy, the power law & the significance of the least effort (2007) 0.26
    0.25704184 = sum of:
      0.25704184 = product of:
        0.91800654 = sum of:
          0.012054746 = weight(abstract_txt:than in 2664) [ClassicSimilarity], result of:
            0.012054746 = score(doc=2664,freq=1.0), product of:
              0.05612305 = queryWeight, product of:
                1.1225986 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.012728817 = queryNorm
              0.21479136 = fieldWeight in 2664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.013837828 = weight(abstract_txt:many in 2664) [ClassicSimilarity], result of:
            0.013837828 = score(doc=2664,freq=1.0), product of:
              0.061529186 = queryWeight, product of:
                1.1754237 = boost
                4.1124315 = idf(docFreq=1866, maxDocs=41962)
                0.012728817 = queryNorm
              0.2248986 = fieldWeight in 2664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1124315 = idf(docFreq=1866, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.12897289 = weight(abstract_txt:del.icio.us in 2664) [ClassicSimilarity], result of:
            0.12897289 = score(doc=2664,freq=3.0), product of:
              0.14996311 = queryWeight, product of:
                1.2975708 = boost
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.012728817 = queryNorm
              0.8600308 = fieldWeight in 2664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.079571 = idf(docFreq=12, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.010982233 = weight(abstract_txt:from in 2664) [ClassicSimilarity], result of:
            0.010982233 = score(doc=2664,freq=1.0), product of:
              0.07158296 = queryWeight, product of:
                2.0046043 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.012728817 = queryNorm
              0.15341966 = fieldWeight in 2664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.17690134 = weight(abstract_txt:folksonomy in 2664) [ClassicSimilarity], result of:
            0.17690134 = score(doc=2664,freq=3.0), product of:
              0.23324709 = queryWeight, product of:
                2.2885582 = boost
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.012728817 = queryNorm
              0.75842893 = fieldWeight in 2664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.034095522 = weight(abstract_txt:search in 2664) [ClassicSimilarity], result of:
            0.034095522 = score(doc=2664,freq=1.0), product of:
              0.17042114 = queryWeight, product of:
                3.6597347 = boost
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.012728817 = queryNorm
              0.20006627 = fieldWeight in 2664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
          0.54116195 = weight(abstract_txt:folksonomies in 2664) [ClassicSimilarity], result of:
            0.54116195 = score(doc=2664,freq=3.0), product of:
              0.74628204 = queryWeight, product of:
                7.6584234 = boost
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.012728817 = queryNorm
              0.725144 = fieldWeight in 2664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2664)
        0.28 = coord(7/25)
    
  2. Yi, K.; Chan, L.M.: Linking folksonomy to Library of Congress subject headings : an exploratory study (2009) 0.21
    0.21259993 = sum of:
      0.21259993 = product of:
        0.8858331 = sum of:
          0.008520883 = weight(abstract_txt:study in 617) [ClassicSimilarity], result of:
            0.008520883 = score(doc=617,freq=1.0), product of:
              0.044534057 = queryWeight, product of:
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.012728817 = queryNorm
              0.19133407 = fieldWeight in 617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
          0.012306916 = weight(abstract_txt:results in 617) [ClassicSimilarity], result of:
            0.012306916 = score(doc=617,freq=2.0), product of:
              0.045163963 = queryWeight, product of:
                1.0070473 = boost
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.012728817 = queryNorm
              0.27249414 = fieldWeight in 617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
          0.015531222 = weight(abstract_txt:from in 617) [ClassicSimilarity], result of:
            0.015531222 = score(doc=617,freq=2.0), product of:
              0.07158296 = queryWeight, product of:
                2.0046043 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.012728817 = queryNorm
              0.21696815 = fieldWeight in 617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
          0.020325985 = weight(abstract_txt:were in 617) [ClassicSimilarity], result of:
            0.020325985 = score(doc=617,freq=1.0), product of:
              0.10017214 = queryWeight, product of:
                2.1210082 = boost
                3.7103646 = idf(docFreq=2790, maxDocs=41962)
                0.012728817 = queryNorm
              0.20291056 = fieldWeight in 617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7103646 = idf(docFreq=2790, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
          0.20426807 = weight(abstract_txt:folksonomy in 617) [ClassicSimilarity], result of:
            0.20426807 = score(doc=617,freq=4.0), product of:
              0.23324709 = queryWeight, product of:
                2.2885582 = boost
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.012728817 = queryNorm
              0.8757583 = fieldWeight in 617, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.006933 = idf(docFreq=37, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
          0.62488 = weight(abstract_txt:folksonomies in 617) [ClassicSimilarity], result of:
            0.62488 = score(doc=617,freq=4.0), product of:
              0.74628204 = queryWeight, product of:
                7.6584234 = boost
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.012728817 = queryNorm
              0.8373242 = fieldWeight in 617, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.0546875 = fieldNorm(doc=617)
        0.24 = coord(6/25)
    
  3. Kipp, M.E.I.; Campbell, D.G.: Searching with tags : do tags help users find things? (2010) 0.20
    0.20032695 = sum of:
      0.20032695 = product of:
        0.62602174 = sum of:
          0.0137718255 = weight(abstract_txt:study in 1065) [ClassicSimilarity], result of:
            0.0137718255 = score(doc=1065,freq=2.0), product of:
              0.044534057 = queryWeight, product of:
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.012728817 = queryNorm
              0.30924255 = fieldWeight in 1065, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.0099454895 = weight(abstract_txt:results in 1065) [ClassicSimilarity], result of:
            0.0099454895 = score(doc=1065,freq=1.0), product of:
              0.045163963 = queryWeight, product of:
                1.0070473 = boost
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.012728817 = queryNorm
              0.22020853 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.060828418 = weight(abstract_txt:bookmarking in 1065) [ClassicSimilarity], result of:
            0.060828418 = score(doc=1065,freq=1.0), product of:
              0.11988613 = queryWeight, product of:
                1.1601746 = boost
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.012728817 = queryNorm
              0.50738496 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.118159 = idf(docFreq=33, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.01581466 = weight(abstract_txt:many in 1065) [ClassicSimilarity], result of:
            0.01581466 = score(doc=1065,freq=1.0), product of:
              0.061529186 = queryWeight, product of:
                1.1754237 = boost
                4.1124315 = idf(docFreq=1866, maxDocs=41962)
                0.012728817 = queryNorm
              0.25702697 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1124315 = idf(docFreq=1866, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.0833455 = weight(abstract_txt:participants in 1065) [ClassicSimilarity], result of:
            0.0833455 = score(doc=1065,freq=5.0), product of:
              0.10896977 = queryWeight, product of:
                1.5642526 = boost
                5.4728193 = idf(docFreq=478, maxDocs=41962)
                0.012728817 = queryNorm
              0.7648498 = fieldWeight in 1065, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4728193 = idf(docFreq=478, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.017749969 = weight(abstract_txt:from in 1065) [ClassicSimilarity], result of:
            0.017749969 = score(doc=1065,freq=2.0), product of:
              0.07158296 = queryWeight, product of:
                2.0046043 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.012728817 = queryNorm
              0.2479636 = fieldWeight in 1065, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.06749163 = weight(abstract_txt:search in 1065) [ClassicSimilarity], result of:
            0.06749163 = score(doc=1065,freq=3.0), product of:
              0.17042114 = queryWeight, product of:
                3.6597347 = boost
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.012728817 = queryNorm
              0.39602852 = fieldWeight in 1065, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
          0.3570743 = weight(abstract_txt:folksonomies in 1065) [ClassicSimilarity], result of:
            0.3570743 = score(doc=1065,freq=1.0), product of:
              0.74628204 = queryWeight, product of:
                7.6584234 = boost
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.012728817 = queryNorm
              0.47847098 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6555357 = idf(docFreq=53, maxDocs=41962)
                0.0625 = fieldNorm(doc=1065)
        0.32 = coord(8/25)
    
  4. Spink, A.; Jansen, B.J.; Blakely, C.; Koshman, S.: ¬A study of results overlap and uniqueness among major Web search engines (2006) 0.17
    0.17404751 = sum of:
      0.17404751 = product of:
        0.48346528 = sum of:
          0.019053271 = weight(abstract_txt:study in 2994) [ClassicSimilarity], result of:
            0.019053271 = score(doc=2994,freq=5.0), product of:
              0.044534057 = queryWeight, product of:
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.012728817 = queryNorm
              0.42783597 = fieldWeight in 2994, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.49868 = idf(docFreq=3448, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.02610691 = weight(abstract_txt:results in 2994) [ClassicSimilarity], result of:
            0.02610691 = score(doc=2994,freq=9.0), product of:
              0.045163963 = queryWeight, product of:
                1.0070473 = boost
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.012728817 = queryNorm
              0.5780474 = fieldWeight in 2994, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.5233364 = idf(docFreq=3364, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.041215565 = weight(abstract_txt:returned in 2994) [ClassicSimilarity], result of:
            0.041215565 = score(doc=2994,freq=1.0), product of:
              0.10109586 = queryWeight, product of:
                1.0653825 = boost
                7.454865 = idf(docFreq=65, maxDocs=41962)
                0.012728817 = queryNorm
              0.40768793 = fieldWeight in 2994, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.454865 = idf(docFreq=65, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.012054746 = weight(abstract_txt:than in 2994) [ClassicSimilarity], result of:
            0.012054746 = score(doc=2994,freq=1.0), product of:
              0.05612305 = queryWeight, product of:
                1.1225986 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.012728817 = queryNorm
              0.21479136 = fieldWeight in 2994, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.030215163 = weight(abstract_txt:performance in 2994) [ClassicSimilarity], result of:
            0.030215163 = score(doc=2994,freq=1.0), product of:
              0.118544914 = queryWeight, product of:
                1.9982091 = boost
                4.66073 = idf(docFreq=1078, maxDocs=41962)
                0.012728817 = queryNorm
              0.25488368 = fieldWeight in 2994, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.66073 = idf(docFreq=1078, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.019021787 = weight(abstract_txt:from in 2994) [ClassicSimilarity], result of:
            0.019021787 = score(doc=2994,freq=3.0), product of:
              0.07158296 = queryWeight, product of:
                2.0046043 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.012728817 = queryNorm
              0.26573065 = fieldWeight in 2994, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.028745284 = weight(abstract_txt:were in 2994) [ClassicSimilarity], result of:
            0.028745284 = score(doc=2994,freq=2.0), product of:
              0.10017214 = queryWeight, product of:
                2.1210082 = boost
                3.7103646 = idf(docFreq=2790, maxDocs=41962)
                0.012728817 = queryNorm
              0.28695887 = fieldWeight in 2994, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7103646 = idf(docFreq=2790, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.16647314 = weight(abstract_txt:engines in 2994) [ClassicSimilarity], result of:
            0.16647314 = score(doc=2994,freq=14.0), product of:
              0.15343332 = queryWeight, product of:
                2.2733135 = boost
                5.302398 = idf(docFreq=567, maxDocs=41962)
                0.012728817 = queryNorm
              1.0849868 = fieldWeight in 2994, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.302398 = idf(docFreq=567, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
          0.14057943 = weight(abstract_txt:search in 2994) [ClassicSimilarity], result of:
            0.14057943 = score(doc=2994,freq=17.0), product of:
              0.17042114 = queryWeight, product of:
                3.6597347 = boost
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.012728817 = queryNorm
              0.82489437 = fieldWeight in 2994, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.0546875 = fieldNorm(doc=2994)
        0.36 = coord(9/25)
    
  5. Gandhi, S.: Proliferation and categories of Internet directories : a database of Internet subject directories (1998) 0.17
    0.16866307 = sum of:
      0.16866307 = product of:
        0.7027628 = sum of:
          0.02066528 = weight(abstract_txt:than in 5164) [ClassicSimilarity], result of:
            0.02066528 = score(doc=5164,freq=1.0), product of:
              0.05612305 = queryWeight, product of:
                1.1225986 = boost
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.012728817 = queryNorm
              0.36821377 = fieldWeight in 5164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9276135 = idf(docFreq=2245, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
          0.027804555 = weight(abstract_txt:those in 5164) [ClassicSimilarity], result of:
            0.027804555 = score(doc=5164,freq=1.0), product of:
              0.06840026 = queryWeight, product of:
                1.2393179 = boost
                4.335977 = idf(docFreq=1492, maxDocs=41962)
                0.012728817 = queryNorm
              0.40649784 = fieldWeight in 5164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.335977 = idf(docFreq=1492, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
          0.018826686 = weight(abstract_txt:from in 5164) [ClassicSimilarity], result of:
            0.018826686 = score(doc=5164,freq=1.0), product of:
              0.07158296 = queryWeight, product of:
                2.0046043 = boost
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.012728817 = queryNorm
              0.26300514 = fieldWeight in 5164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.805388 = idf(docFreq=6898, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
          0.076271676 = weight(abstract_txt:engines in 5164) [ClassicSimilarity], result of:
            0.076271676 = score(doc=5164,freq=1.0), product of:
              0.15343332 = queryWeight, product of:
                2.2733135 = boost
                5.302398 = idf(docFreq=567, maxDocs=41962)
                0.012728817 = queryNorm
              0.49709982 = fieldWeight in 5164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.302398 = idf(docFreq=567, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
          0.5007451 = weight(abstract_txt:directories in 5164) [ClassicSimilarity], result of:
            0.5007451 = score(doc=5164,freq=7.0), product of:
              0.28122634 = queryWeight, product of:
                3.0777085 = boost
                7.1786118 = idf(docFreq=86, maxDocs=41962)
                0.012728817 = queryNorm
              1.780577 = fieldWeight in 5164, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1786118 = idf(docFreq=86, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
          0.058449466 = weight(abstract_txt:search in 5164) [ClassicSimilarity], result of:
            0.058449466 = score(doc=5164,freq=1.0), product of:
              0.17042114 = queryWeight, product of:
                3.6597347 = boost
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.012728817 = queryNorm
              0.34297076 = fieldWeight in 5164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6583548 = idf(docFreq=2939, maxDocs=41962)
                0.09375 = fieldNorm(doc=5164)
        0.24 = coord(6/25)