Document (#40837)

Author
O'Neill, E.T.
Kammerer, K.A.
Bennett, R.
Title
¬The aboutness of words
Source
Journal of the Association for Information Science and Technology. 68(2017) no.10, S.2471-2483
Year
2017
Abstract
Word aboutness is defined as the relationship between words and subjects associated with them. An aboutness coefficient is developed to estimate the strength of the aboutness relationship. Words that are randomly distributed across subjects are assumed to lack aboutness and the degree to which their usage deviates from a random pattern indicates the strength of the aboutness. To estimate aboutness, title words and their associated subjects are extracted from the titles of non-fiction English language books in the OCLC WorldCat database. The usage patterns of the title words are analyzed and used to compute aboutness coefficients for each of the common title words. Words with low aboutness coefficients (An and In) are commonly found in stop word lists, whereas words with high aboutness coefficients (Carbonate, Autism) are unambiguous and have a strong subject association. The aboutness coefficient potentially can enhance indexing, advance authority control, and improve retrieval.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23856/full.
Theme
Begriffstheorie

Similar documents (author)

  1. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2012) 4.49
    4.490419 = sum of:
      4.490419 = sum of:
        1.8683382 = weight(author_txt:o'neill in 2311) [ClassicSimilarity], result of:
          1.8683382 = score(doc=2311,freq=1.0), product of:
            0.62362736 = queryWeight, product of:
              7.9891224 = idf(docFreq=38, maxDocs=42306)
              0.07805956 = queryNorm
            2.995921 = fieldWeight in 2311, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.9891224 = idf(docFreq=38, maxDocs=42306)
              0.375 = fieldNorm(doc=2311)
        2.6220808 = weight(author_txt:bennett in 2311) [ClassicSimilarity], result of:
          2.6220808 = score(doc=2311,freq=1.0), product of:
            0.7817218 = queryWeight, product of:
              1.1196016 = boost
              8.944634 = idf(docFreq=14, maxDocs=42306)
              0.07805956 = queryNorm
            3.354238 = fieldWeight in 2311, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.944634 = idf(docFreq=14, maxDocs=42306)
              0.375 = fieldNorm(doc=2311)
    
  2. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2014) 4.49
    4.490419 = sum of:
      4.490419 = sum of:
        1.8683382 = weight(author_txt:o'neill in 3971) [ClassicSimilarity], result of:
          1.8683382 = score(doc=3971,freq=1.0), product of:
            0.62362736 = queryWeight, product of:
              7.9891224 = idf(docFreq=38, maxDocs=42306)
              0.07805956 = queryNorm
            2.995921 = fieldWeight in 3971, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.9891224 = idf(docFreq=38, maxDocs=42306)
              0.375 = fieldNorm(doc=3971)
        2.6220808 = weight(author_txt:bennett in 3971) [ClassicSimilarity], result of:
          2.6220808 = score(doc=3971,freq=1.0), product of:
            0.7817218 = queryWeight, product of:
              1.1196016 = boost
              8.944634 = idf(docFreq=14, maxDocs=42306)
              0.07805956 = queryNorm
            3.354238 = fieldWeight in 3971, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.944634 = idf(docFreq=14, maxDocs=42306)
              0.375 = fieldNorm(doc=3971)
    
  3. Bennett, R.: Terminology and the computer : attention shifts to the micro (1994) 2.19
    2.1850672 = sum of:
      2.1850672 = product of:
        4.3701344 = sum of:
          4.3701344 = weight(author_txt:bennett in 677) [ClassicSimilarity], result of:
            4.3701344 = score(doc=677,freq=1.0), product of:
              0.7817218 = queryWeight, product of:
                1.1196016 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07805956 = queryNorm
              5.5903964 = fieldWeight in 677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.625 = fieldNorm(doc=677)
        0.5 = coord(1/2)
    
  4. Bennett, D.C.: ¬The internationalization of scholarship and scholarly societies in the humanities and social sciences (1996) 2.19
    2.1850672 = sum of:
      2.1850672 = product of:
        4.3701344 = sum of:
          4.3701344 = weight(author_txt:bennett in 6106) [ClassicSimilarity], result of:
            4.3701344 = score(doc=6106,freq=1.0), product of:
              0.7817218 = queryWeight, product of:
                1.1196016 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07805956 = queryNorm
              5.5903964 = fieldWeight in 6106, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.625 = fieldNorm(doc=6106)
        0.5 = coord(1/2)
    
  5. Bennett, J.L.: ¬The user interface in interactive systems (1972) 2.19
    2.1850672 = sum of:
      2.1850672 = product of:
        4.3701344 = sum of:
          4.3701344 = weight(author_txt:bennett in 246) [ClassicSimilarity], result of:
            4.3701344 = score(doc=246,freq=1.0), product of:
              0.7817218 = queryWeight, product of:
                1.1196016 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.07805956 = queryNorm
              5.5903964 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.625 = fieldNorm(doc=246)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Weinberg, B.H.: Why indexing fails the researcher (1988) 0.09
    0.09113036 = sum of:
      0.09113036 = product of:
        0.7594197 = sum of:
          0.007152572 = weight(abstract_txt:with in 703) [ClassicSimilarity], result of:
            0.007152572 = score(doc=703,freq=4.0), product of:
              0.022647304 = queryWeight, product of:
                1.1609911 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0077206157 = queryNorm
              0.31582442 = fieldWeight in 703, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=703)
          0.019929213 = weight(abstract_txt:associated in 703) [ClassicSimilarity], result of:
            0.019929213 = score(doc=703,freq=1.0), product of:
              0.06218582 = queryWeight, product of:
                1.5707992 = boost
                5.1276546 = idf(docFreq=681, maxDocs=42306)
                0.0077206157 = queryNorm
              0.3204784 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1276546 = idf(docFreq=681, maxDocs=42306)
                0.0625 = fieldNorm(doc=703)
          0.7323379 = weight(abstract_txt:aboutness in 703) [ClassicSimilarity], result of:
            0.7323379 = score(doc=703,freq=3.0), product of:
              0.84123904 = queryWeight, product of:
                13.549274 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0077206157 = queryNorm
              0.8705467 = fieldWeight in 703, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0625 = fieldNorm(doc=703)
        0.12 = coord(3/25)
    
  2. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.08
    0.081515595 = sum of:
      0.081515595 = product of:
        0.3396483 = sum of:
          0.031300087 = weight(abstract_txt:stop in 227) [ClassicSimilarity], result of:
            0.031300087 = score(doc=227,freq=1.0), product of:
              0.066688605 = queryWeight, product of:
                1.150233 = boost
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.0077206157 = queryNorm
              0.46934685 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
          0.006194309 = weight(abstract_txt:with in 227) [ClassicSimilarity], result of:
            0.006194309 = score(doc=227,freq=3.0), product of:
              0.022647304 = queryWeight, product of:
                1.1609911 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0077206157 = queryNorm
              0.27351198 = fieldWeight in 227, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
          0.028184162 = weight(abstract_txt:associated in 227) [ClassicSimilarity], result of:
            0.028184162 = score(doc=227,freq=2.0), product of:
              0.06218582 = queryWeight, product of:
                1.5707992 = boost
                5.1276546 = idf(docFreq=681, maxDocs=42306)
                0.0077206157 = queryNorm
              0.4532249 = fieldWeight in 227, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1276546 = idf(docFreq=681, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
          0.048130307 = weight(abstract_txt:word in 227) [ClassicSimilarity], result of:
            0.048130307 = score(doc=227,freq=4.0), product of:
              0.07051644 = queryWeight, product of:
                1.6727082 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0077206157 = queryNorm
              0.68254024 = fieldWeight in 227, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
          0.06803473 = weight(abstract_txt:coefficient in 227) [ClassicSimilarity], result of:
            0.06803473 = score(doc=227,freq=1.0), product of:
              0.14098895 = queryWeight, product of:
                2.3651981 = boost
                7.7208586 = idf(docFreq=50, maxDocs=42306)
                0.0077206157 = queryNorm
              0.48255366 = fieldWeight in 227, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7208586 = idf(docFreq=50, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
          0.15780468 = weight(abstract_txt:words in 227) [ClassicSimilarity], result of:
            0.15780468 = score(doc=227,freq=3.0), product of:
              0.27190933 = queryWeight, product of:
                6.5692687 = boost
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.0077206157 = queryNorm
              0.58035773 = fieldWeight in 227, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.0625 = fieldNorm(doc=227)
        0.24 = coord(6/25)
    
  3. Moraes, J.B.E. de: Aboutness in fiction : methodological perspectives for knowledge organization (2012) 0.07
    0.07402709 = sum of:
      0.07402709 = product of:
        0.9253387 = sum of:
          0.07970775 = weight(abstract_txt:fiction in 2857) [ClassicSimilarity], result of:
            0.07970775 = score(doc=2857,freq=3.0), product of:
              0.054320183 = queryWeight, product of:
                1.0381033 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.0077206157 = queryNorm
              1.467369 = fieldWeight in 2857, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.125 = fieldNorm(doc=2857)
          0.84563094 = weight(abstract_txt:aboutness in 2857) [ClassicSimilarity], result of:
            0.84563094 = score(doc=2857,freq=1.0), product of:
              0.84123904 = queryWeight, product of:
                13.549274 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0077206157 = queryNorm
              1.0052208 = fieldWeight in 2857, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.125 = fieldNorm(doc=2857)
        0.08 = coord(2/25)
    
  4. Ward, M.; Saarti, J.: Reviewing, rebutting, and reimagining fiction classification (2018) 0.07
    0.07285696 = sum of:
      0.07285696 = product of:
        0.91071206 = sum of:
          0.0650811 = weight(abstract_txt:fiction in 2082) [ClassicSimilarity], result of:
            0.0650811 = score(doc=2082,freq=2.0), product of:
              0.054320183 = queryWeight, product of:
                1.0381033 = boost
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.0077206157 = queryNorm
              1.1981016 = fieldWeight in 2082, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.777487 = idf(docFreq=130, maxDocs=42306)
                0.125 = fieldNorm(doc=2082)
          0.84563094 = weight(abstract_txt:aboutness in 2082) [ClassicSimilarity], result of:
            0.84563094 = score(doc=2082,freq=1.0), product of:
              0.84123904 = queryWeight, product of:
                13.549274 = boost
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.0077206157 = queryNorm
              1.0052208 = fieldWeight in 2082, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.041766 = idf(docFreq=36, maxDocs=42306)
                0.125 = fieldNorm(doc=2082)
        0.08 = coord(2/25)
    
  5. Wilbur, W.J.; Sirotkin, K.: ¬The automatic identification of stop words (1992) 0.07
    0.07230536 = sum of:
      0.07230536 = product of:
        0.3615268 = sum of:
          0.06639751 = weight(abstract_txt:stop in 4853) [ClassicSimilarity], result of:
            0.06639751 = score(doc=4853,freq=2.0), product of:
              0.066688605 = queryWeight, product of:
                1.150233 = boost
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.0077206157 = queryNorm
              0.99563503 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5095496 = idf(docFreq=62, maxDocs=42306)
                0.09375 = fieldNorm(doc=4853)
          0.005364429 = weight(abstract_txt:with in 4853) [ClassicSimilarity], result of:
            0.005364429 = score(doc=4853,freq=1.0), product of:
              0.022647304 = queryWeight, product of:
                1.1609911 = boost
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.0077206157 = queryNorm
              0.23686832 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5265954 = idf(docFreq=9191, maxDocs=42306)
                0.09375 = fieldNorm(doc=4853)
          0.0510499 = weight(abstract_txt:word in 4853) [ClassicSimilarity], result of:
            0.0510499 = score(doc=4853,freq=2.0), product of:
              0.07051644 = queryWeight, product of:
                1.6727082 = boost
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.0077206157 = queryNorm
              0.72394323 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.460322 = idf(docFreq=488, maxDocs=42306)
                0.09375 = fieldNorm(doc=4853)
          0.10205209 = weight(abstract_txt:coefficient in 4853) [ClassicSimilarity], result of:
            0.10205209 = score(doc=4853,freq=1.0), product of:
              0.14098895 = queryWeight, product of:
                2.3651981 = boost
                7.7208586 = idf(docFreq=50, maxDocs=42306)
                0.0077206157 = queryNorm
              0.72383046 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7208586 = idf(docFreq=50, maxDocs=42306)
                0.09375 = fieldNorm(doc=4853)
          0.13666286 = weight(abstract_txt:words in 4853) [ClassicSimilarity], result of:
            0.13666286 = score(doc=4853,freq=1.0), product of:
              0.27190933 = queryWeight, product of:
                6.5692687 = boost
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.0077206157 = queryNorm
              0.50260454 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.09375 = fieldNorm(doc=4853)
        0.2 = coord(5/25)