Document (#29268)

Author
Qin, J.
Title
Semantic patterns in bibliographically coupled documents
Source
Encyclopedia of library and information science. Vol.72, [=Suppl.35]
Imprint
New York : Dekker
Year
2002
Pages
S.341-365
Abstract
Different research fields have different definitions for semantic patterns. For knowledge discovery and representation, semantic patterns represent the distribution of occurrences of words in documents and/or citations. In the broadest sense, the term semantic patterns may also refer to the distribution of occurrences of subjects or topics as reflected in documents. The semantic pattern in a set of documents or a group of topics therefore implies quantitative indicators that describe the subject characteristics of the documents being examined. These characteristics are often described by frequencies of keyword occurrences, number of co-occurred keywords, occurrences of coword, and number of cocitations. There are many ways to analyze and derive semantic patterns in documents and citations. A typical example is text mining in full-text documents, a research topic that studies how to extract useful associations and patterns through clustering, categorizing, and summarizing words in texts. One unique way in library and information science is to discover semantic patterns through bibliographically coupled citations. The history of bibliographical coupling goes back in the early 1960s when Kassler investigated associations among technical reports and technical information flow patterns. A number of definitions may facilitate our understanding of bibliographic coupling: (1) bibliographic coupling determines meaningful relations between papers by a study of each paper's bibliography; (2) a unit of coupling is the functional bond between papers when they share a single reference item; (3) coupling strength shows the order of combinations of units of coupling into a graded scale between groups of papers; and (4) a coupling criterion is the way by which the coupling units are combined between two or more papers. Kessler's classic paper an bibliographic coupling between scientific papers proposes the following two graded criteria: Criterion A: A number of papers constitute a related group GA if each member of the group has at least one coupling unit to a given test paper P0. The coupling strength between P0 and any member of GA is measured by the number of coupling units n between them. G(subA)(supn) is that portion of GA that is linked to P0 through n coupling units; Criterion B: A number of papers constitute a related group GB if each member of the group has at least one coupling unit to every other member of the group.
Theme
Informetrie

Similar documents (content)

  1. Hjoerland, B.: Citation analysis : a social and dynamic approach to knowledge organization (2013) 0.16
    0.1591592 = sum of:
      0.1591592 = product of:
        0.5684257 = sum of:
          0.0120111955 = weight(abstract_txt:each in 4711) [ClassicSimilarity], result of:
            0.0120111955 = score(doc=4711,freq=1.0), product of:
              0.046499297 = queryWeight, product of:
                1.058327 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.010630818 = queryNorm
              0.2583092 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.012805893 = weight(abstract_txt:bibliographic in 4711) [ClassicSimilarity], result of:
            0.012805893 = score(doc=4711,freq=1.0), product of:
              0.048528343 = queryWeight, product of:
                1.081171 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.010630818 = queryNorm
              0.2638848 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.016849345 = weight(abstract_txt:between in 4711) [ClassicSimilarity], result of:
            0.016849345 = score(doc=4711,freq=1.0), product of:
              0.077286415 = queryWeight, product of:
                2.0841868 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.010630818 = queryNorm
              0.21801172 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.039868373 = weight(abstract_txt:group in 4711) [ClassicSimilarity], result of:
            0.039868373 = score(doc=4711,freq=1.0), product of:
              0.13036206 = queryWeight, product of:
                2.5060358 = boost
                4.8932486 = idf(docFreq=870, maxDocs=42740)
                0.010630818 = queryNorm
              0.30582803 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8932486 = idf(docFreq=870, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.03613193 = weight(abstract_txt:semantic in 4711) [ClassicSimilarity], result of:
            0.03613193 = score(doc=4711,freq=1.0), product of:
              0.12852134 = queryWeight, product of:
                2.6876497 = boost
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.010630818 = queryNorm
              0.28113565 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4981704 = idf(docFreq=1292, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.059040993 = weight(abstract_txt:papers in 4711) [ClassicSimilarity], result of:
            0.059040993 = score(doc=4711,freq=1.0), product of:
              0.17829955 = queryWeight, product of:
                3.16563 = boost
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.010630818 = queryNorm
              0.33113372 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
          0.391718 = weight(abstract_txt:coupling in 4711) [ClassicSimilarity], result of:
            0.391718 = score(doc=4711,freq=1.0), product of:
              0.7931832 = queryWeight, product of:
                9.442495 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.010630818 = queryNorm
              0.49385566 = fieldWeight in 4711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=4711)
        0.28 = coord(7/25)
    
  2. Castanha, R.C.G.; Wolfram, D.: ¬The domain of knowledge organization : a bibliometric analysis of prolific authors and their intellectual space (2018) 0.15
    0.15131257 = sum of:
      0.15131257 = product of:
        0.54040205 = sum of:
          0.01597306 = weight(abstract_txt:through in 151) [ClassicSimilarity], result of:
            0.01597306 = score(doc=151,freq=2.0), product of:
              0.044631105 = queryWeight, product of:
                1.0368489 = boost
                4.049072 = idf(docFreq=2025, maxDocs=42740)
                0.010630818 = queryNorm
              0.35789075 = fieldWeight in 151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049072 = idf(docFreq=2025, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.0120111955 = weight(abstract_txt:each in 151) [ClassicSimilarity], result of:
            0.0120111955 = score(doc=151,freq=1.0), product of:
              0.046499297 = queryWeight, product of:
                1.058327 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.010630818 = queryNorm
              0.2583092 = fieldWeight in 151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.012805893 = weight(abstract_txt:bibliographic in 151) [ClassicSimilarity], result of:
            0.012805893 = score(doc=151,freq=1.0), product of:
              0.048528343 = queryWeight, product of:
                1.081171 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.010630818 = queryNorm
              0.2638848 = fieldWeight in 151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.03858544 = weight(abstract_txt:constitute in 151) [ClassicSimilarity], result of:
            0.03858544 = score(doc=151,freq=1.0), product of:
              0.08843838 = queryWeight, product of:
                1.1917123 = boost
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.010630818 = queryNorm
              0.43629745 = fieldWeight in 151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.980759 = idf(docFreq=107, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.045479883 = weight(abstract_txt:citations in 151) [ClassicSimilarity], result of:
            0.045479883 = score(doc=151,freq=3.0), product of:
              0.07832397 = queryWeight, product of:
                1.3735485 = boost
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.010630818 = queryNorm
              0.5806637 = fieldWeight in 151, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.023828572 = weight(abstract_txt:between in 151) [ClassicSimilarity], result of:
            0.023828572 = score(doc=151,freq=2.0), product of:
              0.077286415 = queryWeight, product of:
                2.0841868 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.010630818 = queryNorm
              0.30831513 = fieldWeight in 151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
          0.391718 = weight(abstract_txt:coupling in 151) [ClassicSimilarity], result of:
            0.391718 = score(doc=151,freq=1.0), product of:
              0.7931832 = queryWeight, product of:
                9.442495 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.010630818 = queryNorm
              0.49385566 = fieldWeight in 151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=151)
        0.28 = coord(7/25)
    
  3. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 0.14
    0.14127569 = sum of:
      0.14127569 = product of:
        0.58864874 = sum of:
          0.022798654 = weight(abstract_txt:least in 4744) [ClassicSimilarity], result of:
            0.022798654 = score(doc=4744,freq=1.0), product of:
              0.062272735 = queryWeight, product of:
                5.8577557 = idf(docFreq=331, maxDocs=42740)
                0.010630818 = queryNorm
              0.36610973 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8577557 = idf(docFreq=331, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
          0.016986396 = weight(abstract_txt:each in 4744) [ClassicSimilarity], result of:
            0.016986396 = score(doc=4744,freq=2.0), product of:
              0.046499297 = queryWeight, product of:
                1.058327 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.010630818 = queryNorm
              0.36530435 = fieldWeight in 4744, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
          0.012805893 = weight(abstract_txt:bibliographic in 4744) [ClassicSimilarity], result of:
            0.012805893 = score(doc=4744,freq=1.0), product of:
              0.048528343 = queryWeight, product of:
                1.081171 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.010630818 = queryNorm
              0.2638848 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
          0.026257822 = weight(abstract_txt:citations in 4744) [ClassicSimilarity], result of:
            0.026257822 = score(doc=4744,freq=1.0), product of:
              0.07832397 = queryWeight, product of:
                1.3735485 = boost
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.010630818 = queryNorm
              0.33524632 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
          0.11808199 = weight(abstract_txt:papers in 4744) [ClassicSimilarity], result of:
            0.11808199 = score(doc=4744,freq=4.0), product of:
              0.17829955 = queryWeight, product of:
                3.16563 = boost
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.010630818 = queryNorm
              0.66226745 = fieldWeight in 4744, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
          0.391718 = weight(abstract_txt:coupling in 4744) [ClassicSimilarity], result of:
            0.391718 = score(doc=4744,freq=1.0), product of:
              0.7931832 = queryWeight, product of:
                9.442495 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.010630818 = queryNorm
              0.49385566 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=4744)
        0.24 = coord(6/25)
    
  4. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 0.14
    0.14039983 = sum of:
      0.14039983 = product of:
        0.5849993 = sum of:
          0.012805893 = weight(abstract_txt:bibliographic in 4532) [ClassicSimilarity], result of:
            0.012805893 = score(doc=4532,freq=1.0), product of:
              0.048528343 = queryWeight, product of:
                1.081171 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.010630818 = queryNorm
              0.2638848 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
          0.045479883 = weight(abstract_txt:citations in 4532) [ClassicSimilarity], result of:
            0.045479883 = score(doc=4532,freq=3.0), product of:
              0.07832397 = queryWeight, product of:
                1.3735485 = boost
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.010630818 = queryNorm
              0.5806637 = fieldWeight in 4532, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.363941 = idf(docFreq=543, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
          0.023828572 = weight(abstract_txt:between in 4532) [ClassicSimilarity], result of:
            0.023828572 = score(doc=4532,freq=2.0), product of:
              0.077286415 = queryWeight, product of:
                2.0841868 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.010630818 = queryNorm
              0.30831513 = fieldWeight in 4532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
          0.027670445 = weight(abstract_txt:documents in 4532) [ClassicSimilarity], result of:
            0.027670445 = score(doc=4532,freq=1.0), product of:
              0.10757844 = queryWeight, product of:
                2.4589386 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.010630818 = queryNorm
              0.2572118 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
          0.08349657 = weight(abstract_txt:papers in 4532) [ClassicSimilarity], result of:
            0.08349657 = score(doc=4532,freq=2.0), product of:
              0.17829955 = queryWeight, product of:
                3.16563 = boost
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.010630818 = queryNorm
              0.4682938 = fieldWeight in 4532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2981396 = idf(docFreq=580, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
          0.391718 = weight(abstract_txt:coupling in 4532) [ClassicSimilarity], result of:
            0.391718 = score(doc=4532,freq=1.0), product of:
              0.7931832 = queryWeight, product of:
                9.442495 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.010630818 = queryNorm
              0.49385566 = fieldWeight in 4532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=4532)
        0.24 = coord(6/25)
    
  5. Boyack, K.W.; Klavans, R.: Co-citation analysis, bibliographic coupling, and direct citation : which citation approach represents the research front most accurately? (2010) 0.14
    0.13581325 = sum of:
      0.13581325 = product of:
        0.84883285 = sum of:
          0.022798654 = weight(abstract_txt:least in 1112) [ClassicSimilarity], result of:
            0.022798654 = score(doc=1112,freq=1.0), product of:
              0.062272735 = queryWeight, product of:
                5.8577557 = idf(docFreq=331, maxDocs=42740)
                0.010630818 = queryNorm
              0.36610973 = fieldWeight in 1112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8577557 = idf(docFreq=331, maxDocs=42740)
                0.0625 = fieldNorm(doc=1112)
          0.016986396 = weight(abstract_txt:each in 1112) [ClassicSimilarity], result of:
            0.016986396 = score(doc=1112,freq=2.0), product of:
              0.046499297 = queryWeight, product of:
                1.058327 = boost
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.010630818 = queryNorm
              0.36530435 = fieldWeight in 1112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.132947 = idf(docFreq=1862, maxDocs=42740)
                0.0625 = fieldNorm(doc=1112)
          0.025611786 = weight(abstract_txt:bibliographic in 1112) [ClassicSimilarity], result of:
            0.025611786 = score(doc=1112,freq=4.0), product of:
              0.048528343 = queryWeight, product of:
                1.081171 = boost
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.010630818 = queryNorm
              0.5277696 = fieldWeight in 1112, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.222157 = idf(docFreq=1703, maxDocs=42740)
                0.0625 = fieldNorm(doc=1112)
          0.783436 = weight(abstract_txt:coupling in 1112) [ClassicSimilarity], result of:
            0.783436 = score(doc=1112,freq=4.0), product of:
              0.7931832 = queryWeight, product of:
                9.442495 = boost
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.010630818 = queryNorm
              0.9877113 = fieldWeight in 1112, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.9016905 = idf(docFreq=42, maxDocs=42740)
                0.0625 = fieldNorm(doc=1112)
        0.16 = coord(4/25)