Document (#43922)

Author
Li, G.
Siddharth, L.
Luo, J.
Title
Embedding knowledge graph of patent metadata to measure knowledge proximity
Source
Journal of the Association for Information Science and Technology. 74(2023) no.4, S.476-490
Year
2023
Abstract
Knowledge proximity refers to the strength of association between any two entities in a structural form that embodies certain aspects of a knowledge base. In this work, we operationalize knowledge proximity within the context of the US Patent Database (knowledge base) using a knowledge graph (structural form) named "PatNet" built using patent metadata, including citations, inventors, assignees, and domain classifications. We train various graph embedding models using PatNet to obtain the embeddings of entities and relations. The cosine similarity between the corresponding (or transformed) embeddings of entities denotes the knowledge proximity between these. We compare the embedding models in terms of their performances in predicting target entities and explaining domain expansion profiles of inventors and assignees. We then apply the embeddings of the best-preferred model to associate homogeneous (e.g., patent-patent) and heterogeneous (e.g., inventor-assignee) pairs of entities.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24736.
Field
Patentinformation

Similar documents (content)

  1. Yan, B.; Luo, J.: Measuring technological distance for patent mapping (2017) 0.30
    0.29819655 = sum of:
      0.29819655 = product of:
        0.93186426 = sum of:
          0.07459619 = weight(abstract_txt:inventor in 3351) [ClassicSimilarity], result of:
            0.07459619 = score(doc=3351,freq=1.0), product of:
              0.13374038 = queryWeight, product of:
                1.1659904 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.012852675 = queryNorm
              0.55776864 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.01849408 = weight(abstract_txt:using in 3351) [ClassicSimilarity], result of:
            0.01849408 = score(doc=3351,freq=2.0), product of:
              0.06041856 = queryWeight, product of:
                1.3574051 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.012852675 = queryNorm
              0.30609933 = fieldWeight in 3351, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.013080298 = weight(abstract_txt:between in 3351) [ClassicSimilarity], result of:
            0.013080298 = score(doc=3351,freq=1.0), product of:
              0.060427826 = queryWeight, product of:
                1.3575091 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.012852675 = queryNorm
              0.21646151 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.03752038 = weight(abstract_txt:base in 3351) [ClassicSimilarity], result of:
            0.03752038 = score(doc=3351,freq=1.0), product of:
              0.10657113 = queryWeight, product of:
                1.4719683 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.012852675 = queryNorm
              0.35206887 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.041909035 = weight(abstract_txt:structural in 3351) [ClassicSimilarity], result of:
            0.041909035 = score(doc=3351,freq=1.0), product of:
              0.114727244 = queryWeight, product of:
                1.5272564 = boost
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.012852675 = queryNorm
              0.3652928 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8446846 = idf(docFreq=347, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.03765239 = weight(abstract_txt:knowledge in 3351) [ClassicSimilarity], result of:
            0.03765239 = score(doc=3351,freq=1.0), product of:
              0.1695677 = queryWeight, product of:
                3.7134726 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.012852675 = queryNorm
              0.2220493 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.15723315 = weight(abstract_txt:proximity in 3351) [ClassicSimilarity], result of:
            0.15723315 = score(doc=3351,freq=1.0), product of:
              0.34900704 = queryWeight, product of:
                3.7671313 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.012852675 = queryNorm
              0.4505157 = fieldWeight in 3351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
          0.5513787 = weight(abstract_txt:patent in 3351) [ClassicSimilarity], result of:
            0.5513787 = score(doc=3351,freq=10.0), product of:
              0.4027864 = queryWeight, product of:
                4.524661 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.012852675 = queryNorm
              1.368911 = fieldWeight in 3351, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=3351)
        0.32 = coord(8/25)
    
  2. Jiang, S.; Gao, Q.; Chen, H.; Roco, M.C.: ¬The roles of sharing, transfer, and public funding in nanotechnology knowledge-diffusion networks (2015) 0.23
    0.2331938 = sum of:
      0.2331938 = product of:
        0.9716408 = sum of:
          0.09324524 = weight(abstract_txt:inventor in 1823) [ClassicSimilarity], result of:
            0.09324524 = score(doc=1823,freq=1.0), product of:
              0.13374038 = queryWeight, product of:
                1.1659904 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.012852675 = queryNorm
              0.6972108 = fieldWeight in 1823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
          0.026237978 = weight(abstract_txt:models in 1823) [ClassicSimilarity], result of:
            0.026237978 = score(doc=1823,freq=1.0), product of:
              0.072356075 = queryWeight, product of:
                1.2128754 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.012852675 = queryNorm
              0.362623 = fieldWeight in 1823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
          0.30764017 = weight(abstract_txt:inventors in 1823) [ClassicSimilarity], result of:
            0.30764017 = score(doc=1823,freq=2.0), product of:
              0.29639676 = queryWeight, product of:
                2.4547958 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.012852675 = queryNorm
              1.0379336 = fieldWeight in 1823, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
          0.11176385 = weight(abstract_txt:graph in 1823) [ClassicSimilarity], result of:
            0.11176385 = score(doc=1823,freq=1.0), product of:
              0.21764703 = queryWeight, product of:
                2.5763252 = boost
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.012852675 = queryNorm
              0.51350963 = fieldWeight in 1823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
          0.12452357 = weight(abstract_txt:knowledge in 1823) [ClassicSimilarity], result of:
            0.12452357 = score(doc=1823,freq=7.0), product of:
              0.1695677 = queryWeight, product of:
                3.7134726 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.012852675 = queryNorm
              0.734359 = fieldWeight in 1823, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
          0.30823007 = weight(abstract_txt:patent in 1823) [ClassicSimilarity], result of:
            0.30823007 = score(doc=1823,freq=2.0), product of:
              0.4027864 = queryWeight, product of:
                4.524661 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.012852675 = queryNorm
              0.7652445 = fieldWeight in 1823, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.078125 = fieldNorm(doc=1823)
        0.24 = coord(6/25)
    
  3. Zhu, Y.; Quan, L.; Chen, P.-Y.; Kim, M.C.; Che, C.: Predicting coauthorship using bibliographic network embedding (2023) 0.18
    0.17601074 = sum of:
      0.17601074 = product of:
        0.8800537 = sum of:
          0.01307729 = weight(abstract_txt:using in 917) [ClassicSimilarity], result of:
            0.01307729 = score(doc=917,freq=1.0), product of:
              0.06041856 = queryWeight, product of:
                1.3574051 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.012852675 = queryNorm
              0.21644491 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=917)
          0.08941107 = weight(abstract_txt:graph in 917) [ClassicSimilarity], result of:
            0.08941107 = score(doc=917,freq=1.0), product of:
              0.21764703 = queryWeight, product of:
                2.5763252 = boost
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.012852675 = queryNorm
              0.4108077 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.572923 = idf(docFreq=167, maxDocs=44218)
                0.0625 = fieldNorm(doc=917)
          0.30423796 = weight(abstract_txt:embedding in 917) [ClassicSimilarity], result of:
            0.30423796 = score(doc=917,freq=4.0), product of:
              0.31018025 = queryWeight, product of:
                3.0756106 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.012852675 = queryNorm
              0.9808425 = fieldWeight in 917, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=917)
          0.36916816 = weight(abstract_txt:embeddings in 917) [ClassicSimilarity], result of:
            0.36916816 = score(doc=917,freq=2.0), product of:
              0.44459513 = queryWeight, product of:
                3.6821938 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.012852675 = queryNorm
              0.8303468 = fieldWeight in 917, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=917)
          0.10415919 = weight(abstract_txt:entities in 917) [ClassicSimilarity], result of:
            0.10415919 = score(doc=917,freq=1.0), product of:
              0.28569755 = queryWeight, product of:
                3.810675 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.012852675 = queryNorm
              0.36457852 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=917)
        0.2 = coord(5/25)
    
  4. Li, R.; Chambers, T.; Ding, Y.; Zhang, G.; Meng, L.: Patent citation analysis : calculating science linkage based on citing motivation (2014) 0.15
    0.1471997 = sum of:
      0.1471997 = product of:
        0.91999817 = sum of:
          0.14919238 = weight(abstract_txt:inventor in 1257) [ClassicSimilarity], result of:
            0.14919238 = score(doc=1257,freq=4.0), product of:
              0.13374038 = queryWeight, product of:
                1.1659904 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.012852675 = queryNorm
              1.1155373 = fieldWeight in 1257, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=1257)
          0.031525504 = weight(abstract_txt:domain in 1257) [ClassicSimilarity], result of:
            0.031525504 = score(doc=1257,freq=2.0), product of:
              0.075316966 = queryWeight, product of:
                1.2374426 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.012852675 = queryNorm
              0.41857108 = fieldWeight in 1257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.0625 = fieldNorm(doc=1257)
          0.24611211 = weight(abstract_txt:inventors in 1257) [ClassicSimilarity], result of:
            0.24611211 = score(doc=1257,freq=2.0), product of:
              0.29639676 = queryWeight, product of:
                2.4547958 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.012852675 = queryNorm
              0.8303468 = fieldWeight in 1257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=1257)
          0.49316815 = weight(abstract_txt:patent in 1257) [ClassicSimilarity], result of:
            0.49316815 = score(doc=1257,freq=8.0), product of:
              0.4027864 = queryWeight, product of:
                4.524661 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.012852675 = queryNorm
              1.2243912 = fieldWeight in 1257, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=1257)
        0.16 = coord(4/25)
    
  5. Yan, B.; Luo, J.: Filtering patent maps for visualization of diversification paths of inventors and organizations (2017) 0.13
    0.13302751 = sum of:
      0.13302751 = product of:
        0.831422 = sum of:
          0.105494946 = weight(abstract_txt:inventor in 3651) [ClassicSimilarity], result of:
            0.105494946 = score(doc=3651,freq=2.0), product of:
              0.13374038 = queryWeight, product of:
                1.1659904 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.012852675 = queryNorm
              0.788804 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.018498335 = weight(abstract_txt:between in 3651) [ClassicSimilarity], result of:
            0.018498335 = score(doc=3651,freq=2.0), product of:
              0.060427826 = queryWeight, product of:
                1.3575091 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.012852675 = queryNorm
              0.3061228 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.24611211 = weight(abstract_txt:inventors in 3651) [ClassicSimilarity], result of:
            0.24611211 = score(doc=3651,freq=2.0), product of:
              0.29639676 = queryWeight, product of:
                2.4547958 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.012852675 = queryNorm
              0.8303468 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.46131656 = weight(abstract_txt:patent in 3651) [ClassicSimilarity], result of:
            0.46131656 = score(doc=3651,freq=7.0), product of:
              0.4027864 = queryWeight, product of:
                4.524661 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.012852675 = queryNorm
              1.1453131 = fieldWeight in 3651, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
        0.16 = coord(4/25)