Document (#28876)

Author
Breitzman, A.
Title
Automated identification of technologically similar organizations
Source
Journal of the American Society for Information Science and Technology. 56(2005) no.10, S.1015-1023
Year
2005
Abstract
This article introduces and validates a method for identifying technologically similar organizations, industries, or regions by applying the techniques from information science for term similarity to international patent classifications. Several applications of the method are explored, including identifying hidden competitive threats, finding potential acquisition targets, locating university expertise within a technology, identifying competitor strategy shifts, and more. One advantage of the method is that it is size invariant, meaning, for example, that it is possible for a huge corporation to identify smaller firms in its space before they become significant competitors. Another advantage is that technologically similar organizations can be identified an a large scale without any particular knowledge of the technology or business of either source organizations or target organizations.

Similar documents (content)

  1. Liu, D.-R.; Shih, M.-J.: Hybrid-patent classification based on patent-network analysis (2011) 0.12
    0.1203492 = sum of:
      0.1203492 = product of:
        0.501455 = sum of:
          0.18888548 = weight(abstract_txt:patent in 4189) [ClassicSimilarity], result of:
            0.18888548 = score(doc=4189,freq=14.0), product of:
              0.11661632 = queryWeight, product of:
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.016836978 = queryNorm
              1.6197174 = fieldWeight in 4189, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.050856467 = weight(abstract_txt:competitive in 4189) [ClassicSimilarity], result of:
            0.050856467 = score(doc=4189,freq=1.0), product of:
              0.11719266 = queryWeight, product of:
                1.002468 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016836978 = queryNorm
              0.43395606 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.012127091 = weight(abstract_txt:that in 4189) [ClassicSimilarity], result of:
            0.012127091 = score(doc=4189,freq=4.0), product of:
              0.040944397 = queryWeight, product of:
                1.0263091 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016836978 = queryNorm
              0.2961844 = fieldWeight in 4189, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.06618463 = weight(abstract_txt:advantage in 4189) [ClassicSimilarity], result of:
            0.06618463 = score(doc=4189,freq=1.0), product of:
              0.17600206 = queryWeight, product of:
                1.7373775 = boost
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.016836978 = queryNorm
              0.37604466 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.05877569 = weight(abstract_txt:method in 4189) [ClassicSimilarity], result of:
            0.05877569 = score(doc=4189,freq=2.0), product of:
              0.14774016 = queryWeight, product of:
                1.9495313 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016836978 = queryNorm
              0.3978315 = fieldWeight in 4189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
          0.12462563 = weight(abstract_txt:organizations in 4189) [ClassicSimilarity], result of:
            0.12462563 = score(doc=4189,freq=1.0), product of:
              0.3642486 = queryWeight, product of:
                3.9518847 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.016836978 = queryNorm
              0.34214443 = fieldWeight in 4189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=4189)
        0.24 = coord(6/25)
    
  2. Kay, L.; Newman, N.; Youtie, J.; Porter, A.L.; Rafols, I.: Patent overlay mapping : visualizing technological distance (2014) 0.11
    0.11244715 = sum of:
      0.11244715 = product of:
        0.4685298 = sum of:
          0.17487398 = weight(abstract_txt:patent in 1543) [ClassicSimilarity], result of:
            0.17487398 = score(doc=1543,freq=12.0), product of:
              0.11661632 = queryWeight, product of:
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.016836978 = queryNorm
              1.4995669 = fieldWeight in 1543, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
          0.050856467 = weight(abstract_txt:competitive in 1543) [ClassicSimilarity], result of:
            0.050856467 = score(doc=1543,freq=1.0), product of:
              0.11719266 = queryWeight, product of:
                1.002468 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016836978 = queryNorm
              0.43395606 = fieldWeight in 1543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
          0.012127091 = weight(abstract_txt:that in 1543) [ClassicSimilarity], result of:
            0.012127091 = score(doc=1543,freq=4.0), product of:
              0.040944397 = queryWeight, product of:
                1.0263091 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016836978 = queryNorm
              0.2961844 = fieldWeight in 1543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
          0.04156069 = weight(abstract_txt:method in 1543) [ClassicSimilarity], result of:
            0.04156069 = score(doc=1543,freq=1.0), product of:
              0.14774016 = queryWeight, product of:
                1.9495313 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016836978 = queryNorm
              0.28130937 = fieldWeight in 1543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
          0.06448594 = weight(abstract_txt:similar in 1543) [ClassicSimilarity], result of:
            0.06448594 = score(doc=1543,freq=1.0), product of:
              0.19800982 = queryWeight, product of:
                2.2569623 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.016836978 = queryNorm
              0.3256704 = fieldWeight in 1543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
          0.12462563 = weight(abstract_txt:organizations in 1543) [ClassicSimilarity], result of:
            0.12462563 = score(doc=1543,freq=1.0), product of:
              0.3642486 = queryWeight, product of:
                3.9518847 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.016836978 = queryNorm
              0.34214443 = fieldWeight in 1543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1543)
        0.24 = coord(6/25)
    
  3. Pan, S.; Pan, G.; Hsieh, M.H.: ¬A dual-level analysis of the capability development process : a case study of TT&T (2006) 0.10
    0.09566418 = sum of:
      0.09566418 = product of:
        0.39860076 = sum of:
          0.050856467 = weight(abstract_txt:competitive in 212) [ClassicSimilarity], result of:
            0.050856467 = score(doc=212,freq=1.0), product of:
              0.11719266 = queryWeight, product of:
                1.002468 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016836978 = queryNorm
              0.43395606 = fieldWeight in 212, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
          0.008575149 = weight(abstract_txt:that in 212) [ClassicSimilarity], result of:
            0.008575149 = score(doc=212,freq=2.0), product of:
              0.040944397 = queryWeight, product of:
                1.0263091 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016836978 = queryNorm
              0.20943399 = fieldWeight in 212, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
          0.07281422 = weight(abstract_txt:firms in 212) [ClassicSimilarity], result of:
            0.07281422 = score(doc=212,freq=1.0), product of:
              0.14887226 = queryWeight, product of:
                1.1298667 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.016836978 = queryNorm
              0.48910537 = fieldWeight in 212, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
          0.02392301 = weight(abstract_txt:technology in 212) [ClassicSimilarity], result of:
            0.02392301 = score(doc=212,freq=1.0), product of:
              0.089307964 = queryWeight, product of:
                1.2376003 = boost
                4.2859354 = idf(docFreq=1653, maxDocs=44218)
                0.016836978 = queryNorm
              0.26787096 = fieldWeight in 212, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2859354 = idf(docFreq=1653, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
          0.06618463 = weight(abstract_txt:advantage in 212) [ClassicSimilarity], result of:
            0.06618463 = score(doc=212,freq=1.0), product of:
              0.17600206 = queryWeight, product of:
                1.7373775 = boost
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.016836978 = queryNorm
              0.37604466 = fieldWeight in 212, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
          0.17624725 = weight(abstract_txt:organizations in 212) [ClassicSimilarity], result of:
            0.17624725 = score(doc=212,freq=2.0), product of:
              0.3642486 = queryWeight, product of:
                3.9518847 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.016836978 = queryNorm
              0.4838653 = fieldWeight in 212, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=212)
        0.24 = coord(6/25)
    
  4. Allen, C.: Information challenges in the global marketplace (1994) 0.09
    0.087686606 = sum of:
      0.087686606 = product of:
        0.5480413 = sum of:
          0.0762847 = weight(abstract_txt:competitive in 537) [ClassicSimilarity], result of:
            0.0762847 = score(doc=537,freq=1.0), product of:
              0.11719266 = queryWeight, product of:
                1.002468 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.016836978 = queryNorm
              0.6509341 = fieldWeight in 537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.09375 = fieldNorm(doc=537)
          0.009095319 = weight(abstract_txt:that in 537) [ClassicSimilarity], result of:
            0.009095319 = score(doc=537,freq=1.0), product of:
              0.040944397 = queryWeight, product of:
                1.0263091 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016836978 = queryNorm
              0.22213829 = fieldWeight in 537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=537)
          0.1388744 = weight(abstract_txt:competitors in 537) [ClassicSimilarity], result of:
            0.1388744 = score(doc=537,freq=1.0), product of:
              0.17472576 = queryWeight, product of:
                1.224049 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.016836978 = queryNorm
              0.7948135 = fieldWeight in 537, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.09375 = fieldNorm(doc=537)
          0.32378688 = weight(abstract_txt:organizations in 537) [ClassicSimilarity], result of:
            0.32378688 = score(doc=537,freq=3.0), product of:
              0.3642486 = queryWeight, product of:
                3.9518847 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.016836978 = queryNorm
              0.8889173 = fieldWeight in 537, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.09375 = fieldNorm(doc=537)
        0.16 = coord(4/25)
    
  5. Yan, B.; Luo, J.: Filtering patent maps for visualization of diversification paths of inventors and organizations (2017) 0.08
    0.08371923 = sum of:
      0.08371923 = product of:
        0.41859615 = sum of:
          0.1335622 = weight(abstract_txt:patent in 3651) [ClassicSimilarity], result of:
            0.1335622 = score(doc=3651,freq=7.0), product of:
              0.11661632 = queryWeight, product of:
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.016836978 = queryNorm
              1.1453131 = fieldWeight in 3651, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.008575149 = weight(abstract_txt:that in 3651) [ClassicSimilarity], result of:
            0.008575149 = score(doc=3651,freq=2.0), product of:
              0.040944397 = queryWeight, product of:
                1.0263091 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.016836978 = queryNorm
              0.20943399 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.041435868 = weight(abstract_txt:technology in 3651) [ClassicSimilarity], result of:
            0.041435868 = score(doc=3651,freq=3.0), product of:
              0.089307964 = queryWeight, product of:
                1.2376003 = boost
                4.2859354 = idf(docFreq=1653, maxDocs=44218)
                0.016836978 = queryNorm
              0.4639661 = fieldWeight in 3651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2859354 = idf(docFreq=1653, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.05877569 = weight(abstract_txt:method in 3651) [ClassicSimilarity], result of:
            0.05877569 = score(doc=3651,freq=2.0), product of:
              0.14774016 = queryWeight, product of:
                1.9495313 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.016836978 = queryNorm
              0.3978315 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
          0.17624725 = weight(abstract_txt:organizations in 3651) [ClassicSimilarity], result of:
            0.17624725 = score(doc=3651,freq=2.0), product of:
              0.3642486 = queryWeight, product of:
                3.9518847 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.016836978 = queryNorm
              0.4838653 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=3651)
        0.2 = coord(5/25)