Document (#21917)

Author
Zhang, J.
Korfhage, R.R.
Title
¬A distance and angle similarity measure method
Source
Journal of the American Society for Information Science. 50(1999) no.9, S.772-778
Year
1999
Abstract
This article presents a distance and angle similarity measure. The integrated similarity measure takes the strenghts of both the distance and direction of measured documents into account. This article analyzes the features of the similarity measure by comparing it with the traditional distance-based similarity measure and the cosine measure, providing the iso-similarity contour, investigating the impacts of the parameters and variables on the new similarity measure. It also gives the further research issues on the topic

Similar documents (author)

  1. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.12
    4.1191316 = sum of:
      4.1191316 = weight(author_txt:zhang in 7711) [ClassicSimilarity], result of:
        4.1191316 = score(doc=7711,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.15173098 = queryNorm
          4.119132 = fieldWeight in 7711, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.625 = fieldNorm(doc=7711)
    
  2. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.12
    4.1191316 = sum of:
      4.1191316 = weight(author_txt:zhang in 3281) [ClassicSimilarity], result of:
        4.1191316 = score(doc=3281,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.15173098 = queryNorm
          4.119132 = fieldWeight in 3281, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.625 = fieldNorm(doc=3281)
    
  3. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.12
    4.1191316 = sum of:
      4.1191316 = weight(author_txt:zhang in 6472) [ClassicSimilarity], result of:
        4.1191316 = score(doc=6472,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.15173098 = queryNorm
          4.119132 = fieldWeight in 6472, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.625 = fieldNorm(doc=6472)
    
  4. Zhang, Y.: ¬The impact of Internet-based electronic resources on formal scholarly communication in the area of library and information science : a citation analysis (1998) 4.12
    4.1191316 = sum of:
      4.1191316 = weight(author_txt:zhang in 3809) [ClassicSimilarity], result of:
        4.1191316 = score(doc=3809,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.15173098 = queryNorm
          4.119132 = fieldWeight in 3809, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.625 = fieldNorm(doc=3809)
    
  5. Zhang, Y.: Using the Internet for survey research : a case study (2000) 4.12
    4.1191316 = sum of:
      4.1191316 = weight(author_txt:zhang in 5295) [ClassicSimilarity], result of:
        4.1191316 = score(doc=5295,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.15173098 = queryNorm
          4.119132 = fieldWeight in 5295, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5906115 = idf(docFreq=158, maxDocs=42596)
            0.625 = fieldNorm(doc=5295)
    

Similar documents (content)

  1. Zhang, J.; Korfhage, R.R.: DARE: Distance and Angle Retrieval Environment : A tale of the two measures (1999) 0.32
    0.32056153 = sum of:
      0.32056153 = product of:
        1.3356731 = sum of:
          0.008394504 = weight(abstract_txt:this in 4917) [ClassicSimilarity], result of:
            0.008394504 = score(doc=4917,freq=1.0), product of:
              0.02747357 = queryWeight, product of:
                1.0496877 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.010707425 = queryNorm
              0.30554834 = fieldWeight in 4917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
          0.096100256 = weight(abstract_txt:direction in 4917) [ClassicSimilarity], result of:
            0.096100256 = score(doc=4917,freq=1.0), product of:
              0.110762164 = queryWeight, product of:
                1.490333 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.010707425 = queryNorm
              0.8676271 = fieldWeight in 4917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
          0.03257221 = weight(abstract_txt:article in 4917) [ClassicSimilarity], result of:
            0.03257221 = score(doc=4917,freq=1.0), product of:
              0.06783959 = queryWeight, product of:
                1.6494691 = boost
                3.8410854 = idf(docFreq=2485, maxDocs=42596)
                0.010707425 = queryNorm
              0.48013568 = fieldWeight in 4917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8410854 = idf(docFreq=2485, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
          0.36157262 = weight(abstract_txt:angle in 4917) [ClassicSimilarity], result of:
            0.36157262 = score(doc=4917,freq=1.0), product of:
              0.3375842 = queryWeight, product of:
                3.679541 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.010707425 = queryNorm
              1.0710591 = fieldWeight in 4917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
          0.4418748 = weight(abstract_txt:distance in 4917) [ClassicSimilarity], result of:
            0.4418748 = score(doc=4917,freq=2.0), product of:
              0.3858791 = queryWeight, product of:
                5.5634375 = boost
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.010707425 = queryNorm
              1.145112 = fieldWeight in 4917, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
          0.3951587 = weight(abstract_txt:similarity in 4917) [ClassicSimilarity], result of:
            0.3951587 = score(doc=4917,freq=1.0), product of:
              0.5438204 = queryWeight, product of:
                8.7370405 = boost
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.010707425 = queryNorm
              0.7266346 = fieldWeight in 4917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.125 = fieldNorm(doc=4917)
        0.24 = coord(6/25)
    
  2. Zhang, J.; Wolfram, D.: Visualization of term discrimination analysis (2001) 0.30
    0.29951635 = sum of:
      0.29951635 = product of:
        1.2479848 = sum of:
          0.032473393 = weight(abstract_txt:comparing in 211) [ClassicSimilarity], result of:
            0.032473393 = score(doc=211,freq=1.0), product of:
              0.085299574 = queryWeight, product of:
                1.3078593 = boost
                6.0911713 = idf(docFreq=261, maxDocs=42596)
                0.010707425 = queryNorm
              0.3806982 = fieldWeight in 211, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0911713 = idf(docFreq=261, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
          0.06734446 = weight(abstract_txt:cosine in 211) [ClassicSimilarity], result of:
            0.06734446 = score(doc=211,freq=1.0), product of:
              0.13871698 = queryWeight, product of:
                1.6678325 = boost
                7.7676954 = idf(docFreq=48, maxDocs=42596)
                0.010707425 = queryNorm
              0.48548096 = fieldWeight in 211, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7676954 = idf(docFreq=48, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
          0.25567046 = weight(abstract_txt:angle in 211) [ClassicSimilarity], result of:
            0.25567046 = score(doc=211,freq=2.0), product of:
              0.3375842 = queryWeight, product of:
                3.679541 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.010707425 = queryNorm
              0.7573531 = fieldWeight in 211, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
          0.27059194 = weight(abstract_txt:distance in 211) [ClassicSimilarity], result of:
            0.27059194 = score(doc=211,freq=3.0), product of:
              0.3858791 = queryWeight, product of:
                5.5634375 = boost
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.010707425 = queryNorm
              0.70123506 = fieldWeight in 211, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
          0.27968702 = weight(abstract_txt:measure in 211) [ClassicSimilarity], result of:
            0.27968702 = score(doc=211,freq=3.0), product of:
              0.4753741 = queryWeight, product of:
                8.168727 = boost
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.010707425 = queryNorm
              0.58835137 = fieldWeight in 211, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
          0.34221748 = weight(abstract_txt:similarity in 211) [ClassicSimilarity], result of:
            0.34221748 = score(doc=211,freq=3.0), product of:
              0.5438204 = queryWeight, product of:
                8.7370405 = boost
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.010707425 = queryNorm
              0.629284 = fieldWeight in 211, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.0625 = fieldNorm(doc=211)
        0.24 = coord(6/25)
    
  3. Tudhope, D.; Taylor, C.: Navigation via similarity (1997) 0.25
    0.25413656 = sum of:
      0.25413656 = product of:
        1.0589024 = sum of:
          0.028251387 = weight(abstract_txt:integrated in 1156) [ClassicSimilarity], result of:
            0.028251387 = score(doc=1156,freq=1.0), product of:
              0.06699076 = queryWeight, product of:
                1.159031 = boost
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.010707425 = queryNorm
              0.42172062 = fieldWeight in 1156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.398024 = idf(docFreq=523, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
          0.041276935 = weight(abstract_txt:account in 1156) [ClassicSimilarity], result of:
            0.041276935 = score(doc=1156,freq=2.0), product of:
              0.068462074 = queryWeight, product of:
                1.1716897 = boost
                5.45698 = idf(docFreq=493, maxDocs=42596)
                0.010707425 = queryNorm
              0.60291684 = fieldWeight in 1156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.45698 = idf(docFreq=493, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
          0.039993398 = weight(abstract_txt:takes in 1156) [ClassicSimilarity], result of:
            0.039993398 = score(doc=1156,freq=1.0), product of:
              0.08445926 = queryWeight, product of:
                1.3014013 = boost
                6.061094 = idf(docFreq=269, maxDocs=42596)
                0.010707425 = queryNorm
              0.47352296 = fieldWeight in 1156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.061094 = idf(docFreq=269, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
          0.19528292 = weight(abstract_txt:distance in 1156) [ClassicSimilarity], result of:
            0.19528292 = score(doc=1156,freq=1.0), product of:
              0.3858791 = queryWeight, product of:
                5.5634375 = boost
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.010707425 = queryNorm
              0.5060728 = fieldWeight in 1156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
          0.2018467 = weight(abstract_txt:measure in 1156) [ClassicSimilarity], result of:
            0.2018467 = score(doc=1156,freq=1.0), product of:
              0.4753741 = queryWeight, product of:
                8.168727 = boost
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.010707425 = queryNorm
              0.42460603 = fieldWeight in 1156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
          0.55225104 = weight(abstract_txt:similarity in 1156) [ClassicSimilarity], result of:
            0.55225104 = score(doc=1156,freq=5.0), product of:
              0.5438204 = queryWeight, product of:
                8.7370405 = boost
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.010707425 = queryNorm
              1.0155027 = fieldWeight in 1156, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.078125 = fieldNorm(doc=1156)
        0.24 = coord(6/25)
    
  4. Wolfram, D.; Zhang, J.: ¬An investigation of the influence of indexing exhaustivity and term distributions on a document space (2002) 0.23
    0.22559105 = sum of:
      0.22559105 = product of:
        0.93996274 = sum of:
          0.017481672 = weight(abstract_txt:providing in 239) [ClassicSimilarity], result of:
            0.017481672 = score(doc=239,freq=1.0), product of:
              0.056448236 = queryWeight, product of:
                1.0639293 = boost
                4.9551015 = idf(docFreq=815, maxDocs=42596)
                0.010707425 = queryNorm
              0.30969384 = fieldWeight in 239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9551015 = idf(docFreq=815, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
          0.03716199 = weight(abstract_txt:measured in 239) [ClassicSimilarity], result of:
            0.03716199 = score(doc=239,freq=1.0), product of:
              0.09332423 = queryWeight, product of:
                1.3679959 = boost
                6.3712487 = idf(docFreq=197, maxDocs=42596)
                0.010707425 = queryNorm
              0.39820305 = fieldWeight in 239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3712487 = idf(docFreq=197, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
          0.25567046 = weight(abstract_txt:angle in 239) [ClassicSimilarity], result of:
            0.25567046 = score(doc=239,freq=2.0), product of:
              0.3375842 = queryWeight, product of:
                3.679541 = boost
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.010707425 = queryNorm
              0.7573531 = fieldWeight in 239, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.568473 = idf(docFreq=21, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
          0.27059194 = weight(abstract_txt:distance in 239) [ClassicSimilarity], result of:
            0.27059194 = score(doc=239,freq=3.0), product of:
              0.3858791 = queryWeight, product of:
                5.5634375 = boost
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.010707425 = queryNorm
              0.70123506 = fieldWeight in 239, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
          0.16147736 = weight(abstract_txt:measure in 239) [ClassicSimilarity], result of:
            0.16147736 = score(doc=239,freq=1.0), product of:
              0.4753741 = queryWeight, product of:
                8.168727 = boost
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.010707425 = queryNorm
              0.3396848 = fieldWeight in 239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
          0.19757935 = weight(abstract_txt:similarity in 239) [ClassicSimilarity], result of:
            0.19757935 = score(doc=239,freq=1.0), product of:
              0.5438204 = queryWeight, product of:
                8.7370405 = boost
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.010707425 = queryNorm
              0.3633173 = fieldWeight in 239, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.0625 = fieldNorm(doc=239)
        0.24 = coord(6/25)
    
  5. Shibata, N.; Kajikawa, Y.; Sakata, I.: Measuring relatedness between communities in a citation network (2011) 0.23
    0.22504403 = sum of:
      0.22504403 = product of:
        0.80372864 = sum of:
          0.0052465647 = weight(abstract_txt:this in 485) [ClassicSimilarity], result of:
            0.0052465647 = score(doc=485,freq=1.0), product of:
              0.02747357 = queryWeight, product of:
                1.0496877 = boost
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.010707425 = queryNorm
              0.19096771 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4443867 = idf(docFreq=10047, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.02374522 = weight(abstract_txt:topic in 485) [ClassicSimilarity], result of:
            0.02374522 = score(doc=485,freq=1.0), product of:
              0.05966311 = queryWeight, product of:
                1.0938066 = boost
                5.0942507 = idf(docFreq=709, maxDocs=42596)
                0.010707425 = queryNorm
              0.39798832 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942507 = idf(docFreq=709, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.04645249 = weight(abstract_txt:measured in 485) [ClassicSimilarity], result of:
            0.04645249 = score(doc=485,freq=1.0), product of:
              0.09332423 = queryWeight, product of:
                1.3679959 = boost
                6.3712487 = idf(docFreq=197, maxDocs=42596)
                0.010707425 = queryNorm
              0.4977538 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3712487 = idf(docFreq=197, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.08418057 = weight(abstract_txt:cosine in 485) [ClassicSimilarity], result of:
            0.08418057 = score(doc=485,freq=1.0), product of:
              0.13871698 = queryWeight, product of:
                1.6678325 = boost
                7.7676954 = idf(docFreq=48, maxDocs=42596)
                0.010707425 = queryNorm
              0.6068512 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7676954 = idf(docFreq=48, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.19528292 = weight(abstract_txt:distance in 485) [ClassicSimilarity], result of:
            0.19528292 = score(doc=485,freq=1.0), product of:
              0.3858791 = queryWeight, product of:
                5.5634375 = boost
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.010707425 = queryNorm
              0.5060728 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.477732 = idf(docFreq=177, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.2018467 = weight(abstract_txt:measure in 485) [ClassicSimilarity], result of:
            0.2018467 = score(doc=485,freq=1.0), product of:
              0.4753741 = queryWeight, product of:
                8.168727 = boost
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.010707425 = queryNorm
              0.42460603 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.434957 = idf(docFreq=504, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
          0.24697419 = weight(abstract_txt:similarity in 485) [ClassicSimilarity], result of:
            0.24697419 = score(doc=485,freq=1.0), product of:
              0.5438204 = queryWeight, product of:
                8.7370405 = boost
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.010707425 = queryNorm
              0.45414662 = fieldWeight in 485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813077 = idf(docFreq=345, maxDocs=42596)
                0.078125 = fieldNorm(doc=485)
        0.28 = coord(7/25)