Document (#22300)

Author
Kokol, P.
Podgorelec, V.
Zorman, M.
Kokol, T.
Njivar, T.
Title
Computer and natural language texts : a comparison based on long-range correlations
Source
Journal of the American Society for Information Science. 50(1999) no.14, S.1295-1301
Year
1999
Abstract
'Long-range power low correlation' (LRC) is defined as a maximal propagation distance of the effect of some disturbance within a system found in many systems that can be represented as strings of symbols. LRC between characters has also been identified in natural language texts. The aim of this article is to show that long-range power law correlations can also be found in computer programs, meaning that some common laws hold for both natural language texts and computer programs. This fact enables one to draw parallels between these 2 different types of human writings, and also enables one to measure the differences between them
Theme
Computerlinguistik

Similar documents (content)

  1. Altmann, E.G.; Cristadoro, G.; Esposti, M.D.: On the origin of long-range correlations in texts (2012) 0.26
    0.25834092 = sum of:
      0.25834092 = product of:
        0.92264616 = sum of:
          0.009783712 = weight(abstract_txt:that in 330) [ClassicSimilarity], result of:
            0.009783712 = score(doc=330,freq=1.0), product of:
              0.052852005 = queryWeight, product of:
                1.0361222 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021527736 = queryNorm
              0.18511525 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.053793095 = weight(abstract_txt:language in 330) [ClassicSimilarity], result of:
            0.053793095 = score(doc=330,freq=1.0), product of:
              0.16464305 = queryWeight, product of:
                1.8287399 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.021527736 = queryNorm
              0.32672557 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.34312102 = weight(abstract_txt:correlations in 330) [ClassicSimilarity], result of:
            0.34312102 = score(doc=330,freq=3.0), product of:
              0.34299392 = queryWeight, product of:
                2.1551516 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.021527736 = queryNorm
              1.0003706 = fieldWeight in 330, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.095406726 = weight(abstract_txt:range in 330) [ClassicSimilarity], result of:
            0.095406726 = score(doc=330,freq=1.0), product of:
              0.24123761 = queryWeight, product of:
                2.2136183 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.021527736 = queryNorm
              0.3954886 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.13630791 = weight(abstract_txt:natural in 330) [ClassicSimilarity], result of:
            0.13630791 = score(doc=330,freq=2.0), product of:
              0.24288261 = queryWeight, product of:
                2.2211528 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.021527736 = queryNorm
              0.561209 = fieldWeight in 330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.15128829 = weight(abstract_txt:long in 330) [ClassicSimilarity], result of:
            0.15128829 = score(doc=330,freq=2.0), product of:
              0.26036698 = queryWeight, product of:
                2.2997105 = boost
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.021527736 = queryNorm
              0.5810579 = fieldWeight in 330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
          0.13294539 = weight(abstract_txt:texts in 330) [ClassicSimilarity], result of:
            0.13294539 = score(doc=330,freq=1.0), product of:
              0.30095938 = queryWeight, product of:
                2.472488 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021527736 = queryNorm
              0.44173864 = fieldWeight in 330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=330)
        0.28 = coord(7/25)
    
  2. Clark, M.; Kim, Y.; Kruschwitz, U.; Song, D.; Albakour, D.; Dignum, S.; Beresi, U.C.; Fasli, M.; Roeck, A De: Automatically structuring domain knowledge from text : an overview of current research (2012) 0.19
    0.1936611 = sum of:
      0.1936611 = product of:
        0.60519093 = sum of:
          0.011740454 = weight(abstract_txt:that in 2738) [ClassicSimilarity], result of:
            0.011740454 = score(doc=2738,freq=1.0), product of:
              0.052852005 = queryWeight, product of:
                1.0361222 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021527736 = queryNorm
              0.22213829 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.04139649 = weight(abstract_txt:some in 2738) [ClassicSimilarity], result of:
            0.04139649 = score(doc=2738,freq=2.0), product of:
              0.08489331 = queryWeight, product of:
                1.072189 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.021527736 = queryNorm
              0.4876296 = fieldWeight in 2738, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.17216928 = weight(abstract_txt:propagation in 2738) [ClassicSimilarity], result of:
            0.17216928 = score(doc=2738,freq=1.0), product of:
              0.21955074 = queryWeight, product of:
                1.219234 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.021527736 = queryNorm
              0.78418905 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.034636326 = weight(abstract_txt:also in 2738) [ClassicSimilarity], result of:
            0.034636326 = score(doc=2738,freq=1.0), product of:
              0.108715825 = queryWeight, product of:
                1.4860268 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.021527736 = queryNorm
              0.31859508 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.0366632 = weight(abstract_txt:between in 2738) [ClassicSimilarity], result of:
            0.0366632 = score(doc=2738,freq=1.0), product of:
              0.112916775 = queryWeight, product of:
                1.5144658 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.021527736 = queryNorm
              0.32469225 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.06455172 = weight(abstract_txt:language in 2738) [ClassicSimilarity], result of:
            0.06455172 = score(doc=2738,freq=1.0), product of:
              0.16464305 = queryWeight, product of:
                1.8287399 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.021527736 = queryNorm
              0.3920707 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.1156611 = weight(abstract_txt:natural in 2738) [ClassicSimilarity], result of:
            0.1156611 = score(doc=2738,freq=1.0), product of:
              0.24288261 = queryWeight, product of:
                2.2211528 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.021527736 = queryNorm
              0.47620165 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
          0.12837237 = weight(abstract_txt:long in 2738) [ClassicSimilarity], result of:
            0.12837237 = score(doc=2738,freq=1.0), product of:
              0.26036698 = queryWeight, product of:
                2.2997105 = boost
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.021527736 = queryNorm
              0.49304396 = fieldWeight in 2738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.09375 = fieldNorm(doc=2738)
        0.32 = coord(8/25)
    
  3. Egghe, L.: ¬The power of power laws and an interpretation of Lotkaian informetric systems as self-similar fractals (2005) 0.16
    0.1630713 = sum of:
      0.1630713 = product of:
        0.50959784 = sum of:
          0.011069006 = weight(abstract_txt:that in 3466) [ClassicSimilarity], result of:
            0.011069006 = score(doc=3466,freq=2.0), product of:
              0.052852005 = queryWeight, product of:
                1.0361222 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021527736 = queryNorm
              0.20943399 = fieldWeight in 3466, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.14358813 = weight(abstract_txt:laws in 3466) [ClassicSimilarity], result of:
            0.14358813 = score(doc=3466,freq=4.0), product of:
              0.16057736 = queryWeight, product of:
                1.0427058 = boost
                7.1535926 = idf(docFreq=93, maxDocs=44218)
                0.021527736 = queryNorm
              0.8941991 = fieldWeight in 3466, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1535926 = idf(docFreq=93, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.019514492 = weight(abstract_txt:some in 3466) [ClassicSimilarity], result of:
            0.019514492 = score(doc=3466,freq=1.0), product of:
              0.08489331 = queryWeight, product of:
                1.072189 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.021527736 = queryNorm
              0.22987078 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.035202913 = weight(abstract_txt:found in 3466) [ClassicSimilarity], result of:
            0.035202913 = score(doc=3466,freq=1.0), product of:
              0.12580204 = queryWeight, product of:
                1.3052042 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.021527736 = queryNorm
              0.27982783 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.039994586 = weight(abstract_txt:also in 3466) [ClassicSimilarity], result of:
            0.039994586 = score(doc=3466,freq=3.0), product of:
              0.108715825 = queryWeight, product of:
                1.4860268 = boost
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.021527736 = queryNorm
              0.36788192 = fieldWeight in 3466, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3983476 = idf(docFreq=4017, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.024442136 = weight(abstract_txt:between in 3466) [ClassicSimilarity], result of:
            0.024442136 = score(doc=3466,freq=1.0), product of:
              0.112916775 = queryWeight, product of:
                1.5144658 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.021527736 = queryNorm
              0.21646151 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.12943032 = weight(abstract_txt:power in 3466) [ClassicSimilarity], result of:
            0.12943032 = score(doc=3466,freq=3.0), product of:
              0.20778725 = queryWeight, product of:
                1.6774286 = boost
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.021527736 = queryNorm
              0.6228983 = fieldWeight in 3466, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.754088 = idf(docFreq=380, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.10635631 = weight(abstract_txt:texts in 3466) [ClassicSimilarity], result of:
            0.10635631 = score(doc=3466,freq=1.0), product of:
              0.30095938 = queryWeight, product of:
                2.472488 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021527736 = queryNorm
              0.3533909 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
        0.32 = coord(8/25)
    
  4. Ucoluk, G.; Toroslu, I.H.: ¬A genetic algorithm approach for verification of the syllable-based text compression technique (1997) 0.15
    0.14956026 = sum of:
      0.14956026 = product of:
        0.62316775 = sum of:
          0.021877045 = weight(abstract_txt:that in 2601) [ClassicSimilarity], result of:
            0.021877045 = score(doc=2601,freq=5.0), product of:
              0.052852005 = queryWeight, product of:
                1.0361222 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021527736 = queryNorm
              0.41393027 = fieldWeight in 2601, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
          0.09594981 = weight(abstract_txt:symbols in 2601) [ClassicSimilarity], result of:
            0.09594981 = score(doc=2601,freq=1.0), product of:
              0.16789898 = queryWeight, product of:
                1.0662122 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.021527736 = queryNorm
              0.5714735 = fieldWeight in 2601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
          0.16704877 = weight(abstract_txt:strings in 2601) [ClassicSimilarity], result of:
            0.16704877 = score(doc=2601,freq=3.0), product of:
              0.16847691 = queryWeight, product of:
                1.0680456 = boost
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.021527736 = queryNorm
              0.9915232 = fieldWeight in 2601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3274393 = idf(docFreq=78, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
          0.15155362 = weight(abstract_txt:maximal in 2601) [ClassicSimilarity], result of:
            0.15155362 = score(doc=2601,freq=1.0), product of:
              0.22771737 = queryWeight, product of:
                1.2417029 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.021527736 = queryNorm
              0.66553384 = fieldWeight in 2601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
          0.053793095 = weight(abstract_txt:language in 2601) [ClassicSimilarity], result of:
            0.053793095 = score(doc=2601,freq=1.0), product of:
              0.16464305 = queryWeight, product of:
                1.8287399 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.021527736 = queryNorm
              0.32672557 = fieldWeight in 2601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
          0.13294539 = weight(abstract_txt:texts in 2601) [ClassicSimilarity], result of:
            0.13294539 = score(doc=2601,freq=1.0), product of:
              0.30095938 = queryWeight, product of:
                2.472488 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021527736 = queryNorm
              0.44173864 = fieldWeight in 2601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=2601)
        0.24 = coord(6/25)
    
  5. Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.14
    0.13619454 = sum of:
      0.13619454 = product of:
        0.5674773 = sum of:
          0.015653938 = weight(abstract_txt:that in 5043) [ClassicSimilarity], result of:
            0.015653938 = score(doc=5043,freq=4.0), product of:
              0.052852005 = queryWeight, product of:
                1.0361222 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021527736 = queryNorm
              0.2961844 = fieldWeight in 5043, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.09045452 = weight(abstract_txt:enables in 5043) [ClassicSimilarity], result of:
            0.09045452 = score(doc=5043,freq=1.0), product of:
              0.23600551 = queryWeight, product of:
                1.7877042 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.021527736 = queryNorm
              0.38327292 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.06085994 = weight(abstract_txt:language in 5043) [ClassicSimilarity], result of:
            0.06085994 = score(doc=5043,freq=2.0), product of:
              0.16464305 = queryWeight, product of:
                1.8287399 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.021527736 = queryNorm
              0.3696478 = fieldWeight in 5043, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.0771074 = weight(abstract_txt:natural in 5043) [ClassicSimilarity], result of:
            0.0771074 = score(doc=5043,freq=1.0), product of:
              0.24288261 = queryWeight, product of:
                2.2211528 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.021527736 = queryNorm
              0.31746778 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.08558158 = weight(abstract_txt:long in 5043) [ClassicSimilarity], result of:
            0.08558158 = score(doc=5043,freq=1.0), product of:
              0.26036698 = queryWeight, product of:
                2.2997105 = boost
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.021527736 = queryNorm
              0.32869598 = fieldWeight in 5043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2591357 = idf(docFreq=624, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
          0.23781992 = weight(abstract_txt:texts in 5043) [ClassicSimilarity], result of:
            0.23781992 = score(doc=5043,freq=5.0), product of:
              0.30095938 = queryWeight, product of:
                2.472488 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.021527736 = queryNorm
              0.7902061 = fieldWeight in 5043, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0625 = fieldNorm(doc=5043)
        0.24 = coord(6/25)