Document (#35991)

Author
HaCohen-Kerner, Y.
Kass, A.
Peretz, A.
Title
HAADS: a Hebrew Aramaic abbreviation disambiguation system
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1923-1932
Year
2010
Abstract
In many languages abbreviations are very common and are widely used in both written and spoken language. However, they are not always explicitly defined and in many cases they are ambiguous. This research presents a process that attempts to solve the problem of abbreviation ambiguity using modern machine learning (ML) techniques. Various baseline features are explored, including context-related methods and statistical methods. The application domain is Jewish Law documents written in Hebrew and Aramaic, which are known to be rich in ambiguous abbreviations. Two research approaches were implemented and tested: general and individual. Our system applied four common ML methods to find a successful integration of the various baseline features. The best result was achieved by the SVM ML method in the individual research, with 98.07% accuracy.

Similar documents (content)

  1. HaCohen-Kerner, Y.; Kass, A.; Peretz, A.: Initialism disambiguation : man versus machine (2013) 0.99
    0.99137264 = sum of:
      0.99137264 = product of:
        1.5490198 = sum of:
          0.06050944 = weight(abstract_txt:accuracy in 1094) [ClassicSimilarity], result of:
            0.06050944 = score(doc=1094,freq=3.0), product of:
              0.09362791 = queryWeight, product of:
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.015682964 = queryNorm
              0.6462757 = fieldWeight in 1094, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.06162573 = weight(abstract_txt:achieved in 1094) [ClassicSimilarity], result of:
            0.06162573 = score(doc=1094,freq=3.0), product of:
              0.09477591 = queryWeight, product of:
                1.006112 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.015682964 = queryNorm
              0.6502257 = fieldWeight in 1094, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.036969796 = weight(abstract_txt:rich in 1094) [ClassicSimilarity], result of:
            0.036969796 = score(doc=1094,freq=1.0), product of:
              0.09722882 = queryWeight, product of:
                1.0190485 = boost
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.015682964 = queryNorm
              0.38023496 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.04552198 = weight(abstract_txt:solve in 1094) [ClassicSimilarity], result of:
            0.04552198 = score(doc=1094,freq=1.0), product of:
              0.11169774 = queryWeight, product of:
                1.0922437 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.015682964 = queryNorm
              0.4075461 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.064930804 = weight(abstract_txt:disambiguation in 1094) [ClassicSimilarity], result of:
            0.064930804 = score(doc=1094,freq=1.0), product of:
              0.1415351 = queryWeight, product of:
                1.2295026 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015682964 = queryNorm
              0.45876116 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.039388757 = weight(abstract_txt:various in 1094) [ClassicSimilarity], result of:
            0.039388757 = score(doc=1094,freq=2.0), product of:
              0.10142503 = queryWeight, product of:
                1.4719224 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.015682964 = queryNorm
              0.3883534 = fieldWeight in 1094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.114333175 = weight(abstract_txt:jewish in 1094) [ClassicSimilarity], result of:
            0.114333175 = score(doc=1094,freq=1.0), product of:
              0.20638517 = queryWeight, product of:
                1.4846927 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.015682964 = queryNorm
              0.55397964 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.075224735 = weight(abstract_txt:features in 1094) [ClassicSimilarity], result of:
            0.075224735 = score(doc=1094,freq=6.0), product of:
              0.1082506 = queryWeight, product of:
                1.5206438 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.015682964 = queryNorm
              0.69491285 = fieldWeight in 1094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.015695274 = weight(abstract_txt:research in 1094) [ClassicSimilarity], result of:
            0.015695274 = score(doc=1094,freq=1.0), product of:
              0.0792106 = queryWeight, product of:
                1.5931242 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.015682964 = queryNorm
              0.19814612 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.036457565 = weight(abstract_txt:common in 1094) [ClassicSimilarity], result of:
            0.036457565 = score(doc=1094,freq=1.0), product of:
              0.12136648 = queryWeight, product of:
                1.6101328 = boost
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.015682964 = queryNorm
              0.3003924 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.06343061 = weight(abstract_txt:written in 1094) [ClassicSimilarity], result of:
            0.06343061 = score(doc=1094,freq=1.0), product of:
              0.17556566 = queryWeight, product of:
                1.9365652 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.015682964 = queryNorm
              0.3612928 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.03512177 = weight(abstract_txt:methods in 1094) [ClassicSimilarity], result of:
            0.03512177 = score(doc=1094,freq=1.0), product of:
              0.13551529 = queryWeight, product of:
                2.0837812 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.015682964 = queryNorm
              0.259172 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.1504789 = weight(abstract_txt:baseline in 1094) [ClassicSimilarity], result of:
            0.1504789 = score(doc=1094,freq=2.0), product of:
              0.24786434 = queryWeight, product of:
                2.3010144 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.015682964 = queryNorm
              0.60710186 = fieldWeight in 1094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.24063633 = weight(abstract_txt:ambiguous in 1094) [ClassicSimilarity], result of:
            0.24063633 = score(doc=1094,freq=3.0), product of:
              0.29610157 = queryWeight, product of:
                2.5149693 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.015682964 = queryNorm
              0.81268173 = fieldWeight in 1094, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.19733383 = weight(abstract_txt:hebrew in 1094) [ClassicSimilarity], result of:
            0.19733383 = score(doc=1094,freq=1.0), product of:
              0.3741462 = queryWeight, product of:
                2.8270469 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.015682964 = queryNorm
              0.5274244 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
          0.31136125 = weight(abstract_txt:abbreviations in 1094) [ClassicSimilarity], result of:
            0.31136125 = score(doc=1094,freq=2.0), product of:
              0.40247604 = queryWeight, product of:
                2.9321241 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.015682964 = queryNorm
              0.7736144 = fieldWeight in 1094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1094)
        0.64 = coord(16/25)
    
  2. Franceschini, F.; Maisano, D.; Mastrogiacomo, L.: ¬A novel approach for estimating the omitted-citation rate of bibliometric databases with an application to the field of bibliometrics (2013) 0.99
    0.99137264 = sum of:
      0.99137264 = product of:
        1.5490198 = sum of:
          0.06050944 = weight(abstract_txt:accuracy in 1097) [ClassicSimilarity], result of:
            0.06050944 = score(doc=1097,freq=3.0), product of:
              0.09362791 = queryWeight, product of:
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.015682964 = queryNorm
              0.6462757 = fieldWeight in 1097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.06162573 = weight(abstract_txt:achieved in 1097) [ClassicSimilarity], result of:
            0.06162573 = score(doc=1097,freq=3.0), product of:
              0.09477591 = queryWeight, product of:
                1.006112 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.015682964 = queryNorm
              0.6502257 = fieldWeight in 1097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.036969796 = weight(abstract_txt:rich in 1097) [ClassicSimilarity], result of:
            0.036969796 = score(doc=1097,freq=1.0), product of:
              0.09722882 = queryWeight, product of:
                1.0190485 = boost
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.015682964 = queryNorm
              0.38023496 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0837593 = idf(docFreq=273, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.04552198 = weight(abstract_txt:solve in 1097) [ClassicSimilarity], result of:
            0.04552198 = score(doc=1097,freq=1.0), product of:
              0.11169774 = queryWeight, product of:
                1.0922437 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.015682964 = queryNorm
              0.4075461 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.064930804 = weight(abstract_txt:disambiguation in 1097) [ClassicSimilarity], result of:
            0.064930804 = score(doc=1097,freq=1.0), product of:
              0.1415351 = queryWeight, product of:
                1.2295026 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015682964 = queryNorm
              0.45876116 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.039388757 = weight(abstract_txt:various in 1097) [ClassicSimilarity], result of:
            0.039388757 = score(doc=1097,freq=2.0), product of:
              0.10142503 = queryWeight, product of:
                1.4719224 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.015682964 = queryNorm
              0.3883534 = fieldWeight in 1097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.114333175 = weight(abstract_txt:jewish in 1097) [ClassicSimilarity], result of:
            0.114333175 = score(doc=1097,freq=1.0), product of:
              0.20638517 = queryWeight, product of:
                1.4846927 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.015682964 = queryNorm
              0.55397964 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.075224735 = weight(abstract_txt:features in 1097) [ClassicSimilarity], result of:
            0.075224735 = score(doc=1097,freq=6.0), product of:
              0.1082506 = queryWeight, product of:
                1.5206438 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.015682964 = queryNorm
              0.69491285 = fieldWeight in 1097, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.015695274 = weight(abstract_txt:research in 1097) [ClassicSimilarity], result of:
            0.015695274 = score(doc=1097,freq=1.0), product of:
              0.0792106 = queryWeight, product of:
                1.5931242 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.015682964 = queryNorm
              0.19814612 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.036457565 = weight(abstract_txt:common in 1097) [ClassicSimilarity], result of:
            0.036457565 = score(doc=1097,freq=1.0), product of:
              0.12136648 = queryWeight, product of:
                1.6101328 = boost
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.015682964 = queryNorm
              0.3003924 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.806278 = idf(docFreq=982, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.06343061 = weight(abstract_txt:written in 1097) [ClassicSimilarity], result of:
            0.06343061 = score(doc=1097,freq=1.0), product of:
              0.17556566 = queryWeight, product of:
                1.9365652 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.015682964 = queryNorm
              0.3612928 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.03512177 = weight(abstract_txt:methods in 1097) [ClassicSimilarity], result of:
            0.03512177 = score(doc=1097,freq=1.0), product of:
              0.13551529 = queryWeight, product of:
                2.0837812 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.015682964 = queryNorm
              0.259172 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.1504789 = weight(abstract_txt:baseline in 1097) [ClassicSimilarity], result of:
            0.1504789 = score(doc=1097,freq=2.0), product of:
              0.24786434 = queryWeight, product of:
                2.3010144 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.015682964 = queryNorm
              0.60710186 = fieldWeight in 1097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.24063633 = weight(abstract_txt:ambiguous in 1097) [ClassicSimilarity], result of:
            0.24063633 = score(doc=1097,freq=3.0), product of:
              0.29610157 = queryWeight, product of:
                2.5149693 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.015682964 = queryNorm
              0.81268173 = fieldWeight in 1097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.19733383 = weight(abstract_txt:hebrew in 1097) [ClassicSimilarity], result of:
            0.19733383 = score(doc=1097,freq=1.0), product of:
              0.3741462 = queryWeight, product of:
                2.8270469 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.015682964 = queryNorm
              0.5274244 = fieldWeight in 1097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
          0.31136125 = weight(abstract_txt:abbreviations in 1097) [ClassicSimilarity], result of:
            0.31136125 = score(doc=1097,freq=2.0), product of:
              0.40247604 = queryWeight, product of:
                2.9321241 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.015682964 = queryNorm
              0.7736144 = fieldWeight in 1097, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1097)
        0.64 = coord(16/25)
    
  3. Terada, A.; Tokunaga, T.; Tanaka, H.: Automatic expansion of abbreviations by using context and character information (2004) 0.25
    0.24711604 = sum of:
      0.24711604 = product of:
        1.2355802 = sum of:
          0.034935143 = weight(abstract_txt:accuracy in 2560) [ClassicSimilarity], result of:
            0.034935143 = score(doc=2560,freq=1.0), product of:
              0.09362791 = queryWeight, product of:
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.015682964 = queryNorm
              0.37312746 = fieldWeight in 2560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=2560)
          0.01734457 = weight(abstract_txt:they in 2560) [ClassicSimilarity], result of:
            0.01734457 = score(doc=2560,freq=1.0), product of:
              0.07396325 = queryWeight, product of:
                1.2569567 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.015682964 = queryNorm
              0.23450254 = fieldWeight in 2560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0625 = fieldNorm(doc=2560)
          0.03512177 = weight(abstract_txt:methods in 2560) [ClassicSimilarity], result of:
            0.03512177 = score(doc=2560,freq=1.0), product of:
              0.13551529 = queryWeight, product of:
                2.0837812 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.015682964 = queryNorm
              0.259172 = fieldWeight in 2560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=2560)
          0.5392936 = weight(abstract_txt:abbreviations in 2560) [ClassicSimilarity], result of:
            0.5392936 = score(doc=2560,freq=6.0), product of:
              0.40247604 = queryWeight, product of:
                2.9321241 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.015682964 = queryNorm
              1.3399396 = fieldWeight in 2560, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=2560)
          0.60888517 = weight(abstract_txt:abbreviation in 2560) [ClassicSimilarity], result of:
            0.60888517 = score(doc=2560,freq=4.0), product of:
              0.49954802 = queryWeight, product of:
                3.2666376 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.015682964 = queryNorm
              1.2188722 = fieldWeight in 2560, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=2560)
        0.2 = coord(5/25)
    
  4. HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.19
    0.18600908 = sum of:
      0.18600908 = product of:
        0.5812784 = sum of:
          0.034935143 = weight(abstract_txt:accuracy in 3706) [ClassicSimilarity], result of:
            0.034935143 = score(doc=3706,freq=1.0), product of:
              0.09362791 = queryWeight, product of:
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.015682964 = queryNorm
              0.37312746 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.012593396 = weight(abstract_txt:system in 3706) [ClassicSimilarity], result of:
            0.012593396 = score(doc=3706,freq=1.0), product of:
              0.059749674 = queryWeight, product of:
                1.1297442 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015682964 = queryNorm
              0.21076928 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.027852057 = weight(abstract_txt:various in 3706) [ClassicSimilarity], result of:
            0.027852057 = score(doc=3706,freq=1.0), product of:
              0.10142503 = queryWeight, product of:
                1.4719224 = boost
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.015682964 = queryNorm
              0.27460733 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3937173 = idf(docFreq=1484, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.114333175 = weight(abstract_txt:jewish in 3706) [ClassicSimilarity], result of:
            0.114333175 = score(doc=3706,freq=1.0), product of:
              0.20638517 = queryWeight, product of:
                1.4846927 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.015682964 = queryNorm
              0.55397964 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.068670474 = weight(abstract_txt:features in 3706) [ClassicSimilarity], result of:
            0.068670474 = score(doc=3706,freq=5.0), product of:
              0.1082506 = queryWeight, product of:
                1.5206438 = boost
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.015682964 = queryNorm
              0.63436574 = fieldWeight in 3706, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5391517 = idf(docFreq=1283, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.015695274 = weight(abstract_txt:research in 3706) [ClassicSimilarity], result of:
            0.015695274 = score(doc=3706,freq=1.0), product of:
              0.0792106 = queryWeight, product of:
                1.5931242 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.015682964 = queryNorm
              0.19814612 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.10986504 = weight(abstract_txt:written in 3706) [ClassicSimilarity], result of:
            0.10986504 = score(doc=3706,freq=3.0), product of:
              0.17556566 = queryWeight, product of:
                1.9365652 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.015682964 = queryNorm
              0.6257775 = fieldWeight in 3706, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
          0.19733383 = weight(abstract_txt:hebrew in 3706) [ClassicSimilarity], result of:
            0.19733383 = score(doc=3706,freq=1.0), product of:
              0.3741462 = queryWeight, product of:
                2.8270469 = boost
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.015682964 = queryNorm
              0.5274244 = fieldWeight in 3706, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.43879 = idf(docFreq=25, maxDocs=44218)
                0.0625 = fieldNorm(doc=3706)
        0.32 = coord(8/25)
    
  5. Humphrey, S.M.; Rogers, W.J.; Kilicoglu, H.; Demner-Fushman, D.; Rindflesch, T.C.: Word sense disambiguation by selecting the best semantic type based on journal descriptor indexing : preliminary experiment (2006) 0.16
    0.15785506 = sum of:
      0.15785506 = product of:
        0.5637681 = sum of:
          0.03983173 = weight(abstract_txt:solve in 4912) [ClassicSimilarity], result of:
            0.03983173 = score(doc=4912,freq=1.0), product of:
              0.11169774 = queryWeight, product of:
                1.0922437 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.015682964 = queryNorm
              0.35660285 = fieldWeight in 4912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.015583533 = weight(abstract_txt:system in 4912) [ClassicSimilarity], result of:
            0.015583533 = score(doc=4912,freq=2.0), product of:
              0.059749674 = queryWeight, product of:
                1.1297442 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.015682964 = queryNorm
              0.26081368 = fieldWeight in 4912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.09109032 = weight(abstract_txt:ambiguity in 4912) [ClassicSimilarity], result of:
            0.09109032 = score(doc=4912,freq=3.0), product of:
              0.13443097 = queryWeight, product of:
                1.1982489 = boost
                7.1535926 = idf(docFreq=93, maxDocs=44218)
                0.015682964 = queryNorm
              0.6775992 = fieldWeight in 4912, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1535926 = idf(docFreq=93, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.08034778 = weight(abstract_txt:disambiguation in 4912) [ClassicSimilarity], result of:
            0.08034778 = score(doc=4912,freq=2.0), product of:
              0.1415351 = queryWeight, product of:
                1.2295026 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.015682964 = queryNorm
              0.567688 = fieldWeight in 4912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.033326734 = weight(abstract_txt:individual in 4912) [ClassicSimilarity], result of:
            0.033326734 = score(doc=4912,freq=1.0), product of:
              0.124957815 = queryWeight, product of:
                1.6337818 = boost
                4.8768706 = idf(docFreq=915, maxDocs=44218)
                0.015682964 = queryNorm
              0.26670387 = fieldWeight in 4912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8768706 = idf(docFreq=915, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.13166903 = weight(abstract_txt:baseline in 4912) [ClassicSimilarity], result of:
            0.13166903 = score(doc=4912,freq=2.0), product of:
              0.24786434 = queryWeight, product of:
                2.3010144 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.015682964 = queryNorm
              0.5312141 = fieldWeight in 4912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
          0.17191891 = weight(abstract_txt:ambiguous in 4912) [ClassicSimilarity], result of:
            0.17191891 = score(doc=4912,freq=2.0), product of:
              0.29610157 = queryWeight, product of:
                2.5149693 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.015682964 = queryNorm
              0.5806079 = fieldWeight in 4912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4912)
        0.28 = coord(7/25)