Document (#29395)

Author
Galvez, C.
Moya-Anegón, F. de
Solana, V.H.
Title
Term conflation methods in information retrieval : non-linguistic and linguistic approaches
Source
Journal of documentation. 61(2005) no.4, S.520-547
Year
2005
Abstract
Purpose - To propose a categorization of the different conflation procedures at the two basic approaches, non-linguistic and linguistic techniques, and to justify the application of normalization methods within the framework of linguistic techniques. Design/methodology/approach - Presents a range of term conflation methods, that can be used in information retrieval. The uniterm and multiterm variants can be considered equivalent units for the purposes of automatic indexing. Stemming algorithms, segmentation rules, association measures and clustering techniques are well evaluated non-linguistic methods, and experiments with these techniques show a wide variety of results. Alternatively, the lemmatisation and the use of syntactic pattern-matching, through equivalence relations represented in finite-state transducers (FST), are emerging methods for the recognition and standardization of terms. Findings - The survey attempts to point out the positive and negative effects of the linguistic approach and its potential as a term conflation method. Originality/value - Outlines the importance of FSTs for the normalization of term variants.
Footnote
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/00220410510607507
Theme
Computerlinguistik

Similar documents (author)

  1. Herrero-Solana, V.; Moya Anegón, F. de: Graphical Table of Contents (GTOC) for library collections : the application of UDC codes for the subject maps (2003) 5.65
    5.6463623 = sum of:
      5.6463623 = sum of:
        1.535351 = weight(author_txt:moya in 2758) [ClassicSimilarity], result of:
          1.535351 = score(doc=2758,freq=1.0), product of:
            0.50110227 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.06133047 = queryNorm
            3.0639474 = fieldWeight in 2758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.375 = fieldNorm(doc=2758)
        1.6069969 = weight(author_txt:anegón in 2758) [ClassicSimilarity], result of:
          1.6069969 = score(doc=2758,freq=1.0), product of:
            0.5165725 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.06133047 = queryNorm
            3.1108837 = fieldWeight in 2758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.375 = fieldNorm(doc=2758)
        2.5040145 = weight(author_txt:solana in 2758) [ClassicSimilarity], result of:
          2.5040145 = score(doc=2758,freq=1.0), product of:
            0.69429785 = queryWeight, product of:
              1.1770902 = boost
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.06133047 = queryNorm
            3.606542 = fieldWeight in 2758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.375 = fieldNorm(doc=2758)
    
  2. Guerrero-Bote, V.P.; Moya Anegón, F. de; Herrero Solana, V.: Document organization using Kohonen's algorithm (2002) 4.71
    4.7053022 = sum of:
      4.7053022 = sum of:
        1.2794591 = weight(author_txt:moya in 2564) [ClassicSimilarity], result of:
          1.2794591 = score(doc=2564,freq=1.0), product of:
            0.50110227 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.06133047 = queryNorm
            2.5532894 = fieldWeight in 2564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.3125 = fieldNorm(doc=2564)
        1.339164 = weight(author_txt:anegón in 2564) [ClassicSimilarity], result of:
          1.339164 = score(doc=2564,freq=1.0), product of:
            0.5165725 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.06133047 = queryNorm
            2.592403 = fieldWeight in 2564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.3125 = fieldNorm(doc=2564)
        2.0866787 = weight(author_txt:solana in 2564) [ClassicSimilarity], result of:
          2.0866787 = score(doc=2564,freq=1.0), product of:
            0.69429785 = queryWeight, product of:
              1.1770902 = boost
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.06133047 = queryNorm
            3.005452 = fieldWeight in 2564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.3125 = fieldNorm(doc=2564)
    
  3. Moya-Anegón, F. de; Vargas-Quesada, B.; Chinchilla-Rodríguez, Z.; Corera-Álvarez, E.; Munoz-Fernández, F.J.; Herrero-Solana, V.; SCImago Group: Visualizing the marrow of science (2007) 2.82
    2.8231812 = sum of:
      2.8231812 = sum of:
        0.7676755 = weight(author_txt:moya in 1313) [ClassicSimilarity], result of:
          0.7676755 = score(doc=1313,freq=1.0), product of:
            0.50110227 = queryWeight, product of:
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.06133047 = queryNorm
            1.5319737 = fieldWeight in 1313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.1705265 = idf(docFreq=33, maxDocs=44218)
              0.1875 = fieldNorm(doc=1313)
        0.80349845 = weight(author_txt:anegón in 1313) [ClassicSimilarity], result of:
          0.80349845 = score(doc=1313,freq=1.0), product of:
            0.5165725 = queryWeight, product of:
              1.0153189 = boost
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.06133047 = queryNorm
            1.5554419 = fieldWeight in 1313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.29569 = idf(docFreq=29, maxDocs=44218)
              0.1875 = fieldNorm(doc=1313)
        1.2520072 = weight(author_txt:solana in 1313) [ClassicSimilarity], result of:
          1.2520072 = score(doc=1313,freq=1.0), product of:
            0.69429785 = queryWeight, product of:
              1.1770902 = boost
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.06133047 = queryNorm
            1.803271 = fieldWeight in 1313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.617446 = idf(docFreq=7, maxDocs=44218)
              0.1875 = fieldNorm(doc=1313)
    
  4. Herrero Solana, V.; Moya Anegon, F. de: Bibliographic displays of Web-based OPACs : multivariate analysis applied to Latin-American catalogues (2001) 2.69
    2.6929107 = sum of:
      2.6929107 = product of:
        4.039366 = sum of:
          1.535351 = weight(author_txt:moya in 6143) [ClassicSimilarity], result of:
            1.535351 = score(doc=6143,freq=1.0), product of:
              0.50110227 = queryWeight, product of:
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.06133047 = queryNorm
              3.0639474 = fieldWeight in 6143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.375 = fieldNorm(doc=6143)
          2.5040145 = weight(author_txt:solana in 6143) [ClassicSimilarity], result of:
            2.5040145 = score(doc=6143,freq=1.0), product of:
              0.69429785 = queryWeight, product of:
                1.1770902 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.06133047 = queryNorm
              3.606542 = fieldWeight in 6143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=6143)
        0.6666667 = coord(2/3)
    
  5. Anegón, F. de Moya -> Moya Anegón, F. de: 2.47
    2.4688616 = sum of:
      2.4688616 = product of:
        3.7032924 = sum of:
          1.8094286 = weight(author_txt:moya in 3455) [ClassicSimilarity], result of:
            1.8094286 = score(doc=3455,freq=2.0), product of:
              0.50110227 = queryWeight, product of:
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.06133047 = queryNorm
              3.6108968 = fieldWeight in 3455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.3125 = fieldNorm(doc=3455)
          1.8938639 = weight(author_txt:anegón in 3455) [ClassicSimilarity], result of:
            1.8938639 = score(doc=3455,freq=2.0), product of:
              0.5165725 = queryWeight, product of:
                1.0153189 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.06133047 = queryNorm
              3.6662114 = fieldWeight in 3455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.3125 = fieldNorm(doc=3455)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de: ¬An evaluation of conflation accuracy using finite-state transducers (2006) 0.58
    0.5846599 = sum of:
      0.5846599 = product of:
        1.6240551 = sum of:
          0.021119162 = weight(abstract_txt:retrieval in 5599) [ClassicSimilarity], result of:
            0.021119162 = score(doc=5599,freq=3.0), product of:
              0.044911113 = queryWeight, product of:
                1.1180825 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011558667 = queryNorm
              0.47024357 = fieldWeight in 5599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.01526402 = weight(abstract_txt:approach in 5599) [ClassicSimilarity], result of:
            0.01526402 = score(doc=5599,freq=1.0), product of:
              0.052166186 = queryWeight, product of:
                1.2050123 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011558667 = queryNorm
              0.29260373 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.10550668 = weight(abstract_txt:finite in 5599) [ClassicSimilarity], result of:
            0.10550668 = score(doc=5599,freq=1.0), product of:
              0.15024029 = queryWeight, product of:
                1.4460229 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.011558667 = queryNorm
              0.7022529 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.12292466 = weight(abstract_txt:variants in 5599) [ClassicSimilarity], result of:
            0.12292466 = score(doc=5599,freq=1.0), product of:
              0.2095893 = queryWeight, product of:
                2.415358 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.011558667 = queryNorm
              0.58650255 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.1793722 = weight(abstract_txt:normalization in 5599) [ClassicSimilarity], result of:
            0.1793722 = score(doc=5599,freq=2.0), product of:
              0.21401122 = queryWeight, product of:
                2.4407046 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.011558667 = queryNorm
              0.83814394 = fieldWeight in 5599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.06431023 = weight(abstract_txt:term in 5599) [ClassicSimilarity], result of:
            0.06431023 = score(doc=5599,freq=1.0), product of:
              0.17145091 = queryWeight, product of:
                3.089455 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.011558667 = queryNorm
              0.37509412 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.051792104 = weight(abstract_txt:methods in 5599) [ClassicSimilarity], result of:
            0.051792104 = score(doc=5599,freq=1.0), product of:
              0.15986945 = queryWeight, product of:
                3.3354135 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.011558667 = queryNorm
              0.32396498 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.8628087 = weight(abstract_txt:conflation in 5599) [ClassicSimilarity], result of:
            0.8628087 = score(doc=5599,freq=3.0), product of:
              0.6712058 = queryWeight, product of:
                6.1127944 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.011558667 = queryNorm
              1.2854607 = fieldWeight in 5599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
          0.20095725 = weight(abstract_txt:linguistic in 5599) [ClassicSimilarity], result of:
            0.20095725 = score(doc=5599,freq=1.0), product of:
              0.44160596 = queryWeight, product of:
                6.559163 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.011558667 = queryNorm
              0.45506012 = fieldWeight in 5599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=5599)
        0.36 = coord(9/25)
    
  2. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.30
    0.30427006 = sum of:
      0.30427006 = product of:
        0.95084393 = sum of:
          0.009754524 = weight(abstract_txt:retrieval in 614) [ClassicSimilarity], result of:
            0.009754524 = score(doc=614,freq=1.0), product of:
              0.044911113 = queryWeight, product of:
                1.1180825 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011558667 = queryNorm
              0.21719621 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.012211218 = weight(abstract_txt:approach in 614) [ClassicSimilarity], result of:
            0.012211218 = score(doc=614,freq=1.0), product of:
              0.052166186 = queryWeight, product of:
                1.2050123 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011558667 = queryNorm
              0.234083 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.08568535 = weight(abstract_txt:equivalence in 614) [ClassicSimilarity], result of:
            0.08568535 = score(doc=614,freq=3.0), product of:
              0.105221316 = queryWeight, product of:
                1.210135 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.011558667 = queryNorm
              0.8143345 = fieldWeight in 614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.14619434 = weight(abstract_txt:finite in 614) [ClassicSimilarity], result of:
            0.14619434 = score(doc=614,freq=3.0), product of:
              0.15024029 = queryWeight, product of:
                1.4460229 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.011558667 = queryNorm
              0.97307014 = fieldWeight in 614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.19667946 = weight(abstract_txt:variants in 614) [ClassicSimilarity], result of:
            0.19667946 = score(doc=614,freq=4.0), product of:
              0.2095893 = queryWeight, product of:
                2.415358 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.011558667 = queryNorm
              0.9384041 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.04320865 = weight(abstract_txt:techniques in 614) [ClassicSimilarity], result of:
            0.04320865 = score(doc=614,freq=1.0), product of:
              0.15261841 = queryWeight, product of:
                2.9148445 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.011558667 = queryNorm
              0.2831156 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.05859608 = weight(abstract_txt:methods in 614) [ClassicSimilarity], result of:
            0.05859608 = score(doc=614,freq=2.0), product of:
              0.15986945 = queryWeight, product of:
                3.3354135 = boost
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.011558667 = queryNorm
              0.36652455 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.146752 = idf(docFreq=1900, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.3985143 = weight(abstract_txt:conflation in 614) [ClassicSimilarity], result of:
            0.3985143 = score(doc=614,freq=1.0), product of:
              0.6712058 = queryWeight, product of:
                6.1127944 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.011558667 = queryNorm
              0.5937289 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
        0.32 = coord(8/25)
    
  3. Mustafa, S.H.; AI-Radaideh, Q.A.: Using n-grams for Arabic text searching (2004) 0.23
    0.23165773 = sum of:
      0.23165773 = product of:
        1.1582886 = sum of:
          0.017243722 = weight(abstract_txt:retrieval in 2888) [ClassicSimilarity], result of:
            0.017243722 = score(doc=2888,freq=2.0), product of:
              0.044911113 = queryWeight, product of:
                1.1180825 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011558667 = queryNorm
              0.38395226 = fieldWeight in 2888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2888)
          0.02643806 = weight(abstract_txt:approach in 2888) [ClassicSimilarity], result of:
            0.02643806 = score(doc=2888,freq=3.0), product of:
              0.052166186 = queryWeight, product of:
                1.2050123 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011558667 = queryNorm
              0.5068045 = fieldWeight in 2888, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=2888)
          0.054010816 = weight(abstract_txt:techniques in 2888) [ClassicSimilarity], result of:
            0.054010816 = score(doc=2888,freq=1.0), product of:
              0.15261841 = queryWeight, product of:
                2.9148445 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.011558667 = queryNorm
              0.3538945 = fieldWeight in 2888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=2888)
          0.06431023 = weight(abstract_txt:term in 2888) [ClassicSimilarity], result of:
            0.06431023 = score(doc=2888,freq=1.0), product of:
              0.17145091 = queryWeight, product of:
                3.089455 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.011558667 = queryNorm
              0.37509412 = fieldWeight in 2888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=2888)
          0.9962858 = weight(abstract_txt:conflation in 2888) [ClassicSimilarity], result of:
            0.9962858 = score(doc=2888,freq=4.0), product of:
              0.6712058 = queryWeight, product of:
                6.1127944 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.011558667 = queryNorm
              1.4843223 = fieldWeight in 2888, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.078125 = fieldNorm(doc=2888)
        0.2 = coord(5/25)
    
  4. Willett, P.: Best-match text retrieval (1993) 0.16
    0.1609362 = sum of:
      0.1609362 = product of:
        1.0058513 = sum of:
          0.019509047 = weight(abstract_txt:retrieval in 7818) [ClassicSimilarity], result of:
            0.019509047 = score(doc=7818,freq=1.0), product of:
              0.044911113 = queryWeight, product of:
                1.1180825 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011558667 = queryNorm
              0.43439242 = fieldWeight in 7818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=7818)
          0.0864173 = weight(abstract_txt:techniques in 7818) [ClassicSimilarity], result of:
            0.0864173 = score(doc=7818,freq=1.0), product of:
              0.15261841 = queryWeight, product of:
                2.9148445 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.011558667 = queryNorm
              0.5662312 = fieldWeight in 7818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.125 = fieldNorm(doc=7818)
          0.10289636 = weight(abstract_txt:term in 7818) [ClassicSimilarity], result of:
            0.10289636 = score(doc=7818,freq=1.0), product of:
              0.17145091 = queryWeight, product of:
                3.089455 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.011558667 = queryNorm
              0.6001506 = fieldWeight in 7818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.125 = fieldNorm(doc=7818)
          0.7970286 = weight(abstract_txt:conflation in 7818) [ClassicSimilarity], result of:
            0.7970286 = score(doc=7818,freq=1.0), product of:
              0.6712058 = queryWeight, product of:
                6.1127944 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.011558667 = queryNorm
              1.1874578 = fieldWeight in 7818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.125 = fieldNorm(doc=7818)
        0.16 = coord(4/25)
    
  5. Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation (1996) 0.15
    0.15105239 = sum of:
      0.15105239 = product of:
        0.62938493 = sum of:
          0.04845877 = weight(abstract_txt:syntactic in 5578) [ClassicSimilarity], result of:
            0.04845877 = score(doc=5578,freq=1.0), product of:
              0.079200365 = queryWeight, product of:
                1.0498942 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.011558667 = queryNorm
              0.6118503 = fieldWeight in 5578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
          0.018316826 = weight(abstract_txt:approach in 5578) [ClassicSimilarity], result of:
            0.018316826 = score(doc=5578,freq=1.0), product of:
              0.052166186 = queryWeight, product of:
                1.2050123 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.011558667 = queryNorm
              0.3511245 = fieldWeight in 5578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
          0.14750959 = weight(abstract_txt:variants in 5578) [ClassicSimilarity], result of:
            0.14750959 = score(doc=5578,freq=1.0), product of:
              0.2095893 = queryWeight, product of:
                2.415358 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.011558667 = queryNorm
              0.70380306 = fieldWeight in 5578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
          0.06481297 = weight(abstract_txt:techniques in 5578) [ClassicSimilarity], result of:
            0.06481297 = score(doc=5578,freq=1.0), product of:
              0.15261841 = queryWeight, product of:
                2.9148445 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.011558667 = queryNorm
              0.42467338 = fieldWeight in 5578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
          0.10913807 = weight(abstract_txt:term in 5578) [ClassicSimilarity], result of:
            0.10913807 = score(doc=5578,freq=2.0), product of:
              0.17145091 = queryWeight, product of:
                3.089455 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.011558667 = queryNorm
              0.6365558 = fieldWeight in 5578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
          0.2411487 = weight(abstract_txt:linguistic in 5578) [ClassicSimilarity], result of:
            0.2411487 = score(doc=5578,freq=1.0), product of:
              0.44160596 = queryWeight, product of:
                6.559163 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.011558667 = queryNorm
              0.5460721 = fieldWeight in 5578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.09375 = fieldNorm(doc=5578)
        0.24 = coord(6/25)