Document (#40477)

Author
Layfield, C.
Azzopardi, J,
Staff, C.
Title
Experiments with document retrieval from small text collections using Latent Semantic Analysis or term similarity with query coordination and automatic relevance feedback
Source
Semantic keyword-based search on structured data sources: COST Action IC1302. Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8-9, 2016, Revised Selected Papers. Eds.: A. Calì, A. et al
Imprint
Springer International Publishing
Year
2017
Pages
S.25-36
Series
Information Systems and Applications, incl. Internet/Web, and HCI; 10151
Abstract
One of the problems faced by users of databases containing textual documents is the difficulty in retrieving relevant results due to the diverse vocabulary used in queries and contained in relevant documents, especially when there are only a small number of relevant documents. This problem is known as the Vocabulary Gap. The PIKES team have constructed a small test collection of 331 articles extracted from a blog and a Gold Standard for 35 queries selected from the blog's search log so the results of different approaches to semantic search can be compared. So far, prior approaches include recognising Named Entities in documents and queries, and relations including temporal relations, and represent them as `semantic layers' in a retrieval system index. In this work, we take two different approaches that do not involve Named Entity Recognition. In the first approach, we process an unannotated version of the PIKES document collection using Latent Semantic Analysis and use a combination of query coordination and automatic relevance feedback with which we outperform prior work. However, this approach is highly dependent on the underlying collection, and is not necessarily scalable to massive collections. In our second approach, we use an LSA Model generated by SEMILAR from a Wikipedia dump to generate a Term Similarity Matrix (TSM). We automatically expand the queries in the PIKES test collection with related terms from the TSM and submit them to a term-by-document matrix derived by indexing the PIKES collection using the Vector Space Model. Coupled with a combination of query coordination and automatic relevance feedback we also outperform prior work with this approach. The advantage of the second approach is that it is independent of the underlying document collection.
Content
Vgl. auch: http://www.keystone-cost.eu/ikc2016/program.php.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Object
Latent Semantic Analysis

Similar documents (author)

  1. Baillie, M.; Azzopardi, L.; Ruthven, I.: Evaluating epistemic uncertainty under incomplete assessments (2008) 3.65
    3.6509595 = sum of:
      3.6509595 = weight(author_txt:azzopardi in 4063) [ClassicSimilarity], result of:
        3.6509595 = fieldWeight in 4063, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.375 = fieldNorm(doc=4063)
    
  2. Balog, K.; Azzopardi, L.; Rijke, M. de: ¬A language modeling framework for expert finding (2009) 3.65
    3.6509595 = sum of:
      3.6509595 = weight(author_txt:azzopardi in 4445) [ClassicSimilarity], result of:
        3.6509595 = fieldWeight in 4445, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.375 = fieldNorm(doc=4445)
    
  3. Russell-Rose, T.; Chamberlain, J.; Azzopardi, L.: Information retrieval in the workplace : a comparison of professional search practices (2018) 3.65
    3.6509595 = sum of:
      3.6509595 = weight(author_txt:azzopardi in 1334) [ClassicSimilarity], result of:
        3.6509595 = fieldWeight in 1334, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.375 = fieldNorm(doc=1334)
    
  4. Azzopardi, J.; Benedetti, F.; Guerra, F.; Lupu, M.: Back to the sketch-board : integrating keyword search, semantics, and information retrieval (2017) 3.04
    3.0424664 = sum of:
      3.0424664 = weight(author_txt:azzopardi in 482) [ClassicSimilarity], result of:
        3.0424664 = fieldWeight in 482, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.3125 = fieldNorm(doc=482)
    
  5. Ruthven, I.; Baillie, M.; Azzopardi, L.; Bierig, R.; Nicol, E.; Sweeney, S.; Yaciki, M.: Contextual factors affecting the utility of surrogates within exploratory search (2008) 2.43
    2.433973 = sum of:
      2.433973 = weight(author_txt:azzopardi in 4040) [ClassicSimilarity], result of:
        2.433973 = fieldWeight in 4040, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.735892 = idf(docFreq=6, maxDocs=43556)
          0.25 = fieldNorm(doc=4040)
    

Similar documents (content)

  1. Deerwester, S.C.; Dumais, S.T.; Landauer, T.K.; Furnas, G.W.; Harshman, R.A.: Indexing by latent semantic analysis (1990) 0.44
    0.44312936 = sum of:
      0.44312936 = product of:
        0.9231862 = sum of:
          0.06887847 = weight(abstract_txt:combination in 4397) [ClassicSimilarity], result of:
            0.06887847 = score(doc=4397,freq=1.0), product of:
              0.15233861 = queryWeight, product of:
                1.0125377 = boost
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.025996525 = queryNorm
              0.4521406 = fieldWeight in 4397, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.16176449 = weight(abstract_txt:matrix in 4397) [ClassicSimilarity], result of:
            0.16176449 = score(doc=4397,freq=2.0), product of:
              0.21363206 = queryWeight, product of:
                1.1990559 = boost
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.025996525 = queryNorm
              0.75721073 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.026920792 = weight(abstract_txt:from in 4397) [ClassicSimilarity], result of:
            0.026920792 = score(doc=4397,freq=2.0), product of:
              0.08772355 = queryWeight, product of:
                1.2148826 = boost
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.025996525 = queryNorm
              0.30688214 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.053470183 = weight(abstract_txt:relevant in 4397) [ClassicSimilarity], result of:
            0.053470183 = score(doc=4397,freq=1.0), product of:
              0.14729653 = queryWeight, product of:
                1.2194054 = boost
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.025996525 = queryNorm
              0.36301047 = fieldWeight in 4397, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.05957199 = weight(abstract_txt:term in 4397) [ClassicSimilarity], result of:
            0.05957199 = score(doc=4397,freq=1.0), product of:
              0.15829948 = queryWeight, product of:
                1.2641295 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.025996525 = queryNorm
              0.37632462 = fieldWeight in 4397, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.023800546 = weight(abstract_txt:with in 4397) [ClassicSimilarity], result of:
            0.023800546 = score(doc=4397,freq=2.0), product of:
              0.08587024 = queryWeight, product of:
                1.3167042 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.025996525 = queryNorm
              0.27716872 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.10571996 = weight(abstract_txt:automatic in 4397) [ClassicSimilarity], result of:
            0.10571996 = score(doc=4397,freq=2.0), product of:
              0.1841674 = queryWeight, product of:
                1.3635097 = boost
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.025996525 = queryNorm
              0.57404274 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.099448934 = weight(abstract_txt:documents in 4397) [ClassicSimilarity], result of:
            0.099448934 = score(doc=4397,freq=4.0), product of:
              0.15445824 = queryWeight, product of:
                1.4418721 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.025996525 = queryNorm
              0.64385647 = fieldWeight in 4397, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.07926955 = weight(abstract_txt:document in 4397) [ClassicSimilarity], result of:
            0.07926955 = score(doc=4397,freq=2.0), product of:
              0.16729845 = queryWeight, product of:
                1.5006077 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.025996525 = queryNorm
              0.4738212 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.06405444 = weight(abstract_txt:semantic in 4397) [ClassicSimilarity], result of:
            0.06405444 = score(doc=4397,freq=1.0), product of:
              0.18286495 = queryWeight, product of:
                1.5688682 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.025996525 = queryNorm
              0.35028276 = fieldWeight in 4397, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.047094874 = weight(abstract_txt:approach in 4397) [ClassicSimilarity], result of:
            0.047094874 = score(doc=4397,freq=1.0), product of:
              0.16046615 = queryWeight, product of:
                1.6431149 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.025996525 = queryNorm
              0.2934879 = fieldWeight in 4397, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
          0.13319199 = weight(abstract_txt:queries in 4397) [ClassicSimilarity], result of:
            0.13319199 = score(doc=4397,freq=2.0), product of:
              0.23645023 = queryWeight, product of:
                1.7839845 = boost
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.025996525 = queryNorm
              0.5632982 = fieldWeight in 4397, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.078125 = fieldNorm(doc=4397)
        0.48 = coord(12/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.40
    0.39741072 = sum of:
      0.39741072 = product of:
        0.58442754 = sum of:
          0.011795797 = weight(abstract_txt:work in 4460) [ClassicSimilarity], result of:
            0.011795797 = score(doc=4460,freq=1.0), product of:
              0.09905954 = queryWeight, product of:
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.025996525 = queryNorm
              0.11907786 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.027551388 = weight(abstract_txt:combination in 4460) [ClassicSimilarity], result of:
            0.027551388 = score(doc=4460,freq=1.0), product of:
              0.15233861 = queryWeight, product of:
                1.0125377 = boost
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.025996525 = queryNorm
              0.18085624 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.027993204 = weight(abstract_txt:similarity in 4460) [ClassicSimilarity], result of:
            0.027993204 = score(doc=4460,freq=1.0), product of:
              0.1539629 = queryWeight, product of:
                1.0179214 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.025996525 = queryNorm
              0.18181786 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.045753904 = weight(abstract_txt:matrix in 4460) [ClassicSimilarity], result of:
            0.045753904 = score(doc=4460,freq=1.0), product of:
              0.21363206 = queryWeight, product of:
                1.1990559 = boost
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.025996525 = queryNorm
              0.21417153 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.036588352 = weight(abstract_txt:approaches in 4460) [ClassicSimilarity], result of:
            0.036588352 = score(doc=4460,freq=3.0), product of:
              0.14608297 = queryWeight, product of:
                1.2143717 = boost
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.025996525 = queryNorm
              0.25046283 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.0076143495 = weight(abstract_txt:from in 4460) [ClassicSimilarity], result of:
            0.0076143495 = score(doc=4460,freq=1.0), product of:
              0.08772355 = queryWeight, product of:
                1.2148826 = boost
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.025996525 = queryNorm
              0.086799376 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.03704523 = weight(abstract_txt:relevant in 4460) [ClassicSimilarity], result of:
            0.03704523 = score(doc=4460,freq=3.0), product of:
              0.14729653 = queryWeight, product of:
                1.2194054 = boost
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.025996525 = queryNorm
              0.25150102 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.049136095 = weight(abstract_txt:latent in 4460) [ClassicSimilarity], result of:
            0.049136095 = score(doc=4460,freq=1.0), product of:
              0.22403443 = queryWeight, product of:
                1.2279017 = boost
                7.0183635 = idf(docFreq=105, maxDocs=43556)
                0.025996525 = queryNorm
              0.21932386 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0183635 = idf(docFreq=105, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.05085162 = weight(abstract_txt:query in 4460) [ClassicSimilarity], result of:
            0.05085162 = score(doc=4460,freq=5.0), product of:
              0.15344684 = queryWeight, product of:
                1.2446029 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.025996525 = queryNorm
              0.3313957 = fieldWeight in 4460, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.023828795 = weight(abstract_txt:term in 4460) [ClassicSimilarity], result of:
            0.023828795 = score(doc=4460,freq=1.0), product of:
              0.15829948 = queryWeight, product of:
                1.2641295 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.025996525 = queryNorm
              0.15052985 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.009520219 = weight(abstract_txt:with in 4460) [ClassicSimilarity], result of:
            0.009520219 = score(doc=4460,freq=2.0), product of:
              0.08587024 = queryWeight, product of:
                1.3167042 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.025996525 = queryNorm
              0.11086749 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.042287987 = weight(abstract_txt:automatic in 4460) [ClassicSimilarity], result of:
            0.042287987 = score(doc=4460,freq=2.0), product of:
              0.1841674 = queryWeight, product of:
                1.3635097 = boost
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.025996525 = queryNorm
              0.2296171 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.062897034 = weight(abstract_txt:documents in 4460) [ClassicSimilarity], result of:
            0.062897034 = score(doc=4460,freq=10.0), product of:
              0.15445824 = queryWeight, product of:
                1.4418721 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.025996525 = queryNorm
              0.4072106 = fieldWeight in 4460, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.03170782 = weight(abstract_txt:document in 4460) [ClassicSimilarity], result of:
            0.03170782 = score(doc=4460,freq=2.0), product of:
              0.16729845 = queryWeight, product of:
                1.5006077 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.025996525 = queryNorm
              0.18952848 = fieldWeight in 4460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.044378217 = weight(abstract_txt:semantic in 4460) [ClassicSimilarity], result of:
            0.044378217 = score(doc=4460,freq=3.0), product of:
              0.18286495 = queryWeight, product of:
                1.5688682 = boost
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.025996525 = queryNorm
              0.24268301 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.483619 = idf(docFreq=1336, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.032628283 = weight(abstract_txt:approach in 4460) [ClassicSimilarity], result of:
            0.032628283 = score(doc=4460,freq=3.0), product of:
              0.16046615 = queryWeight, product of:
                1.6431149 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.025996525 = queryNorm
              0.20333438 = fieldWeight in 4460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
          0.042849224 = weight(abstract_txt:collection in 4460) [ClassicSimilarity], result of:
            0.042849224 = score(doc=4460,freq=1.0), product of:
              0.2949285 = queryWeight, product of:
                2.440199 = boost
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.025996525 = queryNorm
              0.14528683 = fieldWeight in 4460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.03125 = fieldNorm(doc=4460)
        0.68 = coord(17/25)
    
  3. Crouch, C.J.; Crouch, D.B.; Chen, Q.; Holtz, S.J.: Improving the retrieval effectiveness of very short queries (2002) 0.38
    0.37900314 = sum of:
      0.37900314 = product of:
        0.7895899 = sum of:
          0.023591595 = weight(abstract_txt:work in 3570) [ClassicSimilarity], result of:
            0.023591595 = score(doc=3570,freq=1.0), product of:
              0.09905954 = queryWeight, product of:
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.025996525 = queryNorm
              0.23815572 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.021536633 = weight(abstract_txt:from in 3570) [ClassicSimilarity], result of:
            0.021536633 = score(doc=3570,freq=2.0), product of:
              0.08772355 = queryWeight, product of:
                1.2148826 = boost
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.025996525 = queryNorm
              0.2455057 = fieldWeight in 3570, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.07409046 = weight(abstract_txt:relevant in 3570) [ClassicSimilarity], result of:
            0.07409046 = score(doc=3570,freq=3.0), product of:
              0.14729653 = queryWeight, product of:
                1.2194054 = boost
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.025996525 = queryNorm
              0.50300205 = fieldWeight in 3570, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.06432278 = weight(abstract_txt:query in 3570) [ClassicSimilarity], result of:
            0.06432278 = score(doc=3570,freq=2.0), product of:
              0.15344684 = queryWeight, product of:
                1.2446029 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.025996525 = queryNorm
              0.41918606 = fieldWeight in 3570, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.013463623 = weight(abstract_txt:with in 3570) [ClassicSimilarity], result of:
            0.013463623 = score(doc=3570,freq=1.0), product of:
              0.08587024 = queryWeight, product of:
                1.3167042 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.025996525 = queryNorm
              0.15679032 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.059804242 = weight(abstract_txt:automatic in 3570) [ClassicSimilarity], result of:
            0.059804242 = score(doc=3570,freq=1.0), product of:
              0.1841674 = queryWeight, product of:
                1.3635097 = boost
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.025996525 = queryNorm
              0.32472762 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.195642 = idf(docFreq=655, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.07955915 = weight(abstract_txt:documents in 3570) [ClassicSimilarity], result of:
            0.07955915 = score(doc=3570,freq=4.0), product of:
              0.15445824 = queryWeight, product of:
                1.4418721 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.025996525 = queryNorm
              0.51508516 = fieldWeight in 3570, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.04484163 = weight(abstract_txt:document in 3570) [ClassicSimilarity], result of:
            0.04484163 = score(doc=3570,freq=1.0), product of:
              0.16729845 = queryWeight, product of:
                1.5006077 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.025996525 = queryNorm
              0.26803374 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.12692386 = weight(abstract_txt:feedback in 3570) [ClassicSimilarity], result of:
            0.12692386 = score(doc=3570,freq=2.0), product of:
              0.24140354 = queryWeight, product of:
                1.5610746 = boost
                5.9484615 = idf(docFreq=308, maxDocs=43556)
                0.025996525 = queryNorm
              0.52577466 = fieldWeight in 3570, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9484615 = idf(docFreq=308, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.065256566 = weight(abstract_txt:approach in 3570) [ClassicSimilarity], result of:
            0.065256566 = score(doc=3570,freq=3.0), product of:
              0.16046615 = queryWeight, product of:
                1.6431149 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.025996525 = queryNorm
              0.40666875 = fieldWeight in 3570, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.13050096 = weight(abstract_txt:queries in 3570) [ClassicSimilarity], result of:
            0.13050096 = score(doc=3570,freq=3.0), product of:
              0.23645023 = queryWeight, product of:
                1.7839845 = boost
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.025996525 = queryNorm
              0.55191725 = fieldWeight in 3570, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
          0.08569845 = weight(abstract_txt:collection in 3570) [ClassicSimilarity], result of:
            0.08569845 = score(doc=3570,freq=1.0), product of:
              0.2949285 = queryWeight, product of:
                2.440199 = boost
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.025996525 = queryNorm
              0.29057366 = fieldWeight in 3570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.0625 = fieldNorm(doc=3570)
        0.48 = coord(12/25)
    
  4. Cai, F.; Wang, S.; Rijke, M.de: Behavior-based personalization in web search (2017) 0.37
    0.36698732 = sum of:
      0.36698732 = product of:
        0.76455694 = sum of:
          0.023591595 = weight(abstract_txt:work in 525) [ClassicSimilarity], result of:
            0.023591595 = score(doc=525,freq=1.0), product of:
              0.09905954 = queryWeight, product of:
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.025996525 = queryNorm
              0.23815572 = fieldWeight in 525, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8104916 = idf(docFreq=2620, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.055102777 = weight(abstract_txt:combination in 525) [ClassicSimilarity], result of:
            0.055102777 = score(doc=525,freq=1.0), product of:
              0.15233861 = queryWeight, product of:
                1.0125377 = boost
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.025996525 = queryNorm
              0.3617125 = fieldWeight in 525, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7874 = idf(docFreq=362, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.09150781 = weight(abstract_txt:matrix in 525) [ClassicSimilarity], result of:
            0.09150781 = score(doc=525,freq=1.0), product of:
              0.21363206 = queryWeight, product of:
                1.1990559 = boost
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.025996525 = queryNorm
              0.42834306 = fieldWeight in 525, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.853489 = idf(docFreq=124, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.04224859 = weight(abstract_txt:approaches in 525) [ClassicSimilarity], result of:
            0.04224859 = score(doc=525,freq=1.0), product of:
              0.14608297 = queryWeight, product of:
                1.2143717 = boost
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.025996525 = queryNorm
              0.28920957 = fieldWeight in 525, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.627353 = idf(docFreq=1157, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.02637688 = weight(abstract_txt:from in 525) [ClassicSimilarity], result of:
            0.02637688 = score(doc=525,freq=3.0), product of:
              0.08772355 = queryWeight, product of:
                1.2148826 = boost
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.025996525 = queryNorm
              0.30068186 = fieldWeight in 525, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.042776145 = weight(abstract_txt:relevant in 525) [ClassicSimilarity], result of:
            0.042776145 = score(doc=525,freq=1.0), product of:
              0.14729653 = queryWeight, product of:
                1.2194054 = boost
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.025996525 = queryNorm
              0.29040837 = fieldWeight in 525, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.646534 = idf(docFreq=1135, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.11141032 = weight(abstract_txt:query in 525) [ClassicSimilarity], result of:
            0.11141032 = score(doc=525,freq=6.0), product of:
              0.15344684 = queryWeight, product of:
                1.2446029 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.025996525 = queryNorm
              0.72605157 = fieldWeight in 525, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.08872161 = weight(abstract_txt:relevance in 525) [ClassicSimilarity], result of:
            0.08872161 = score(doc=525,freq=3.0), product of:
              0.16610038 = queryWeight, product of:
                1.2949028 = boost
                4.934216 = idf(docFreq=851, maxDocs=43556)
                0.025996525 = queryNorm
              0.5341445 = fieldWeight in 525, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.934216 = idf(docFreq=851, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.019040437 = weight(abstract_txt:with in 525) [ClassicSimilarity], result of:
            0.019040437 = score(doc=525,freq=2.0), product of:
              0.08587024 = queryWeight, product of:
                1.3167042 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.025996525 = queryNorm
              0.22173499 = fieldWeight in 525, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.07955915 = weight(abstract_txt:documents in 525) [ClassicSimilarity], result of:
            0.07955915 = score(doc=525,freq=4.0), product of:
              0.15445824 = queryWeight, product of:
                1.4418721 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.025996525 = queryNorm
              0.51508516 = fieldWeight in 525, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.07766798 = weight(abstract_txt:document in 525) [ClassicSimilarity], result of:
            0.07766798 = score(doc=525,freq=3.0), product of:
              0.16729845 = queryWeight, product of:
                1.5006077 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.025996525 = queryNorm
              0.46424806 = fieldWeight in 525, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
          0.10655359 = weight(abstract_txt:queries in 525) [ClassicSimilarity], result of:
            0.10655359 = score(doc=525,freq=2.0), product of:
              0.23645023 = queryWeight, product of:
                1.7839845 = boost
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.025996525 = queryNorm
              0.45063856 = fieldWeight in 525, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.0625 = fieldNorm(doc=525)
        0.48 = coord(12/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.36
    0.3555096 = sum of:
      0.3555096 = product of:
        0.74064505 = sum of:
          0.05598641 = weight(abstract_txt:similarity in 599) [ClassicSimilarity], result of:
            0.05598641 = score(doc=599,freq=1.0), product of:
              0.1539629 = queryWeight, product of:
                1.0179214 = boost
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.025996525 = queryNorm
              0.36363572 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181715 = idf(docFreq=351, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.021536633 = weight(abstract_txt:from in 599) [ClassicSimilarity], result of:
            0.021536633 = score(doc=599,freq=2.0), product of:
              0.08772355 = queryWeight, product of:
                1.2148826 = boost
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.025996525 = queryNorm
              0.2455057 = fieldWeight in 599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.77758 = idf(docFreq=7362, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.06432278 = weight(abstract_txt:query in 599) [ClassicSimilarity], result of:
            0.06432278 = score(doc=599,freq=2.0), product of:
              0.15344684 = queryWeight, product of:
                1.2446029 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.025996525 = queryNorm
              0.41918606 = fieldWeight in 599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.04765759 = weight(abstract_txt:term in 599) [ClassicSimilarity], result of:
            0.04765759 = score(doc=599,freq=1.0), product of:
              0.15829948 = queryWeight, product of:
                1.2641295 = boost
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.025996525 = queryNorm
              0.3010597 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.816955 = idf(docFreq=957, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.05122345 = weight(abstract_txt:relevance in 599) [ClassicSimilarity], result of:
            0.05122345 = score(doc=599,freq=1.0), product of:
              0.16610038 = queryWeight, product of:
                1.2949028 = boost
                4.934216 = idf(docFreq=851, maxDocs=43556)
                0.025996525 = queryNorm
              0.3083885 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.934216 = idf(docFreq=851, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.032979008 = weight(abstract_txt:with in 599) [ClassicSimilarity], result of:
            0.032979008 = score(doc=599,freq=6.0), product of:
              0.08587024 = queryWeight, product of:
                1.3167042 = boost
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.025996525 = queryNorm
              0.3840563 = fieldWeight in 599, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.508645 = idf(docFreq=9634, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.06543601 = weight(abstract_txt:small in 599) [ClassicSimilarity], result of:
            0.06543601 = score(doc=599,freq=1.0), product of:
              0.19555517 = queryWeight, product of:
                1.4050329 = boost
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.025996525 = queryNorm
              0.33461663 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353866 = idf(docFreq=559, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.039779574 = weight(abstract_txt:documents in 599) [ClassicSimilarity], result of:
            0.039779574 = score(doc=599,freq=1.0), product of:
              0.15445824 = queryWeight, product of:
                1.4418721 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.025996525 = queryNorm
              0.25754258 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.10026894 = weight(abstract_txt:document in 599) [ClassicSimilarity], result of:
            0.10026894 = score(doc=599,freq=5.0), product of:
              0.16729845 = queryWeight, product of:
                1.5006077 = boost
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.025996525 = queryNorm
              0.5993417 = fieldWeight in 599, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.28854 = idf(docFreq=1624, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.0376759 = weight(abstract_txt:approach in 599) [ClassicSimilarity], result of:
            0.0376759 = score(doc=599,freq=1.0), product of:
              0.16046615 = queryWeight, product of:
                1.6431149 = boost
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.025996525 = queryNorm
              0.23479033 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7566452 = idf(docFreq=2765, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.07534476 = weight(abstract_txt:queries in 599) [ClassicSimilarity], result of:
            0.07534476 = score(doc=599,freq=1.0), product of:
              0.23645023 = queryWeight, product of:
                1.7839845 = boost
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.025996525 = queryNorm
              0.3186496 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0983934 = idf(docFreq=722, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
          0.14843407 = weight(abstract_txt:collection in 599) [ClassicSimilarity], result of:
            0.14843407 = score(doc=599,freq=3.0), product of:
              0.2949285 = queryWeight, product of:
                2.440199 = boost
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.025996525 = queryNorm
              0.5032883 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6491785 = idf(docFreq=1132, maxDocs=43556)
                0.0625 = fieldNorm(doc=599)
        0.48 = coord(12/25)