Document (#37736)

Author
Aerts, D.
Broekaert, J.
Sozzo, S.
Veloz, T.
Title
Meaning-focused and quantum-inspired information retrieval
Source
http://arXiv:1304.0104
Year
2013
Abstract
In recent years, quantum-based methods have promisingly integrated the traditional procedures in information retrieval (IR) and natural language processing (NLP). Inspired by our research on the identification and application of quantum structures in cognition, more specifically our work on the representation of concepts and their combinations, we put forward a 'quantum meaning based' framework for structured query retrieval in text corpora and standardized testing corpora. This scheme for IR rests on considering as basic notions, (i) 'entities of meaning', e.g., concepts and their combinations and (ii) traces of such entities of meaning, which is how documents are considered in this approach. The meaning content of these 'entities of meaning' is reconstructed by solving an 'inverse problem' in the quantum formalism, consisting of reconstructing the full states of the entities of meaning from their collapsed states identified as traces in relevant documents. The advantages with respect to traditional approaches, such as Latent Semantic Analysis (LSA), are discussed by means of concrete examples.

Similar documents (content)

  1. Bawden, D.; Robinson, L.; Siddiqui, T.: "Potentialities or possibilities" : towards quantum information science? (2015) 0.33
    0.33403176 = sum of:
      0.33403176 = product of:
        1.6701587 = sum of:
          0.010991449 = weight(abstract_txt:such in 1659) [ClassicSimilarity], result of:
            0.010991449 = score(doc=1659,freq=1.0), product of:
              0.041070398 = queryWeight, product of:
                1.019499 = boost
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0117599685 = queryNorm
              0.2676246 = fieldWeight in 1659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.078125 = fieldNorm(doc=1659)
          0.06322555 = weight(abstract_txt:concepts in 1659) [ClassicSimilarity], result of:
            0.06322555 = score(doc=1659,freq=6.0), product of:
              0.07256106 = queryWeight, product of:
                1.3551087 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0117599685 = queryNorm
              0.8713427 = fieldWeight in 1659, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.078125 = fieldNorm(doc=1659)
          0.024342764 = weight(abstract_txt:retrieval in 1659) [ClassicSimilarity], result of:
            0.024342764 = score(doc=1659,freq=2.0), product of:
              0.0634005 = queryWeight, product of:
                1.5513661 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0117599685 = queryNorm
              0.38395226 = fieldWeight in 1659, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1659)
          0.16735491 = weight(abstract_txt:meaning in 1659) [ClassicSimilarity], result of:
            0.16735491 = score(doc=1659,freq=1.0), product of:
              0.38306633 = queryWeight, product of:
                5.824965 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0117599685 = queryNorm
              0.43688235 = fieldWeight in 1659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.078125 = fieldNorm(doc=1659)
          1.4042441 = weight(abstract_txt:quantum in 1659) [ClassicSimilarity], result of:
            1.4042441 = score(doc=1659,freq=8.0), product of:
              0.706975 = queryWeight, product of:
                6.6879706 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0117599685 = queryNorm
              1.9862711 = fieldWeight in 1659, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=1659)
        0.2 = coord(5/25)
    
  2. Piwowarski, B.; Amini, M.R.; Lalmas, M.: On using a quantum physics formalism for multidocument summarization (2012) 0.18
    0.17865084 = sum of:
      0.17865084 = product of:
        0.89325416 = sum of:
          0.03745693 = weight(abstract_txt:latent in 236) [ClassicSimilarity], result of:
            0.03745693 = score(doc=236,freq=1.0), product of:
              0.08565981 = queryWeight, product of:
                1.0411083 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0117599685 = queryNorm
              0.43727544 = fieldWeight in 236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0625 = fieldNorm(doc=236)
          0.05782334 = weight(abstract_txt:formalism in 236) [ClassicSimilarity], result of:
            0.05782334 = score(doc=236,freq=1.0), product of:
              0.11441714 = queryWeight, product of:
                1.2032417 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0117599685 = queryNorm
              0.50537306 = fieldWeight in 236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0625 = fieldNorm(doc=236)
          0.021654867 = weight(abstract_txt:documents in 236) [ClassicSimilarity], result of:
            0.021654867 = score(doc=236,freq=2.0), product of:
              0.059446458 = queryWeight, product of:
                1.2265502 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0117599685 = queryNorm
              0.36427513 = fieldWeight in 236, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=236)
          0.08838281 = weight(abstract_txt:inspired in 236) [ClassicSimilarity], result of:
            0.08838281 = score(doc=236,freq=1.0), product of:
              0.19128351 = queryWeight, product of:
                2.2001946 = boost
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0117599685 = queryNorm
              0.4620514 = fieldWeight in 236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3928223 = idf(docFreq=73, maxDocs=44218)
                0.0625 = fieldNorm(doc=236)
          0.68793625 = weight(abstract_txt:quantum in 236) [ClassicSimilarity], result of:
            0.68793625 = score(doc=236,freq=3.0), product of:
              0.706975 = queryWeight, product of:
                6.6879706 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0117599685 = queryNorm
              0.97307014 = fieldWeight in 236, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=236)
        0.2 = coord(5/25)
    
  3. Ungvary, R.: Intensional splitting : an empirical examination of conceptual duality (1986) 0.11
    0.11004118 = sum of:
      0.11004118 = product of:
        0.6877574 = sum of:
          0.010991449 = weight(abstract_txt:such in 5219) [ClassicSimilarity], result of:
            0.010991449 = score(doc=5219,freq=1.0), product of:
              0.041070398 = queryWeight, product of:
                1.019499 = boost
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0117599685 = queryNorm
              0.2676246 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.078125 = fieldNorm(doc=5219)
          0.012935794 = weight(abstract_txt:their in 5219) [ClassicSimilarity], result of:
            0.012935794 = score(doc=5219,freq=1.0), product of:
              0.052406456 = queryWeight, product of:
                1.4104587 = boost
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.0117599685 = queryNorm
              0.24683589 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.078125 = fieldNorm(doc=5219)
          0.16735491 = weight(abstract_txt:meaning in 5219) [ClassicSimilarity], result of:
            0.16735491 = score(doc=5219,freq=1.0), product of:
              0.38306633 = queryWeight, product of:
                5.824965 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0117599685 = queryNorm
              0.43688235 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.078125 = fieldNorm(doc=5219)
          0.49647525 = weight(abstract_txt:quantum in 5219) [ClassicSimilarity], result of:
            0.49647525 = score(doc=5219,freq=1.0), product of:
              0.706975 = queryWeight, product of:
                6.6879706 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0117599685 = queryNorm
              0.7022529 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=5219)
        0.16 = coord(4/25)
    
  4. Andersen, J.; Christensen, F.S.: Wittgenstein and indexing theory (2001) 0.11
    0.1061394 = sum of:
      0.1061394 = product of:
        0.530697 = sum of:
          0.015312303 = weight(abstract_txt:documents in 1590) [ClassicSimilarity], result of:
            0.015312303 = score(doc=1590,freq=1.0), product of:
              0.059446458 = queryWeight, product of:
                1.2265502 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0117599685 = queryNorm
              0.2575814 = fieldWeight in 1590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1590)
          0.07943603 = weight(abstract_txt:rests in 1590) [ClassicSimilarity], result of:
            0.07943603 = score(doc=1590,freq=1.0), product of:
              0.14139497 = queryWeight, product of:
                1.337594 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0117599685 = queryNorm
              0.5618023 = fieldWeight in 1590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1590)
          0.013770348 = weight(abstract_txt:retrieval in 1590) [ClassicSimilarity], result of:
            0.013770348 = score(doc=1590,freq=1.0), product of:
              0.0634005 = queryWeight, product of:
                1.5513661 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0117599685 = queryNorm
              0.21719621 = fieldWeight in 1590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=1590)
          0.12280474 = weight(abstract_txt:entities in 1590) [ClassicSimilarity], result of:
            0.12280474 = score(doc=1590,freq=2.0), product of:
              0.23818207 = queryWeight, product of:
                3.4720972 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0117599685 = queryNorm
              0.51559186 = fieldWeight in 1590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=1590)
          0.29937357 = weight(abstract_txt:meaning in 1590) [ClassicSimilarity], result of:
            0.29937357 = score(doc=1590,freq=5.0), product of:
              0.38306633 = queryWeight, product of:
                5.824965 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0117599685 = queryNorm
              0.7815189 = fieldWeight in 1590, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0625 = fieldNorm(doc=1590)
        0.2 = coord(5/25)
    
  5. Floridi, L.: Information: a very short introduction (2010) 0.10
    0.09855411 = sum of:
      0.09855411 = product of:
        0.6159632 = sum of:
          0.008793158 = weight(abstract_txt:such in 3270) [ClassicSimilarity], result of:
            0.008793158 = score(doc=3270,freq=1.0), product of:
              0.041070398 = queryWeight, product of:
                1.019499 = boost
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0117599685 = queryNorm
              0.21409966 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4255946 = idf(docFreq=3909, maxDocs=44218)
                0.0625 = fieldNorm(doc=3270)
          0.02064938 = weight(abstract_txt:concepts in 3270) [ClassicSimilarity], result of:
            0.02064938 = score(doc=3270,freq=1.0), product of:
              0.07256106 = queryWeight, product of:
                1.3551087 = boost
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0117599685 = queryNorm
              0.28457934 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5532694 = idf(docFreq=1265, maxDocs=44218)
                0.0625 = fieldNorm(doc=3270)
          0.18934046 = weight(abstract_txt:meaning in 3270) [ClassicSimilarity], result of:
            0.18934046 = score(doc=3270,freq=2.0), product of:
              0.38306633 = queryWeight, product of:
                5.824965 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0117599685 = queryNorm
              0.49427593 = fieldWeight in 3270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.0625 = fieldNorm(doc=3270)
          0.3971802 = weight(abstract_txt:quantum in 3270) [ClassicSimilarity], result of:
            0.3971802 = score(doc=3270,freq=1.0), product of:
              0.706975 = queryWeight, product of:
                6.6879706 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0117599685 = queryNorm
              0.5618023 = fieldWeight in 3270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=3270)
        0.16 = coord(4/25)