Document (#37737)

Author
Aerts, D.
Broekaert, J.
Sozzo, S.
Veloz, T.
Title
Meaning-focused and quantum-inspired information retrieval
Source
http://arXiv:1304.0104
Year
2013
Abstract
In recent years, quantum-based methods have promisingly integrated the traditional procedures in information retrieval (IR) and natural language processing (NLP). Inspired by our research on the identification and application of quantum structures in cognition, more specifically our work on the representation of concepts and their combinations, we put forward a 'quantum meaning based' framework for structured query retrieval in text corpora and standardized testing corpora. This scheme for IR rests on considering as basic notions, (i) 'entities of meaning', e.g., concepts and their combinations and (ii) traces of such entities of meaning, which is how documents are considered in this approach. The meaning content of these 'entities of meaning' is reconstructed by solving an 'inverse problem' in the quantum formalism, consisting of reconstructing the full states of the entities of meaning from their collapsed states identified as traces in relevant documents. The advantages with respect to traditional approaches, such as Latent Semantic Analysis (LSA), are discussed by means of concrete examples.

Similar documents (content)

  1. Bawden, D.; Robinson, L.; Siddiqui, T.: "Potentialities or possibilities" : towards quantum information science? (2015) 0.33
    0.33375722 = sum of:
      0.33375722 = product of:
        1.668786 = sum of:
          0.01140187 = weight(abstract_txt:such in 3660) [ClassicSimilarity], result of:
            0.01140187 = score(doc=3660,freq=1.0), product of:
              0.042205915 = queryWeight, product of:
                1.0271959 = boost
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.011882485 = queryNorm
              0.27014863 = fieldWeight in 3660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.078125 = fieldNorm(doc=3660)
          0.06526875 = weight(abstract_txt:concepts in 3660) [ClassicSimilarity], result of:
            0.06526875 = score(doc=3660,freq=6.0), product of:
              0.07432627 = queryWeight, product of:
                1.3631316 = boost
                4.5887804 = idf(docFreq=1168, maxDocs=42306)
                0.011882485 = queryNorm
              0.8781383 = fieldWeight in 3660, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.5887804 = idf(docFreq=1168, maxDocs=42306)
                0.078125 = fieldNorm(doc=3660)
          0.024285872 = weight(abstract_txt:retrieval in 3660) [ClassicSimilarity], result of:
            0.024285872 = score(doc=3660,freq=2.0), product of:
              0.06348125 = queryWeight, product of:
                1.5428901 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.011882485 = queryNorm
              0.38256764 = fieldWeight in 3660, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=3660)
          0.1723913 = weight(abstract_txt:meaning in 3660) [ClassicSimilarity], result of:
            0.1723913 = score(doc=3660,freq=1.0), product of:
              0.391822 = queryWeight, product of:
                5.855245 = boost
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.011882485 = queryNorm
              0.4399735 = fieldWeight in 3660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.078125 = fieldNorm(doc=3660)
          1.3954382 = weight(abstract_txt:quantum in 3660) [ClassicSimilarity], result of:
            1.3954382 = score(doc=3660,freq=8.0), product of:
              0.7060135 = queryWeight, product of:
                6.642677 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.011882485 = queryNorm
              1.9765036 = fieldWeight in 3660, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=3660)
        0.2 = coord(5/25)
    
  2. Piwowarski, B.; Amini, M.R.; Lalmas, M.: On using a quantum physics formalism for multidocument summarization (2012) 0.18
    0.17952882 = sum of:
      0.17952882 = product of:
        0.89764404 = sum of:
          0.038447212 = weight(abstract_txt:latent in 2237) [ClassicSimilarity], result of:
            0.038447212 = score(doc=2237,freq=1.0), product of:
              0.08741028 = queryWeight, product of:
                1.0452806 = boost
                7.037564 = idf(docFreq=100, maxDocs=42306)
                0.011882485 = queryNorm
              0.43984774 = fieldWeight in 2237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.037564 = idf(docFreq=100, maxDocs=42306)
                0.0625 = fieldNorm(doc=2237)
          0.058563128 = weight(abstract_txt:formalism in 2237) [ClassicSimilarity], result of:
            0.058563128 = score(doc=2237,freq=1.0), product of:
              0.11571831 = queryWeight, product of:
                1.2026871 = boost
                8.097336 = idf(docFreq=34, maxDocs=42306)
                0.011882485 = queryNorm
              0.5060835 = fieldWeight in 2237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.097336 = idf(docFreq=34, maxDocs=42306)
                0.0625 = fieldNorm(doc=2237)
          0.021752115 = weight(abstract_txt:documents in 2237) [ClassicSimilarity], result of:
            0.021752115 = score(doc=2237,freq=2.0), product of:
              0.059793446 = queryWeight, product of:
                1.2226254 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.011882485 = queryNorm
              0.36378762 = fieldWeight in 2237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=2237)
          0.09525921 = weight(abstract_txt:inspired in 2237) [ClassicSimilarity], result of:
            0.09525921 = score(doc=2237,freq=1.0), product of:
              0.20165108 = queryWeight, product of:
                2.2452614 = boost
                7.5583396 = idf(docFreq=59, maxDocs=42306)
                0.011882485 = queryNorm
              0.47239622 = fieldWeight in 2237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5583396 = idf(docFreq=59, maxDocs=42306)
                0.0625 = fieldNorm(doc=2237)
          0.68362236 = weight(abstract_txt:quantum in 2237) [ClassicSimilarity], result of:
            0.68362236 = score(doc=2237,freq=3.0), product of:
              0.7060135 = queryWeight, product of:
                6.642677 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.011882485 = queryNorm
              0.9682851 = fieldWeight in 2237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=2237)
        0.2 = coord(5/25)
    
  3. Ungvary, R.: Intensional splitting : an empirical examination of conceptual duality (1986) 0.11
    0.11051509 = sum of:
      0.11051509 = product of:
        0.6907193 = sum of:
          0.01140187 = weight(abstract_txt:such in 5219) [ClassicSimilarity], result of:
            0.01140187 = score(doc=5219,freq=1.0), product of:
              0.042205915 = queryWeight, product of:
                1.0271959 = boost
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.011882485 = queryNorm
              0.27014863 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.078125 = fieldNorm(doc=5219)
          0.01356423 = weight(abstract_txt:their in 5219) [ClassicSimilarity], result of:
            0.01356423 = score(doc=5219,freq=1.0), product of:
              0.054243755 = queryWeight, product of:
                1.426222 = boost
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.011882485 = queryNorm
              0.25006068 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2007766 = idf(docFreq=4683, maxDocs=42306)
                0.078125 = fieldNorm(doc=5219)
          0.1723913 = weight(abstract_txt:meaning in 5219) [ClassicSimilarity], result of:
            0.1723913 = score(doc=5219,freq=1.0), product of:
              0.391822 = queryWeight, product of:
                5.855245 = boost
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.011882485 = queryNorm
              0.4399735 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.078125 = fieldNorm(doc=5219)
          0.49336192 = weight(abstract_txt:quantum in 5219) [ClassicSimilarity], result of:
            0.49336192 = score(doc=5219,freq=1.0), product of:
              0.7060135 = queryWeight, product of:
                6.642677 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.011882485 = queryNorm
              0.69879955 = fieldWeight in 5219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.078125 = fieldNorm(doc=5219)
        0.16 = coord(4/25)
    
  4. Andersen, J.; Christensen, F.S.: Wittgenstein and indexing theory (2001) 0.11
    0.10879674 = sum of:
      0.10879674 = product of:
        0.5439837 = sum of:
          0.015381068 = weight(abstract_txt:documents in 2591) [ClassicSimilarity], result of:
            0.015381068 = score(doc=2591,freq=1.0), product of:
              0.059793446 = queryWeight, product of:
                1.2226254 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.011882485 = queryNorm
              0.2572367 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=2591)
          0.07893791 = weight(abstract_txt:rests in 2591) [ClassicSimilarity], result of:
            0.07893791 = score(doc=2591,freq=1.0), product of:
              0.1412027 = queryWeight, product of:
                1.3285354 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.011882485 = queryNorm
              0.55903965 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=2591)
          0.013738164 = weight(abstract_txt:retrieval in 2591) [ClassicSimilarity], result of:
            0.013738164 = score(doc=2591,freq=1.0), product of:
              0.06348125 = queryWeight, product of:
                1.5428901 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.011882485 = queryNorm
              0.21641295 = fieldWeight in 2591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=2591)
          0.12754358 = weight(abstract_txt:entities in 2591) [ClassicSimilarity], result of:
            0.12754358 = score(doc=2591,freq=2.0), product of:
              0.24496366 = queryWeight, product of:
                3.4997132 = boost
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.011882485 = queryNorm
              0.52066326 = fieldWeight in 2591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8906326 = idf(docFreq=317, maxDocs=42306)
                0.0625 = fieldNorm(doc=2591)
          0.30838296 = weight(abstract_txt:meaning in 2591) [ClassicSimilarity], result of:
            0.30838296 = score(doc=2591,freq=5.0), product of:
              0.391822 = queryWeight, product of:
                5.855245 = boost
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.011882485 = queryNorm
              0.7870486 = fieldWeight in 2591, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.0625 = fieldNorm(doc=2591)
        0.2 = coord(5/25)
    
  5. Floridi, L.: Information: a very short introduction (2010) 0.10
    0.09922659 = sum of:
      0.09922659 = product of:
        0.6201662 = sum of:
          0.009121496 = weight(abstract_txt:such in 189) [ClassicSimilarity], result of:
            0.009121496 = score(doc=189,freq=1.0), product of:
              0.042205915 = queryWeight, product of:
                1.0271959 = boost
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.011882485 = queryNorm
              0.21611892 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4579027 = idf(docFreq=3621, maxDocs=42306)
                0.0625 = fieldNorm(doc=189)
          0.021316683 = weight(abstract_txt:concepts in 189) [ClassicSimilarity], result of:
            0.021316683 = score(doc=189,freq=1.0), product of:
              0.07432627 = queryWeight, product of:
                1.3631316 = boost
                4.5887804 = idf(docFreq=1168, maxDocs=42306)
                0.011882485 = queryNorm
              0.28679878 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5887804 = idf(docFreq=1168, maxDocs=42306)
                0.0625 = fieldNorm(doc=189)
          0.1950385 = weight(abstract_txt:meaning in 189) [ClassicSimilarity], result of:
            0.1950385 = score(doc=189,freq=2.0), product of:
              0.391822 = queryWeight, product of:
                5.855245 = boost
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.011882485 = queryNorm
              0.4977732 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.631661 = idf(docFreq=411, maxDocs=42306)
                0.0625 = fieldNorm(doc=189)
          0.39468953 = weight(abstract_txt:quantum in 189) [ClassicSimilarity], result of:
            0.39468953 = score(doc=189,freq=1.0), product of:
              0.7060135 = queryWeight, product of:
                6.642677 = boost
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.011882485 = queryNorm
              0.55903965 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.944634 = idf(docFreq=14, maxDocs=42306)
                0.0625 = fieldNorm(doc=189)
        0.16 = coord(4/25)