Document (#5450)

Author
Soergel, D.
Title
Mathematical analysis of documentation systems : an attempt to a theory of classification and search request formulation
Source
Information storage and retrieval. 3(1967), S.129-173
Year
1967
Abstract
As an attempt to make a general structural theory of information retrieval, a documentation system (DS) is defined as a formal system consisting of (a) a set o of objects (documents); (b) a set A++ of elementary attributes (key-words), from which further attributes may be constructed: A++ generates A; (c) a set of axioms of the form X++(x)=m (m¯M, M a set of constant connecting attributes with objects: from the axioms further theorems (=true statements) may be constructed. By use of the theorems, different mappings O -> P(o) (P(o) set of all subsets of o) (search question -> set of documents retrieved) are defined. The type of a DS depends on two basic decisions: (1) choice of the rules for the construction of attributes and theorems, e.g., logical product in coordinate indexing; links. (2) choice of M; M may consist of the two constants 'applicable' and 'not applicable', or some positive integers, ...; Further practical decisions: A++ hierarchical or not; kind of mapping; introduction of roles (=further attributes). The most simple case - ordinary two-valued Coordinate Indexing - is discusssed in detail; o is a free distributive (but not Boolean) lattice, the homographic image a ring of subsets of o; instead of negation which is not useful, a useful retrieval operation 'praeternagation' is introduced. Furthermore these are discussed: a generalized definition of superimposed coding, some functions for the distance of objects or attributes; optimization and automatic derivation of classifications. The model takes into account term-term relations and document-document relations. It may serve as a structural framework in terms of which the functional problems of retrieval theory may be expressed more clearly

Similar documents (author)

  1. Soergel, D.E.: Organizing information : principles of database and retrieval systems (1985) 5.01
    5.007052 = sum of:
      5.007052 = weight(author_txt:soergel in 868) [ClassicSimilarity], result of:
        5.007052 = fieldWeight in 868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.011283 = idf(docFreq=38, maxDocs=43254)
          0.625 = fieldNorm(doc=868)
    
  2. Soergel, D.: ¬The Broad System of Ordering : a critique (1979) 5.01
    5.007052 = sum of:
      5.007052 = weight(author_txt:soergel in 1864) [ClassicSimilarity], result of:
        5.007052 = fieldWeight in 1864, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.011283 = idf(docFreq=38, maxDocs=43254)
          0.625 = fieldNorm(doc=1864)
    
  3. Soergel, D.: Software support for thesaurus construction and display (1994) 5.01
    5.007052 = sum of:
      5.007052 = weight(author_txt:soergel in 890) [ClassicSimilarity], result of:
        5.007052 = fieldWeight in 890, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.011283 = idf(docFreq=38, maxDocs=43254)
          0.625 = fieldNorm(doc=890)
    
  4. Soergel, D.: Information structure management : a unified framework for indexing and searching in database, expert, information-retrieval, and hypermedia systems (1994) 5.01
    5.007052 = sum of:
      5.007052 = weight(author_txt:soergel in 4053) [ClassicSimilarity], result of:
        5.007052 = fieldWeight in 4053, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.011283 = idf(docFreq=38, maxDocs=43254)
          0.625 = fieldNorm(doc=4053)
    
  5. Soergel, D.: Framework for data element standardization (1995) 5.01
    5.007052 = sum of:
      5.007052 = weight(author_txt:soergel in 5643) [ClassicSimilarity], result of:
        5.007052 = fieldWeight in 5643, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.011283 = idf(docFreq=38, maxDocs=43254)
          0.625 = fieldNorm(doc=5643)
    

Similar documents (content)

  1. Srinivasan, P.: Intelligent information retrieval using rough set approximations (1989) 0.15
    0.14902472 = sum of:
      0.14902472 = product of:
        0.7451236 = sum of:
          0.044792496 = weight(abstract_txt:term in 3595) [ClassicSimilarity], result of:
            0.044792496 = score(doc=3595,freq=1.0), product of:
              0.099137455 = queryWeight, product of:
                1.0975116 = boost
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.01874271 = queryNorm
              0.45182213 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.819436 = idf(docFreq=948, maxDocs=43254)
                0.09375 = fieldNorm(doc=3595)
          0.025076022 = weight(abstract_txt:retrieval in 3595) [ClassicSimilarity], result of:
            0.025076022 = score(doc=3595,freq=1.0), product of:
              0.0770851 = queryWeight, product of:
                1.1852804 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01874271 = queryNorm
              0.3253031 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.09375 = fieldNorm(doc=3595)
          0.099056095 = weight(abstract_txt:theory in 3595) [ClassicSimilarity], result of:
            0.099056095 = score(doc=3595,freq=3.0), product of:
              0.13356061 = queryWeight, product of:
                1.5601814 = boost
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.01874271 = queryNorm
              0.7416565 = fieldWeight in 3595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.09375 = fieldNorm(doc=3595)
          0.09825128 = weight(abstract_txt:objects in 3595) [ClassicSimilarity], result of:
            0.09825128 = score(doc=3595,freq=1.0), product of:
              0.19158293 = queryWeight, product of:
                1.8685912 = boost
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.01874271 = queryNorm
              0.51283944 = fieldWeight in 3595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.09375 = fieldNorm(doc=3595)
          0.47794768 = weight(abstract_txt:attributes in 3595) [ClassicSimilarity], result of:
            0.47794768 = score(doc=3595,freq=3.0), product of:
              0.48049384 = queryWeight, product of:
                4.1849937 = boost
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.01874271 = queryNorm
              0.99470097 = fieldWeight in 3595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.09375 = fieldNorm(doc=3595)
        0.2 = coord(5/25)
    
  2. Rorissa, A.: Relationships between perceived features and similarity of images : a test of Tversky's contrast model (2007) 0.13
    0.12743683 = sum of:
      0.12743683 = product of:
        0.45513153 = sum of:
          0.010039315 = weight(abstract_txt:which in 2521) [ClassicSimilarity], result of:
            0.010039315 = score(doc=2521,freq=1.0), product of:
              0.05486915 = queryWeight, product of:
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01874271 = queryNorm
              0.1829683 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.028955296 = weight(abstract_txt:retrieval in 2521) [ClassicSimilarity], result of:
            0.028955296 = score(doc=2521,freq=3.0), product of:
              0.0770851 = queryWeight, product of:
                1.1852804 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01874271 = queryNorm
              0.37562767 = fieldWeight in 2521, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.054282892 = weight(abstract_txt:structural in 2521) [ClassicSimilarity], result of:
            0.054282892 = score(doc=2521,freq=1.0), product of:
              0.14766257 = queryWeight, product of:
                1.3394468 = boost
                5.881831 = idf(docFreq=327, maxDocs=43254)
                0.01874271 = queryNorm
              0.36761445 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.881831 = idf(docFreq=327, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.056696184 = weight(abstract_txt:attempt in 2521) [ClassicSimilarity], result of:
            0.056696184 = score(doc=2521,freq=1.0), product of:
              0.15200725 = queryWeight, product of:
                1.3590093 = boost
                5.967735 = idf(docFreq=300, maxDocs=43254)
                0.01874271 = queryNorm
              0.37298343 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.967735 = idf(docFreq=300, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.065500855 = weight(abstract_txt:objects in 2521) [ClassicSimilarity], result of:
            0.065500855 = score(doc=2521,freq=1.0), product of:
              0.19158293 = queryWeight, product of:
                1.8685912 = boost
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.01874271 = queryNorm
              0.34189296 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.18438374 = weight(abstract_txt:axioms in 2521) [ClassicSimilarity], result of:
            0.18438374 = score(doc=2521,freq=1.0), product of:
              0.33366463 = queryWeight, product of:
                2.013471 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01874271 = queryNorm
              0.552602 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
          0.05527322 = weight(abstract_txt:further in 2521) [ClassicSimilarity], result of:
            0.05527322 = score(doc=2521,freq=1.0), product of:
              0.1882991 = queryWeight, product of:
                2.1390917 = boost
                4.6966314 = idf(docFreq=1072, maxDocs=43254)
                0.01874271 = queryNorm
              0.29353946 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6966314 = idf(docFreq=1072, maxDocs=43254)
                0.0625 = fieldNorm(doc=2521)
        0.28 = coord(7/25)
    
  3. Khoo, S.G.; Na, J.-C.: Semantic relations in information science (2006) 0.12
    0.11843866 = sum of:
      0.11843866 = product of:
        0.4229952 = sum of:
          0.0070988676 = weight(abstract_txt:which in 3979) [ClassicSimilarity], result of:
            0.0070988676 = score(doc=3979,freq=2.0), product of:
              0.05486915 = queryWeight, product of:
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01874271 = queryNorm
              0.12937813 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.014477648 = weight(abstract_txt:retrieval in 3979) [ClassicSimilarity], result of:
            0.014477648 = score(doc=3979,freq=3.0), product of:
              0.0770851 = queryWeight, product of:
                1.1852804 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01874271 = queryNorm
              0.18781383 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.026995614 = weight(abstract_txt:defined in 3979) [ClassicSimilarity], result of:
            0.026995614 = score(doc=3979,freq=2.0), product of:
              0.11677966 = queryWeight, product of:
                1.1911703 = boost
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.01874271 = queryNorm
              0.23116708 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.121969715 = weight(abstract_txt:relations in 3979) [ClassicSimilarity], result of:
            0.121969715 = score(doc=3979,freq=28.0), product of:
              0.1324231 = queryWeight, product of:
                1.2684464 = boost
                5.5700517 = idf(docFreq=447, maxDocs=43254)
                0.01874271 = queryNorm
              0.9210607 = fieldWeight in 3979, product of:
                5.2915025 = tf(freq=28.0), with freq of:
                  28.0 = termFreq=28.0
                5.5700517 = idf(docFreq=447, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.065500855 = weight(abstract_txt:objects in 3979) [ClassicSimilarity], result of:
            0.065500855 = score(doc=3979,freq=4.0), product of:
              0.19158293 = queryWeight, product of:
                1.8685912 = boost
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.01874271 = queryNorm
              0.34189296 = fieldWeight in 3979, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.02763661 = weight(abstract_txt:further in 3979) [ClassicSimilarity], result of:
            0.02763661 = score(doc=3979,freq=1.0), product of:
              0.1882991 = queryWeight, product of:
                2.1390917 = boost
                4.6966314 = idf(docFreq=1072, maxDocs=43254)
                0.01874271 = queryNorm
              0.14676973 = fieldWeight in 3979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6966314 = idf(docFreq=1072, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
          0.1593159 = weight(abstract_txt:attributes in 3979) [ClassicSimilarity], result of:
            0.1593159 = score(doc=3979,freq=3.0), product of:
              0.48049384 = queryWeight, product of:
                4.1849937 = boost
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.01874271 = queryNorm
              0.331567 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.03125 = fieldNorm(doc=3979)
        0.28 = coord(7/25)
    
  4. Huibers, T.W.C.; Bruza, P.D.: Situations, a general framework for studying information retrieval (1996) 0.11
    0.11310364 = sum of:
      0.11310364 = product of:
        0.4712652 = sum of:
          0.021735754 = weight(abstract_txt:which in 33) [ClassicSimilarity], result of:
            0.021735754 = score(doc=33,freq=3.0), product of:
              0.05486915 = queryWeight, product of:
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01874271 = queryNorm
              0.39613798 = fieldWeight in 33, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
          0.04179337 = weight(abstract_txt:retrieval in 33) [ClassicSimilarity], result of:
            0.04179337 = score(doc=33,freq=4.0), product of:
              0.0770851 = queryWeight, product of:
                1.1852804 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01874271 = queryNorm
              0.54217184 = fieldWeight in 33, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
          0.047721952 = weight(abstract_txt:defined in 33) [ClassicSimilarity], result of:
            0.047721952 = score(doc=33,freq=1.0), product of:
              0.11677966 = queryWeight, product of:
                1.1911703 = boost
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.01874271 = queryNorm
              0.4086495 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
          0.047658388 = weight(abstract_txt:theory in 33) [ClassicSimilarity], result of:
            0.047658388 = score(doc=33,freq=1.0), product of:
              0.13356061 = queryWeight, product of:
                1.5601814 = boost
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.01874271 = queryNorm
              0.35682964 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
          0.08187607 = weight(abstract_txt:objects in 33) [ClassicSimilarity], result of:
            0.08187607 = score(doc=33,freq=1.0), product of:
              0.19158293 = queryWeight, product of:
                1.8685912 = boost
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.01874271 = queryNorm
              0.4273662 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
          0.23047967 = weight(abstract_txt:axioms in 33) [ClassicSimilarity], result of:
            0.23047967 = score(doc=33,freq=1.0), product of:
              0.33366463 = queryWeight, product of:
                2.013471 = boost
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.01874271 = queryNorm
              0.6907525 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.841632 = idf(docFreq=16, maxDocs=43254)
                0.078125 = fieldNorm(doc=33)
        0.24 = coord(6/25)
    
  5. Rijsbergen, C.J. van; Lalmas, M.: Information calculus for information retrieval (1996) 0.11
    0.10845846 = sum of:
      0.10845846 = product of:
        0.45191026 = sum of:
          0.014197735 = weight(abstract_txt:which in 5270) [ClassicSimilarity], result of:
            0.014197735 = score(doc=5270,freq=2.0), product of:
              0.05486915 = queryWeight, product of:
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.01874271 = queryNorm
              0.25875625 = fieldWeight in 5270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9274929 = idf(docFreq=6293, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
          0.028955296 = weight(abstract_txt:retrieval in 5270) [ClassicSimilarity], result of:
            0.028955296 = score(doc=5270,freq=3.0), product of:
              0.0770851 = queryWeight, product of:
                1.1852804 = boost
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.01874271 = queryNorm
              0.37562767 = fieldWeight in 5270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4699 = idf(docFreq=3658, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
          0.066125475 = weight(abstract_txt:defined in 5270) [ClassicSimilarity], result of:
            0.066125475 = score(doc=5270,freq=3.0), product of:
              0.11677966 = queryWeight, product of:
                1.1911703 = boost
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.01874271 = queryNorm
              0.5662414 = fieldWeight in 5270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.230714 = idf(docFreq=628, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
          0.066037394 = weight(abstract_txt:theory in 5270) [ClassicSimilarity], result of:
            0.066037394 = score(doc=5270,freq=3.0), product of:
              0.13356061 = queryWeight, product of:
                1.5601814 = boost
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.01874271 = queryNorm
              0.49443766 = fieldWeight in 5270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5674195 = idf(docFreq=1220, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
          0.0926322 = weight(abstract_txt:objects in 5270) [ClassicSimilarity], result of:
            0.0926322 = score(doc=5270,freq=2.0), product of:
              0.19158293 = queryWeight, product of:
                1.8685912 = boost
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.01874271 = queryNorm
              0.48350966 = fieldWeight in 5270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4702873 = idf(docFreq=494, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
          0.18396215 = weight(abstract_txt:attributes in 5270) [ClassicSimilarity], result of:
            0.18396215 = score(doc=5270,freq=1.0), product of:
              0.48049384 = queryWeight, product of:
                4.1849937 = boost
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.01874271 = queryNorm
              0.38286057 = fieldWeight in 5270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.125769 = idf(docFreq=256, maxDocs=43254)
                0.0625 = fieldNorm(doc=5270)
        0.24 = coord(6/25)