Document (#5450)

Author
Soergel, D.
Title
Mathematical analysis of documentation systems : an attempt to a theory of classification and search request formulation
Source
Information storage and retrieval. 3(1967), S.129-173
Year
1967
Abstract
As an attempt to make a general structural theory of information retrieval, a documentation system (DS) is defined as a formal system consisting of (a) a set o of objects (documents); (b) a set A++ of elementary attributes (key-words), from which further attributes may be constructed: A++ generates A; (c) a set of axioms of the form X++(x)=m (m¯M, M a set of constant connecting attributes with objects: from the axioms further theorems (=true statements) may be constructed. By use of the theorems, different mappings O -> P(o) (P(o) set of all subsets of o) (search question -> set of documents retrieved) are defined. The type of a DS depends on two basic decisions: (1) choice of the rules for the construction of attributes and theorems, e.g., logical product in coordinate indexing; links. (2) choice of M; M may consist of the two constants 'applicable' and 'not applicable', or some positive integers, ...; Further practical decisions: A++ hierarchical or not; kind of mapping; introduction of roles (=further attributes). The most simple case - ordinary two-valued Coordinate Indexing - is discusssed in detail; o is a free distributive (but not Boolean) lattice, the homographic image a ring of subsets of o; instead of negation which is not useful, a useful retrieval operation 'praeternagation' is introduced. Furthermore these are discussed: a generalized definition of superimposed coding, some functions for the distance of objects or attributes; optimization and automatic derivation of classifications. The model takes into account term-term relations and document-document relations. It may serve as a structural framework in terms of which the functional problems of retrieval theory may be expressed more clearly

Similar documents (author)

  1. Soergel, D.E.: Organizing information : principles of database and retrieval systems (1985) 5.03
    5.026104 = sum of:
      5.026104 = weight(author_txt:soergel in 868) [ClassicSimilarity], result of:
        5.026104 = fieldWeight in 868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.041766 = idf(docFreq=36, maxDocs=42306)
          0.625 = fieldNorm(doc=868)
    
  2. Soergel, D.: ¬The Broad System of Ordering : a critique (1979) 5.03
    5.026104 = sum of:
      5.026104 = weight(author_txt:soergel in 1864) [ClassicSimilarity], result of:
        5.026104 = fieldWeight in 1864, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.041766 = idf(docFreq=36, maxDocs=42306)
          0.625 = fieldNorm(doc=1864)
    
  3. Soergel, D.: Software support for thesaurus construction and display (1994) 5.03
    5.026104 = sum of:
      5.026104 = weight(author_txt:soergel in 890) [ClassicSimilarity], result of:
        5.026104 = fieldWeight in 890, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.041766 = idf(docFreq=36, maxDocs=42306)
          0.625 = fieldNorm(doc=890)
    
  4. Soergel, D.: Information structure management : a unified framework for indexing and searching in database, expert, information-retrieval, and hypermedia systems (1994) 5.03
    5.026104 = sum of:
      5.026104 = weight(author_txt:soergel in 3053) [ClassicSimilarity], result of:
        5.026104 = fieldWeight in 3053, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.041766 = idf(docFreq=36, maxDocs=42306)
          0.625 = fieldNorm(doc=3053)
    
  5. Soergel, D.: Framework for data element standardization (1995) 5.03
    5.026104 = sum of:
      5.026104 = weight(author_txt:soergel in 4643) [ClassicSimilarity], result of:
        5.026104 = fieldWeight in 4643, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.041766 = idf(docFreq=36, maxDocs=42306)
          0.625 = fieldNorm(doc=4643)
    

Similar documents (content)

  1. Srinivasan, P.: Intelligent information retrieval using rough set approximations (1989) 0.15
    0.14908275 = sum of:
      0.14908275 = product of:
        0.7454137 = sum of:
          0.045072854 = weight(abstract_txt:term in 2595) [ClassicSimilarity], result of:
            0.045072854 = score(doc=2595,freq=1.0), product of:
              0.09941734 = queryWeight, product of:
                1.0969833 = boost
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.018740468 = queryNorm
              0.45337015 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8359485 = idf(docFreq=912, maxDocs=42306)
                0.09375 = fieldNorm(doc=2595)
          0.02481824 = weight(abstract_txt:retrieval in 2595) [ClassicSimilarity], result of:
            0.02481824 = score(doc=2595,freq=1.0), product of:
              0.07645334 = queryWeight, product of:
                1.1781832 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.018740468 = queryNorm
              0.3246194 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.09375 = fieldNorm(doc=2595)
          0.100273594 = weight(abstract_txt:theory in 2595) [ClassicSimilarity], result of:
            0.100273594 = score(doc=2595,freq=3.0), product of:
              0.1344724 = queryWeight, product of:
                1.56254 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.018740468 = queryNorm
              0.7456816 = fieldWeight in 2595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.09375 = fieldNorm(doc=2595)
          0.09980536 = weight(abstract_txt:objects in 2595) [ClassicSimilarity], result of:
            0.09980536 = score(doc=2595,freq=1.0), product of:
              0.19333853 = queryWeight, product of:
                1.8735867 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018740468 = queryNorm
              0.51622075 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.09375 = fieldNorm(doc=2595)
          0.47544366 = weight(abstract_txt:attributes in 2595) [ClassicSimilarity], result of:
            0.47544366 = score(doc=2595,freq=3.0), product of:
              0.47817272 = queryWeight, product of:
                4.1669855 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.018740468 = queryNorm
              0.99429274 = fieldWeight in 2595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.09375 = fieldNorm(doc=2595)
        0.2 = coord(5/25)
    
  2. Rorissa, A.: Relationships between perceived features and similarity of images : a test of Tversky's contrast model (2007) 0.13
    0.12856776 = sum of:
      0.12856776 = product of:
        0.45917055 = sum of:
          0.010116756 = weight(abstract_txt:which in 2521) [ClassicSimilarity], result of:
            0.010116756 = score(doc=2521,freq=1.0), product of:
              0.055077072 = queryWeight, product of:
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.018740468 = queryNorm
              0.18368362 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.028657634 = weight(abstract_txt:retrieval in 2521) [ClassicSimilarity], result of:
            0.028657634 = score(doc=2521,freq=3.0), product of:
              0.07645334 = queryWeight, product of:
                1.1781832 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.018740468 = queryNorm
              0.3748382 = fieldWeight in 2521, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.054570697 = weight(abstract_txt:structural in 2521) [ClassicSimilarity], result of:
            0.054570697 = score(doc=2521,freq=1.0), product of:
              0.14798553 = queryWeight, product of:
                1.3383774 = boost
                5.9001117 = idf(docFreq=314, maxDocs=42306)
                0.018740468 = queryNorm
              0.36875698 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9001117 = idf(docFreq=314, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.056897476 = weight(abstract_txt:attempt in 2521) [ClassicSimilarity], result of:
            0.056897476 = score(doc=2521,freq=1.0), product of:
              0.15216272 = queryWeight, product of:
                1.357135 = boost
                5.9828033 = idf(docFreq=289, maxDocs=42306)
                0.018740468 = queryNorm
              0.3739252 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9828033 = idf(docFreq=289, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.06653691 = weight(abstract_txt:objects in 2521) [ClassicSimilarity], result of:
            0.06653691 = score(doc=2521,freq=1.0), product of:
              0.19333853 = queryWeight, product of:
                1.8735867 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018740468 = queryNorm
              0.34414718 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.18605088 = weight(abstract_txt:axioms in 2521) [ClassicSimilarity], result of:
            0.18605088 = score(doc=2521,freq=1.0), product of:
              0.3352232 = queryWeight, product of:
                2.014355 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.018740468 = queryNorm
              0.55500597 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
          0.0563402 = weight(abstract_txt:further in 2521) [ClassicSimilarity], result of:
            0.0563402 = score(doc=2521,freq=1.0), product of:
              0.19045915 = queryWeight, product of:
                2.1472611 = boost
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.018740468 = queryNorm
              0.29581252 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.0625 = fieldNorm(doc=2521)
        0.28 = coord(7/25)
    
  3. Khoo, S.G.; Na, J.-C.: Semantic relations in information science (2006) 0.12
    0.11882682 = sum of:
      0.11882682 = product of:
        0.4243815 = sum of:
          0.0071536265 = weight(abstract_txt:which in 3979) [ClassicSimilarity], result of:
            0.0071536265 = score(doc=3979,freq=2.0), product of:
              0.055077072 = queryWeight, product of:
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.018740468 = queryNorm
              0.12988393 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.014328817 = weight(abstract_txt:retrieval in 3979) [ClassicSimilarity], result of:
            0.014328817 = score(doc=3979,freq=3.0), product of:
              0.07645334 = queryWeight, product of:
                1.1781832 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.018740468 = queryNorm
              0.1874191 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.027146267 = weight(abstract_txt:defined in 3979) [ClassicSimilarity], result of:
            0.027146267 = score(doc=3979,freq=2.0), product of:
              0.11705671 = queryWeight, product of:
                1.1903293 = boost
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.018740468 = queryNorm
              0.23190697 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.12256455 = weight(abstract_txt:relations in 3979) [ClassicSimilarity], result of:
            0.12256455 = score(doc=3979,freq=28.0), product of:
              0.13267532 = queryWeight, product of:
                1.2672551 = boost
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.018740468 = queryNorm
              0.92379314 = fieldWeight in 3979, product of:
                5.2915025 = tf(freq=28.0), with freq of:
                  28.0 = termFreq=28.0
                5.586576 = idf(docFreq=430, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.06653691 = weight(abstract_txt:objects in 3979) [ClassicSimilarity], result of:
            0.06653691 = score(doc=3979,freq=4.0), product of:
              0.19333853 = queryWeight, product of:
                1.8735867 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018740468 = queryNorm
              0.34414718 = fieldWeight in 3979, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.0281701 = weight(abstract_txt:further in 3979) [ClassicSimilarity], result of:
            0.0281701 = score(doc=3979,freq=1.0), product of:
              0.19045915 = queryWeight, product of:
                2.1472611 = boost
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.018740468 = queryNorm
              0.14790626 = fieldWeight in 3979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
          0.15848123 = weight(abstract_txt:attributes in 3979) [ClassicSimilarity], result of:
            0.15848123 = score(doc=3979,freq=3.0), product of:
              0.47817272 = queryWeight, product of:
                4.1669855 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.018740468 = queryNorm
              0.3314309 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.03125 = fieldNorm(doc=3979)
        0.28 = coord(7/25)
    
  4. Huibers, T.W.C.; Bruza, P.D.: Situations, a general framework for studying information retrieval (1996) 0.11
    0.11405624 = sum of:
      0.11405624 = product of:
        0.47523433 = sum of:
          0.021903418 = weight(abstract_txt:which in 33) [ClassicSimilarity], result of:
            0.021903418 = score(doc=33,freq=3.0), product of:
              0.055077072 = queryWeight, product of:
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.018740468 = queryNorm
              0.3976867 = fieldWeight in 33, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
          0.041363735 = weight(abstract_txt:retrieval in 33) [ClassicSimilarity], result of:
            0.041363735 = score(doc=33,freq=4.0), product of:
              0.07645334 = queryWeight, product of:
                1.1781832 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.018740468 = queryNorm
              0.5410324 = fieldWeight in 33, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
          0.047988273 = weight(abstract_txt:defined in 33) [ClassicSimilarity], result of:
            0.047988273 = score(doc=33,freq=1.0), product of:
              0.11705671 = queryWeight, product of:
                1.1903293 = boost
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.018740468 = queryNorm
              0.40995747 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
          0.04824416 = weight(abstract_txt:theory in 33) [ClassicSimilarity], result of:
            0.04824416 = score(doc=33,freq=1.0), product of:
              0.1344724 = queryWeight, product of:
                1.56254 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.018740468 = queryNorm
              0.35876626 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
          0.08317114 = weight(abstract_txt:objects in 33) [ClassicSimilarity], result of:
            0.08317114 = score(doc=33,freq=1.0), product of:
              0.19333853 = queryWeight, product of:
                1.8735867 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018740468 = queryNorm
              0.43018398 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
          0.2325636 = weight(abstract_txt:axioms in 33) [ClassicSimilarity], result of:
            0.2325636 = score(doc=33,freq=1.0), product of:
              0.3352232 = queryWeight, product of:
                2.014355 = boost
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.018740468 = queryNorm
              0.6937575 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.8800955 = idf(docFreq=15, maxDocs=42306)
                0.078125 = fieldNorm(doc=33)
        0.24 = coord(6/25)
    
  5. Rijsbergen, C.J. van; Lalmas, M.: Information calculus for information retrieval (1996) 0.11
    0.108817 = sum of:
      0.108817 = product of:
        0.4534042 = sum of:
          0.014307253 = weight(abstract_txt:which in 4270) [ClassicSimilarity], result of:
            0.014307253 = score(doc=4270,freq=2.0), product of:
              0.055077072 = queryWeight, product of:
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.018740468 = queryNorm
              0.25976786 = fieldWeight in 4270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.938938 = idf(docFreq=6085, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
          0.028657634 = weight(abstract_txt:retrieval in 4270) [ClassicSimilarity], result of:
            0.028657634 = score(doc=4270,freq=3.0), product of:
              0.07645334 = queryWeight, product of:
                1.1781832 = boost
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.018740468 = queryNorm
              0.3748382 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4626071 = idf(docFreq=3604, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
          0.0664945 = weight(abstract_txt:defined in 4270) [ClassicSimilarity], result of:
            0.0664945 = score(doc=4270,freq=3.0), product of:
              0.11705671 = queryWeight, product of:
                1.1903293 = boost
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.018740468 = queryNorm
              0.5680537 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2474556 = idf(docFreq=604, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
          0.06684906 = weight(abstract_txt:theory in 4270) [ClassicSimilarity], result of:
            0.06684906 = score(doc=4270,freq=3.0), product of:
              0.1344724 = queryWeight, product of:
                1.56254 = boost
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.018740468 = queryNorm
              0.49712107 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.592208 = idf(docFreq=1164, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
          0.0940974 = weight(abstract_txt:objects in 4270) [ClassicSimilarity], result of:
            0.0940974 = score(doc=4270,freq=2.0), product of:
              0.19333853 = queryWeight, product of:
                1.8735867 = boost
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.018740468 = queryNorm
              0.48669758 = fieldWeight in 4270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.506355 = idf(docFreq=466, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
          0.18299834 = weight(abstract_txt:attributes in 4270) [ClassicSimilarity], result of:
            0.18299834 = score(doc=4270,freq=1.0), product of:
              0.47817272 = queryWeight, product of:
                4.1669855 = boost
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.018740468 = queryNorm
              0.38270345 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1232553 = idf(docFreq=251, maxDocs=42306)
                0.0625 = fieldNorm(doc=4270)
        0.24 = coord(6/25)