Document (#5450)

Author
Soergel, D.
Title
Mathematical analysis of documentation systems : an attempt to a theory of classification and search request formulation
Source
Information storage and retrieval. 3(1967), S.129-173
Year
1967
Abstract
As an attempt to make a general structural theory of information retrieval, a documentation system (DS) is defined as a formal system consisting of (a) a set o of objects (documents); (b) a set A++ of elementary attributes (key-words), from which further attributes may be constructed: A++ generates A; (c) a set of axioms of the form X++(x)=m (m¯M, M a set of constant connecting attributes with objects: from the axioms further theorems (=true statements) may be constructed. By use of the theorems, different mappings O -> P(o) (P(o) set of all subsets of o) (search question -> set of documents retrieved) are defined. The type of a DS depends on two basic decisions: (1) choice of the rules for the construction of attributes and theorems, e.g., logical product in coordinate indexing; links. (2) choice of M; M may consist of the two constants 'applicable' and 'not applicable', or some positive integers, ...; Further practical decisions: A++ hierarchical or not; kind of mapping; introduction of roles (=further attributes). The most simple case - ordinary two-valued Coordinate Indexing - is discusssed in detail; o is a free distributive (but not Boolean) lattice, the homographic image a ring of subsets of o; instead of negation which is not useful, a useful retrieval operation 'praeternagation' is introduced. Furthermore these are discussed: a generalized definition of superimposed coding, some functions for the distance of objects or attributes; optimization and automatic derivation of classifications. The model takes into account term-term relations and document-document relations. It may serve as a structural framework in terms of which the functional problems of retrieval theory may be expressed more clearly

Similar documents (author)

  1. Soergel, D.E.: Organizing information : principles of database and retrieval systems (1985) 5.02
    5.0158157 = sum of:
      5.0158157 = weight(author_txt:soergel in 868) [ClassicSimilarity], result of:
        5.0158157 = fieldWeight in 868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.025305 = idf(docFreq=37, maxDocs=42740)
          0.625 = fieldNorm(doc=868)
    
  2. Soergel, D.: ¬The Broad System of Ordering : a critique (1979) 5.02
    5.0158157 = sum of:
      5.0158157 = weight(author_txt:soergel in 1864) [ClassicSimilarity], result of:
        5.0158157 = fieldWeight in 1864, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.025305 = idf(docFreq=37, maxDocs=42740)
          0.625 = fieldNorm(doc=1864)
    
  3. Soergel, D.: Software support for thesaurus construction and display (1994) 5.02
    5.0158157 = sum of:
      5.0158157 = weight(author_txt:soergel in 890) [ClassicSimilarity], result of:
        5.0158157 = fieldWeight in 890, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.025305 = idf(docFreq=37, maxDocs=42740)
          0.625 = fieldNorm(doc=890)
    
  4. Soergel, D.: Information structure management : a unified framework for indexing and searching in database, expert, information-retrieval, and hypermedia systems (1994) 5.02
    5.0158157 = sum of:
      5.0158157 = weight(author_txt:soergel in 3053) [ClassicSimilarity], result of:
        5.0158157 = fieldWeight in 3053, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.025305 = idf(docFreq=37, maxDocs=42740)
          0.625 = fieldNorm(doc=3053)
    
  5. Soergel, D.: Framework for data element standardization (1995) 5.02
    5.0158157 = sum of:
      5.0158157 = weight(author_txt:soergel in 4643) [ClassicSimilarity], result of:
        5.0158157 = fieldWeight in 4643, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.025305 = idf(docFreq=37, maxDocs=42740)
          0.625 = fieldNorm(doc=4643)
    

Similar documents (content)

  1. Srinivasan, P.: Intelligent information retrieval using rough set approximations (1989) 0.15
    0.14868481 = sum of:
      0.14868481 = product of:
        0.74342406 = sum of:
          0.044914793 = weight(abstract_txt:term in 2595) [ClassicSimilarity], result of:
            0.044914793 = score(doc=2595,freq=1.0), product of:
              0.09921573 = queryWeight, product of:
                1.0973905 = boost
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.018723272 = queryNorm
              0.4526983 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8287816 = idf(docFreq=928, maxDocs=42740)
                0.09375 = fieldNorm(doc=2595)
          0.024888737 = weight(abstract_txt:retrieval in 2595) [ClassicSimilarity], result of:
            0.024888737 = score(doc=2595,freq=1.0), product of:
              0.076621965 = queryWeight, product of:
                1.1811178 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.018723272 = queryNorm
              0.3248251 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.09375 = fieldNorm(doc=2595)
          0.09937295 = weight(abstract_txt:theory in 2595) [ClassicSimilarity], result of:
            0.09937295 = score(doc=2595,freq=3.0), product of:
              0.1337077 = queryWeight, product of:
                1.5602522 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.018723272 = queryNorm
              0.7432104 = fieldWeight in 2595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.09375 = fieldNorm(doc=2595)
          0.098738156 = weight(abstract_txt:objects in 2595) [ClassicSimilarity], result of:
            0.098738156 = score(doc=2595,freq=1.0), product of:
              0.19201775 = queryWeight, product of:
                1.869766 = boost
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.018723272 = queryNorm
              0.5142137 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.09375 = fieldNorm(doc=2595)
          0.47550938 = weight(abstract_txt:attributes in 2595) [ClassicSimilarity], result of:
            0.47550938 = score(doc=2595,freq=3.0), product of:
              0.47836605 = queryWeight, product of:
                4.173609 = boost
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.018723272 = queryNorm
              0.9940283 = fieldWeight in 2595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.09375 = fieldNorm(doc=2595)
        0.2 = coord(5/25)
    
  2. Rorissa, A.: Relationships between perceived features and similarity of images : a test of Tversky's contrast model (2007) 0.13
    0.1283635 = sum of:
      0.1283635 = product of:
        0.45844108 = sum of:
          0.010070058 = weight(abstract_txt:which in 2521) [ClassicSimilarity], result of:
            0.010070058 = score(doc=2521,freq=1.0), product of:
              0.054924592 = queryWeight, product of:
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.018723272 = queryNorm
              0.18334334 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.028739039 = weight(abstract_txt:retrieval in 2521) [ClassicSimilarity], result of:
            0.028739039 = score(doc=2521,freq=3.0), product of:
              0.076621965 = queryWeight, product of:
                1.1811178 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.018723272 = queryNorm
              0.37507573 = fieldWeight in 2521, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.054468025 = weight(abstract_txt:structural in 2521) [ClassicSimilarity], result of:
            0.054468025 = score(doc=2521,freq=1.0), product of:
              0.14784598 = queryWeight, product of:
                1.3396018 = boost
                5.8945694 = idf(docFreq=319, maxDocs=42740)
                0.018723272 = queryNorm
              0.3684106 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8945694 = idf(docFreq=319, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.056754284 = weight(abstract_txt:attempt in 2521) [ClassicSimilarity], result of:
            0.056754284 = score(doc=2521,freq=1.0), product of:
              0.15195473 = queryWeight, product of:
                1.3580884 = boost
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.018723272 = queryNorm
              0.37349468 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.975915 = idf(docFreq=294, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.06582544 = weight(abstract_txt:objects in 2521) [ClassicSimilarity], result of:
            0.06582544 = score(doc=2521,freq=1.0), product of:
              0.19201775 = queryWeight, product of:
                1.869766 = boost
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.018723272 = queryNorm
              0.34280914 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.18686797 = weight(abstract_txt:axioms in 2521) [ClassicSimilarity], result of:
            0.18686797 = score(doc=2521,freq=1.0), product of:
              0.3363089 = queryWeight, product of:
                2.020413 = boost
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.018723272 = queryNorm
              0.55564386 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
          0.05571629 = weight(abstract_txt:further in 2521) [ClassicSimilarity], result of:
            0.05571629 = score(doc=2521,freq=1.0), product of:
              0.18910946 = queryWeight, product of:
                2.1426072 = boost
                4.713993 = idf(docFreq=1041, maxDocs=42740)
                0.018723272 = queryNorm
              0.29462457 = fieldWeight in 2521, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.713993 = idf(docFreq=1041, maxDocs=42740)
                0.0625 = fieldNorm(doc=2521)
        0.28 = coord(7/25)
    
  3. Khoo, S.G.; Na, J.-C.: Semantic relations in information science (2006) 0.12
    0.11847312 = sum of:
      0.11847312 = product of:
        0.4231183 = sum of:
          0.0071206065 = weight(abstract_txt:which in 3979) [ClassicSimilarity], result of:
            0.0071206065 = score(doc=3979,freq=2.0), product of:
              0.054924592 = queryWeight, product of:
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.018723272 = queryNorm
              0.12964332 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.014369519 = weight(abstract_txt:retrieval in 3979) [ClassicSimilarity], result of:
            0.014369519 = score(doc=3979,freq=3.0), product of:
              0.076621965 = queryWeight, product of:
                1.1811178 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.018723272 = queryNorm
              0.18753786 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.027000349 = weight(abstract_txt:defined in 3979) [ClassicSimilarity], result of:
            0.027000349 = score(doc=3979,freq=2.0), product of:
              0.11667327 = queryWeight, product of:
                1.1900265 = boost
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.018723272 = queryNorm
              0.23141846 = fieldWeight in 3979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.12244113 = weight(abstract_txt:relations in 3979) [ClassicSimilarity], result of:
            0.12244113 = score(doc=3979,freq=28.0), product of:
              0.13262762 = queryWeight, product of:
                1.2687848 = boost
                5.5829573 = idf(docFreq=436, maxDocs=42740)
                0.018723272 = queryNorm
              0.92319477 = fieldWeight in 3979, product of:
                5.2915025 = tf(freq=28.0), with freq of:
                  28.0 = termFreq=28.0
                5.5829573 = idf(docFreq=436, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.06582544 = weight(abstract_txt:objects in 3979) [ClassicSimilarity], result of:
            0.06582544 = score(doc=3979,freq=4.0), product of:
              0.19201775 = queryWeight, product of:
                1.869766 = boost
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.018723272 = queryNorm
              0.34280914 = fieldWeight in 3979, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.027858146 = weight(abstract_txt:further in 3979) [ClassicSimilarity], result of:
            0.027858146 = score(doc=3979,freq=1.0), product of:
              0.18910946 = queryWeight, product of:
                2.1426072 = boost
                4.713993 = idf(docFreq=1041, maxDocs=42740)
                0.018723272 = queryNorm
              0.14731228 = fieldWeight in 3979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.713993 = idf(docFreq=1041, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
          0.15850313 = weight(abstract_txt:attributes in 3979) [ClassicSimilarity], result of:
            0.15850313 = score(doc=3979,freq=3.0), product of:
              0.47836605 = queryWeight, product of:
                4.173609 = boost
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.018723272 = queryNorm
              0.33134276 = fieldWeight in 3979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.03125 = fieldNorm(doc=3979)
        0.28 = coord(7/25)
    
  4. Huibers, T.W.C.; Bruza, P.D.: Situations, a general framework for studying information retrieval (1996) 0.11
    0.11392595 = sum of:
      0.11392595 = product of:
        0.47469145 = sum of:
          0.021802314 = weight(abstract_txt:which in 33) [ClassicSimilarity], result of:
            0.021802314 = score(doc=33,freq=3.0), product of:
              0.054924592 = queryWeight, product of:
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.018723272 = queryNorm
              0.39694995 = fieldWeight in 33, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
          0.041481234 = weight(abstract_txt:retrieval in 33) [ClassicSimilarity], result of:
            0.041481234 = score(doc=33,freq=4.0), product of:
              0.076621965 = queryWeight, product of:
                1.1811178 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.018723272 = queryNorm
              0.5413752 = fieldWeight in 33, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
          0.047730323 = weight(abstract_txt:defined in 33) [ClassicSimilarity], result of:
            0.047730323 = score(doc=33,freq=1.0), product of:
              0.11667327 = queryWeight, product of:
                1.1900265 = boost
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.018723272 = queryNorm
              0.40909392 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
          0.047810834 = weight(abstract_txt:theory in 33) [ClassicSimilarity], result of:
            0.047810834 = score(doc=33,freq=1.0), product of:
              0.1337077 = queryWeight, product of:
                1.5602522 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.018723272 = queryNorm
              0.35757726 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
          0.082281806 = weight(abstract_txt:objects in 33) [ClassicSimilarity], result of:
            0.082281806 = score(doc=33,freq=1.0), product of:
              0.19201775 = queryWeight, product of:
                1.869766 = boost
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.018723272 = queryNorm
              0.42851144 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
          0.23358496 = weight(abstract_txt:axioms in 33) [ClassicSimilarity], result of:
            0.23358496 = score(doc=33,freq=1.0), product of:
              0.3363089 = queryWeight, product of:
                2.020413 = boost
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.018723272 = queryNorm
              0.6945548 = fieldWeight in 33, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.890302 = idf(docFreq=15, maxDocs=42740)
                0.078125 = fieldNorm(doc=33)
        0.24 = coord(6/25)
    
  5. Rijsbergen, C.J. van; Lalmas, M.: Information calculus for information retrieval (1996) 0.11
    0.10835539 = sum of:
      0.10835539 = product of:
        0.4514808 = sum of:
          0.014241213 = weight(abstract_txt:which in 4270) [ClassicSimilarity], result of:
            0.014241213 = score(doc=4270,freq=2.0), product of:
              0.054924592 = queryWeight, product of:
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.018723272 = queryNorm
              0.25928664 = fieldWeight in 4270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9334934 = idf(docFreq=6181, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
          0.028739039 = weight(abstract_txt:retrieval in 4270) [ClassicSimilarity], result of:
            0.028739039 = score(doc=4270,freq=3.0), product of:
              0.076621965 = queryWeight, product of:
                1.1811178 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.018723272 = queryNorm
              0.37507573 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
          0.066137075 = weight(abstract_txt:defined in 4270) [ClassicSimilarity], result of:
            0.066137075 = score(doc=4270,freq=3.0), product of:
              0.11667327 = queryWeight, product of:
                1.1900265 = boost
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.018723272 = queryNorm
              0.56685716 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.236402 = idf(docFreq=617, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
          0.06624863 = weight(abstract_txt:theory in 4270) [ClassicSimilarity], result of:
            0.06624863 = score(doc=4270,freq=3.0), product of:
              0.1337077 = queryWeight, product of:
                1.5602522 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.018723272 = queryNorm
              0.4954736 = fieldWeight in 4270, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
          0.09309123 = weight(abstract_txt:objects in 4270) [ClassicSimilarity], result of:
            0.09309123 = score(doc=4270,freq=2.0), product of:
              0.19201775 = queryWeight, product of:
                1.869766 = boost
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.018723272 = queryNorm
              0.48480532 = fieldWeight in 4270, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4849463 = idf(docFreq=481, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
          0.18302365 = weight(abstract_txt:attributes in 4270) [ClassicSimilarity], result of:
            0.18302365 = score(doc=4270,freq=1.0), product of:
              0.47836605 = queryWeight, product of:
                4.173609 = boost
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.018723272 = queryNorm
              0.38260168 = fieldWeight in 4270, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.121627 = idf(docFreq=254, maxDocs=42740)
                0.0625 = fieldNorm(doc=4270)
        0.24 = coord(6/25)