Document (#10115)

Author
Yannakoudakis, E.J.
Daraki, J.J.
Title
Lexical clustering and retrieval of bibliographic records
Source
Information retrieval: new systems and current research. Proceedings of the 15th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Glasgow 1993. Ed.: Ruben Leon
Imprint
London : Taylor Graham
Year
1994
Pages
S.137-149
Abstract
Presents a new system that enables users to retrieve catalogue entries on the basis of theri lexical similarities and to cluster records in a dynamic fashion. Describes the information retrieval system developed by the Department of Informatics, Athens University of Economics and Business, Greece. The system also offers the means for cyclic retrieval of records from each cluster while allowing the user to define the field to be used in each case. The approach is based on logical keys which are derived from pertinent bibliographic fields and are used for all clustering and information retrieval functions
Theme
Computerlinguistik

Similar documents (content)

  1. Leazer, G.H.: ¬A conceptual schema for the control of bibliographic works (1994) 0.15
    0.15182011 = sum of:
      0.15182011 = product of:
        0.47443783 = sum of:
          0.049805056 = weight(abstract_txt:retrieve in 3033) [ClassicSimilarity], result of:
            0.049805056 = score(doc=3033,freq=1.0), product of:
              0.13236931 = queryWeight, product of:
                1.0433577 = boost
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.021074047 = queryNorm
              0.37625834 = fieldWeight in 3033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.052642852 = weight(abstract_txt:enables in 3033) [ClassicSimilarity], result of:
            0.052642852 = score(doc=3033,freq=1.0), product of:
              0.13735083 = queryWeight, product of:
                1.062809 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.021074047 = queryNorm
              0.38327292 = fieldWeight in 3033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.053655397 = weight(abstract_txt:logical in 3033) [ClassicSimilarity], result of:
            0.053655397 = score(doc=3033,freq=1.0), product of:
              0.13910645 = queryWeight, product of:
                1.0695798 = boost
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.021074047 = queryNorm
              0.38571465 = fieldWeight in 3033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1714344 = idf(docFreq=250, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.017307373 = weight(abstract_txt:used in 3033) [ClassicSimilarity], result of:
            0.017307373 = score(doc=3033,freq=1.0), product of:
              0.08243325 = queryWeight, product of:
                1.1644096 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021074047 = queryNorm
              0.2099562 = fieldWeight in 3033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.045112178 = weight(abstract_txt:each in 3033) [ClassicSimilarity], result of:
            0.045112178 = score(doc=3033,freq=2.0), product of:
              0.123917945 = queryWeight, product of:
                1.4276497 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.021074047 = queryNorm
              0.36404878 = fieldWeight in 3033, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.13378355 = weight(abstract_txt:bibliographic in 3033) [ClassicSimilarity], result of:
            0.13378355 = score(doc=3033,freq=15.0), product of:
              0.13067316 = queryWeight, product of:
                1.4660466 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.021074047 = queryNorm
              1.0238029 = fieldWeight in 3033, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.045490302 = weight(abstract_txt:system in 3033) [ClassicSimilarity], result of:
            0.045490302 = score(doc=3033,freq=3.0), product of:
              0.124609426 = queryWeight, product of:
                1.7533784 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.021074047 = queryNorm
              0.3650631 = fieldWeight in 3033, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
          0.0766411 = weight(abstract_txt:retrieval in 3033) [ClassicSimilarity], result of:
            0.0766411 = score(doc=3033,freq=4.0), product of:
              0.17643286 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021074047 = queryNorm
              0.43439242 = fieldWeight in 3033, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=3033)
        0.32 = coord(8/25)
    
  2. Dunlavy, D.M.; O'Leary, D.P.; Conroy, J.M.; Schlesinger, J.D.: QCS: A system for querying, clustering and summarizing documents (2007) 0.13
    0.12794328 = sum of:
      0.12794328 = product of:
        0.533097 = sum of:
          0.026230091 = weight(abstract_txt:used in 947) [ClassicSimilarity], result of:
            0.026230091 = score(doc=947,freq=3.0), product of:
              0.08243325 = queryWeight, product of:
                1.1644096 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021074047 = queryNorm
              0.31819794 = fieldWeight in 947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
          0.062412545 = weight(abstract_txt:each in 947) [ClassicSimilarity], result of:
            0.062412545 = score(doc=947,freq=5.0), product of:
              0.123917945 = queryWeight, product of:
                1.4276497 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.021074047 = queryNorm
              0.50366026 = fieldWeight in 947, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
          0.05629138 = weight(abstract_txt:system in 947) [ClassicSimilarity], result of:
            0.05629138 = score(doc=947,freq=6.0), product of:
              0.124609426 = queryWeight, product of:
                1.7533784 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.021074047 = queryNorm
              0.45174253 = fieldWeight in 947, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
          0.13570417 = weight(abstract_txt:clustering in 947) [ClassicSimilarity], result of:
            0.13570417 = score(doc=947,freq=2.0), product of:
              0.2822681 = queryWeight, product of:
                2.1546934 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.021074047 = queryNorm
              0.4807634 = fieldWeight in 947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
          0.19438234 = weight(abstract_txt:cluster in 947) [ClassicSimilarity], result of:
            0.19438234 = score(doc=947,freq=3.0), product of:
              0.3133337 = queryWeight, product of:
                2.2701685 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.021074047 = queryNorm
              0.6203685 = fieldWeight in 947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
          0.058076493 = weight(abstract_txt:retrieval in 947) [ClassicSimilarity], result of:
            0.058076493 = score(doc=947,freq=3.0), product of:
              0.17643286 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021074047 = queryNorm
              0.3291705 = fieldWeight in 947, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=947)
        0.24 = coord(6/25)
    
  3. Pao, M.L.: Retrieval differences between term and citation indexing (1989) 0.12
    0.12392715 = sum of:
      0.12392715 = product of:
        0.51636314 = sum of:
          0.08715885 = weight(abstract_txt:retrieve in 3566) [ClassicSimilarity], result of:
            0.08715885 = score(doc=3566,freq=1.0), product of:
              0.13236931 = queryWeight, product of:
                1.0433577 = boost
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.021074047 = queryNorm
              0.6584521 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0201335 = idf(docFreq=291, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
          0.030287903 = weight(abstract_txt:used in 3566) [ClassicSimilarity], result of:
            0.030287903 = score(doc=3566,freq=1.0), product of:
              0.08243325 = queryWeight, product of:
                1.1644096 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021074047 = queryNorm
              0.36742336 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
          0.2155821 = weight(abstract_txt:keys in 3566) [ClassicSimilarity], result of:
            0.2155821 = score(doc=3566,freq=1.0), product of:
              0.24209628 = queryWeight, product of:
                1.4110215 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.021074047 = queryNorm
              0.8904809 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
          0.055823475 = weight(abstract_txt:each in 3566) [ClassicSimilarity], result of:
            0.055823475 = score(doc=3566,freq=1.0), product of:
              0.123917945 = queryWeight, product of:
                1.4276497 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.021074047 = queryNorm
              0.4504874 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
          0.06044984 = weight(abstract_txt:bibliographic in 3566) [ClassicSimilarity], result of:
            0.06044984 = score(doc=3566,freq=1.0), product of:
              0.13067316 = queryWeight, product of:
                1.4660466 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.021074047 = queryNorm
              0.46260333 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
          0.06706096 = weight(abstract_txt:retrieval in 3566) [ClassicSimilarity], result of:
            0.06706096 = score(doc=3566,freq=1.0), product of:
              0.17643286 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021074047 = queryNorm
              0.38009337 = fieldWeight in 3566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=3566)
        0.24 = coord(6/25)
    
  4. Huang, L.; Milne, D.; Frank, E.; Witten, I.H.: Learning a concept-based document similarity measure (2012) 0.12
    0.12012062 = sum of:
      0.12012062 = product of:
        0.6006031 = sum of:
          0.03987391 = weight(abstract_txt:each in 372) [ClassicSimilarity], result of:
            0.03987391 = score(doc=372,freq=1.0), product of:
              0.123917945 = queryWeight, product of:
                1.4276497 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.021074047 = queryNorm
              0.32177672 = fieldWeight in 372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.19386312 = weight(abstract_txt:clustering in 372) [ClassicSimilarity], result of:
            0.19386312 = score(doc=372,freq=2.0), product of:
              0.2822681 = queryWeight, product of:
                2.1546934 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.021074047 = queryNorm
              0.6868049 = fieldWeight in 372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.15864152 = weight(abstract_txt:lexical in 372) [ClassicSimilarity], result of:
            0.15864152 = score(doc=372,freq=1.0), product of:
              0.31113788 = queryWeight, product of:
                2.2622 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.021074047 = queryNorm
              0.5098753 = fieldWeight in 372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.16032386 = weight(abstract_txt:cluster in 372) [ClassicSimilarity], result of:
            0.16032386 = score(doc=372,freq=1.0), product of:
              0.3133337 = queryWeight, product of:
                2.2701685 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.021074047 = queryNorm
              0.5116713 = fieldWeight in 372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
          0.047900684 = weight(abstract_txt:retrieval in 372) [ClassicSimilarity], result of:
            0.047900684 = score(doc=372,freq=1.0), product of:
              0.17643286 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021074047 = queryNorm
              0.27149525 = fieldWeight in 372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=372)
        0.2 = coord(5/25)
    
  5. Evens, M.: Thesaural relations in information retrieval (2002) 0.11
    0.11333596 = sum of:
      0.11333596 = product of:
        0.5666798 = sum of:
          0.04496587 = weight(abstract_txt:used in 1201) [ClassicSimilarity], result of:
            0.04496587 = score(doc=1201,freq=3.0), product of:
              0.08243325 = queryWeight, product of:
                1.1644096 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.021074047 = queryNorm
              0.54548216 = fieldWeight in 1201, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.039395757 = weight(abstract_txt:system in 1201) [ClassicSimilarity], result of:
            0.039395757 = score(doc=1201,freq=1.0), product of:
              0.124609426 = queryWeight, product of:
                1.7533784 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.021074047 = queryNorm
              0.3161539 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.19036981 = weight(abstract_txt:lexical in 1201) [ClassicSimilarity], result of:
            0.19036981 = score(doc=1201,freq=1.0), product of:
              0.31113788 = queryWeight, product of:
                2.2622 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.021074047 = queryNorm
              0.6118503 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.19238862 = weight(abstract_txt:cluster in 1201) [ClassicSimilarity], result of:
            0.19238862 = score(doc=1201,freq=1.0), product of:
              0.3133337 = queryWeight, product of:
                2.2701685 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.021074047 = queryNorm
              0.61400557 = fieldWeight in 1201, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
          0.09955971 = weight(abstract_txt:retrieval in 1201) [ClassicSimilarity], result of:
            0.09955971 = score(doc=1201,freq=3.0), product of:
              0.17643286 = queryWeight, product of:
                2.4091249 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021074047 = queryNorm
              0.5642923 = fieldWeight in 1201, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1201)
        0.2 = coord(5/25)