Document (#27230)

Author
Eijk, C.C. van der
Mulligen, E.M. van
Kors, J.A.
Mons, B.
Berg, J. van den
Title
Constructing an associative concept space for literature-based discovery
Source
Journal of the American Society for Information Science and technology. 55(2004) no.5, S.436-444
Year
2004
Abstract
Scientific literature is often fragmented, which implies that certain scientific questions can only be answered by combining information from various articles. In this paper, a new algorithm is proposed for finding associations between related concepts present in literature. To this end, concepts are mapped to a multidimensional space by a Hebbian type of learning algorithm using co-occurrence data as input. The resulting concept space allows exploration of the neighborhood of a concept and finding potentially novel relationships between concepts. The obtained information retrieval system is useful for finding literature supporting hypotheses and for discovering previously unknown relationships between concepts. Tests an artificial data show the potential of the proposed methodology. In addition, preliminary tests an a set of Medline abstracts yield promising results.
Theme
Informetrie

Similar documents (author)

  1. Berg, O.: Current problems with MARC/ISBD formats in relation to online public access of bibliographic information (1991) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:berg in 469) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 469, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=469)
    
  2. Berg, S.: Auf dem Weg : Fallbeispiel: Vorbereitungen für einen elektronischen Katalog (1995) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:berg in 717) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 717, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=717)
    
  3. Berg, L.: Wie das Internet die Gesellschaft verändert : Google gründet ein Forschungsinstitut in Berlin (2011) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:berg in 1553) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 1553, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=1553)
    
  4. Berg, L.: Pablo will es wissen : Lernen mit Salman Khan (2012) 5.42
    5.416974 = sum of:
      5.416974 = weight(author_txt:berg in 2229) [ClassicSimilarity], result of:
        5.416974 = fieldWeight in 2229, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.625 = fieldNorm(doc=2229)
    
  5. Berg, J. van den: ¬The ICONCLASS browser user's guide (1992) 4.33
    4.333579 = sum of:
      4.333579 = weight(author_txt:berg in 3270) [ClassicSimilarity], result of:
        4.333579 = fieldWeight in 3270, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.667158 = idf(docFreq=19, maxDocs=42740)
          0.5 = fieldNorm(doc=3270)
    

Similar documents (content)

  1. Koike, A.; Takagi, T.: Knowledge discovery based on an implicit and explicit conceptual network (2007) 0.20
    0.1953636 = sum of:
      0.1953636 = product of:
        0.61051124 = sum of:
          0.016100068 = weight(abstract_txt:data in 2086) [ClassicSimilarity], result of:
            0.016100068 = score(doc=2086,freq=1.0), product of:
              0.07639766 = queryWeight, product of:
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02265752 = queryNorm
              0.21074034 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.065669134 = weight(abstract_txt:medline in 2086) [ClassicSimilarity], result of:
            0.065669134 = score(doc=2086,freq=1.0), product of:
              0.15479577 = queryWeight, product of:
                1.0065249 = boost
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.02265752 = queryNorm
              0.4242308 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.087115705 = weight(abstract_txt:discovering in 2086) [ClassicSimilarity], result of:
            0.087115705 = score(doc=2086,freq=1.0), product of:
              0.18688847 = queryWeight, product of:
                1.1059519 = boost
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.02265752 = queryNorm
              0.46613738 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.07474207 = weight(abstract_txt:scientific in 2086) [ClassicSimilarity], result of:
            0.07474207 = score(doc=2086,freq=3.0), product of:
              0.14741145 = queryWeight, product of:
                1.3890747 = boost
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.02265752 = queryNorm
              0.5070303 = fieldWeight in 2086, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.04709765 = weight(abstract_txt:relationships in 2086) [ClassicSimilarity], result of:
            0.04709765 = score(doc=2086,freq=1.0), product of:
              0.15626475 = queryWeight, product of:
                1.4301794 = boost
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.02265752 = queryNorm
              0.3013965 = fieldWeight in 2086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.03781207 = weight(abstract_txt:between in 2086) [ClassicSimilarity], result of:
            0.03781207 = score(doc=2086,freq=2.0), product of:
              0.122640975 = queryWeight, product of:
                1.5517559 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.02265752 = queryNorm
              0.30831513 = fieldWeight in 2086, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.10188879 = weight(abstract_txt:concept in 2086) [ClassicSimilarity], result of:
            0.10188879 = score(doc=2086,freq=3.0), product of:
              0.2074607 = queryWeight, product of:
                2.0182433 = boost
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.02265752 = queryNorm
              0.49112335 = fieldWeight in 2086, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
          0.18008576 = weight(abstract_txt:concepts in 2086) [ClassicSimilarity], result of:
            0.18008576 = score(doc=2086,freq=5.0), product of:
              0.28153634 = queryWeight, product of:
                2.7148273 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.02265752 = queryNorm
              0.6396537 = fieldWeight in 2086, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.0625 = fieldNorm(doc=2086)
        0.32 = coord(8/25)
    
  2. Leroy, G.; Chen, H.: Genescene: an ontology-enhanced integration of linguistic and co-occurrence based relations in biomedical texts (2005) 0.17
    0.16892552 = sum of:
      0.16892552 = product of:
        0.60330546 = sum of:
          0.016100068 = weight(abstract_txt:data in 260) [ClassicSimilarity], result of:
            0.016100068 = score(doc=260,freq=1.0), product of:
              0.07639766 = queryWeight, product of:
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02265752 = queryNorm
              0.21074034 = fieldWeight in 260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.092870176 = weight(abstract_txt:medline in 260) [ClassicSimilarity], result of:
            0.092870176 = score(doc=260,freq=2.0), product of:
              0.15479577 = queryWeight, product of:
                1.0065249 = boost
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.02265752 = queryNorm
              0.59995294 = fieldWeight in 260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.787693 = idf(docFreq=130, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.06611669 = weight(abstract_txt:occurrence in 260) [ClassicSimilarity], result of:
            0.06611669 = score(doc=260,freq=1.0), product of:
              0.1554983 = queryWeight, product of:
                1.0088063 = boost
                6.803078 = idf(docFreq=128, maxDocs=42740)
                0.02265752 = queryNorm
              0.4251924 = fieldWeight in 260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.803078 = idf(docFreq=128, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.02673717 = weight(abstract_txt:between in 260) [ClassicSimilarity], result of:
            0.02673717 = score(doc=260,freq=1.0), product of:
              0.122640975 = queryWeight, product of:
                1.5517559 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.02265752 = queryNorm
              0.21801172 = fieldWeight in 260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.10188879 = weight(abstract_txt:concept in 260) [ClassicSimilarity], result of:
            0.10188879 = score(doc=260,freq=3.0), product of:
              0.2074607 = queryWeight, product of:
                2.0182433 = boost
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.02265752 = queryNorm
              0.49112335 = fieldWeight in 260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.17140265 = weight(abstract_txt:space in 260) [ClassicSimilarity], result of:
            0.17140265 = score(doc=260,freq=3.0), product of:
              0.29344717 = queryWeight, product of:
                2.400328 = boost
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.02265752 = queryNorm
              0.58410054 = fieldWeight in 260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
          0.12818995 = weight(abstract_txt:literature in 260) [ClassicSimilarity], result of:
            0.12818995 = score(doc=260,freq=3.0), product of:
              0.26611364 = queryWeight, product of:
                2.6394203 = boost
                4.4498587 = idf(docFreq=1356, maxDocs=42740)
                0.02265752 = queryNorm
              0.48171133 = fieldWeight in 260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4498587 = idf(docFreq=1356, maxDocs=42740)
                0.0625 = fieldNorm(doc=260)
        0.28 = coord(7/25)
    
  3. Li, J.; Zhang, P.; Cao, J.: External concept support for group support systems through Web mining (2009) 0.15
    0.14700851 = sum of:
      0.14700851 = product of:
        0.7350425 = sum of:
          0.052474134 = weight(abstract_txt:proposed in 4807) [ClassicSimilarity], result of:
            0.052474134 = score(doc=4807,freq=1.0), product of:
              0.14472772 = queryWeight, product of:
                1.3763721 = boost
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.02265752 = queryNorm
              0.36257142 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.078125 = fieldNorm(doc=4807)
          0.0982525 = weight(abstract_txt:algorithm in 4807) [ClassicSimilarity], result of:
            0.0982525 = score(doc=4807,freq=1.0), product of:
              0.21986222 = queryWeight, product of:
                1.6964275 = boost
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.02265752 = queryNorm
              0.44688213 = fieldWeight in 4807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7200913 = idf(docFreq=380, maxDocs=42740)
                0.078125 = fieldNorm(doc=4807)
          0.19454713 = weight(abstract_txt:concept in 4807) [ClassicSimilarity], result of:
            0.19454713 = score(doc=4807,freq=7.0), product of:
              0.2074607 = queryWeight, product of:
                2.0182433 = boost
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.02265752 = queryNorm
              0.93775415 = fieldWeight in 4807, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.078125 = fieldNorm(doc=4807)
          0.24739844 = weight(abstract_txt:space in 4807) [ClassicSimilarity], result of:
            0.24739844 = score(doc=4807,freq=4.0), product of:
              0.29344717 = queryWeight, product of:
                2.400328 = boost
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.02265752 = queryNorm
              0.8430766 = fieldWeight in 4807, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.078125 = fieldNorm(doc=4807)
          0.14237028 = weight(abstract_txt:concepts in 4807) [ClassicSimilarity], result of:
            0.14237028 = score(doc=4807,freq=2.0), product of:
              0.28153634 = queryWeight, product of:
                2.7148273 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.02265752 = queryNorm
              0.50569063 = fieldWeight in 4807, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.078125 = fieldNorm(doc=4807)
        0.2 = coord(5/25)
    
  4. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.14
    0.13847648 = sum of:
      0.13847648 = product of:
        0.57698536 = sum of:
          0.028461168 = weight(abstract_txt:data in 4698) [ClassicSimilarity], result of:
            0.028461168 = score(doc=4698,freq=2.0), product of:
              0.07639766 = queryWeight, product of:
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02265752 = queryNorm
              0.3725398 = fieldWeight in 4698, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
          0.101969406 = weight(abstract_txt:relationships in 4698) [ClassicSimilarity], result of:
            0.101969406 = score(doc=4698,freq=3.0), product of:
              0.15626475 = queryWeight, product of:
                1.4301794 = boost
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.02265752 = queryNorm
              0.6525426 = fieldWeight in 4698, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
          0.03342146 = weight(abstract_txt:between in 4698) [ClassicSimilarity], result of:
            0.03342146 = score(doc=4698,freq=1.0), product of:
              0.122640975 = queryWeight, product of:
                1.5517559 = boost
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.02265752 = queryNorm
              0.27251464 = fieldWeight in 4698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4881876 = idf(docFreq=3549, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
          0.14706382 = weight(abstract_txt:concept in 4698) [ClassicSimilarity], result of:
            0.14706382 = score(doc=4698,freq=4.0), product of:
              0.2074607 = queryWeight, product of:
                2.0182433 = boost
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.02265752 = queryNorm
              0.70887554 = fieldWeight in 4698, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5368032 = idf(docFreq=1243, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
          0.12369922 = weight(abstract_txt:space in 4698) [ClassicSimilarity], result of:
            0.12369922 = score(doc=4698,freq=1.0), product of:
              0.29344717 = queryWeight, product of:
                2.400328 = boost
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.02265752 = queryNorm
              0.4215383 = fieldWeight in 4698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.39569 = idf(docFreq=526, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
          0.14237028 = weight(abstract_txt:concepts in 4698) [ClassicSimilarity], result of:
            0.14237028 = score(doc=4698,freq=2.0), product of:
              0.28153634 = queryWeight, product of:
                2.7148273 = boost
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.02265752 = queryNorm
              0.50569063 = fieldWeight in 4698, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.576989 = idf(docFreq=1194, maxDocs=42740)
                0.078125 = fieldNorm(doc=4698)
        0.24 = coord(6/25)
    
  5. Gordon, M.D.; Dumais, S.: Using latent semantic indexing for literature based discovery (1998) 0.13
    0.132543 = sum of:
      0.132543 = product of:
        0.66271496 = sum of:
          0.118560694 = weight(abstract_txt:hypotheses in 5893) [ClassicSimilarity], result of:
            0.118560694 = score(doc=5893,freq=1.0), product of:
              0.17515312 = queryWeight, product of:
                1.0706658 = boost
                7.220239 = idf(docFreq=84, maxDocs=42740)
                0.02265752 = queryNorm
              0.6768974 = fieldWeight in 5893, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.220239 = idf(docFreq=84, maxDocs=42740)
                0.09375 = fieldNorm(doc=5893)
          0.13067356 = weight(abstract_txt:discovering in 5893) [ClassicSimilarity], result of:
            0.13067356 = score(doc=5893,freq=1.0), product of:
              0.18688847 = queryWeight, product of:
                1.1059519 = boost
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.02265752 = queryNorm
              0.69920605 = fieldWeight in 5893, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.458198 = idf(docFreq=66, maxDocs=42740)
                0.09375 = fieldNorm(doc=5893)
          0.09153997 = weight(abstract_txt:scientific in 5893) [ClassicSimilarity], result of:
            0.09153997 = score(doc=5893,freq=2.0), product of:
              0.14741145 = queryWeight, product of:
                1.3890747 = boost
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.02265752 = queryNorm
              0.62098277 = fieldWeight in 5893, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6837454 = idf(docFreq=1073, maxDocs=42740)
                0.09375 = fieldNorm(doc=5893)
          0.099909194 = weight(abstract_txt:relationships in 5893) [ClassicSimilarity], result of:
            0.099909194 = score(doc=5893,freq=2.0), product of:
              0.15626475 = queryWeight, product of:
                1.4301794 = boost
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.02265752 = queryNorm
              0.63935846 = fieldWeight in 5893, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.822344 = idf(docFreq=934, maxDocs=42740)
                0.09375 = fieldNorm(doc=5893)
          0.22203152 = weight(abstract_txt:literature in 5893) [ClassicSimilarity], result of:
            0.22203152 = score(doc=5893,freq=4.0), product of:
              0.26611364 = queryWeight, product of:
                2.6394203 = boost
                4.4498587 = idf(docFreq=1356, maxDocs=42740)
                0.02265752 = queryNorm
              0.8343485 = fieldWeight in 5893, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4498587 = idf(docFreq=1356, maxDocs=42740)
                0.09375 = fieldNorm(doc=5893)
        0.2 = coord(5/25)