Search (19 results, page 1 of 1)

Cardie, C.: Empirical methods in information extraction (1997) 0.04

0.039365895 = product of:
  0.07873179 = sum of:
    0.06289996 = weight(_text_:processing in 3246) [ClassicSimilarity], result of:
      0.06289996 = score(doc=3246,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.35780904 = fieldWeight in 3246, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0625 = fieldNorm(doc=3246)
    0.015831826 = product of:
      0.047495477 = sum of:
        0.047495477 = weight(_text_:29 in 3246) [ClassicSimilarity], result of:
          0.047495477 = score(doc=3246,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.31092256 = fieldWeight in 3246, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=3246)
      0.33333334 = coord(1/3)
  0.5 = coord(2/4)

Date: 6. 3.1999 13:50:29
Footnote: Contribution to a special section reviewing recent research in empirical methods in speech recognition, syntactic parsing, semantic processing, information extraction and machine translation

Galal, G.M.; Cook, D.J.; Holder, L.B.: Exploiting parallelism in a structural scientific discovery system to improve scalability (1999) 0.03
```
0.026916523 = product of:
  0.053833045 = sum of:
    0.04717497 = weight(_text_:processing in 2952) [ClassicSimilarity], result of:
      0.04717497 = score(doc=2952,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 2952, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=2952)
    0.006658075 = product of:
      0.019974224 = sum of:
        0.019974224 = weight(_text_:science in 2952) [ClassicSimilarity], result of:
          0.019974224 = score(doc=2952,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 2952, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2952)
      0.33333334 = coord(1/3)
  0.5 = coord(2/4)
```
Abstract

The large amount of data collected today is quickly overwhelming researchers' abilities to interpret the data and discover interesting patterns. Knowledge discovery and data mining approaches hold the potential to automate the interpretation process, but these approaches frequently utilize computationally expensive algorithms. In particular, scientific discovery systems focus on the utilization of richer data representation, sometimes without regard for scalability. This research investigates approaches for scaling a particular knowledge discovery in databases (KDD) system, SUBDUE, using parallel and distributed resources. SUBDUE has been used to discover interesting and repetitive concepts in graph-based databases from a variety of domains, but requires a substantial amount of processing time. Experiments that demonstrate scalability of parallel versions of the SUBDUE system are performed using CAD circuit databases and artificially-generated databases, and potential achievements and obstacles are discussed

Source

Journal of the American Society for Information Science. 50(1999) no.1, S.65-73
Gaizauskas, R.; Wilks, Y.: Information extraction : beyond document retrieval (1998) 0.02
```
0.016678872 = product of:
  0.06671549 = sum of:
    0.06671549 = weight(_text_:processing in 4716) [ClassicSimilarity], result of:
      0.06671549 = score(doc=4716,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3795138 = fieldWeight in 4716, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=4716)
  0.25 = coord(1/4)
```
Abstract

In this paper we give a synoptic view of the growth of the text processing technology of informatione xtraction (IE) whose function is to extract information about a pre-specified set of entities, relations or events from natural language texts and to record this information in structured representations called templates. Here we describe the nature of the IE task, review the history of the area from its origins in AI work in the 1960s and 70s till the present, discuss the techniques being used to carry out the task, describe application areas where IE systems are or are about to be at work, and conclude with a discussion of the challenges facing the area. What emerges is a picture of an exciting new text processing technology with a host of new applications, both on its own and in conjunction with other technologies, such as information retrieval, machine translation and data mining

Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997) 0.02

0.01576062 = product of:
  0.06304248 = sum of:
    0.06304248 = product of:
      0.09456371 = sum of:
        0.047495477 = weight(_text_:29 in 1270) [ClassicSimilarity], result of:
          0.047495477 = score(doc=1270,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.31092256 = fieldWeight in 1270, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0625 = fieldNorm(doc=1270)
        0.047068227 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
          0.047068227 = score(doc=1270,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.30952093 = fieldWeight in 1270, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1270)
      0.6666667 = coord(2/3)
  0.25 = coord(1/4)

Date: 5. 4.1996 15:29:15
Source: Information systems. 22(1997) nos.5/6, S.333-347

Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997) 0.01

0.01379054 = product of:
  0.05516216 = sum of:
    0.05516216 = product of:
      0.08274324 = sum of:
        0.04155854 = weight(_text_:29 in 2908) [ClassicSimilarity], result of:
          0.04155854 = score(doc=2908,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.27205724 = fieldWeight in 2908, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2908)
        0.041184697 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
          0.041184697 = score(doc=2908,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.2708308 = fieldWeight in 2908, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2908)
      0.6666667 = coord(2/3)
  0.25 = coord(1/4)

Date: 5. 4.1996 15:29:15
Source: Information systems. 22(1997) nos.5/6, S.349-385

Methodologies for knowledge discovery and data mining : Third Pacific-Asia Conference, PAKDD'99, Beijing, China, April 26-28, 1999, Proceedings (1999) 0.01

0.010810301 = product of:
  0.043241203 = sum of:
    0.043241203 = product of:
      0.064861804 = sum of:
        0.023303263 = weight(_text_:science in 3821) [ClassicSimilarity], result of:
          0.023303263 = score(doc=3821,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 3821, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3821)
        0.04155854 = weight(_text_:29 in 3821) [ClassicSimilarity], result of:
          0.04155854 = score(doc=3821,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.27205724 = fieldWeight in 3821, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3821)
      0.6666667 = coord(2/3)
  0.25 = coord(1/4)

Abstract: The 29 revised full papers presented together with 37 short papers were carefully selected from a total of 158 submissions. The book is divided into sections on emerging KDD technology; association rules; feature selection and generation; mining in semi-unstructured data; interestingness, surprisingness, and exceptions; rough sets, fuzzy logic, and neural networks; induction, classification, and clustering; visualization, causal models and graph-based methods; agent-based and distributed data mining; and advanced topics and new methodologies
Series: Lecture notes in computer science; vol.1574

Chowdhury, G.G.: Template mining for information extraction from digital documents (1999) 0.01

0.0068641165 = product of:
  0.027456466 = sum of:
    0.027456466 = product of:
      0.082369395 = sum of:
        0.082369395 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
          0.082369395 = score(doc=4577,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.5416616 = fieldWeight in 4577, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=4577)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Date: 2. 4.2000 18:01:22

KDD : techniques and applications (1998) 0.01

0.005883528 = product of:
  0.023534112 = sum of:
    0.023534112 = product of:
      0.070602335 = sum of:
        0.070602335 = weight(_text_:22 in 6783) [ClassicSimilarity], result of:
          0.070602335 = score(doc=6783,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.46428138 = fieldWeight in 6783, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=6783)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Footnote: A special issue of selected papers from the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'97), held Singapore, 22-23 Feb 1997

Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997) 0.00

0.0039223526 = product of:
  0.01568941 = sum of:
    0.01568941 = product of:
      0.047068227 = sum of:
        0.047068227 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
          0.047068227 = score(doc=1737,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.30952093 = fieldWeight in 1737, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1737)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Date: 22.11.1998 18:57:22

Knowledge discovery and data mining (1998) 0.00

0.0033290375 = product of:
  0.01331615 = sum of:
    0.01331615 = product of:
      0.03994845 = sum of:
        0.03994845 = weight(_text_:science in 2898) [ClassicSimilarity], result of:
          0.03994845 = score(doc=2898,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.34923816 = fieldWeight in 2898, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.09375 = fieldNorm(doc=2898)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.397-470

Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996) 0.00

0.0022193585 = product of:
  0.008877434 = sum of:
    0.008877434 = product of:
      0.0266323 = sum of:
        0.0266323 = weight(_text_:science in 6625) [ClassicSimilarity], result of:
          0.0266323 = score(doc=6625,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.23282544 = fieldWeight in 6625, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=6625)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Abstract: Offers a data mining approach based on machine learning classification methods to the problem of automated cataloguing of online databases of digital images resulting from sky surveys. The SKICAT system automates the reduction and analysis of 3 terabytes of images expected to contain about 2 billion sky objects. It offers a solution to problems associated with the analysis of large data sets in science

Principles of data mining and knowledge discovery (1998) 0.00

0.0022193585 = product of:
  0.008877434 = sum of:
    0.008877434 = product of:
      0.0266323 = sum of:
        0.0266323 = weight(_text_:science in 3822) [ClassicSimilarity], result of:
          0.0266323 = score(doc=3822,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.23282544 = fieldWeight in 3822, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0625 = fieldNorm(doc=3822)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Series: Lecture notes in computer science; vol.1510

Trybula, W.J.: Data mining and knowledge discovery (1997) 0.00

0.0019419387 = product of:
  0.0077677546 = sum of:
    0.0077677546 = product of:
      0.023303263 = sum of:
        0.023303263 = weight(_text_:science in 2300) [ClassicSimilarity], result of:
          0.023303263 = score(doc=2300,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 2300, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2300)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Annual review of information science and technology. 32(1997), S.197-229

Wong, S.K.M.; Butz, C.J.; Xiang, X.: Automated database schema design using mined data dependencies (1998) 0.00

0.0019419387 = product of:
  0.0077677546 = sum of:
    0.0077677546 = product of:
      0.023303263 = sum of:
        0.023303263 = weight(_text_:science in 2897) [ClassicSimilarity], result of:
          0.023303263 = score(doc=2897,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 2897, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2897)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.455-470

Raghavan, V.V.; Deogun, J.S.; Sever, H.: Knowledge discovery and data mining : introduction (1998) 0.00

0.0019419387 = product of:
  0.0077677546 = sum of:
    0.0077677546 = product of:
      0.023303263 = sum of:
        0.023303263 = weight(_text_:science in 2899) [ClassicSimilarity], result of:
          0.023303263 = score(doc=2899,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 2899, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2899)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.397-402

Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998) 0.00

0.0019419387 = product of:
  0.0077677546 = sum of:
    0.0077677546 = product of:
      0.023303263 = sum of:
        0.023303263 = weight(_text_:science in 2909) [ClassicSimilarity], result of:
          0.023303263 = score(doc=2909,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 2909, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2909)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.403-414

Lingras, P.J.; Yao, Y.Y.: Data mining using extensions of the rough set model (1998) 0.00

0.0019419387 = product of:
  0.0077677546 = sum of:
    0.0077677546 = product of:
      0.023303263 = sum of:
        0.023303263 = weight(_text_:science in 2910) [ClassicSimilarity], result of:
          0.023303263 = score(doc=2910,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 2910, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2910)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.415-422

Deogun, J.S.: Feature selection and effective classifiers (1998) 0.00

0.0016645187 = product of:
  0.006658075 = sum of:
    0.006658075 = product of:
      0.019974224 = sum of:
        0.019974224 = weight(_text_:science in 2911) [ClassicSimilarity], result of:
          0.019974224 = score(doc=2911,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 2911, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2911)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.423-434

Wu, X.: Rule induction with extension matrices (1998) 0.00

0.0016645187 = product of:
  0.006658075 = sum of:
    0.006658075 = product of:
      0.019974224 = sum of:
        0.019974224 = weight(_text_:science in 2912) [ClassicSimilarity], result of:
          0.019974224 = score(doc=2912,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 2912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2912)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.435-454

Search (19 results, page 1 of 1)

Authors

Types

Themes