-
Chowdhury, G.G.: Template mining for information extraction from digital documents (1999)
0.22
0.22446406 = product of:
0.44892812 = sum of:
0.44892812 = sum of:
0.35283166 = weight(_text_:mining in 4577) [ClassicSimilarity], result of:
0.35283166 = score(doc=4577,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.2342855 = fieldWeight in 4577, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.109375 = fieldNorm(doc=4577)
0.09609647 = weight(_text_:22 in 4577) [ClassicSimilarity], result of:
0.09609647 = score(doc=4577,freq=2.0), product of:
0.17741053 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05066224 = queryNorm
0.5416616 = fieldWeight in 4577, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=4577)
0.5 = coord(1/2)
- Date
- 2. 4.2000 18:01:22
- Theme
- Data Mining
-
Matson, L.D.; Bonski, D.J.: Do digital libraries need librarians? (1997)
0.13
0.12826519 = product of:
0.25653037 = sum of:
0.25653037 = sum of:
0.2016181 = weight(_text_:mining in 1737) [ClassicSimilarity], result of:
0.2016181 = score(doc=1737,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.705306 = fieldWeight in 1737, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0625 = fieldNorm(doc=1737)
0.054912273 = weight(_text_:22 in 1737) [ClassicSimilarity], result of:
0.054912273 = score(doc=1737,freq=2.0), product of:
0.17741053 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05066224 = queryNorm
0.30952093 = fieldWeight in 1737, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1737)
0.5 = coord(1/2)
- Abstract
- Defines digital libraries and discusses the effects of new technology on librarians. Examines the different viewpoints of librarians and information technologists on digital libraries. Describes the development of a digital library at the National Drug Intelligence Center, USA, which was carried out in collaboration with information technology experts. The system is based on Web enabled search technology to find information, data visualization and data mining to visualize it and use of SGML as an information standard to store it
- Date
- 22.11.1998 18:57:22
- Theme
- Data Mining
-
Saz, J.T.: Perspectivas en recuperacion y explotacion de informacion electronica : el 'data mining' (1997)
0.11
0.109129 = product of:
0.218258 = sum of:
0.218258 = product of:
0.436516 = sum of:
0.436516 = weight(_text_:mining in 3723) [ClassicSimilarity], result of:
0.436516 = score(doc=3723,freq=12.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.5270323 = fieldWeight in 3723, product of:
3.4641016 = tf(freq=12.0), with freq of:
12.0 = termFreq=12.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.078125 = fieldNorm(doc=3723)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Presents the concept and the techniques identified by the term data mining. Explains the principles and phases of developing a data mining process, and the main types of data mining tools
- Footnote
- Übers. des Titels: Perspectives on the retrieval and exploitation of electronic information: data mining
- Theme
- Data Mining
-
Tunbridge, N.: Semiology put to data mining (1999)
0.10
0.10080905 = product of:
0.2016181 = sum of:
0.2016181 = product of:
0.4032362 = sum of:
0.4032362 = weight(_text_:mining in 6782) [ClassicSimilarity], result of:
0.4032362 = score(doc=6782,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.410612 = fieldWeight in 6782, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.125 = fieldNorm(doc=6782)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Theme
- Data Mining
-
Amir, A.; Feldman, R.; Kashi, R.: ¬A new and versatile method for association generation (1997)
0.10
0.098738894 = product of:
0.19747779 = sum of:
0.19747779 = sum of:
0.14256552 = weight(_text_:mining in 1270) [ClassicSimilarity], result of:
0.14256552 = score(doc=1270,freq=2.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.49872664 = fieldWeight in 1270, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0625 = fieldNorm(doc=1270)
0.054912273 = weight(_text_:22 in 1270) [ClassicSimilarity], result of:
0.054912273 = score(doc=1270,freq=2.0), product of:
0.17741053 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05066224 = queryNorm
0.30952093 = fieldWeight in 1270, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1270)
0.5 = coord(1/2)
- Source
- Information systems. 22(1997) nos.5/6, S.333-347
- Theme
- Data Mining
-
Fayyad, U.; Piatetsky-Shapiro, G.; Smyth, P.: From data mining to knowledge discovery in databases (1996)
0.09
0.08910345 = product of:
0.1782069 = sum of:
0.1782069 = product of:
0.3564138 = sum of:
0.3564138 = weight(_text_:mining in 7458) [ClassicSimilarity], result of:
0.3564138 = score(doc=7458,freq=8.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.2468166 = fieldWeight in 7458, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.078125 = fieldNorm(doc=7458)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Gives an overview of data mining and knowledge discovery in databases. Clarifies how they are related both to each other and to related fields. Mentions real world applications data mining techniques, challenges involved in real world applications of knowledge discovery, and current and future research directions
- Theme
- Data Mining
-
Schmid, J.: Data mining : wie finde ich in Datensammlungen entscheidungsrelevante Muster? (1999)
0.09
0.088207915 = product of:
0.17641583 = sum of:
0.17641583 = product of:
0.35283166 = sum of:
0.35283166 = weight(_text_:mining in 4540) [ClassicSimilarity], result of:
0.35283166 = score(doc=4540,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.2342855 = fieldWeight in 4540, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.109375 = fieldNorm(doc=4540)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Theme
- Data Mining
-
Hofstede, A.H.M. ter; Proper, H.A.; Van der Weide, T.P.: Exploiting fact verbalisation in conceptual information modelling (1997)
0.09
0.08639653 = product of:
0.17279306 = sum of:
0.17279306 = sum of:
0.12474483 = weight(_text_:mining in 2908) [ClassicSimilarity], result of:
0.12474483 = score(doc=2908,freq=2.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.4363858 = fieldWeight in 2908, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2908)
0.048048235 = weight(_text_:22 in 2908) [ClassicSimilarity], result of:
0.048048235 = score(doc=2908,freq=2.0), product of:
0.17741053 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.05066224 = queryNorm
0.2708308 = fieldWeight in 2908, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2908)
0.5 = coord(1/2)
- Source
- Information systems. 22(1997) nos.5/6, S.349-385
- Theme
- Data Mining
-
Raghavan, V.V.; Deogun, J.S.; Sever, H.: Knowledge discovery and data mining : introduction (1998)
0.08
0.082510956 = product of:
0.16502191 = sum of:
0.16502191 = product of:
0.33004382 = sum of:
0.33004382 = weight(_text_:mining in 2899) [ClassicSimilarity], result of:
0.33004382 = score(doc=2899,freq=14.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.1545684 = fieldWeight in 2899, product of:
3.7416575 = tf(freq=14.0), with freq of:
14.0 = termFreq=14.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2899)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Defines knowledge discovery and database mining. The challenge for knowledge discovery in databases (KDD) is to automatically process large quantities of raw data, identifying the most significant and meaningful patterns, and present these as as knowledge appropriate for achieving a user's goals. Data mining is the process of deriving useful knowledge from real world databases through the application of pattern extraction techniques. Explains the goals of, and motivation for, research work on data mining. Discusses the nature of database contents, along with problems within the field of data mining
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining
-
Fayyad, U.M.: Data mining and knowledge dicovery : making sense out of data (1996)
0.08
0.077165864 = product of:
0.15433173 = sum of:
0.15433173 = product of:
0.30866346 = sum of:
0.30866346 = weight(_text_:mining in 7007) [ClassicSimilarity], result of:
0.30866346 = score(doc=7007,freq=6.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
1.079775 = fieldWeight in 7007, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.078125 = fieldNorm(doc=7007)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Defines knowledge discovery and data mining (KDD) as the overall process of extracting high level knowledge from low level data. Outlines the KDD process. Explains how KDD is related to the fields of: statistics, pattern recognition, machine learning, artificial intelligence, databases and data warehouses
- Theme
- Data Mining
-
Howlett, D.: Digging deep for treasure (1998)
0.07
0.07128276 = product of:
0.14256552 = sum of:
0.14256552 = product of:
0.28513104 = sum of:
0.28513104 = weight(_text_:mining in 4544) [ClassicSimilarity], result of:
0.28513104 = score(doc=4544,freq=2.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.9974533 = fieldWeight in 4544, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.125 = fieldNorm(doc=4544)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Theme
- Data Mining
-
Lingras, P.J.; Yao, Y.Y.: Data mining using extensions of the rough set model (1998)
0.07
0.069734484 = product of:
0.13946897 = sum of:
0.13946897 = product of:
0.27893794 = sum of:
0.27893794 = weight(_text_:mining in 2910) [ClassicSimilarity], result of:
0.27893794 = score(doc=2910,freq=10.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.97578835 = fieldWeight in 2910, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2910)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Examines basic issues of data mining using the theory of rough sets, which is a recent proposal for generalizing classical set theory. The Pawlak rough set model is based on the concept of an equivalence relation. A generalized rough set model need not be based on equivalence relation axioms. The Pawlak rough set model has been used for deriving deterministic as well as probabilistic rules froma complete database. Demonstrates that a generalised rough set model can be used for generating rules from incomplete databases. These rules are based on plausability functions proposed by Shafer. Discusses the importance of rule extraction from incomplete databases in data mining
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining
-
Trybula, W.J.: Data mining and knowledge discovery (1997)
0.06
0.062372416 = product of:
0.12474483 = sum of:
0.12474483 = product of:
0.24948967 = sum of:
0.24948967 = weight(_text_:mining in 2300) [ClassicSimilarity], result of:
0.24948967 = score(doc=2300,freq=8.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.8727716 = fieldWeight in 2300, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2300)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- State of the art review of the recently developed concepts of data mining (defined as the automated process of evaluating data and finding relationships) and knowledge discovery (defined as the automated process of extracting information, especially unpredicted relationships or previously unknown patterns among the data) with particular reference to numerical data. Includes: the knowledge acquisition process; data mining; evaluation methods; and knowledge discovery. Concludes that existing work in the field are confusing because the terminology is inconsistent and poorly defined. Although methods are available for analyzing and cleaning databases, better coordinated efforts should be directed toward providing users with improved means of structuring search mechanisms to explore the data for relationships
- Theme
- Data Mining
-
Wu, X.: Rule induction with extension matrices (1998)
0.05
0.053462073 = product of:
0.10692415 = sum of:
0.10692415 = product of:
0.2138483 = sum of:
0.2138483 = weight(_text_:mining in 2912) [ClassicSimilarity], result of:
0.2138483 = score(doc=2912,freq=8.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.74808997 = fieldWeight in 2912, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.046875 = fieldNorm(doc=2912)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Presents a heuristic, attribute-based, noise-tolerant data mining program, HCV (Version 2.0), absed on the newly-developed extension matrix approach. Gives a simple example of attribute-based induction to show the difference between the rules in variable-valued logic produced by HCV, the decision tree generated by C4.5 and the decision tree's decompiled rules by C4.5 rules. Outlines the extension matrix approach for data mining. Describes the HCV algorithm in detail. Outlines techniques developed and implemented in the HCV program for noise handling and discretization of continuous domains respectively. Follows these with a performance comparison of HCV with famous ID3-like algorithms including C4.5 and C4.5 rules on a collection of standard databases including the famous MONK's problems
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining
-
Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996)
0.05
0.050404526 = product of:
0.10080905 = sum of:
0.10080905 = product of:
0.2016181 = sum of:
0.2016181 = weight(_text_:mining in 6625) [ClassicSimilarity], result of:
0.2016181 = score(doc=6625,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.705306 = fieldWeight in 6625, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0625 = fieldNorm(doc=6625)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Offers a data mining approach based on machine learning classification methods to the problem of automated cataloguing of online databases of digital images resulting from sky surveys. The SKICAT system automates the reduction and analysis of 3 terabytes of images expected to contain about 2 billion sky objects. It offers a solution to problems associated with the analysis of large data sets in science
- Theme
- Data Mining
-
Chen, Z.: Knowledge discovery and system-user partnership : on a production 'adversarial partnership' approach (1994)
0.05
0.050404526 = product of:
0.10080905 = sum of:
0.10080905 = product of:
0.2016181 = sum of:
0.2016181 = weight(_text_:mining in 6759) [ClassicSimilarity], result of:
0.2016181 = score(doc=6759,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.705306 = fieldWeight in 6759, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0625 = fieldNorm(doc=6759)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Examines the relationship between systems and users from the knowledge discovery in databases or data mining perspecitives. A comprehensive study on knowledge discovery in human computer symbiosis is needed. Proposes a database-user adversarial partnership, which is general enough to cover knowledge discovery and security of issues related to databases and their users. It can be further generalized into system-user adversarial paertnership. Discusses opportunities provided by knowledge discovery techniques and potential social implications
- Theme
- Data Mining
-
Wong, S.K.M.; Butz, C.J.; Xiang, X.: Automated database schema design using mined data dependencies (1998)
0.04
0.044103958 = product of:
0.088207915 = sum of:
0.088207915 = product of:
0.17641583 = sum of:
0.17641583 = weight(_text_:mining in 2897) [ClassicSimilarity], result of:
0.17641583 = score(doc=2897,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.61714274 = fieldWeight in 2897, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2897)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining
-
Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998)
0.04
0.044103958 = product of:
0.088207915 = sum of:
0.088207915 = product of:
0.17641583 = sum of:
0.17641583 = weight(_text_:mining in 2909) [ClassicSimilarity], result of:
0.17641583 = score(doc=2909,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.61714274 = fieldWeight in 2909, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=2909)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining
-
Search tools (1997)
0.04
0.044103958 = product of:
0.088207915 = sum of:
0.088207915 = product of:
0.17641583 = sum of:
0.17641583 = weight(_text_:mining in 3834) [ClassicSimilarity], result of:
0.17641583 = score(doc=3834,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.61714274 = fieldWeight in 3834, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.0546875 = fieldNorm(doc=3834)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- Offers brief accounts of Internet search tools. Covers the Lycos revamp; the new navigation service produced jointly by Excite and Netscape, delivering a language specific, locally relevant Web guide for Japan, Germany, France, the UK and Australia; InfoWatcher, a combination offline browser, search engine and push product from Carvelle Inc., USA; Alexa by Alexa Internet and WBI from IBM which are free and provide users with information on how others have used the Web sites which they are visiting; and Concept Explorer from Knowledge Discovery Systems, Inc., California which performs data mining from the Web, Usenet groups, MEDLINE and the US Patent and Trademark Office patent abstracts
- Theme
- Data Mining
-
Deogun, J.S.: Feature selection and effective classifiers (1998)
0.04
0.037803393 = product of:
0.075606786 = sum of:
0.075606786 = product of:
0.15121357 = sum of:
0.15121357 = weight(_text_:mining in 2911) [ClassicSimilarity], result of:
0.15121357 = score(doc=2911,freq=4.0), product of:
0.28585905 = queryWeight, product of:
5.642448 = idf(docFreq=425, maxDocs=44218)
0.05066224 = queryNorm
0.5289795 = fieldWeight in 2911, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
5.642448 = idf(docFreq=425, maxDocs=44218)
0.046875 = fieldNorm(doc=2911)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Footnote
- Contribution to a special issue devoted to knowledge discovery and data mining
- Theme
- Data Mining