Search (79 results, page 1 of 4)

Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.04

0.039151367 = product of:
  0.12528437 = sum of:
    0.017687442 = weight(_text_:26 in 354) [ClassicSimilarity], result of:
      0.017687442 = score(doc=354,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.15607277 = fieldWeight in 354, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.03125 = fieldNorm(doc=354)
    0.017986061 = product of:
      0.035972122 = sum of:
        0.035972122 = weight(_text_:rules in 354) [ClassicSimilarity], result of:
          0.035972122 = score(doc=354,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.22257565 = fieldWeight in 354, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.03125 = fieldNorm(doc=354)
      0.5 = coord(1/2)
    0.035972122 = weight(_text_:rules in 354) [ClassicSimilarity], result of:
      0.035972122 = score(doc=354,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.22257565 = fieldWeight in 354, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.03125 = fieldNorm(doc=354)
    0.044672076 = weight(_text_:2nd in 354) [ClassicSimilarity], result of:
      0.044672076 = score(doc=354,freq=2.0), product of:
        0.18010403 = queryWeight, product of:
          5.6123877 = idf(docFreq=438, maxDocs=44218)
          0.032090448 = queryNorm
        0.24803483 = fieldWeight in 354, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.6123877 = idf(docFreq=438, maxDocs=44218)
          0.03125 = fieldNorm(doc=354)
    0.008966677 = product of:
      0.017933354 = sum of:
        0.017933354 = weight(_text_:ed in 354) [ClassicSimilarity], result of:
          0.017933354 = score(doc=354,freq=2.0), product of:
            0.11411327 = queryWeight, product of:
              3.5559888 = idf(docFreq=3431, maxDocs=44218)
              0.032090448 = queryNorm
            0.15715398 = fieldWeight in 354, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5559888 = idf(docFreq=3431, maxDocs=44218)
              0.03125 = fieldNorm(doc=354)
      0.5 = coord(1/2)
  0.3125 = coord(5/16)

Content: Inhalt: 1. Introduction 2. Association Rules and Sequential Patterns 3. Supervised Learning 4. Unsupervised Learning 5. Partially Supervised Learning 6. Information Retrieval and Web Search 7. Social Network Analysis 8. Web Crawling 9. Structured Data Extraction: Wrapper Generation 10. Information Integration
Date: 26. 8.2012 11:53:40
Issue: 2nd ed.

Lingras, P.J.; Yao, Y.Y.: Data mining using extensions of the rough set model (1998) 0.04

0.03607512 = product of:
  0.19240063 = sum of:
    0.054517355 = product of:
      0.10903471 = sum of:
        0.10903471 = weight(_text_:rules in 2910) [ClassicSimilarity], result of:
          0.10903471 = score(doc=2910,freq=6.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.6746466 = fieldWeight in 2910, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2910)
      0.5 = coord(1/2)
    0.028848568 = weight(_text_:american in 2910) [ClassicSimilarity], result of:
      0.028848568 = score(doc=2910,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.26367915 = fieldWeight in 2910, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2910)
    0.10903471 = weight(_text_:rules in 2910) [ClassicSimilarity], result of:
      0.10903471 = score(doc=2910,freq=6.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.6746466 = fieldWeight in 2910, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2910)
  0.1875 = coord(3/16)

Abstract: Examines basic issues of data mining using the theory of rough sets, which is a recent proposal for generalizing classical set theory. The Pawlak rough set model is based on the concept of an equivalence relation. A generalized rough set model need not be based on equivalence relation axioms. The Pawlak rough set model has been used for deriving deterministic as well as probabilistic rules froma complete database. Demonstrates that a generalised rough set model can be used for generating rules from incomplete databases. These rules are based on plausability functions proposed by Shafer. Discusses the importance of rule extraction from incomplete databases in data mining
Source: Journal of the American Society for Information Science. 49(1998) no.5, S.415-422

Wu, X.: Rule induction with extension matrices (1998) 0.03

0.03498786 = product of:
  0.1866019 = sum of:
    0.053958185 = product of:
      0.10791637 = sum of:
        0.10791637 = weight(_text_:rules in 2912) [ClassicSimilarity], result of:
          0.10791637 = score(doc=2912,freq=8.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.66772693 = fieldWeight in 2912, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.046875 = fieldNorm(doc=2912)
      0.5 = coord(1/2)
    0.024727343 = weight(_text_:american in 2912) [ClassicSimilarity], result of:
      0.024727343 = score(doc=2912,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.22601068 = fieldWeight in 2912, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.046875 = fieldNorm(doc=2912)
    0.10791637 = weight(_text_:rules in 2912) [ClassicSimilarity], result of:
      0.10791637 = score(doc=2912,freq=8.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.66772693 = fieldWeight in 2912, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.046875 = fieldNorm(doc=2912)
  0.1875 = coord(3/16)

Abstract: Presents a heuristic, attribute-based, noise-tolerant data mining program, HCV (Version 2.0), absed on the newly-developed extension matrix approach. Gives a simple example of attribute-based induction to show the difference between the rules in variable-valued logic produced by HCV, the decision tree generated by C4.5 and the decision tree's decompiled rules by C4.5 rules. Outlines the extension matrix approach for data mining. Describes the HCV algorithm in detail. Outlines techniques developed and implemented in the HCV program for noise handling and discretization of continuous domains respectively. Follows these with a performance comparison of HCV with famous ID3-like algorithms including C4.5 and C4.5 rules on a collection of standard databases including the famous MONK's problems
Source: Journal of the American Society for Information Science. 49(1998) no.5, S.435-454

Principles of data mining and knowledge discovery (1998) 0.03

0.02686711 = product of:
  0.14329125 = sum of:
    0.035374884 = weight(_text_:26 in 3822) [ClassicSimilarity], result of:
      0.035374884 = score(doc=3822,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.31214553 = fieldWeight in 3822, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.0625 = fieldNorm(doc=3822)
    0.035972122 = product of:
      0.071944244 = sum of:
        0.071944244 = weight(_text_:rules in 3822) [ClassicSimilarity], result of:
          0.071944244 = score(doc=3822,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.4451513 = fieldWeight in 3822, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0625 = fieldNorm(doc=3822)
      0.5 = coord(1/2)
    0.071944244 = weight(_text_:rules in 3822) [ClassicSimilarity], result of:
      0.071944244 = score(doc=3822,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.4451513 = fieldWeight in 3822, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0625 = fieldNorm(doc=3822)
  0.1875 = coord(3/16)

Abstract: The volume presents 26 revised papers corresponding to the oral presentations given at the conference, also included are refereed papers corresponding to the 30 poster presentations. These papers were selected from a total of 73 full draft submissions. The papers are organized in topical sections on rule evaluation, visualization, association rules and text mining, KDD process and software, tree construction, sequential and spatial data mining, and attribute selection

Benoit, G.: Data mining (2002) 0.03

0.026436329 = product of:
  0.14099376 = sum of:
    0.026531162 = weight(_text_:26 in 4296) [ClassicSimilarity], result of:
      0.026531162 = score(doc=4296,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.23410915 = fieldWeight in 4296, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.046875 = fieldNorm(doc=4296)
    0.0381542 = product of:
      0.0763084 = sum of:
        0.0763084 = weight(_text_:rules in 4296) [ClassicSimilarity], result of:
          0.0763084 = score(doc=4296,freq=4.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.47215426 = fieldWeight in 4296, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.046875 = fieldNorm(doc=4296)
      0.5 = coord(1/2)
    0.0763084 = weight(_text_:rules in 4296) [ClassicSimilarity], result of:
      0.0763084 = score(doc=4296,freq=4.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.47215426 = fieldWeight in 4296, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.046875 = fieldNorm(doc=4296)
  0.1875 = coord(3/16)

Abstract: Data mining (DM) is a multistaged process of extracting previously unanticipated knowledge from large databases, and applying the results to decision making. Data mining tools detect patterns from the data and infer associations and rules from them. The extracted information may then be applied to prediction or classification models by identifying relations within the data records or between databases. Those patterns and rules can then guide decision making and forecast the effects of those decisions. However, this definition may be applied equally to "knowledge discovery in databases" (KDD). Indeed, in the recent literature of DM and KDD, a source of confusion has emerged, making it difficult to determine the exact parameters of both. KDD is sometimes viewed as the broader discipline, of which data mining is merely a component-specifically pattern extraction, evaluation, and cleansing methods (Raghavan, Deogun, & Sever, 1998, p. 397). Thurasingham (1999, p. 2) remarked that "knowledge discovery," "pattern discovery," "data dredging," "information extraction," and "knowledge mining" are all employed as synonyms for DM. Trybula, in his ARIST chapter an text mining, observed that the "existing work [in KDD] is confusing because the terminology is inconsistent and poorly defined.
Date: 24.10.2005 12:38:26

Methodologies for knowledge discovery and data mining : Third Pacific-Asia Conference, PAKDD'99, Beijing, China, April 26-28, 1999, Proceedings (1999) 0.03

0.025912687 = product of:
  0.138201 = sum of:
    0.043774184 = weight(_text_:26 in 3821) [ClassicSimilarity], result of:
      0.043774184 = score(doc=3821,freq=4.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.3862604 = fieldWeight in 3821, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3821)
    0.031475607 = product of:
      0.062951215 = sum of:
        0.062951215 = weight(_text_:rules in 3821) [ClassicSimilarity], result of:
          0.062951215 = score(doc=3821,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.38950738 = fieldWeight in 3821, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3821)
      0.5 = coord(1/2)
    0.062951215 = weight(_text_:rules in 3821) [ClassicSimilarity], result of:
      0.062951215 = score(doc=3821,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.38950738 = fieldWeight in 3821, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3821)
  0.1875 = coord(3/16)

Abstract: The 29 revised full papers presented together with 37 short papers were carefully selected from a total of 158 submissions. The book is divided into sections on emerging KDD technology; association rules; feature selection and generation; mining in semi-unstructured data; interestingness, surprisingness, and exceptions; rough sets, fuzzy logic, and neural networks; induction, classification, and clustering; visualization, causal models and graph-based methods; agent-based and distributed data mining; and advanced topics and new methodologies
Date: 26. 5.1996 11:11:10

Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998) 0.02

0.023114135 = product of:
  0.12327539 = sum of:
    0.031475607 = product of:
      0.062951215 = sum of:
        0.062951215 = weight(_text_:rules in 2909) [ClassicSimilarity], result of:
          0.062951215 = score(doc=2909,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.38950738 = fieldWeight in 2909, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2909)
      0.5 = coord(1/2)
    0.028848568 = weight(_text_:american in 2909) [ClassicSimilarity], result of:
      0.028848568 = score(doc=2909,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.26367915 = fieldWeight in 2909, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2909)
    0.062951215 = weight(_text_:rules in 2909) [ClassicSimilarity], result of:
      0.062951215 = score(doc=2909,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.38950738 = fieldWeight in 2909, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2909)
  0.1875 = coord(3/16)

Abstract: Rough set theory is a mathematical tool to deal with vagueness and uncertainty. To apply the theory, it needs to be associated with efficient and effective computational methods. A relation can be used to represent a decison table for use in decision making. By using this kind of table, rough set theory can be applied successfully to rough classification and knowledge discovery. Presents computational methods for using rough sets to identify classes in datasets, finding dependencies in relations, and discovering rules which are hidden in databases. Illustrates the methods with a running example from a database of car test results
Source: Journal of the American Society for Information Science. 49(1998) no.5, S.403-414

Biskri, I.; Rompré, L.: Using association rules for query reformulation (2012) 0.02

0.017523436 = product of:
  0.14018749 = sum of:
    0.046729162 = product of:
      0.093458325 = sum of:
        0.093458325 = weight(_text_:rules in 92) [ClassicSimilarity], result of:
          0.093458325 = score(doc=92,freq=6.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.5782685 = fieldWeight in 92, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.046875 = fieldNorm(doc=92)
      0.5 = coord(1/2)
    0.093458325 = weight(_text_:rules in 92) [ClassicSimilarity], result of:
      0.093458325 = score(doc=92,freq=6.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.5782685 = fieldWeight in 92, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.046875 = fieldNorm(doc=92)
  0.125 = coord(2/16)

Abstract: In this paper the authors will present research on the combination of two methods of data mining: text classification and maximal association rules. Text classification has been the focus of interest of many researchers for a long time. However, the results take the form of lists of words (classes) that people often do not know what to do with. The use of maximal association rules induced a number of advantages: (1) the detection of dependencies and correlations between the relevant units of information (words) of different classes, (2) the extraction of hidden knowledge, often relevant, from a large volume of data. The authors will show how this combination can improve the process of information retrieval.

Shi, X.; Yang, C.C.: Mining related queries from Web search engine query logs using an improved association rule mining model (2007) 0.02

0.016510097 = product of:
  0.08805385 = sum of:
    0.022482576 = product of:
      0.04496515 = sum of:
        0.04496515 = weight(_text_:rules in 597) [ClassicSimilarity], result of:
          0.04496515 = score(doc=597,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.27821955 = fieldWeight in 597, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0390625 = fieldNorm(doc=597)
      0.5 = coord(1/2)
    0.02060612 = weight(_text_:american in 597) [ClassicSimilarity], result of:
      0.02060612 = score(doc=597,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.18834224 = fieldWeight in 597, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.0390625 = fieldNorm(doc=597)
    0.04496515 = weight(_text_:rules in 597) [ClassicSimilarity], result of:
      0.04496515 = score(doc=597,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.27821955 = fieldWeight in 597, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0390625 = fieldNorm(doc=597)
  0.1875 = coord(3/16)

Abstract: With the overwhelming volume of information, the task of finding relevant information on a given topic on the Web is becoming increasingly difficult. Web search engines hence become one of the most popular solutions available on the Web. However, it has never been easy for novice users to organize and represent their information needs using simple queries. Users have to keep modifying their input queries until they get expected results. Therefore, it is often desirable for search engines to give suggestions on related queries to users. Besides, by identifying those related queries, search engines can potentially perform optimizations on their systems, such as query expansion and file indexing. In this work we propose a method that suggests a list of related queries given an initial input query. The related queries are based in the query log of previously submitted queries by human users, which can be identified using an enhanced model of association rules. Users can utilize the suggested related queries to tune or redirect the search process. Our method not only discovers the related queries, but also ranks them according to the degree of their relatedness. Unlike many other rival techniques, it also performs reasonably well on less frequent input queries.
Source: Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1871-1883

Fayyad, U.M.; Djorgovski, S.G.; Weir, N.: From digitized images to online catalogs : data ming a sky server (1996) 0.01

0.014018415 = product of:
  0.11214732 = sum of:
    0.05607366 = weight(_text_:cataloguing in 6625) [ClassicSimilarity], result of:
      0.05607366 = score(doc=6625,freq=2.0), product of:
        0.14268221 = queryWeight, product of:
          4.446252 = idf(docFreq=1408, maxDocs=44218)
          0.032090448 = queryNorm
        0.39299685 = fieldWeight in 6625, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.446252 = idf(docFreq=1408, maxDocs=44218)
          0.0625 = fieldNorm(doc=6625)
    0.05607366 = weight(_text_:cataloguing in 6625) [ClassicSimilarity], result of:
      0.05607366 = score(doc=6625,freq=2.0), product of:
        0.14268221 = queryWeight, product of:
          4.446252 = idf(docFreq=1408, maxDocs=44218)
          0.032090448 = queryNorm
        0.39299685 = fieldWeight in 6625, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.446252 = idf(docFreq=1408, maxDocs=44218)
          0.0625 = fieldNorm(doc=6625)
  0.125 = coord(2/16)

Abstract: Offers a data mining approach based on machine learning classification methods to the problem of automated cataloguing of online databases of digital images resulting from sky surveys. The SKICAT system automates the reduction and analysis of 3 terabytes of images expected to contain about 2 billion sky objects. It offers a solution to problems associated with the analysis of large data sets in science

Sánchez, D.; Chamorro-Martínez, J.; Vila, M.A.: Modelling subjectivity in visual perception of orientation for image retrieval (2003) 0.01

0.01011716 = product of:
  0.08093728 = sum of:
    0.026979093 = product of:
      0.053958185 = sum of:
        0.053958185 = weight(_text_:rules in 1067) [ClassicSimilarity], result of:
          0.053958185 = score(doc=1067,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.33386347 = fieldWeight in 1067, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.046875 = fieldNorm(doc=1067)
      0.5 = coord(1/2)
    0.053958185 = weight(_text_:rules in 1067) [ClassicSimilarity], result of:
      0.053958185 = score(doc=1067,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.33386347 = fieldWeight in 1067, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.046875 = fieldNorm(doc=1067)
  0.125 = coord(2/16)

Abstract: In this paper we combine computer vision and data mining techniques to model high-level concepts for image retrieval, on the basis of basic perceptual features of the human visual system. High-level concepts related to these features are learned and represented by means of a set of fuzzy association rules. The concepts so acquired can be used for image retrieval with the advantage that it is not needed to provide an image as a query. Instead, a query is formulated by using the labels that identify the learned concepts as search terms, and the retrieval process calculates the relevance of an image to the query by an inference mechanism. An additional feature of our methodology is that it can capture user's subjectivity. For that purpose, fuzzy sets theory is employed to measure user's assessments about the fulfillment of a concept by an image.

Relational data mining (2001) 0.01

0.009506433 = product of:
  0.076051466 = sum of:
    0.049520306 = weight(_text_:author in 1303) [ClassicSimilarity], result of:
      0.049520306 = score(doc=1303,freq=2.0), product of:
        0.15482868 = queryWeight, product of:
          4.824759 = idf(docFreq=964, maxDocs=44218)
          0.032090448 = queryNorm
        0.31983936 = fieldWeight in 1303, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.824759 = idf(docFreq=964, maxDocs=44218)
          0.046875 = fieldNorm(doc=1303)
    0.026531162 = weight(_text_:26 in 1303) [ClassicSimilarity], result of:
      0.026531162 = score(doc=1303,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.23410915 = fieldWeight in 1303, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.046875 = fieldNorm(doc=1303)
  0.125 = coord(2/16)

Abstract: As the first book devoted to relational data mining, this coherently written multi-author monograph provides a thorough introduction and systematic overview of the area. The ferst part introduces the reader to the basics and principles of classical knowledge discovery in databases and inductive logic programmeng; subsequent chapters by leading experts assess the techniques in relational data mining in a principled and comprehensive way; finally, three chapters deal with advanced applications in various fields and refer the reader to resources for relational data mining. This book will become a valuable source of reference for R&D professionals active in relational data mining. Students as well as IT professionals and ambitioned practitioners interested in learning about relational data mining will appreciate the book as a useful text and gentle introduction to this exciting new field.
Date: 26. 5.1996 11:11:10

Classification, automation, and new media : Proceedings of the 24th Annual Conference of the Gesellschaft für Klassifikation e.V., University of Passau, March 15 - 17, 2000 (2002) 0.01

0.0090667745 = product of:
  0.072534196 = sum of:
    0.04126692 = weight(_text_:author in 5997) [ClassicSimilarity], result of:
      0.04126692 = score(doc=5997,freq=2.0), product of:
        0.15482868 = queryWeight, product of:
          4.824759 = idf(docFreq=964, maxDocs=44218)
          0.032090448 = queryNorm
        0.26653278 = fieldWeight in 5997, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.824759 = idf(docFreq=964, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5997)
    0.031267278 = weight(_text_:26 in 5997) [ClassicSimilarity], result of:
      0.031267278 = score(doc=5997,freq=4.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.2759003 = fieldWeight in 5997, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5997)
  0.125 = coord(2/16)

Content: Data Analysis, Statistics, and Classification.- Pattern Recognition and Automation.- Data Mining, Information Processing, and Automation.- New Media, Web Mining, and Automation.- Applications in Management Science, Finance, and Marketing.- Applications in Medicine, Biology, Archaeology, and Others.- Author Index.- Subject Index.
Date: 26. 9.2006 18:02:28
26. 9.2006 18:20:10

Ayadi, H.; Torjmen-Khemakhem, M.; Daoud, M.; Huang, J.X.; Jemaa, M.B.: Mining correlations between medically dependent features and image retrieval models for query classification (2017) 0.01
```
0.008430966 = product of:
  0.06744773 = sum of:
    0.022482576 = product of:
      0.04496515 = sum of:
        0.04496515 = weight(_text_:rules in 3607) [ClassicSimilarity], result of:
          0.04496515 = score(doc=3607,freq=2.0), product of:
            0.16161752 = queryWeight, product of:
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.032090448 = queryNorm
            0.27821955 = fieldWeight in 3607, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.036312 = idf(docFreq=780, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3607)
      0.5 = coord(1/2)
    0.04496515 = weight(_text_:rules in 3607) [ClassicSimilarity], result of:
      0.04496515 = score(doc=3607,freq=2.0), product of:
        0.16161752 = queryWeight, product of:
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.032090448 = queryNorm
        0.27821955 = fieldWeight in 3607, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.036312 = idf(docFreq=780, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3607)
  0.125 = coord(2/16)
```
Abstract

The abundance of medical resources has encouraged the development of systems that allow for efficient searches of information in large medical image data sets. State-of-the-art image retrieval models are classified into three categories: content-based (visual) models, textual models, and combined models. Content-based models use visual features to answer image queries, textual image retrieval models use word matching to answer textual queries, and combined image retrieval models, use both textual and visual features to answer queries. Nevertheless, most of previous works in this field have used the same image retrieval model independently of the query type. In this article, we define a list of generic and specific medical query features and exploit them in an association rule mining technique to discover correlations between query features and image retrieval models. Based on these rules, we propose to use an associative classifier (NaiveClass) to find the best suitable retrieval model given a new textual query. We also propose a second associative classifier (SmartClass) to select the most appropriate default class for the query. Experiments are performed on Medical ImageCLEF queries from 2008 to 2012 to evaluate the impact of the proposed query features on the classification performance. The results show that combining our proposed specific and generic query features is effective in query classification.

Howlett, D.: Digging deep for treasure (1998) 0.00

0.0044218604 = product of:
  0.07074977 = sum of:
    0.07074977 = weight(_text_:26 in 4544) [ClassicSimilarity], result of:
      0.07074977 = score(doc=4544,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.62429106 = fieldWeight in 4544, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.125 = fieldNorm(doc=4544)
  0.0625 = coord(1/16)

Date: 26. 3.2000 16:35:23

Hallonsten, O.; Holmberg, D.: Analyzing structural stratification in the Swedish higher education system : data contextualization with policy-history analysis (2013) 0.00

0.0039344565 = product of:
  0.031475652 = sum of:
    0.02060612 = weight(_text_:american in 668) [ClassicSimilarity], result of:
      0.02060612 = score(doc=668,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.18834224 = fieldWeight in 668, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.0390625 = fieldNorm(doc=668)
    0.010869532 = product of:
      0.021739064 = sum of:
        0.021739064 = weight(_text_:22 in 668) [ClassicSimilarity], result of:
          0.021739064 = score(doc=668,freq=2.0), product of:
            0.11237528 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.032090448 = queryNorm
            0.19345059 = fieldWeight in 668, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=668)
      0.5 = coord(1/2)
  0.125 = coord(2/16)

Date: 22. 3.2013 19:43:01
Source: Journal of the American Society for Information Science and Technology. 64(2013) no.3, S.574-586

Schmid, J.: Data mining : wie finde ich in Datensammlungen entscheidungsrelevante Muster? (1999) 0.00

0.003869128 = product of:
  0.061906047 = sum of:
    0.061906047 = weight(_text_:26 in 4540) [ClassicSimilarity], result of:
      0.061906047 = score(doc=4540,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.5462547 = fieldWeight in 4540, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.109375 = fieldNorm(doc=4540)
  0.0625 = coord(1/16)

Date: 26. 3.2000 16:30:30

Data mining, data warehousing and client/server databases : Proceedings of the 8th International Hong Kong Computer Society Database Workshop (Industrial Stream) (1997) 0.00

0.0033163952 = product of:
  0.053062323 = sum of:
    0.053062323 = weight(_text_:26 in 2301) [ClassicSimilarity], result of:
      0.053062323 = score(doc=2301,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.4682183 = fieldWeight in 2301, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.09375 = fieldNorm(doc=2301)
  0.0625 = coord(1/16)

Date: 22.11.1998 18:56:26

Peters, G.; Gaese, V.: ¬Das DocCat-System in der Textdokumentation von G+J (2003) 0.00

0.0032978836 = product of:
  0.026383068 = sum of:
    0.017687442 = weight(_text_:26 in 1507) [ClassicSimilarity], result of:
      0.017687442 = score(doc=1507,freq=2.0), product of:
        0.113328174 = queryWeight, product of:
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.032090448 = queryNorm
        0.15607277 = fieldWeight in 1507, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5315237 = idf(docFreq=3516, maxDocs=44218)
          0.03125 = fieldNorm(doc=1507)
    0.008695626 = product of:
      0.017391251 = sum of:
        0.017391251 = weight(_text_:22 in 1507) [ClassicSimilarity], result of:
          0.017391251 = score(doc=1507,freq=2.0), product of:
            0.11237528 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.032090448 = queryNorm
            0.15476047 = fieldWeight in 1507, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1507)
      0.5 = coord(1/2)
  0.125 = coord(2/16)

Date: 26. 2.1996 17:51:49
22. 4.2003 11:45:36

Knowledge discovery and data mining (1998) 0.00

0.0030909178 = product of:
  0.049454685 = sum of:
    0.049454685 = weight(_text_:american in 2898) [ClassicSimilarity], result of:
      0.049454685 = score(doc=2898,freq=2.0), product of:
        0.10940785 = queryWeight, product of:
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.032090448 = queryNorm
        0.45202136 = fieldWeight in 2898, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.4093587 = idf(docFreq=3973, maxDocs=44218)
          0.09375 = fieldNorm(doc=2898)
  0.0625 = coord(1/16)

Source: Journal of the American Society for Information Science. 49(1998) no.5, S.397-470

Search (79 results, page 1 of 4)

Authors

Years

Languages

Types

Themes

Subjects

Classifications