Search (844 results, page 1 of 43)

Gnoli, C.: Classifying phenomena : part 4: themes and rhemes (2018) 0.13
```
0.1302097 = sum of:
  0.10650964 = product of:
    0.3195289 = sum of:
      0.3195289 = weight(_text_:themes in 4152) [ClassicSimilarity], result of:
        0.3195289 = score(doc=4152,freq=8.0), product of:
          0.3748681 = queryWeight, product of:
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.058308665 = queryNorm
          0.8523769 = fieldWeight in 4152, product of:
            2.828427 = tf(freq=8.0), with freq of:
              8.0 = termFreq=8.0
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.046875 = fieldNorm(doc=4152)
    0.33333334 = coord(1/3)
  0.023700057 = product of:
    0.047400113 = sum of:
      0.047400113 = weight(_text_:22 in 4152) [ClassicSimilarity], result of:
        0.047400113 = score(doc=4152,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.23214069 = fieldWeight in 4152, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=4152)
    0.5 = coord(1/2)
```
Abstract

This is the fourth in a series of papers on classification based on phenomena instead of disciplines. Together with types, levels and facets that have been discussed in the previous parts, themes and rhemes are further structural components of such a classification. In a statement or in a longer document, a base theme and several particular themes can be identified. Base theme should be cited first in a classmark, followed by particular themes, each with its own facets. In some cases, rhemes can also be expressed, that is new information provided about a theme, converting an abstract statement ("wolves, affected by cervids") into a claim that some thing actually occurs ("wolves are affected by cervids"). In the Integrative Levels Classification rhemes can be expressed by special deictic classes, including those for actual specimens, anaphoras, unknown values, conjunctions and spans, whole universe, anthropocentric favoured classes, and favoured host classes. These features, together with rules for pronounciation, make a classification of phenomena a true language, that may be suitable for many uses.

Date

17. 2.2018 18:22:25

Zhang, W.; Yoshida, T.; Tang, X.: ¬A comparative study of TF*IDF, LSI and multi-words for text classification (2011) 0.11

0.114785895 = sum of:
  0.05325482 = product of:
    0.15976445 = sum of:
      0.15976445 = weight(_text_:themes in 1165) [ClassicSimilarity], result of:
        0.15976445 = score(doc=1165,freq=2.0), product of:
          0.3748681 = queryWeight, product of:
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.058308665 = queryNorm
          0.42618844 = fieldWeight in 1165, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.046875 = fieldNorm(doc=1165)
    0.33333334 = coord(1/3)
  0.061531074 = product of:
    0.12306215 = sum of:
      0.12306215 = weight(_text_:mining in 1165) [ClassicSimilarity], result of:
        0.12306215 = score(doc=1165,freq=2.0), product of:
          0.3290036 = queryWeight, product of:
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.058308665 = queryNorm
          0.37404498 = fieldWeight in 1165, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.046875 = fieldNorm(doc=1165)
    0.5 = coord(1/2)

Abstract: One of the main themes in text mining is text representation, which is fundamental and indispensable for text-based intellegent information processing. Generally, text representation inludes two tasks: indexing and weighting. This paper has comparatively studied TF*IDF, LSI and multi-word for text representation. We used a Chinese and an English document collection to respectively evaluate the three methods in information retreival and text categorization. Experimental results have demonstrated that in text categorization, LSI has better performance than other methods in both document collections. Also, LSI has produced the best performance in retrieving English documents. This outcome has shown that LSI has both favorable semantic and statistical quality and is different with the claim that LSI can not produce discriminative power for indexing.

Blake, C.: Text mining (2011) 0.10

0.10152109 = product of:
  0.20304218 = sum of:
    0.20304218 = product of:
      0.40608436 = sum of:
        0.40608436 = weight(_text_:mining in 1599) [ClassicSimilarity], result of:
          0.40608436 = score(doc=1599,freq=4.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            1.2342855 = fieldWeight in 1599, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.109375 = fieldNorm(doc=1599)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Theme: Data Mining

Tonkin, E.L.; Tourte, G.J.L.: Working with text. tools, techniques and approaches for text mining (2016) 0.10
```
0.09929535 = product of:
  0.1985907 = sum of:
    0.1985907 = product of:
      0.3971814 = sum of:
        0.3971814 = weight(_text_:mining in 4019) [ClassicSimilarity], result of:
          0.3971814 = score(doc=4019,freq=30.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            1.2072251 = fieldWeight in 4019, product of:
              5.477226 = tf(freq=30.0), with freq of:
                30.0 = termFreq=30.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4019)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

What is text mining, and how can it be used? What relevance do these methods have to everyday work in information science and the digital humanities? How does one develop competences in text mining? Working with Text provides a series of cross-disciplinary perspectives on text mining and its applications. As text mining raises legal and ethical issues, the legal background of text mining and the responsibilities of the engineer are discussed in this book. Chapters provide an introduction to the use of the popular GATE text mining package with data drawn from social media, the use of text mining to support semantic search, the development of an authority system to support content tagging, and recent techniques in automatic language evaluation. Focused studies describe text mining on historical texts, automated indexing using constrained vocabularies, and the use of natural language processing to explore the climate science literature. Interviews are included that offer a glimpse into the real-life experience of working within commercial and academic text mining.

LCSH

Data mining

RSWK

Text Mining / Aufsatzsammlung

Subject

Text Mining / Aufsatzsammlung
Data mining

Theme

Data Mining
Mining text data (2012) 0.10
```
0.09836422 = product of:
  0.19672844 = sum of:
    0.19672844 = product of:
      0.39345688 = sum of:
        0.39345688 = weight(_text_:mining in 362) [ClassicSimilarity], result of:
          0.39345688 = score(doc=362,freq=46.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            1.1959045 = fieldWeight in 362, product of:
              6.78233 = tf(freq=46.0), with freq of:
                46.0 = termFreq=46.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.03125 = fieldNorm(doc=362)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. Mining Text Data introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including the key research content on the topic, and the future directions of research in the field. There is a special focus on Text Embedded with Heterogeneous and Multimedia Data which makes the mining process much more challenging. A number of methods have been designed such as transfer learning and cross-lingual mining for such cases. Mining Text Data simplifies the content, so that advanced-level students, practitioners and researchers in computer science can benefit from this book. Academic and corporate libraries, as well as ACM, IEEE, and Management Science focused on information security, electronic commerce, databases, data mining, machine learning, and statistics are the primary buyers for this reference book.

Content

Inhalt: An Introduction to Text Mining.- Information Extraction from Text.- A Survey of Text Summarization Techniques.- A Survey of Text Clustering Algorithms.- Dimensionality Reduction and Topic Modeling.- A Survey of Text Classification Algorithms.- Transfer Learning for Text Mining.- Probabilistic Models for Text Mining.- Mining Text Streams.- Translingual Mining from Text Data.- Text Mining in Multimedia.- Text Analytics in Social Media.- A Survey of Opinion Mining and Sentiment Analysis.- Biomedical Text Mining: A Survey of Recent Progress.- Index.

LCSH

Data mining

RSWK

Text Mining / Aufsatzsammlung

Subject

Text Mining / Aufsatzsammlung
Data mining

Theme

Data Mining

Verwer, K.: Freiheit und Verantwortung bei Hans Jonas (2011) 0.09

0.092609614 = product of:
  0.18521923 = sum of:
    0.18521923 = product of:
      0.5556577 = sum of:
        0.5556577 = weight(_text_:3a in 973) [ClassicSimilarity], result of:
          0.5556577 = score(doc=973,freq=2.0), product of:
            0.49434152 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.058308665 = queryNorm
            1.1240361 = fieldWeight in 973, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.09375 = fieldNorm(doc=973)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Content: Vgl.: http%3A%2F%2Fcreativechoice.org%2Fdoc%2FHansJonas.pdf&usg=AOvVaw1TM3teaYKgABL5H9yoIifA&opi=89978449.

Vaughan, L.; Chen, Y.: Data mining from web search queries : a comparison of Google trends and Baidu index (2015) 0.09

0.092265114 = product of:
  0.18453023 = sum of:
    0.18453023 = sum of:
      0.14503014 = weight(_text_:mining in 1605) [ClassicSimilarity], result of:
        0.14503014 = score(doc=1605,freq=4.0), product of:
          0.3290036 = queryWeight, product of:
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.058308665 = queryNorm
          0.44081625 = fieldWeight in 1605, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1605)
      0.039500095 = weight(_text_:22 in 1605) [ClassicSimilarity], result of:
        0.039500095 = score(doc=1605,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.19345059 = fieldWeight in 1605, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1605)
  0.5 = coord(1/2)

Source: Journal of the Association for Information Science and Technology. 66(2015) no.1, S.13-22
Theme: Data Mining

Chambers, S.; Myall, C.: Cataloging and classification : review of the literature 2007-8 (2010) 0.09

0.08978069 = sum of:
  0.062130623 = product of:
    0.18639186 = sum of:
      0.18639186 = weight(_text_:themes in 4309) [ClassicSimilarity], result of:
        0.18639186 = score(doc=4309,freq=2.0), product of:
          0.3748681 = queryWeight, product of:
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.058308665 = queryNorm
          0.49721986 = fieldWeight in 4309, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4309)
    0.33333334 = coord(1/3)
  0.027650066 = product of:
    0.05530013 = sum of:
      0.05530013 = weight(_text_:22 in 4309) [ClassicSimilarity], result of:
        0.05530013 = score(doc=4309,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.2708308 = fieldWeight in 4309, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0546875 = fieldNorm(doc=4309)
    0.5 = coord(1/2)

Abstract: This paper surveys library literature on cataloging and classification published in 2007-8, indicating its extent and range in terms of types of literature, major subject areas, and themes. The paper reviews pertinent literature in the following areas: the future of bibliographic control, general cataloging standards and texts, Functional Requirements for Bibliographic Records (FRBR), cataloging varied resources, metadata and cataloging in the Web world, classification and subject access, questions of diversity and diverse perspectives, additional reports of practice and research, catalogers' education and careers, keeping current through columns and blogs, and cataloging history.
Date: 10. 9.2000 17:38:22

Arbelaitz, O.; Martínez-Otzeta. J.M.; Muguerza, J.: User modeling in a social network for cognitively disabled people (2016) 0.09

0.08523113 = product of:
  0.17046227 = sum of:
    0.17046227 = sum of:
      0.12306215 = weight(_text_:mining in 2639) [ClassicSimilarity], result of:
        0.12306215 = score(doc=2639,freq=2.0), product of:
          0.3290036 = queryWeight, product of:
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.058308665 = queryNorm
          0.37404498 = fieldWeight in 2639, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.046875 = fieldNorm(doc=2639)
      0.047400113 = weight(_text_:22 in 2639) [ClassicSimilarity], result of:
        0.047400113 = score(doc=2639,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.23214069 = fieldWeight in 2639, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=2639)
  0.5 = coord(1/2)

Abstract: Online communities are becoming an important tool in the communication and participation processes in our society. However, the most widespread applications are difficult to use for people with disabilities, or may involve some risks if no previous training has been undertaken. This work describes a novel social network for cognitively disabled people along with a clustering-based method for modeling activity and socialization processes of its users in a noninvasive way. This closed social network is specifically designed for people with cognitive disabilities, called Guremintza, that provides the network administrators (e.g., social workers) with two types of reports: summary statistics of the network usage and behavior patterns discovered by a data mining process. Experiments made in an initial stage of the network show that the discovered patterns are meaningful to the social workers and they find them useful in monitoring the progress of the users.
Date: 22. 1.2016 12:02:26

Varathan, K.D.; Giachanou, A.; Crestani, F.: Comparative opinion mining : a review (2017) 0.09
```
0.08503145 = product of:
  0.1700629 = sum of:
    0.1700629 = product of:
      0.3401258 = sum of:
        0.3401258 = weight(_text_:mining in 3540) [ClassicSimilarity], result of:
          0.3401258 = score(doc=3540,freq=22.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            1.0338057 = fieldWeight in 3540, product of:
              4.690416 = tf(freq=22.0), with freq of:
                22.0 = termFreq=22.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3540)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Opinion mining refers to the use of natural language processing, text analysis, and computational linguistics to identify and extract subjective information in textual material. Opinion mining, also known as sentiment analysis, has received a lot of attention in recent times, as it provides a number of tools to analyze public opinion on a number of different topics. Comparative opinion mining is a subfield of opinion mining which deals with identifying and extracting information that is expressed in a comparative form (e.g., "paper X is better than the Y"). Comparative opinion mining plays a very important role when one tries to evaluate something because it provides a reference point for the comparison. This paper provides a review of the area of comparative opinion mining. It is the first review that cover specifically this topic as all previous reviews dealt mostly with general opinion mining. This survey covers comparative opinion mining from two different angles. One from the perspective of techniques and the other from the perspective of comparative opinion elements. It also incorporates preprocessing tools as well as data set that were used by past researchers that can be useful to future researchers in the field of comparative opinion mining.

Theme

Data Mining
Challenges and opportunities for knowledge organization in the digital age : proceedings of the Fifteenth International ISKO Conference, 9-11 July 2018, Porto, Portugal / organized by: International Society for Knowledge Organization (ISKO), ISKO Spain and Portugal Chapter, University of Porto - Faculty of Arts and Humanities, Research Centre in Communication, Information and Digital Culture (CIC.digital) - Porto (2018) 0.08
```
0.082511455 = sum of:
  0.0627614 = product of:
    0.1882842 = sum of:
      0.1882842 = weight(_text_:themes in 4696) [ClassicSimilarity], result of:
        0.1882842 = score(doc=4696,freq=4.0), product of:
          0.3748681 = queryWeight, product of:
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.058308665 = queryNorm
          0.5022679 = fieldWeight in 4696, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.0390625 = fieldNorm(doc=4696)
    0.33333334 = coord(1/3)
  0.019750047 = product of:
    0.039500095 = sum of:
      0.039500095 = weight(_text_:22 in 4696) [ClassicSimilarity], result of:
        0.039500095 = score(doc=4696,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.19345059 = fieldWeight in 4696, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=4696)
    0.5 = coord(1/2)
```
Abstract

The 15th International ISKO Conference has been held in Porto (Portugal) under the topic Challenges and opportunities for KO in the digital age. ISKO has been organizing biennial international conferences since 1990, in order to promote a space for debate among Knowledge Organization (KO) scholars and practitioners all over the world. The topics under discussion in the 15th International ISKO Conference are intended to cover a wide range of issues that, in a very incisive way, constitute challenges, obstacles and questions in the field of KO, but also highlight ways and open innovative perspectives for this area in a world undergoing constant change, due to the digital revolution that unavoidably moulds our society. Accordingly, the three aggregating themes, chosen to fit the proposals for papers and posters to be submitted, are as follows: 1 - Foundations and methods for KO; 2 - Interoperability towards information access; 3 - Societal challenges in KO. In addition to these themes, the inaugural session includes a keynote speech by Prof. David Bawden of City University London, entitled Supporting truth and promoting understanding: knowledge organization and the curation of the infosphere.

Date

17. 1.2019 17:22:18
Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.08
```
0.08204143 = product of:
  0.16408285 = sum of:
    0.16408285 = product of:
      0.3281657 = sum of:
        0.3281657 = weight(_text_:mining in 354) [ClassicSimilarity], result of:
          0.3281657 = score(doc=354,freq=32.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            0.9974533 = fieldWeight in 354, product of:
              5.656854 = tf(freq=32.0), with freq of:
                32.0 = termFreq=32.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.03125 = fieldNorm(doc=354)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Web mining aims to discover useful information and knowledge from the Web hyperlink structure, page contents, and usage data. Although Web mining uses many conventional data mining techniques, it is not purely an application of traditional data mining due to the semistructured and unstructured nature of the Web data and its heterogeneity. It has also developed many of its own algorithms and techniques. Liu has written a comprehensive text on Web data mining. Key topics of structure mining, content mining, and usage mining are covered both in breadth and in depth. His book brings together all the essential concepts and algorithms from related areas such as data mining, machine learning, and text processing to form an authoritative and coherent text. The book offers a rich blend of theory and practice, addressing seminal research ideas, as well as examining the technology from a practical point of view. It is suitable for students, researchers and practitioners interested in Web mining both as a learning text and a reference book. Lecturers can readily use it for classes on data mining, Web mining, and Web search. Additional teaching materials such as lecture slides, datasets, and implemented algorithms are available online.

RSWK

World Wide Web / Data Mining

Subject

World Wide Web / Data Mining

Theme

Data Mining

Kleineberg, M.: Context analysis and context indexing : formal pragmatics in knowledge organization (2014) 0.08

0.077174686 = product of:
  0.15434937 = sum of:
    0.15434937 = product of:
      0.4630481 = sum of:
        0.4630481 = weight(_text_:3a in 1826) [ClassicSimilarity], result of:
          0.4630481 = score(doc=1826,freq=2.0), product of:
            0.49434152 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.058308665 = queryNorm
            0.93669677 = fieldWeight in 1826, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.078125 = fieldNorm(doc=1826)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Source: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=5&ved=0CDQQFjAE&url=http%3A%2F%2Fdigbib.ubka.uni-karlsruhe.de%2Fvolltexte%2Fdocuments%2F3131107&ei=HzFWVYvGMsiNsgGTyoFI&usg=AFQjCNE2FHUeR9oQTQlNC4TPedv4Mo3DaQ&sig2=Rlzpr7a3BLZZkqZCXXN_IA&bvm=bv.93564037,d.bGg&cad=rja

Gnoli, C.: Boundaries and overlaps of disciplines in Bloch's methodology of historical knowledge (2014) 0.08

0.07695488 = sum of:
  0.05325482 = product of:
    0.15976445 = sum of:
      0.15976445 = weight(_text_:themes in 1414) [ClassicSimilarity], result of:
        0.15976445 = score(doc=1414,freq=2.0), product of:
          0.3748681 = queryWeight, product of:
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.058308665 = queryNorm
          0.42618844 = fieldWeight in 1414, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.429029 = idf(docFreq=193, maxDocs=44218)
            0.046875 = fieldNorm(doc=1414)
    0.33333334 = coord(1/3)
  0.023700057 = product of:
    0.047400113 = sum of:
      0.047400113 = weight(_text_:22 in 1414) [ClassicSimilarity], result of:
        0.047400113 = score(doc=1414,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.23214069 = fieldWeight in 1414, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=1414)
    0.5 = coord(1/2)

Abstract: Marc Bloch's famous methodological essay, The Historian's Craft, contains many relevant considerations on knowledge organization. These have been selected and grouped into four main themes: terminology problems in history; principles for the organization of historical knowledge, with special reference to the genetic principle; sources of historical information, to be found not only in archives but also in very different media and contexts; and the nature and boundaries of history as a discipline. Analysis of them shows that knowledge organization is an important part of historians' work, and suggests that it can be especially fruitful when a cross-medial, interdisciplinary approach is adopted.
Source: Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Mandl, T.: Text mining und data minig (2013) 0.07

0.07251507 = product of:
  0.14503014 = sum of:
    0.14503014 = product of:
      0.29006028 = sum of:
        0.29006028 = weight(_text_:mining in 713) [ClassicSimilarity], result of:
          0.29006028 = score(doc=713,freq=4.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            0.8816325 = fieldWeight in 713, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.078125 = fieldNorm(doc=713)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Theme: Data Mining

Miao, Q.; Li, Q.; Zeng, D.: Fine-grained opinion mining by integrating multiple review sources (2010) 0.07
```
0.071786255 = product of:
  0.14357251 = sum of:
    0.14357251 = product of:
      0.28714502 = sum of:
        0.28714502 = weight(_text_:mining in 4104) [ClassicSimilarity], result of:
          0.28714502 = score(doc=4104,freq=8.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            0.8727716 = fieldWeight in 4104, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4104)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

With the rapid development of Web 2.0, online reviews have become extremely valuable sources for mining customers' opinions. Fine-grained opinion mining has attracted more and more attention of both applied and theoretical research. In this article, the authors study how to automatically mine product features and opinions from multiple review sources. Specifically, they propose an integration strategy to solve the issue. Within the integration strategy, the authors mine domain knowledge from semistructured reviews and then exploit the domain knowledge to assist product feature extraction and sentiment orientation identification from unstructured reviews. Finally, feature-opinion tuples are generated. Experimental results on real-world datasets show that the proposed approach is effective.

Theme

Data Mining

Information and communication technologies : international conference; proceedings / ICT 2010, Kochi, Kerala, India, September 7 - 9, 2010 (2010) 0.07

0.071786255 = product of:
  0.14357251 = sum of:
    0.14357251 = product of:
      0.28714502 = sum of:
        0.28714502 = weight(_text_:mining in 4784) [ClassicSimilarity], result of:
          0.28714502 = score(doc=4784,freq=8.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            0.8727716 = fieldWeight in 4784, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4784)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

LCSH: Data mining
RSWK: Data Mining / Kongress / Cochin <Kerala, 2010>
Subject: Data Mining / Kongress / Cochin <Kerala, 2010>
Data mining

Winterhalter, C.: Licence to mine : ein Überblick über Rahmenbedingungen von Text and Data Mining und den aktuellen Stand der Diskussion (2016) 0.07

0.071049966 = product of:
  0.14209993 = sum of:
    0.14209993 = product of:
      0.28419986 = sum of:
        0.28419986 = weight(_text_:mining in 673) [ClassicSimilarity], result of:
          0.28419986 = score(doc=673,freq=6.0), product of:
            0.3290036 = queryWeight, product of:
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.058308665 = queryNorm
            0.86381996 = fieldWeight in 673, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.642448 = idf(docFreq=425, maxDocs=44218)
              0.0625 = fieldNorm(doc=673)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Abstract: Der Artikel gibt einen Überblick über die Möglichkeiten der Anwendung von Text and Data Mining (TDM) und ähnlichen Verfahren auf der Grundlage bestehender Regelungen in Lizenzverträgen zu kostenpflichtigen elektronischen Ressourcen, die Debatte über zusätzliche Lizenzen für TDM am Beispiel von Elseviers TDM Policy und den Stand der Diskussion über die Einführung von Schrankenregelungen im Urheberrecht für TDM zu nichtkommerziellen wissenschaftlichen Zwecken.
Theme: Data Mining

Cui, H.: Competency evaluation of plant character ontologies against domain literature (2010) 0.07
```
0.07102594 = product of:
  0.14205188 = sum of:
    0.14205188 = sum of:
      0.10255179 = weight(_text_:mining in 3466) [ClassicSimilarity], result of:
        0.10255179 = score(doc=3466,freq=2.0), product of:
          0.3290036 = queryWeight, product of:
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.058308665 = queryNorm
          0.31170416 = fieldWeight in 3466, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3466)
      0.039500095 = weight(_text_:22 in 3466) [ClassicSimilarity], result of:
        0.039500095 = score(doc=3466,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.19345059 = fieldWeight in 3466, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=3466)
  0.5 = coord(1/2)
```
Abstract

Specimen identification keys are still the most commonly created tools used by systematic biologists to access biodiversity information. Creating identification keys requires analyzing and synthesizing large amounts of information from specimens and their descriptions and is a very labor-intensive and time-consuming activity. Automating the generation of identification keys from text descriptions becomes a highly attractive text mining application in the biodiversity domain. Fine-grained semantic annotation of morphological descriptions of organisms is a necessary first step in generating keys from text. Machine-readable ontologies are needed in this process because most biological characters are only implied (i.e., not stated) in descriptions. The immediate question to ask is How well do existing ontologies support semantic annotation and automated key generation? With the intention to either select an existing ontology or develop a unified ontology based on existing ones, this paper evaluates the coverage, semantic consistency, and inter-ontology agreement of a biodiversity character ontology and three plant glossaries that may be turned into ontologies. The coverage and semantic consistency of the ontology/glossaries are checked against the authoritative domain literature, namely, Flora of North America and Flora of China. The evaluation results suggest that more work is needed to improve the coverage and interoperability of the ontology/glossaries. More concepts need to be added to the ontology/glossaries and careful work is needed to improve the semantic consistency. The method used in this paper to evaluate the ontology/glossaries can be used to propose new candidate concepts from the domain literature and suggest appropriate definitions.

Date

1. 6.2010 9:55:22
Yi, K.: Harnessing collective intelligence in social tagging using Delicious (2012) 0.07
```
0.07102594 = product of:
  0.14205188 = sum of:
    0.14205188 = sum of:
      0.10255179 = weight(_text_:mining in 515) [ClassicSimilarity], result of:
        0.10255179 = score(doc=515,freq=2.0), product of:
          0.3290036 = queryWeight, product of:
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.058308665 = queryNorm
          0.31170416 = fieldWeight in 515, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            5.642448 = idf(docFreq=425, maxDocs=44218)
            0.0390625 = fieldNorm(doc=515)
      0.039500095 = weight(_text_:22 in 515) [ClassicSimilarity], result of:
        0.039500095 = score(doc=515,freq=2.0), product of:
          0.204187 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.058308665 = queryNorm
          0.19345059 = fieldWeight in 515, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=515)
  0.5 = coord(1/2)
```
Abstract

A new collaborative approach in information organization and sharing has recently arisen, known as collaborative tagging or social indexing. A key element of collaborative tagging is the concept of collective intelligence (CI), which is a shared intelligence among all participants. This research investigates the phenomenon of social tagging in the context of CI with the aim to serve as a stepping-stone towards the mining of truly valuable social tags for web resources. This study focuses on assessing and evaluating the degree of CI embedded in social tagging over time in terms of two-parameter values, number of participants, and top frequency ranking window. Five different metrics were adopted and utilized for assessing the similarity between ranking lists: overlapList, overlapRank, Footrule, Fagin's measure, and the Inverse Rank measure. The result of this study demonstrates that a substantial degree of CI is most likely to be achieved when somewhere between the first 200 and 400 people have participated in tagging, and that a target degree of CI can be projected by controlling the two factors along with the selection of a similarity metric. The study also tests some experimental conditions for detecting social tags with high CI degree. The results of this study can be applicable to the study of filtering social tags based on CI; filtered social tags may be utilized for the metadata creation of tagged resources and possibly for the retrieval of tagged resources.

Date

25.12.2012 15:22:37

Search (844 results, page 1 of 43)

Authors

Languages

Types

Themes

Subjects

Classifications