Search (173 results, page 9 of 9)

Carter, D.; Sholler, D.: Data science on the ground : hype, criticism, and everyday work (2016) 0.00

8.047755E-4 = product of:
  0.004426265 = sum of:
    0.0023430442 = weight(_text_:a in 3111) [ClassicSimilarity], result of:
      0.0023430442 = score(doc=3111,freq=2.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.07643694 = fieldWeight in 3111, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=3111)
    0.0020832212 = weight(_text_:s in 3111) [ClassicSimilarity], result of:
      0.0020832212 = score(doc=3111,freq=2.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.072074346 = fieldWeight in 3111, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.046875 = fieldNorm(doc=3111)
  0.18181819 = coord(2/11)

Source: Journal of the Association for Information Science and Technology. 67(2016) no.10, S.2309-2319
Type: a

Survey of text mining : clustering, classification, and retrieval (2004) 0.00

8.0138847E-4 = product of:
  0.0044076364 = sum of:
    0.0019525366 = weight(_text_:a in 804) [ClassicSimilarity], result of:
      0.0019525366 = score(doc=804,freq=2.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.06369744 = fieldWeight in 804, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0390625 = fieldNorm(doc=804)
    0.0024550997 = weight(_text_:s in 804) [ClassicSimilarity], result of:
      0.0024550997 = score(doc=804,freq=4.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.08494043 = fieldWeight in 804, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=804)
  0.18181819 = coord(2/11)

Abstract: Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs. Knowledge extraction or creation from text requires systematic yet reliable processing that can be codified and adapted for changing needs and environments. This book will draw upon experts in both academia and industry to recommend practical approaches to the purification, indexing, and mining of textual information. It will address document identification, clustering and categorizing documents, cleaning text, and visualizing semantic models of text.
Pages: XVII, 244 S
Type: s

Advances in knowledge discovery and data mining (1996) 0.00

6.5604463E-4 = product of:
  0.007216491 = sum of:
    0.007216491 = weight(_text_:s in 413) [ClassicSimilarity], result of:
      0.007216491 = score(doc=413,freq=6.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.24967289 = fieldWeight in 413, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.09375 = fieldNorm(doc=413)
  0.09090909 = coord(1/11)

Footnote: Rez. in: JASIS 49(1998) no.4, S.386-387 (F. Exner)
Pages: 625 S
Type: s

Data mining, data warehousing and client/server databases : Proceedings of the 8th International Hong Kong Computer Society Database Workshop (Academic Stream) (1997) 0.00

5.356582E-4 = product of:
  0.00589224 = sum of:
    0.00589224 = weight(_text_:s in 977) [ClassicSimilarity], result of:
      0.00589224 = score(doc=977,freq=4.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.20385705 = fieldWeight in 977, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.09375 = fieldNorm(doc=977)
  0.09090909 = coord(1/11)

Pages: 345 S
Type: s

Intelligent information processing and web mining : Proceedings of the International IIS: IIPWM'03 Conference held in Zakopane, Poland, June 2-5, 2003 (2003) 0.00

5.356582E-4 = product of:
  0.00589224 = sum of:
    0.00589224 = weight(_text_:s in 4642) [ClassicSimilarity], result of:
      0.00589224 = score(doc=4642,freq=4.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.20385705 = fieldWeight in 4642, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.09375 = fieldNorm(doc=4642)
  0.09090909 = coord(1/11)

Pages: XIV, 579 S
Type: s

Mohr, J.W.; Bogdanov, P.: Topic models : what they are and why they matter (2013) 0.00
```
4.762915E-4 = product of:
  0.0052392064 = sum of:
    0.0052392064 = weight(_text_:a in 1142) [ClassicSimilarity], result of:
      0.0052392064 = score(doc=1142,freq=10.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.1709182 = fieldWeight in 1142, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=1142)
  0.09090909 = coord(1/11)
```
Abstract

We provide a brief, non-technical introduction to the text mining methodology known as "topic modeling." We summarize the theory and background of the method and discuss what kinds of things are found by topic models. Using a text corpus comprised of the eight articles from the special issue of Poetics on the subject of topic models, we run a topic model on these articles, both as a way to introduce the methodology and also to help summarize some of the ways in which social and cultural scientists are using topic models. We review some of the critiques and debates over the use of the method and finally, we link these developments back to some of the original innovations in the field of content analysis that were pioneered by Harold D. Lasswell and colleagues during and just after World War II.

Type

a

Decker, B.: Data Mining in Öffentlichen Bibliotheken (2000) 0.00

4.4189542E-4 = product of:
  0.0048608496 = sum of:
    0.0048608496 = weight(_text_:s in 4782) [ClassicSimilarity], result of:
      0.0048608496 = score(doc=4782,freq=2.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.16817348 = fieldWeight in 4782, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.109375 = fieldNorm(doc=4782)
  0.09090909 = coord(1/11)

Pages: III,45,V S

Wattenberg, M.; Viégas, F.; Johnson, I.: How to use t-SNE effectively (2016) 0.00

4.0164427E-4 = product of:
  0.0044180867 = sum of:
    0.0044180867 = weight(_text_:a in 3887) [ClassicSimilarity], result of:
      0.0044180867 = score(doc=3887,freq=4.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.14413087 = fieldWeight in 3887, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.0625 = fieldNorm(doc=3887)
  0.09090909 = coord(1/11)

Abstract: Although extremely useful for visualizing high-dimensional data, t-SNE plots can sometimes be mysterious or misleading. By exploring how it behaves in simple cases, we can learn to use it more effectively. We'll walk through a series of simple examples to illustrate what t-SNE diagrams can and cannot show. The t-SNE technique really is useful-but only if you know how to interpret it.
Type: a

Lowe, D.B.; Dollinger, I.; Koster, T.; Herbert, B.E.: Text mining for type of research classification (2021) 0.00
```
3.689338E-4 = product of:
  0.0040582716 = sum of:
    0.0040582716 = weight(_text_:a in 720) [ClassicSimilarity], result of:
      0.0040582716 = score(doc=720,freq=6.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.13239266 = fieldWeight in 720, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=720)
  0.09090909 = coord(1/11)
```
Abstract

This project brought together undergraduate students in Computer Science with librarians to mine abstracts of articles from the Texas A&M University Libraries' institutional repository, OAKTrust, in order to probe the creation of new metadata to improve discovery and use. The mining operation task consisted simply of classifying the articles into two categories of research type: basic research ("for understanding," "curiosity-based," or "knowledge-based") and applied research ("use-based"). These categories are fundamental especially for funders but are also important to researchers. The mining-to-classification steps took several iterations, but ultimately, we achieved good results with the toolkit BERT (Bidirectional Encoder Representations from Transformers). The project and its workflows represent a preview of what may lie ahead in the future of crafting metadata using text mining techniques to enhance discoverability.

Type

a

Data mining : Theoretische Aspekte und Anwendungen (1998) 0.00

3.5710545E-4 = product of:
  0.00392816 = sum of:
    0.00392816 = weight(_text_:s in 966) [ClassicSimilarity], result of:
      0.00392816 = score(doc=966,freq=4.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.1359047 = fieldWeight in 966, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=966)
  0.09090909 = coord(1/11)

Pages: XII,363 S
Type: s

Ester, M.; Sander, J.: Knowledge discovery in databases : Techniken und Anwendungen (2000) 0.00

2.5251167E-4 = product of:
  0.0027776284 = sum of:
    0.0027776284 = weight(_text_:s in 1374) [ClassicSimilarity], result of:
      0.0027776284 = score(doc=1374,freq=2.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.09609913 = fieldWeight in 1374, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0625 = fieldNorm(doc=1374)
  0.09090909 = coord(1/11)

Pages: VIII, 281 S

Loonus, Y.: Einsatzbereiche der KI und ihre Relevanz für Information Professionals (2017) 0.00

2.1300402E-4 = product of:
  0.0023430442 = sum of:
    0.0023430442 = weight(_text_:a in 5668) [ClassicSimilarity], result of:
      0.0023430442 = score(doc=5668,freq=2.0), product of:
        0.030653298 = queryWeight, product of:
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.026584605 = queryNorm
        0.07643694 = fieldWeight in 5668, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.153047 = idf(docFreq=37942, maxDocs=44218)
          0.046875 = fieldNorm(doc=5668)
  0.09090909 = coord(1/11)

Type: a

Witschel, H.F.: Text, Wörter, Morpheme : Möglichkeiten einer automatischen Terminologie-Extraktion (2004) 0.00

1.578198E-4 = product of:
  0.0017360178 = sum of:
    0.0017360178 = weight(_text_:s in 126) [ClassicSimilarity], result of:
      0.0017360178 = score(doc=126,freq=2.0), product of:
        0.028903782 = queryWeight, product of:
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.026584605 = queryNorm
        0.060061958 = fieldWeight in 126, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.0872376 = idf(docFreq=40523, maxDocs=44218)
          0.0390625 = fieldNorm(doc=126)
  0.09090909 = coord(1/11)

Pages: 141 S

Search (173 results, page 9 of 9)

Authors

Years

Languages

Types

Themes

Subjects

Classifications