Search (3524 results, page 1 of 177)

Zeng, Q.; Yu, M.; Yu, W.; Xiong, J.; Shi, Y.; Jiang, M.: Faceted hierarchy : a new graph type to organize scientific concepts and a construction method (2019) 0.09

0.092102945 = product of:
  0.12280393 = sum of:
    0.06897088 = product of:
      0.20691264 = sum of:
        0.20691264 = weight(_text_:3a in 400) [ClassicSimilarity], result of:
          0.20691264 = score(doc=400,freq=2.0), product of:
            0.36816013 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043425296 = queryNorm
            0.56201804 = fieldWeight in 400, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=400)
      0.33333334 = coord(1/3)
    0.04717497 = weight(_text_:processing in 400) [ClassicSimilarity], result of:
      0.04717497 = score(doc=400,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 400, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=400)
    0.006658075 = product of:
      0.019974224 = sum of:
        0.019974224 = weight(_text_:science in 400) [ClassicSimilarity], result of:
          0.019974224 = score(doc=400,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 400, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=400)
      0.33333334 = coord(1/3)
  0.75 = coord(3/4)

Abstract: On a scientific concept hierarchy, a parent concept may have a few attributes, each of which has multiple values being a group of child concepts. We call these attributes facets: classification has a few facets such as application (e.g., face recognition), model (e.g., svm, knn), and metric (e.g., precision). In this work, we aim at building faceted concept hierarchies from scientific literature. Hierarchy construction methods heavily rely on hypernym detection, however, the faceted relations are parent-to-child links but the hypernym relation is a multi-hop, i.e., ancestor-to-descendent link with a specific facet "type-of". We use information extraction techniques to find synonyms, sibling concepts, and ancestor-descendent relations from a data science corpus. And we propose a hierarchy growth algorithm to infer the parent-child links from the three types of relationships. It resolves conflicts by maintaining the acyclic structure of a hierarchy.
Content: Vgl.: https%3A%2F%2Faclanthology.org%2FD19-5317.pdf&usg=AOvVaw0ZZFyq5wWTtNTvNkrvjlGA.
Source: Graph-Based Methods for Natural Language Processing - proceedings of the Thirteenth Workshop (TextGraphs-13): November 4, 2019, Hong Kong : EMNLP-IJCNLP 2019. Ed.: Dmitry Ustalov

Calì, A. et al.: Processing keyword queries under access limitations (2016) 0.07

0.07002103 = product of:
  0.14004207 = sum of:
    0.07862496 = weight(_text_:processing in 4233) [ClassicSimilarity], result of:
      0.07862496 = score(doc=4233,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.4472613 = fieldWeight in 4233, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.078125 = fieldNorm(doc=4233)
    0.061417103 = product of:
      0.092125654 = sum of:
        0.033290375 = weight(_text_:science in 4233) [ClassicSimilarity], result of:
          0.033290375 = score(doc=4233,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.2910318 = fieldWeight in 4233, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.078125 = fieldNorm(doc=4233)
        0.058835283 = weight(_text_:22 in 4233) [ClassicSimilarity], result of:
          0.058835283 = score(doc=4233,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.38690117 = fieldWeight in 4233, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4233)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Date: 1. 2.2016 18:25:22
Series: Lecture notes in computer science ; 9398

Börner, K.: Atlas of knowledge : anyone can map (2015) 0.07

0.069973096 = product of:
  0.13994619 = sum of:
    0.06671549 = weight(_text_:processing in 3355) [ClassicSimilarity], result of:
      0.06671549 = score(doc=3355,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3795138 = fieldWeight in 3355, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=3355)
    0.07323071 = product of:
      0.10984607 = sum of:
        0.059922673 = weight(_text_:science in 3355) [ClassicSimilarity], result of:
          0.059922673 = score(doc=3355,freq=18.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.52385724 = fieldWeight in 3355, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=3355)
        0.049923394 = weight(_text_:22 in 3355) [ClassicSimilarity], result of:
          0.049923394 = score(doc=3355,freq=4.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.32829654 = fieldWeight in 3355, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3355)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Content: One of a series of three publications influenced by the travelling exhibit Places & Spaces: Mapping Science, curated by the Cyberinfrastructure for Network Science Center at Indiana University. - Additional materials can be found at http://http://scimaps.org/atlas2. Erweitert durch: Börner, Katy. Atlas of Science: Visualizing What We Know.
Date: 22. 1.2017 16:54:03
22. 1.2017 17:10:56
LCSH: Science / Atlases
Science / Study and teaching / Graphic methods
Communication in science / Data processing
Subject: Science / Atlases
Science / Study and teaching / Graphic methods
Communication in science / Data processing

Costa Carvalho, A. da; Rossi, C.; Moura, E.S. de; Silva, A.S. da; Fernandes, D.: LePrEF: Learn to precompute evidence fusion for efficient query evaluation (2012) 0.06
```
0.06359105 = product of:
  0.1271821 = sum of:
    0.09629551 = weight(_text_:processing in 278) [ClassicSimilarity], result of:
      0.09629551 = score(doc=278,freq=12.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.547781 = fieldWeight in 278, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0390625 = fieldNorm(doc=278)
    0.030886576 = product of:
      0.046329863 = sum of:
        0.016645188 = weight(_text_:science in 278) [ClassicSimilarity], result of:
          0.016645188 = score(doc=278,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.1455159 = fieldWeight in 278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=278)
        0.029684676 = weight(_text_:29 in 278) [ClassicSimilarity], result of:
          0.029684676 = score(doc=278,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.19432661 = fieldWeight in 278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=278)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)
```
Abstract

State-of-the-art search engine ranking methods combine several distinct sources of relevance evidence to produce a high-quality ranking of results for each query. The fusion of information is currently done at query-processing time, which has a direct effect on the response time of search systems. Previous research also shows that an alternative to improve search efficiency in textual databases is to precompute term impacts at indexing time. In this article, we propose a novel alternative to precompute term impacts, providing a generic framework for combining any distinct set of sources of evidence by using a machine-learning technique. This method retains the advantages of producing high-quality results, but avoids the costs of combining evidence at query-processing time. Our method, called Learn to Precompute Evidence Fusion (LePrEF), uses genetic programming to compute a unified precomputed impact value for each term found in each document prior to query processing, at indexing time. Compared with previous research on precomputing term impacts, our method offers the advantage of providing a generic framework to precompute impact using any set of relevance evidence at any text collection, whereas previous research articles do not. The precomputed impact values are indexed and used later for computing document ranking at query-processing time. By doing so, our method effectively reduces the query processing to simple additions of such impacts. We show that this approach, while leading to results comparable to state-of-the-art ranking methods, also can lead to a significant decrease in computational costs during query processing.

Date

24. 6.2012 14:29:10

Source

Journal of the American Society for Information Science and Technology. 63(2012) no.7, S.1383-1397
Perovsek, M.; Kranjca, J.; Erjaveca, T.; Cestnika, B.; Lavraca, N.: TextFlows : a visual programming platform for text mining and natural language processing (2016) 0.06
```
0.06354337 = product of:
  0.12708674 = sum of:
    0.115554616 = weight(_text_:processing in 2697) [ClassicSimilarity], result of:
      0.115554616 = score(doc=2697,freq=12.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.6573372 = fieldWeight in 2697, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=2697)
    0.011532126 = product of:
      0.034596376 = sum of:
        0.034596376 = weight(_text_:science in 2697) [ClassicSimilarity], result of:
          0.034596376 = score(doc=2697,freq=6.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.30244917 = fieldWeight in 2697, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
      0.33333334 = coord(1/3)
  0.5 = coord(2/4)
```
Abstract

Text mining and natural language processing are fast growing areas of research, with numerous applications in business, science and creative industries. This paper presents TextFlows, a web-based text mining and natural language processing platform supporting workflow construction, sharing and execution. The platform enables visual construction of text mining workflows through a web browser, and the execution of the constructed workflows on a processing cloud. This makes TextFlows an adaptable infrastructure for the construction and sharing of text processing workflows, which can be reused in various applications. The paper presents the implemented text mining and language processing modules, and describes some precomposed workflows. Their features are demonstrated on three use cases: comparison of document classifiers and of different part-of-speech taggers on a text categorization problem, and outlier detection in document corpora.

Content

Vgl.: http://www.sciencedirect.com/science/article/pii/S0167642316000113. Vgl. auch: http://textflows.org.

Source

Science of computer programming. In Press, 2016

Metadata and semantics research : 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings (2014) 0.05

0.052372687 = product of:
  0.10474537 = sum of:
    0.05559624 = weight(_text_:processing in 2192) [ClassicSimilarity], result of:
      0.05559624 = score(doc=2192,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3162615 = fieldWeight in 2192, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2192)
    0.049149137 = product of:
      0.0737237 = sum of:
        0.04403903 = weight(_text_:science in 2192) [ClassicSimilarity], result of:
          0.04403903 = score(doc=2192,freq=14.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.38499892 = fieldWeight in 2192, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2192)
        0.029684676 = weight(_text_:29 in 2192) [ClassicSimilarity], result of:
          0.029684676 = score(doc=2192,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.19432661 = fieldWeight in 2192, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2192)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: This book constitutes the refereed proceedings of the 8th Metadata and Semantics Research Conference, MTSR 2014, held in Karlsruhe, Germany, in November 2014. The 23 full papers and 9 short papers presented were carefully reviewed and selected from 57 submissions. The papers are organized in several sessions and tracks. They cover the following topics: metadata and linked data: tools and models; (meta) data quality assessment and curation; semantic interoperability, ontology-based data access and representation; big data and digital libraries in health, science and technology; metadata and semantics for open repositories, research information systems and data infrastructure; metadata and semantics for cultural collections and applications; semantics for agriculture, food and environment.
Content: Metadata and linked data.- Tools and models.- (Meta)data quality assessment and curation.- Semantic interoperability, ontology-based data access and representation.- Big data and digital libraries in health, science and technology.- Metadata and semantics for open repositories, research information systems and data infrastructure.- Metadata and semantics for cultural collections and applications.- Semantics for agriculture, food and environment.
LCSH: Computer science
Text processing (Computer science)
Series: Communications in computer and information science; 478
Subject: Computer science
Text processing (Computer science)

Schöneberg, U.; Sperber, W.: POS tagging and its applications for mathematics (2014) 0.05

0.051889688 = product of:
  0.103779376 = sum of:
    0.06671549 = weight(_text_:processing in 1748) [ClassicSimilarity], result of:
      0.06671549 = score(doc=1748,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3795138 = fieldWeight in 1748, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=1748)
    0.03706389 = product of:
      0.055595834 = sum of:
        0.019974224 = weight(_text_:science in 1748) [ClassicSimilarity], result of:
          0.019974224 = score(doc=1748,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 1748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=1748)
        0.03562161 = weight(_text_:29 in 1748) [ClassicSimilarity], result of:
          0.03562161 = score(doc=1748,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 1748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=1748)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: Content analysis of scientific publications is a nontrivial task, but a useful and important one for scientific information services. In the Gutenberg era it was a domain of human experts; in the digital age many machine-based methods, e.g., graph analysis tools and machine-learning techniques, have been developed for it. Natural Language Processing (NLP) is a powerful machine-learning approach to semiautomatic speech and language processing, which is also applicable to mathematics. The well established methods of NLP have to be adjusted for the special needs of mathematics, in particular for handling mathematical formulae. We demonstrate a mathematics-aware part of speech tagger and give a short overview about our adaptation of NLP methods for mathematical publications. We show the use of the tools developed for key phrase extraction and classification in the database zbMATH.
Date: 29. 3.2015 19:34:37
Series: Lecture notes in computer science; 8543)(Lecture notes in artificial intelligence

Devaul, H.; Diekema, A.R.; Ostwald, J.: Computer-assisted assignment of educational standards using natural language processing (2011) 0.05

0.051782876 = product of:
  0.10356575 = sum of:
    0.06671549 = weight(_text_:processing in 4199) [ClassicSimilarity], result of:
      0.06671549 = score(doc=4199,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3795138 = fieldWeight in 4199, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=4199)
    0.036850262 = product of:
      0.05527539 = sum of:
        0.019974224 = weight(_text_:science in 4199) [ClassicSimilarity], result of:
          0.019974224 = score(doc=4199,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.17461908 = fieldWeight in 4199, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=4199)
        0.035301168 = weight(_text_:22 in 4199) [ClassicSimilarity], result of:
          0.035301168 = score(doc=4199,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.23214069 = fieldWeight in 4199, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4199)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: Educational standards are a central focus of the current educational system in the United States, underpinning educational practice, curriculum design, teacher professional development, and high-stakes testing and assessment. Digital library users have requested that this information be accessible in association with digital learning resources to support teaching and learning as well as accountability requirements. Providing this information is complex because of the variability and number of standards documents in use at the national, state, and local level. This article describes a cataloging tool that aids catalogers in the assignment of standards metadata to digital library resources, using natural language processing techniques. The research explores whether the standards suggestor service would suggest the same standards as a human, whether relevant standards are ranked appropriately in the result set, and whether the relevance of the suggested assignments improve when, in addition to resource content, metadata is included in the query to the cataloging tool. The article also discusses how this service might streamline the cataloging workflow.
Date: 22. 1.2011 14:25:32
Source: Journal of the American Society for Information Science and Technology. 62(2011) no.2, S.395-405

Crestani, F.; Mizzaro, S.; Scagnetto, I,: Mobile information retrieval (2017) 0.05

0.050099604 = product of:
  0.10019921 = sum of:
    0.05559624 = weight(_text_:processing in 4469) [ClassicSimilarity], result of:
      0.05559624 = score(doc=4469,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3162615 = fieldWeight in 4469, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4469)
    0.044602968 = product of:
      0.06690445 = sum of:
        0.037219774 = weight(_text_:science in 4469) [ClassicSimilarity], result of:
          0.037219774 = score(doc=4469,freq=10.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.32538348 = fieldWeight in 4469, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4469)
        0.029684676 = weight(_text_:29 in 4469) [ClassicSimilarity], result of:
          0.029684676 = score(doc=4469,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.19432661 = fieldWeight in 4469, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4469)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Date: 29. 9.2018 13:24:44
LCSH: Computer science
Text processing (Computer science)
Series: Springer briefs in computer science
Subject: Computer science
Text processing (Computer science)

Boerner, K.: Atlas of science : visualizing what we know (2010) 0.05
```
0.049933746 = product of:
  0.09986749 = sum of:
    0.044476993 = weight(_text_:processing in 3359) [ClassicSimilarity], result of:
      0.044476993 = score(doc=3359,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.2530092 = fieldWeight in 3359, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.03125 = fieldNorm(doc=3359)
    0.0553905 = product of:
      0.083085746 = sum of:
        0.059551634 = weight(_text_:science in 3359) [ClassicSimilarity], result of:
          0.059551634 = score(doc=3359,freq=40.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.52061355 = fieldWeight in 3359, product of:
              6.3245554 = tf(freq=40.0), with freq of:
                40.0 = termFreq=40.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.03125 = fieldNorm(doc=3359)
        0.023534114 = weight(_text_:22 in 3359) [ClassicSimilarity], result of:
          0.023534114 = score(doc=3359,freq=2.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.15476047 = fieldWeight in 3359, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=3359)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)
```
Abstract

Cartographic maps have guided our explorations for centuries, allowing us to navigate the world. Science maps have the potential to guide our search for knowledge in the same way, helping us navigate, understand, and communicate the dynamic and changing structure of science and technology. Allowing us to visualize scientific results, science maps help us make sense of the avalanche of data generated by scientific research today. Atlas of Science, features more than thirty full-page science maps, fifty data charts, a timeline of science-mapping milestones, and 500 color images; it serves as a sumptuous visual index to the evolution of modern science and as an introduction to "the science of science"--charting the trajectory from scientific concept to published results. Atlas of Science, based on the popular exhibit "Places & Spaces: Mapping Science," describes and displays successful mapping techniques. The heart of the book is a visual feast: Claudius Ptolemy's Cosmographia World Map from 1482; a guide to a PhD thesis that resembles a subway map; "the structure of science" as revealed in a map of citation relationships in papers published in 2002; a periodic table; a history flow visualization of the Wikipedia article on abortion; a globe showing the worldwide distribution of patents; a forecast of earthquake risk; hands-on science maps for kids; and many more. Each entry includes the story behind the map and biographies of its makers. Not even the most brilliant minds can keep up with today's deluge of scientific results. Science maps show us the landscape of what we know. Exhibition Ongoing National Science Foundation, Washington, D.C. The Institute for Research Information and Quality Assurance, Bonn, Germany Storm Hall, San Diego State College

Date

22. 1.2017 17:12:16

LCSH

Communication in science
Data processing
Science

Subject

Communication in science
Data processing
Science

Engerer, V.: Informationswissenschaft und Linguistik. : kurze Geschichte eines fruchtbaren interdisziplinäaren Verhäaltnisses in drei Akten (2012) 0.05

0.04920737 = product of:
  0.09841474 = sum of:
    0.07862496 = weight(_text_:processing in 3376) [ClassicSimilarity], result of:
      0.07862496 = score(doc=3376,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.4472613 = fieldWeight in 3376, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.078125 = fieldNorm(doc=3376)
    0.019789785 = product of:
      0.05936935 = sum of:
        0.05936935 = weight(_text_:29 in 3376) [ClassicSimilarity], result of:
          0.05936935 = score(doc=3376,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.38865322 = fieldWeight in 3376, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.078125 = fieldNorm(doc=3376)
      0.33333334 = coord(1/3)
  0.5 = coord(2/4)

Date: 19. 2.2017 13:29:08
Source: SDV - Sprache und Datenverarbeitung. International journal for language data processing. 36(2012) H.2, S.71-91 [= E-Books - Fakten, Perspektiven und Szenarien] 36/2 (2012), S. 71-91

Desconnets, J.-C.; Chahdi, H.; Mougenot, I.: Application profile for earth observation images (2014) 0.05

0.049139336 = product of:
  0.09827867 = sum of:
    0.05503747 = weight(_text_:processing in 1573) [ClassicSimilarity], result of:
      0.05503747 = score(doc=1573,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3130829 = fieldWeight in 1573, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1573)
    0.043241203 = product of:
      0.064861804 = sum of:
        0.023303263 = weight(_text_:science in 1573) [ClassicSimilarity], result of:
          0.023303263 = score(doc=1573,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.20372227 = fieldWeight in 1573, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1573)
        0.04155854 = weight(_text_:29 in 1573) [ClassicSimilarity], result of:
          0.04155854 = score(doc=1573,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.27205724 = fieldWeight in 1573, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1573)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: Based on the concept of an application profile as proposed by the Dublin Core initiative, the work presented in this manuscript attempts to propose an application profile for the Earth Observation images. This approach aims to provide an open and extensible model facilitating the sharing and management of distributed images within decentralized architectures. It is intended to eventually cover the needs of discovery, localization, consulting, preservation and processing of data for decision support. We are using the Singapore framework recommendations to build the application profile. A particular focus on the formalization and representation of Description Set Profile (DSP) in RDF is proposed.
Series: Communications in computer and information science; 478
Source: Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al

Semantic keyword-based search on structured data sources : First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers (2016) 0.05
```
0.048255846 = product of:
  0.09651169 = sum of:
    0.054472968 = weight(_text_:processing in 2753) [ClassicSimilarity], result of:
      0.054472968 = score(doc=2753,freq=6.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.30987173 = fieldWeight in 2753, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.03125 = fieldNorm(doc=2753)
    0.04203872 = product of:
      0.06305808 = sum of:
        0.029775817 = weight(_text_:science in 2753) [ClassicSimilarity], result of:
          0.029775817 = score(doc=2753,freq=10.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.26030678 = fieldWeight in 2753, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.03125 = fieldNorm(doc=2753)
        0.03328226 = weight(_text_:22 in 2753) [ClassicSimilarity], result of:
          0.03328226 = score(doc=2753,freq=4.0), product of:
            0.15206799 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043425296 = queryNorm
            0.21886435 = fieldWeight in 2753, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=2753)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)
```
Abstract

This book constitutes the thoroughly refereed post-conference proceedings of the First COST Action IC1302 International KEYSTONE Conference on semantic Keyword-based Search on Structured Data Sources, IKC 2015, held in Coimbra, Portugal, in September 2015. The 13 revised full papers, 3 revised short papers, and 2 invited papers were carefully reviewed and selected from 22 initial submissions. The paper topics cover techniques for keyword search, semantic data management, social Web and social media, information retrieval, benchmarking for search on big data.

Content

Inhalt: Professional Collaborative Information Seeking: On Traceability and Creative Sensemaking / Nürnberger, Andreas (et al.) - Recommending Web Pages Using Item-Based Collaborative Filtering Approaches / Cadegnani, Sara (et al.) - Processing Keyword Queries Under Access Limitations / Calì, Andrea (et al.) - Balanced Large Scale Knowledge Matching Using LSH Forest / Cochez, Michael (et al.) - Improving css-KNN Classification Performance by Shifts in Training Data / Draszawka, Karol (et al.) - Classification Using Various Machine Learning Methods and Combinations of Key-Phrases and Visual Features / HaCohen-Kerner, Yaakov (et al.) - Mining Workflow Repositories for Improving Fragments Reuse / Harmassi, Mariem (et al.) - AgileDBLP: A Search-Based Mobile Application for Structured Digital Libraries / Ifrim, Claudia (et al.) - Support of Part-Whole Relations in Query Answering / Kozikowski, Piotr (et al.) - Key-Phrases as Means to Estimate Birth and Death Years of Jewish Text Authors / Mughaz, Dror (et al.) - Visualization of Uncertainty in Tag Clouds / Platis, Nikos (et al.) - Multimodal Image Retrieval Based on Keywords and Low-Level Image Features / Pobar, Miran (et al.) - Toward Optimized Multimodal Concept Indexing / Rekabsaz, Navid (et al.) - Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives / Souza, Tarcisio (et al.) - Indexing of Textual Databases Based on Lexical Resources: A Case Study for Serbian / Stankovic, Ranka (et al.) - Domain-Specific Modeling: Towards a Food and Drink Gazetteer / Tagarev, Andrey (et al.) - Analysing Entity Context in Multilingual Wikipedia to Support Entity-Centric Retrieval Applications / Zhou, Yiwei (et al.)

Date

1. 2.2016 18:25:22

LCSH

Computer science
Text processing (Computer science)

Series

Lecture notes in computer science ; 9398

Subject

Computer science
Text processing (Computer science)

Jeffery, K.G.; Bailo, D.: EPOS: using metadata in geoscience (2014) 0.05

0.04699348 = product of:
  0.09398696 = sum of:
    0.04717497 = weight(_text_:processing in 1581) [ClassicSimilarity], result of:
      0.04717497 = score(doc=1581,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 1581, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=1581)
    0.04681199 = product of:
      0.07021798 = sum of:
        0.034596376 = weight(_text_:science in 1581) [ClassicSimilarity], result of:
          0.034596376 = score(doc=1581,freq=6.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.30244917 = fieldWeight in 1581, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=1581)
        0.03562161 = weight(_text_:29 in 1581) [ClassicSimilarity], result of:
          0.03562161 = score(doc=1581,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 1581, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=1581)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: One of the key aspects of the approaching data-intensive science era is integration of data through interoperability of systems providing data products or visualisation and processing services. Far from being simple, interoperability requires robust and scalable e-infrastructures capable of supporting it. In this work we present the case of EPOS, a project for data integration in the field of Earth Sciences. We describe the design of its e-infrastructure and show its main characteristics. One of the main elements enabling the system to integrate data, data products and services is the metadata catalog based on the CERIF metadata model. Such a model, modified to fit into the general e-infrastructure design, is part of a three-layer metadata architecture. CERIF guarantees a robust handling of metadata, which is in this case the key to the interoperability and to one of the feature of the EPOS system: the possibility of carrying on data intensive science orchestrating the distributed resources made available by EPOS data providers and stakeholders.
Series: Communications in computer and information science; 478
Source: Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al

Fiala, D.: Bibliometric analysis of CiteSeer data for countries (2012) 0.05

0.04699348 = product of:
  0.09398696 = sum of:
    0.04717497 = weight(_text_:processing in 2742) [ClassicSimilarity], result of:
      0.04717497 = score(doc=2742,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 2742, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=2742)
    0.04681199 = product of:
      0.07021798 = sum of:
        0.034596376 = weight(_text_:science in 2742) [ClassicSimilarity], result of:
          0.034596376 = score(doc=2742,freq=6.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.30244917 = fieldWeight in 2742, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2742)
        0.03562161 = weight(_text_:29 in 2742) [ClassicSimilarity], result of:
          0.03562161 = score(doc=2742,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 2742, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=2742)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: This article describes the results of our analysis of the data from the CiteSeer digital library. First, we examined the data from the point of view of source top-level Internet domains from which the data were collected. Second, we measured country shares in publications indexed by CiteSeer and compared them to those based on mainstream bibliographic data from the Web of Science and Scopus. And third, we concentrated on analyzing publications and their citations aggregated by countries. This way, we generated rankings of the most influential countries in computer science using several non-recursive as well as recursive methods such as citation counts or PageRank. We conclude that even if East Asian countries are underrepresented in CiteSeer, its data may well be used along with other conventional bibliographic databases for comparing the computer science research productivity and performance of countries.
Date: 29. 1.2016 18:36:47
Source: Information processing and management. 48(2012) no.2, S.242-253

Rorissa, A.; Yuan, X.: Visualizing and mapping the intellectual structure of information retrieval (2012) 0.05

0.04699348 = product of:
  0.09398696 = sum of:
    0.04717497 = weight(_text_:processing in 2744) [ClassicSimilarity], result of:
      0.04717497 = score(doc=2744,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 2744, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=2744)
    0.04681199 = product of:
      0.07021798 = sum of:
        0.034596376 = weight(_text_:science in 2744) [ClassicSimilarity], result of:
          0.034596376 = score(doc=2744,freq=6.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.30244917 = fieldWeight in 2744, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2744)
        0.03562161 = weight(_text_:29 in 2744) [ClassicSimilarity], result of:
          0.03562161 = score(doc=2744,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 2744, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=2744)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: Information retrieval is a long established subfield of library and information science. Since its inception in the early- to mid -1950s, it has grown as a result, in part, of well-regarded retrieval system evaluation exercises/campaigns, the proliferation of Web search engines, and the expansion of digital libraries. Although researchers have examined the intellectual structure and nature of the general field of library and information science, the same cannot be said about the subfield of information retrieval. We address that in this work by sketching the information retrieval intellectual landscape through visualizations of citation behaviors. Citation data for 10 years (2000-2009) were retrieved from the Web of Science and analyzed using existing visualization techniques. Our results address information retrieval's co-authorship network, highly productive authors, highly cited journals and papers, author-assigned keywords, active institutions, and the import of ideas from other disciplines.
Date: 29. 1.2016 19:20:01
Source: Information processing and management. 48(2012) no.1, S.120-135

Saint-Dizier, P.; Moens, M.-F.: Knowledge and reasoning for question answering : research perspectives (2011) 0.05

0.04584379 = product of:
  0.09168758 = sum of:
    0.07783473 = weight(_text_:processing in 2746) [ClassicSimilarity], result of:
      0.07783473 = score(doc=2746,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.4427661 = fieldWeight in 2746, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2746)
    0.013852848 = product of:
      0.04155854 = sum of:
        0.04155854 = weight(_text_:29 in 2746) [ClassicSimilarity], result of:
          0.04155854 = score(doc=2746,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.27205724 = fieldWeight in 2746, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2746)
      0.33333334 = coord(1/3)
  0.5 = coord(2/4)

Abstract: This paper presents a roadmap of current promising research tracks in question answering with a focus on knowledge acquisition and reasoning. We show that many current techniques developed in the frame of text mining and natural language processing are ready to be integrated in question answering search systems. Their integration opens new avenues of research for factual answer finding and for advanced question answering. Advanced question answering refers to a situation where an understanding of the meaning of the question and the information source together with techniques for answer fusion and generation are needed.
Date: 29. 1.2016 19:45:11
Source: Information processing and management. 47(2011) no.6, S.899-906

Liu, X.; Yu, S.; Janssens, F.; Glänzel, W.; Moreau, Y.; Moor, B.de: Weighted hybrid clustering by combining text mining and bibliometrics on a large-scale journal database (2010) 0.04

0.0448773 = product of:
  0.0897546 = sum of:
    0.04717497 = weight(_text_:processing in 3464) [ClassicSimilarity], result of:
      0.04717497 = score(doc=3464,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 3464, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=3464)
    0.04257962 = product of:
      0.06386943 = sum of:
        0.02824782 = weight(_text_:science in 3464) [ClassicSimilarity], result of:
          0.02824782 = score(doc=3464,freq=4.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.24694869 = fieldWeight in 3464, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=3464)
        0.03562161 = weight(_text_:29 in 3464) [ClassicSimilarity], result of:
          0.03562161 = score(doc=3464,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 3464, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=3464)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel-fusion clustering. To improve the flexibility and the efficiency of processing large-scale data, we propose an information-based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross-compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields.
Date: 1. 6.2010 9:29:57
Source: Journal of the American Society for Information Science and Technology. 61(2010) no.6, S.1105-1119

Mingers, J.; Macri, F.; Petrovici, D.: Using the h-index to measure the quality of journals in the field of business and management (2012) 0.04

0.0448773 = product of:
  0.0897546 = sum of:
    0.04717497 = weight(_text_:processing in 2741) [ClassicSimilarity], result of:
      0.04717497 = score(doc=2741,freq=2.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.26835677 = fieldWeight in 2741, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.046875 = fieldNorm(doc=2741)
    0.04257962 = product of:
      0.06386943 = sum of:
        0.02824782 = weight(_text_:science in 2741) [ClassicSimilarity], result of:
          0.02824782 = score(doc=2741,freq=4.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.24694869 = fieldWeight in 2741, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=2741)
        0.03562161 = weight(_text_:29 in 2741) [ClassicSimilarity], result of:
          0.03562161 = score(doc=2741,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.23319192 = fieldWeight in 2741, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=2741)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: This paper considers the use of the h-index as a measure of a journal's research quality and contribution. We study a sample of 455 journals in business and management all of which are included in the ISI Web of Science (WoS) and the Association of Business School's peer review journal ranking list. The h-index is compared with both the traditional impact factors, and with the peer review judgements. We also consider two sources of citation data - the WoS itself and Google Scholar. The conclusions are that the h-index is preferable to the impact factor for a variety of reasons, especially the selective coverage of the impact factor and the fact that it disadvantages journals that publish many papers. Google Scholar is also preferred to WoS as a data source. However, the paper notes that it is not sufficient to use any single metric to properly evaluate research achievements.
Date: 29. 1.2016 19:00:16
Object: Web of Science
Source: Information processing and management. 48(2012) no.2, S.234-241

Lee, D.H.; Schleyer, T.: Social tagging is no substitute for controlled indexing : a comparison of Medical Subject Headings and CiteULike tags assigned to 231,388 papers (2012) 0.04

0.043241408 = product of:
  0.086482815 = sum of:
    0.05559624 = weight(_text_:processing in 383) [ClassicSimilarity], result of:
      0.05559624 = score(doc=383,freq=4.0), product of:
        0.175792 = queryWeight, product of:
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.043425296 = queryNorm
        0.3162615 = fieldWeight in 383, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.048147 = idf(docFreq=2097, maxDocs=44218)
          0.0390625 = fieldNorm(doc=383)
    0.030886576 = product of:
      0.046329863 = sum of:
        0.016645188 = weight(_text_:science in 383) [ClassicSimilarity], result of:
          0.016645188 = score(doc=383,freq=2.0), product of:
            0.11438741 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.043425296 = queryNorm
            0.1455159 = fieldWeight in 383, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=383)
        0.029684676 = weight(_text_:29 in 383) [ClassicSimilarity], result of:
          0.029684676 = score(doc=383,freq=2.0), product of:
            0.15275662 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043425296 = queryNorm
            0.19432661 = fieldWeight in 383, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=383)
      0.6666667 = coord(2/3)
  0.5 = coord(2/4)

Abstract: Social tagging and controlled indexing both facilitate access to information resources. Given the increasing popularity of social tagging and the limitations of controlled indexing (primarily cost and scalability), it is reasonable to investigate to what degree social tagging could substitute for controlled indexing. In this study, we compared CiteULike tags to Medical Subject Headings (MeSH) terms for 231,388 citations indexed in MEDLINE. In addition to descriptive analyses of the data sets, we present a paper-by-paper analysis of tags and MeSH terms: the number of common annotations, Jaccard similarity, and coverage ratio. In the analysis, we apply three increasingly progressive levels of text processing, ranging from normalization to stemming, to reduce the impact of lexical differences. Annotations of our corpus consisted of over 76,968 distinct tags and 21,129 distinct MeSH terms. The top 20 tags/MeSH terms showed little direct overlap. On a paper-by-paper basis, the number of common annotations ranged from 0.29 to 0.5 and the Jaccard similarity from 2.12% to 3.3% using increased levels of text processing. At most, 77,834 citations (33.6%) shared at least one annotation. Our results show that CiteULike tags and MeSH terms are quite distinct lexically, reflecting different viewpoints/processes between social tagging and controlled indexing.
Date: 26. 8.2012 14:29:37
Source: Journal of the American Society for Information Science and Technology. 63(2012) no.9, S.1747-1757

Search (3524 results, page 1 of 177)

Authors

Languages

Types

Themes

Subjects

Classifications