Search (36 results, page 2 of 2)

Liu, X.; Zhang, J.; Guo, C.: Full-text citation analysis : a new method to enhance scholarly networks (2013) 0.00
```
7.196534E-4 = product of:
  0.0028786135 = sum of:
    0.0028786135 = product of:
      0.00863584 = sum of:
        0.00863584 = weight(_text_:a in 1044) [ClassicSimilarity], result of:
          0.00863584 = score(doc=1044,freq=12.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.15602624 = fieldWeight in 1044, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1044)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

In this article, we use innovative full-text citation analysis along with supervised topic modeling and network-analysis algorithms to enhance classical bibliometric analysis and publication/author/venue ranking. By utilizing citation contexts extracted from a large number of full-text publications, each citation or publication is represented by a probability distribution over a set of predefined topics, where each topic is labeled by an author-contributed keyword. We then used publication/citation topic distribution to generate a citation graph with vertex prior and edge transitioning probability distributions. The publication importance score for each given topic is calculated by PageRank with edge and vertex prior distributions. To evaluate this work, we sampled 104 topics (labeled with keywords) in review papers. The cited publications of each review paper are assumed to be "important publications" for the target topic (keyword), and we use these cited publications to validate our topic-ranking result and to compare different publication-ranking lists. Evaluation results show that full-text citation and publication content prior topic distribution, along with the classical PageRank algorithm can significantly enhance bibliometric analysis and scientific publication ranking performance, comparing with term frequency-inverted document frequency (tf-idf), language model, BM25, PageRank, and PageRank + language model (p < .001), for academic information retrieval (IR) systems.

Type

a
Zhang, J.; Zhao, Y.: ¬A user term visualization analysis based on a social question and answer log (2013) 0.00
```
7.196534E-4 = product of:
  0.0028786135 = sum of:
    0.0028786135 = product of:
      0.00863584 = sum of:
        0.00863584 = weight(_text_:a in 2715) [ClassicSimilarity], result of:
          0.00863584 = score(doc=2715,freq=12.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.15602624 = fieldWeight in 2715, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2715)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

The authors of this paper investigate terms of consumers' diabetes based on a log from the Yahoo!Answers social question and answers (Q&A) forum, ascertain characteristics and relationships among terms related to diabetes from the consumers' perspective, and reveal users' diabetes information seeking patterns. In this study, the log analysis method, data coding method, and visualization multiple-dimensional scaling analysis method were used for analysis. The visual analyses were conducted at two levels: terms analysis within a category and category analysis among the categories in the schema. The findings show that the average number of words per question was 128.63, the average number of sentences per question was 8.23, the average number of words per response was 254.83, and the average number of sentences per response was 16.01. There were 12 categories (Cause & Pathophysiology, Sign & Symptom, Diagnosis & Test, Organ & Body Part, Complication & Related Disease, Medication, Treatment, Education & Info Resource, Affect, Social & Culture, Lifestyle, and Nutrient) in the diabetes related schema which emerged from the data coding analysis. The analyses at the two levels show that terms and categories were clustered and patterns were revealed. Future research directions are also included.

Type

a
Zhang, J.; Zhai, S.; Liu, H.; Stevenson, J.A.: Social network analysis on a topic-based navigation guidance system in a public health portal (2016) 0.00
```
7.196534E-4 = product of:
  0.0028786135 = sum of:
    0.0028786135 = product of:
      0.00863584 = sum of:
        0.00863584 = weight(_text_:a in 2887) [ClassicSimilarity], result of:
          0.00863584 = score(doc=2887,freq=12.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.15602624 = fieldWeight in 2887, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2887)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

We investigated a topic-based navigation guidance system in the World Health Organization portal, compared the link connection network and the semantic connection network derived from the guidance system, analyzed the characteristics of the 2 networks from the perspective of the node centrality (in_closeness, out_closeness, betweenness, in_degree, and out_degree), and provided the suggestions to optimize and enhance the topic-based navigation guidance system. A mixed research method that combines the social network analysis method, clustering analysis method, and inferential analysis methods was used. The clustering analysis results of the link connection network were quite different from those of the semantic connection network. There were significant differences between the link connection network and the semantic network in terms of density and centrality. Inferential analysis results show that there were no strong correlations between the centrality of a node and its topic information characteristics. Suggestions for enhancing the navigation guidance system are discussed in detail. Future research directions, such as application of the same research method presented in this study to other similar public health portals, are also included.

Type

a
Geng, Q.; Townley, C.; Huang, K.; Zhang, J.: Comparative knowledge management : a pilot study of Chinese and American universities (2005) 0.00
```
7.1242056E-4 = product of:
  0.0028496822 = sum of:
    0.0028496822 = product of:
      0.008549047 = sum of:
        0.008549047 = weight(_text_:a in 3876) [ClassicSimilarity], result of:
          0.008549047 = score(doc=3876,freq=6.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.1544581 = fieldWeight in 3876, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3876)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

Comparative study of knowledge management (KM) promises to lead to more effective knowledge use in all cultural environments. This pilot study compares KM priorities, needs, tools, and administrative structure components in large Chinese and American universities. General KM theory and literature related to KM in higher education are analyzed to develop the four components of the study. Comparative differences in KM practice at large Chinese and American universities are analyzed for each component. A correlation matrix reveals statistically significant co-variation among all but one of the study components. Four conclusions related to comparative KM and suggestions for future research are presented.

Type

a
Zhang, J.; Jastram, I.: ¬A study of the metadata creation behavior of different user groups on the Internet (2006) 0.00
```
7.1242056E-4 = product of:
  0.0028496822 = sum of:
    0.0028496822 = product of:
      0.008549047 = sum of:
        0.008549047 = weight(_text_:a in 982) [ClassicSimilarity], result of:
          0.008549047 = score(doc=982,freq=6.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.1544581 = fieldWeight in 982, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=982)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

Metadata is designed to improve information organization and information retrieval effectiveness and efficiency on the Internet. The way web publishers respond to metadata and the way they use it when publishing their web pages, however, is still a mystery. The authors of this paper aim to solve this mystery by defining different professional publisher groups, examining the behaviors of these user groups, and identifying the characteristics of their metadata use. This study will enhance the current understanding of metadata application behavior and provide evidence useful to researchers, web publishers, and search engine designers.

Type

a
Zhang, J.; Wolfram, D.; Wang, P.: Analysis of query keywords of sports-related queries using visualization and clustering (2009) 0.00
```
6.569507E-4 = product of:
  0.0026278028 = sum of:
    0.0026278028 = product of:
      0.007883408 = sum of:
        0.007883408 = weight(_text_:a in 2947) [ClassicSimilarity], result of:
          0.007883408 = score(doc=2947,freq=10.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.14243183 = fieldWeight in 2947, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2947)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

The authors investigated 11 sports-related query keywords extracted from a public search engine query log to better understand sports-related information seeking on the Internet. After the query log contents were cleaned and query data were parsed, popular sports-related keywords were identified, along with frequently co-occurring query terms associated with the identified keywords. Relationships among each sports-related focus keyword and its related keywords were characterized and grouped using multidimensional scaling (MDS) in combination with traditional hierarchical clustering methods. The two approaches were synthesized in a visual context by highlighting the results of the hierarchical clustering analysis in the visual MDS configuration. Important events, people, subjects, merchandise, and so on related to a sport were illustrated, and relationships among the sports were analyzed. A small-scale comparative study of sports searches with and without term assistance was conducted. Searches that used search term assistance by relying on previous query term relationships outperformed the searches without the search term assistance. The findings of this study provide insights into sports information seeking behavior on the Internet. The developed method also may be applied to other query log subject areas.

Type

a
Zhang, L.; Liu, Q.L.; Zhang, J.; Wang, H.F.; Pan, Y.; Yu, Y.: Semplore: an IR approach to scalable hybrid query of Semantic Web data (2007) 0.00
```
6.569507E-4 = product of:
  0.0026278028 = sum of:
    0.0026278028 = product of:
      0.007883408 = sum of:
        0.007883408 = weight(_text_:a in 231) [ClassicSimilarity], result of:
          0.007883408 = score(doc=231,freq=10.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.14243183 = fieldWeight in 231, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=231)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

As an extension to the current Web, Semantic Web will not only contain structured data with machine understandable semantics but also textual information. While structured queries can be used to find information more precisely on the Semantic Web, keyword searches are still needed to help exploit textual information. It thus becomes very important that we can combine precise structured queries with imprecise keyword searches to have a hybrid query capability. In addition, due to the huge volume of information on the Semantic Web, the hybrid query must be processed in a very scalable way. In this paper, we define such a hybrid query capability that combines unary tree-shaped structured queries with keyword searches. We show how existing information retrieval (IR) index structures and functions can be reused to index semantic web data and its textual information, and how the hybrid query is evaluated on the index structure using IR engines in an efficient and scalable manner. We implemented this IR approach in an engine called Semplore. Comprehensive experiments on its performance show that it is a promising approach. It leads us to believe that it may be possible to evolve current web search engines to query and search the Semantic Web. Finally, we briefy describe how Semplore is used for searching Wikipedia and an IBM customer's product information.

Type

a
Zhang, J.: Archival context, digital content, and the ethics of digital archival representation : the ethics of identification in digital library metadata (2012) 0.00
```
6.569507E-4 = product of:
  0.0026278028 = sum of:
    0.0026278028 = product of:
      0.007883408 = sum of:
        0.007883408 = weight(_text_:a in 419) [ClassicSimilarity], result of:
          0.007883408 = score(doc=419,freq=10.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.14243183 = fieldWeight in 419, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=419)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

The findings of a recent study on digital archival representation raise some ethical concerns about how digital archival materials are organized, described, and made available for use on the Web. Archivists have a fundamental obligation to preserve and protect the authenticity and integrity of records in their holdings and, at the same time, have the responsibility to promote the use of records as a fundamental purpose of the keeping of archives (SAA 2005 Code of Ethics for Archivists V & VI). Is it an ethical practice that digital content in digital archives is deeply embedded in its contextual structure and generally underrepresented in digital archival systems? Similarly, is it ethical for archivists to detach digital items from their archival context in order to make them more "digital friendly" and more accessible to meet needs of some users? Do archivists have an obligation to bring the two representation systems together so that the context and content of digital archives can be better represented and archival materials "can be located and used by anyone, for any purpose, while still remaining authentic evidence of the work and life of the creator"? (Millar 2010, 157) This paper discusses the findings of the study and their ethical implications relating to digital archival description and representation.

Content

Beitrag aus einem Themenheft zu den Proceedings of the 2nd Milwaukee Conference on Ethics in Information Organization, June 15-16, 2012, School of Information Studies, University of Wisconsin-Milwaukee. Hope A. Olson, Conference Chair. Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_39_2012_5_d.pdf.

Type

a
Zhang, J.; Zhai, S.; Stevenson, J.A.; Xia, L.: Optimization of the subject directory in a government agriculture department web portal (2016) 0.00
```
6.569507E-4 = product of:
  0.0026278028 = sum of:
    0.0026278028 = product of:
      0.007883408 = sum of:
        0.007883408 = weight(_text_:a in 3088) [ClassicSimilarity], result of:
          0.007883408 = score(doc=3088,freq=10.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.14243183 = fieldWeight in 3088, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3088)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

We investigated a subject directory in the US Agriculture Department-Economic Research Service portal. Parent-child relationships, related connections among the categories, and related connections among the subcategories in the subject directory were optimized using social network analysis. The optimization results were assessed by both density analysis and edge strength analysis methods. In addition, the results were evaluated by domain experts. From this study, it is recommended that four subcategories be switched from their original four categories into two different categories as a result of the parent-child relationship optimization.?It is also recommended that 132 subcategories be moved to 40 subcategories and that eight categories be moved to two categories as a result of the related connection optimization. The findings show that optimization boosted the densities of the optimized categories, and the recommended connections of both the related categories and subcategories were stronger than the existing connections of the related categories and subcategories. This paper provides visual displays of the optimization analysis as well as suggestions to enhance the subject directory of this portal.

Type

a
Wolfram, D.; Wang, P.; Zhang, J.: Identifying Web search session patterns using cluster analysis : a comparison of three search environments (2009) 0.00
```
6.106462E-4 = product of:
  0.0024425848 = sum of:
    0.0024425848 = product of:
      0.007327754 = sum of:
        0.007327754 = weight(_text_:a in 2796) [ClassicSimilarity], result of:
          0.007327754 = score(doc=2796,freq=6.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.13239266 = fieldWeight in 2796, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2796)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

Session characteristics taken from large transaction logs of three Web search environments (academic Web site, public search engine, consumer health information portal) were modeled using cluster analysis to determine if coherent session groups emerged for each environment and whether the types of session groups are similar across the three environments. The analysis revealed three distinct clusters of session behaviors common to each environment: hit and run sessions on focused topics, relatively brief sessions on popular topics, and sustained sessions using obscure terms with greater query modification. The findings also revealed shifts in session characteristics over time for one of the datasets, away from hit and run sessions toward more popular search topics. A better understanding of session characteristics can help system designers to develop more responsive systems to support search features that cater to identifiable groups of searchers based on their search behaviors. For example, the system may identify struggling searchers based on session behaviors that match those identified in the current study to provide context sensitive help.

Type

a
An, L.; Zhang, J.; Yu, C.: ¬The visual subject analysis of library and information science journals with self-organizing map (2011) 0.00
```
5.875945E-4 = product of:
  0.002350378 = sum of:
    0.002350378 = product of:
      0.007051134 = sum of:
        0.007051134 = weight(_text_:a in 4613) [ClassicSimilarity], result of:
          0.007051134 = score(doc=4613,freq=8.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.12739488 = fieldWeight in 4613, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4613)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

Academic journals play an important role in scientific communication. The effective organization of journals can help reveal the thematic contents of journals and thus make them more user-friendly. In this study, the Self-Organizing Map (SOM) technique was employed to visually analyze the 60 library and information science-related journals published from 2006 to 2008. The U-matrix by Ultsch (2003) was applied to categorize the journals into 19 clusters according to their subjects. Four journals were recommended to supplement library collections although they were not indexed by SCI/SSCI. A novel SOM display named Attribute Accumulation Matrix (AA-matrix) was proposed, and the results from this method show that they correlate significantly with the total occurrences of the subjects in the investigated journals. The AA-matrix was employed to identify the 86 salient subjects, which could be manually classified into 7 meaningful groups. A method of the Salient Attribute Projection was constructed to label the attribute characteristics of different clusters. Finally, the subject characteristics of the journals with high impact factors (IFs) were also addressed. The findings of this study can lead to a better understanding of the subject structure and characteristics of library/information-related journals.

Type

a
Zhang, J.; Chen, Y.; Zhao, Y.; Wolfram, D.; Ma, F.: Public health and social media : a study of Zika virus-related posts on Yahoo! Answers (2020) 0.00
```
5.875945E-4 = product of:
  0.002350378 = sum of:
    0.002350378 = product of:
      0.007051134 = sum of:
        0.007051134 = weight(_text_:a in 5672) [ClassicSimilarity], result of:
          0.007051134 = score(doc=5672,freq=8.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.12739488 = fieldWeight in 5672, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5672)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

This study investigates the content of questions and responses about the Zika virus on Yahoo! Answers as a recent example of how public concerns regarding an international health issue are reflected in social media. We investigate the contents of posts about the Zika virus on Yahoo! Answers, identify and reveal subject patterns about the Zika virus, and analyze the temporal changes of the revealed subject topics over 4 defined periods of the Zika virus outbreak. Multidimensional scaling analysis, temporal analysis, and inferential statistical analysis approaches were used in the study. A resulting 2-layer Zika virus schema, and term connections and relationships are presented. The results indicate that consumers' concerns changed over the 4 defined periods. Consumers paid more attention to the basic information about the Zika virus, and the prevention and protection from the Zika virus at the beginning of the outbreak of the Zika virus. During the later periods, consumers became more interested in the role that the government and health organizations played in the public health emergency.

Type

a
Hansen, D.L.; Khopkar, T.; Zhang, J.: Recommender systems and expert locators (2009) 0.00
```
5.8168895E-4 = product of:
  0.0023267558 = sum of:
    0.0023267558 = product of:
      0.0069802674 = sum of:
        0.0069802674 = weight(_text_:a in 3867) [ClassicSimilarity], result of:
          0.0069802674 = score(doc=3867,freq=4.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.12611452 = fieldWeight in 3867, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3867)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

This entry describes two important classes of systems that facilitate the sharing of recommendations and expertise. Recommender systems suggest items of potential interest to individuals who do not have personal experience with the items. Expert locator systems, an important subset of recommender systems, help find people with the appropriate skills, knowledge, or expertise to meet a particular need. Research related to each of these systems is relatively new and extremely active. The use of these systems is likely to continue increasing as more and more activity is implicitly captured online, making it possible to automatically identify experts, and capture preferences that can be used to recommend items.

Type

a
Gao, J.; Zhang, J.: Clustered SVD strategies in latent semantic indexing (2005) 0.00
```
5.8168895E-4 = product of:
  0.0023267558 = sum of:
    0.0023267558 = product of:
      0.0069802674 = sum of:
        0.0069802674 = weight(_text_:a in 1166) [ClassicSimilarity], result of:
          0.0069802674 = score(doc=1166,freq=4.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.12611452 = fieldWeight in 1166, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1166)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Abstract

The text retrieval method using latent semantic indexing (LSI) technique with truncated singular value decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collections. For large inhomogeneous datasets, the performance of the SVD based text retrieval technique may deteriorate. We propose to partition a large inhomogeneous dataset into several smaller ones with clustered structure, on which we apply the truncated SVD. Our experimental results show that the clustered SVD strategies may enhance the retrieval accuracy and reduce the computing and storage costs.

Type

a
Chen, C.; Ibekwe-SanJuan, F.; Pinho, R.; Zhang, J.: ¬The impact of the sloan digital sky survey on astronomical research : the role of culture, identity, and international collaboration (2008) 0.00
```
4.985905E-4 = product of:
  0.001994362 = sum of:
    0.001994362 = product of:
      0.005983086 = sum of:
        0.005983086 = weight(_text_:a in 2275) [ClassicSimilarity], result of:
          0.005983086 = score(doc=2275,freq=4.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.10809815 = fieldWeight in 2275, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2275)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)
```
Content

We investigate the influence of culture and identity (geographic location) on the constitution of a specific research field. Using as case study the Sloan Digital Sky Survey (SDSS) project in the Astronomy field, we analyzed texts from bibliographic records of publications along three cultural and geographic axes: US only publications, non-US publications and international collaboration. Using three text mining systems (CiteSpace, TermWatch and PEx), we were able to automatically identify the topics specific to each cultural and geographic region as well as isolate the core research topics common to all geographic zones. The results tended to show that US-only and non-US research in this field shared more commonalities with international collaboration than with one another, thus indicating that the former two (US-only and non-US) research focused on rather distinct topics.

Type

a

Zhuge, H.; Zhang, J.: Topological centrality and its e-Science applications (2010) 0.00

4.1131617E-4 = product of:
  0.0016452647 = sum of:
    0.0016452647 = product of:
      0.004935794 = sum of:
        0.004935794 = weight(_text_:a in 3984) [ClassicSimilarity], result of:
          0.004935794 = score(doc=3984,freq=2.0), product of:
            0.055348642 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.04800207 = queryNorm
            0.089176424 = fieldWeight in 3984, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=3984)
      0.33333334 = coord(1/3)
  0.25 = coord(1/4)

Type: a

Search (36 results, page 2 of 2)

Authors

Years

Types

Themes