Search (31 results, page 1 of 2)

Wittig, C.; Wolfram, D.: ¬A survey of networking education in North American library schools (1994) 0.02

0.02384643 = product of:
  0.07153929 = sum of:
    0.01596415 = weight(_text_:in in 750) [ClassicSimilarity], result of:
      0.01596415 = score(doc=750,freq=10.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.26884392 = fieldWeight in 750, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=750)
    0.05557514 = product of:
      0.11115028 = sum of:
        0.11115028 = weight(_text_:ausbildung in 750) [ClassicSimilarity], result of:
          0.11115028 = score(doc=750,freq=2.0), product of:
            0.23429902 = queryWeight, product of:
              5.3671665 = idf(docFreq=560, maxDocs=44218)
              0.043654136 = queryNorm
            0.47439498 = fieldWeight in 750, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.3671665 = idf(docFreq=560, maxDocs=44218)
              0.0625 = fieldNorm(doc=750)
      0.5 = coord(1/2)
  0.33333334 = coord(2/6)

Abstract: Reports results of a survey of US library schools to investigate the adoption, impact, and role of networking concepts and resources, such as the Internet, in the library and information science curriculum. Findings indicate that, to a large degree, educators have kept up with recent trends and tools in networking in a variety of courses. There was overwhelming consensus on the importance of networked information resources and access tools but less agreement on their places in the library and information science curriculum
Theme: Ausbildung

Dimitroff, A.; Wolfram, D.: Searcher response in a hypertext-based bibliographic information retrieval system (1995) 0.01

0.011251582 = product of:
  0.033754744 = sum of:
    0.010096614 = weight(_text_:in in 187) [ClassicSimilarity], result of:
      0.010096614 = score(doc=187,freq=4.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.17003182 = fieldWeight in 187, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0625 = fieldNorm(doc=187)
    0.02365813 = product of:
      0.04731626 = sum of:
        0.04731626 = weight(_text_:22 in 187) [ClassicSimilarity], result of:
          0.04731626 = score(doc=187,freq=2.0), product of:
            0.15286934 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043654136 = queryNorm
            0.30952093 = fieldWeight in 187, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=187)
      0.5 = coord(1/2)
  0.33333334 = coord(2/6)

Abstract: This article examines searcher behavior and affective response to a hypertext-based bibliographic information retrieval system called HyperLynx for searchers with different search skills and backgrounds. Search times and number of nodes visited were recorded for five specified search queries, and views of the system were recorded for each searcher. No significant differences were found in search times or user satisfaction with the system, indicating that a hypertext-based approach to bibliographic retrieval could be appropriate for a variety of searcher experience levels
Source: Journal of the American Society for Information Science. 46(1995) no.1, S.22-29

Ajiferuke, I.; Lu, K.; Wolfram, D.: ¬A comparison of citer and citation-based measure outcomes for multiple disciplines (2010) 0.01

0.0076993788 = product of:
  0.023098135 = sum of:
    0.005354538 = weight(_text_:in in 4000) [ClassicSimilarity], result of:
      0.005354538 = score(doc=4000,freq=2.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.09017298 = fieldWeight in 4000, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=4000)
    0.017743597 = product of:
      0.035487194 = sum of:
        0.035487194 = weight(_text_:22 in 4000) [ClassicSimilarity], result of:
          0.035487194 = score(doc=4000,freq=2.0), product of:
            0.15286934 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043654136 = queryNorm
            0.23214069 = fieldWeight in 4000, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=4000)
      0.5 = coord(1/2)
  0.33333334 = coord(2/6)

Abstract: Author research impact was examined based on citer analysis (the number of citers as opposed to the number of citations) for 90 highly cited authors grouped into three broad subject areas. Citer-based outcome measures were also compared with more traditional citation-based measures for levels of association. The authors found that there are significant differences in citer-based outcomes among the three broad subject areas examined and that there is a high degree of correlation between citer and citation-based measures for all measures compared, except for two outcomes calculated for the social sciences. Citer-based measures do produce slightly different rankings of authors based on citer counts when compared to more traditional citation counts. Examples are provided. Citation measures may not adequately address the influence, or reach, of an author because citations usually do not address the origin of the citation beyond self-citations.
Date: 28. 9.2010 12:54:22

Castanha, R.C.G.; Wolfram, D.: ¬The domain of knowledge organization : a bibliometric analysis of prolific authors and their intellectual space (2018) 0.01
```
0.007032239 = product of:
  0.021096716 = sum of:
    0.006310384 = weight(_text_:in in 4150) [ClassicSimilarity], result of:
      0.006310384 = score(doc=4150,freq=4.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.10626988 = fieldWeight in 4150, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4150)
    0.014786332 = product of:
      0.029572664 = sum of:
        0.029572664 = weight(_text_:22 in 4150) [ClassicSimilarity], result of:
          0.029572664 = score(doc=4150,freq=2.0), product of:
            0.15286934 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043654136 = queryNorm
            0.19345059 = fieldWeight in 4150, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4150)
      0.5 = coord(1/2)
  0.33333334 = coord(2/6)
```
Abstract

The domain of knowledge organization (KO) represents a foundational area of information science. One way to better understand the intellectual structure of the KO domain is to apply bibliometric methods to key contributors to the literature. This study analyzes the most prolific contributing authors to the journal Knowledge Organization, the sources they cite and the citations they receive for the period 1993 to 2016. The analyses were conducted using visualization outcomes of citation, co-citation and author bibliographic coupling analysis to reveal theoretical points of reference among authors and the most prominent research themes that constitute this scientific community. Birger Hjørland was the most cited author, and was situated at or near the middle of each of the maps based on different citation relationships. The proximities between authors resulting from the different citation relationships demonstrate how authors situate themselves intellectually through the citations they give and how other authors situate them through the citations received. There is a consistent core of theoretical references as well among the most productive authors. We observed a close network of scholarly communication between the authors cited in this core, which indicates the actual role of the journal Knowledge Organization as a space for knowledge construction in the area of knowledge organization.

Source

Knowledge organization. 45(2018) no.1, S.13-22
Wolfram, D.: Search characteristics in different types of Web-based IR environments : are they the same? (2008) 0.00
```
0.0029448462 = product of:
  0.017669076 = sum of:
    0.017669076 = weight(_text_:in in 2093) [ClassicSimilarity], result of:
      0.017669076 = score(doc=2093,freq=16.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.29755569 = fieldWeight in 2093, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2093)
  0.16666667 = coord(1/6)
```
Abstract

Transaction logs from four different Web-based information retrieval environments (bibliographic databank, OPAC, search engine, specialized search system) were analyzed for empirical regularities in search characteristics to determine whether users engage in different behaviors in different Web-based search environments. Descriptive statistics and relative frequency distributions related to term usage, query formulation, and session duration were tabulated. The analysis revealed that there are differences in these characteristics. Users were more likely to engage in extensive searching using the OPAC and specialized search system. Surprisingly, the bibliographic databank search environment resulted in the most parsimonious searching, more similar to a general search engine. Although on the surface Web-based search facilities may appear similar, users do engage in different search behaviors.
Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.00
```
0.002145118 = product of:
  0.0128707085 = sum of:
    0.0128707085 = weight(_text_:in in 6980) [ClassicSimilarity], result of:
      0.0128707085 = score(doc=6980,freq=26.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.2167489 = fieldWeight in 6980, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.03125 = fieldNorm(doc=6980)
  0.16666667 = coord(1/6)
```
Abstract

In previous articles, we reported the state of Web searching in 1997 (Jansen, Spink, & Saracevic, 2000) and in 1999 (Spink, Wolfram, Jansen, & Saracevic, 2001). Such snapshot studies and statistics on Web use appear regularly (OCLC, 1999), but provide little information about Web searching trends. In this article, we compare and contrast results from our two previous studies of Excite queries' data sets, each containing over 1 million queries submitted by over 200,000 Excite users collected on 16 September 1997 and 20 December 1999. We examine how public Web searching changing during that 2-year time period. As Table 1 shows, the overall structure of Web queries in some areas did not change, while in others we see change from 1997 to 1999. Our comparison shows how Web searching changed incrementally and also dramatically. We see some moves toward greater simplicity, including shorter queries (i.e., fewer terms) and shorter sessions (i.e., fewer queries per user), with little modification (addition or deletion) of terms in subsequent queries. The trend toward shorter queries suggests that Web information content should target specific terms in order to reach Web users. Another trend was to view fewer pages of results per query. Most Excite users examined only one page of results per query, since an Excite results page contains ten ranked Web sites. Were users satisfied with the results and did not need to view more pages? It appears that the public continues to have a low tolerance of wading through retrieved sites. This decline in interactivity levels is a disturbing finding for the future of Web searching. Queries that included Boolean operators were in the minority, but the percentage increased between the two time periods. Most Boolean use involved the AND operator with many mistakes. The use of relevance feedback almost doubled from 1997 to 1999, but overall use was still small. An unusually large number of terms were used with low frequency, such as personal names, spelling errors, non-English words, and Web-specific terms, such as URLs. Web query vocabulary contains more words than found in large English texts in general. The public language of Web queries has its own and unique characteristics. How did Web searching topics change from 1997 to 1999? We classified a random sample of 2,414 queries from 1997 and 2,539 queries from 1999 into 11 categories (Table 2). From 1997 to 1999, Web searching shifted from entertainment, recreation and sex, and pornography, preferences to e-commerce-related topics under commerce, travel, employment, and economy. This shift coincided with changes in information distribution on the publicly indexed Web.
Wolfram, D.; Xie, H.I.: Traditional IR for web users : a context for general audience digital libraries (2002) 0.00
```
0.0021034614 = product of:
  0.012620768 = sum of:
    0.012620768 = weight(_text_:in in 2589) [ClassicSimilarity], result of:
      0.012620768 = score(doc=2589,freq=16.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.21253976 = fieldWeight in 2589, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2589)
  0.16666667 = coord(1/6)
```
Abstract

The emergence of general audience digital libraries (GADLs) defines a context that represents a hybrid of both "traditional" IR, using primarily bibliographic resources provided by database vendors, and "popular" IR, exemplified by public search systems available on the World Wide Web. Findings of a study investigating end-user searching and response to a GADL are reported. Data collected from a Web-based end-user survey and data logs of resource usage for a Web-based GADL were analyzed for user characteristics, patterns of access and use, and user feedback. Cross-tabulations using respondent demographics revealed several key differences in how the system was used and valued by users of different age groups. Older users valued the service more than younger users and engaged in different searching and viewing behaviors. The GADL more closely resembles traditional retrieval systems in terms of content and purpose of use, but is more similar to popular IR systems in terms of user behavior and accessibility. A model that defines the dual context of the GADL environment is derived from the data analysis and existing IR models in general and other specific contexts. The authors demonstrate the distinguishing characteristics of this IR context, and discuss implications for the development and evaluation of future GADLs to accommodate a variety of user needs and expectations.

Footnote

Beitrag in einem Themenheft: "Issues of context in information retrieval (IR)"

Theme

Semantisches Umfeld in Indexierung u. Retrieval
Park, H.; You, S.; Wolfram, D.: Informal data citation for data sharing and reuse is more common than formal data citation in biomedical fields (2018) 0.00
```
0.0021034614 = product of:
  0.012620768 = sum of:
    0.012620768 = weight(_text_:in in 4544) [ClassicSimilarity], result of:
      0.012620768 = score(doc=4544,freq=16.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.21253976 = fieldWeight in 4544, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4544)
  0.16666667 = coord(1/6)
```
Abstract

Data citation, where products of research such as data sets, software, and tissue cultures are shared and acknowledged, is becoming more common in the era of Open Science. Currently, the practice of formal data citation-where data references are included alongside bibliographic references in the reference section of a publication-is uncommon. We examine the prevalence of data citation, documenting data sharing and reuse, in a sample of full text articles from the biological/biomedical sciences, the fields with the most public data sets available documented by the Data Citation Index (DCI). We develop a method that combines automated text extraction with human assessment for revealing candidate occurrences of data sharing and reuse by using terms that are most likely to indicate their occurrence. The analysis reveals that informal data citation in the main text of articles is far more common than formal data citations in the references of articles. As a result, data sharers do not receive documented credit for their data contributions in a similar way as authors do for their research articles because informal data citations are not recorded in sources such as the DCI. Ongoing challenges for the study of data citation are also outlined.
Zhang, J.; Wolfram, D.: Visualization of term discrimination analysis (2001) 0.00
```
0.001821651 = product of:
  0.010929906 = sum of:
    0.010929906 = weight(_text_:in in 5210) [ClassicSimilarity], result of:
      0.010929906 = score(doc=5210,freq=12.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.18406484 = fieldWeight in 5210, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5210)
  0.16666667 = coord(1/6)
```
Abstract

Zang and Wolfram compute the discrimination value for terms as the difference between the centroid value of all terms in the corpus and that value without the term in question, and suggest selection be made by comparing density changes with a visualization tool. The Distance Angle Retrieval Environment (DARE) visually projects a document or term space by presenting distance similarity on the X axis and angular similarity on the Y axis. Thus a document icon appearing close to the X axis would be relevant to reference points in terms of a distance similarity measure, while those close to the Y axis are relevant to reference points in terms of an angle based measure. Using 450 Associated Press news reports indexed by 44 distinct terms, the removal of the term ``Yeltsin'' causes the cluster to fall on the Y axis indicating a good discriminator. For an angular measure, cosine say, movement along the X axis to the left will signal good discrimination, as movement to the right will signal poor discrimination. A term density space could also be used. Most terms are shown to be indifferent discriminators. Different measures result in different choices as good and poor discriminators, as does the use of a term space rather than a document space. The visualization approach is clearly feasible, and provides some additional insights not found in the computation of a discrimination value.
Olson, H.A.; Wolfram, D.: Syntagmatic relationships and indexing consistency on a larger scale (2008) 0.00
```
0.001821651 = product of:
  0.010929906 = sum of:
    0.010929906 = weight(_text_:in in 2214) [ClassicSimilarity], result of:
      0.010929906 = score(doc=2214,freq=12.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.18406484 = fieldWeight in 2214, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2214)
  0.16666667 = coord(1/6)
```
Abstract

Purpose - The purpose of this article is to examine interindexer consistency on a larger scale than other studies have done to determine if group consensus is reached by larger numbers of indexers and what, if any, relationships emerge between assigned terms. Design/methodology/approach - In total, 64 MLIS students were recruited to assign up to five terms to a document. The authors applied basic data modeling and the exploratory statistical techniques of multi-dimensional scaling (MDS) and hierarchical cluster analysis to determine whether relationships exist in indexing consistency and the coocurrence of assigned terms. Findings - Consistency in the assignment of indexing terms to a document follows an inverse shape, although it is not strictly power law-based unlike many other social phenomena. The exploratory techniques revealed that groups of terms clustered together. The resulting term cooccurrence relationships were largely syntagmatic. Research limitations/implications - The results are based on the indexing of one article by non-expert indexers and are, thus, not generalizable. Based on the study findings, along with the growing popularity of folksonomies and the apparent authority of communally developed information resources, communally developed indexes based on group consensus may have merit. Originality/value - Consistency in the assignment of indexing terms has been studied primarily on a small scale. Few studies have examined indexing on a larger scale with more than a handful of indexers. Recognition of the differences in indexing assignment has implications for the development of public information systems, especially those that do not use a controlled vocabulary and those tagged by end-users. In such cases, multiple access points that accommodate the different ways that users interpret content are needed so that searchers may be guided to relevant content despite using different terminology.
Wolfram, D.; Volz, A.; Dimitroff, A.: ¬The effect of linkage structure on retrieval performance in a hypertext-based bibliographic retrieval system (1996) 0.00
```
0.0018033426 = product of:
  0.010820055 = sum of:
    0.010820055 = weight(_text_:in in 6622) [ClassicSimilarity], result of:
      0.010820055 = score(doc=6622,freq=6.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.1822149 = fieldWeight in 6622, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6622)
  0.16666667 = coord(1/6)
```
Abstract

Investigates how linkage environments in a hypertext based bibliographic retrieval system affect retrieval performance for novice and experienced searchers, 2 systems, 1 with inter record linkages to authors and descriptors and 1 that also included title and abstract keywords, were tested. No significant differences in retrieval performance and system usage were found for most search tests. The enhanced system did provide better performance where title and abstract keywords provided the most direct access to relevant records. The findings have implications for the design of bilbiographic information retrieval systems using hypertext linkages

Wolfram, D.; Dimitroff, A.: Hypertext vs. Boolean-based searching in a bibliographic database environment : a direct comparison of searcher performance (1998) 0.00

0.0017848461 = product of:
  0.010709076 = sum of:
    0.010709076 = weight(_text_:in in 6436) [ClassicSimilarity], result of:
      0.010709076 = score(doc=6436,freq=2.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.18034597 = fieldWeight in 6436, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=6436)
  0.16666667 = coord(1/6)

Wolfram, D.; Zhang, J.: ¬The influence of indexing practices and weighting algorithms on document spaces (2008) 0.00
```
0.0017848461 = product of:
  0.010709076 = sum of:
    0.010709076 = weight(_text_:in in 1963) [ClassicSimilarity], result of:
      0.010709076 = score(doc=1963,freq=8.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.18034597 = fieldWeight in 1963, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=1963)
  0.16666667 = coord(1/6)
```
Abstract

Index modeling and computer simulation techniques are used to examine the influence of indexing frequency distributions, indexing exhaustivity distributions, and three weighting methods on hypothetical document spaces in a vector-based information retrieval (IR) system. The way documents are indexed plays an important role in retrieval. The authors demonstrate the influence of different indexing characteristics on document space density (DSD) changes and document space discriminative capacity for IR. Document environments that contain a relatively higher percentage of infrequently occurring terms provide lower density outcomes than do environments where a higher percentage of frequently occurring terms exists. Different indexing exhaustivity levels, however, have little influence on the document space densities. A weighting algorithm that favors higher weights for infrequently occurring terms results in the lowest overall document space densities, which allows documents to be more readily differentiated from one another. This in turn can positively influence IR. The authors also discuss the influence on outcomes using two methods of normalization of term weights (i.e., means and ranges) for the different weighting methods.

Wolfram, D.: ¬The symbiotic relationship between information retrieval and informetrics (2015) 0.00

0.0017848461 = product of:
  0.010709076 = sum of:
    0.010709076 = weight(_text_:in in 1689) [ClassicSimilarity], result of:
      0.010709076 = score(doc=1689,freq=2.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.18034597 = fieldWeight in 1689, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=1689)
  0.16666667 = coord(1/6)

Footnote: Beitrag in einem Special Issue "Combining bibliometrics and information retrieval"

Ajiferuke, I.; Wolfram, D.: Analysis of Web page image tag distribution characteristics (2005) 0.00
```
0.0015457221 = product of:
  0.009274333 = sum of:
    0.009274333 = weight(_text_:in in 1059) [ClassicSimilarity], result of:
      0.009274333 = score(doc=1059,freq=6.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.1561842 = fieldWeight in 1059, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=1059)
  0.16666667 = coord(1/6)
```
Abstract

The authors investigate the frequency distribution of the use of image tags in Web pages. Using data sampled from top level Web pages across five top level domains and from sample pages within individual websites, the authors model observed patterns in the frequency of image tag usage by fitting collected data distributions to different theoretical models used in informetrics. Models tested include the modified power law (MPL), Mandelbrot (MDB), generalized waring (GW), generalized inverse Gaussian-Poisson (GIGP), and generalized negative binomial (GNB) distributions. The GIGP provided the best fit for data sets for top level pages across the top level domains tested. The poor fits of the models to the observed data distributions from specific websites were due to the multimodal nature of the observed data sets. Mixtures of the tested models for the data sets provided better fits. The ability to effectively model Web page attributes, such as the distribution of the number of image tags used per page, is needed for accurate simulation models of Web page content, and makes it possible to estimate the number of requests needed to display the complete content of Web pages.
Wolfram, D.; Olson, H.A.; Bloom, R.: Measuring consistency for multiple taggers using vector space modeling (2009) 0.00
```
0.0015457221 = product of:
  0.009274333 = sum of:
    0.009274333 = weight(_text_:in in 3113) [ClassicSimilarity], result of:
      0.009274333 = score(doc=3113,freq=6.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.1561842 = fieldWeight in 3113, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=3113)
  0.16666667 = coord(1/6)
```
Abstract

A longstanding area of study in indexing is the identification of factors affecting vocabulary usage and consistency. This topic has seen a recent resurgence with a focus on social tagging. Tagging data for scholarly articles made available by the social bookmarking Website CiteULike (www.citeulike.org) were used to test the use of inter-indexer/tagger consistency density values, based on a method developed by the authors by comparing calculations for highly tagged documents representing three subject areas (Science, Social Science, Social Software). The analysis revealed that the developed method is viable for a large dataset. The findings also indicated that there were no significant differences in tagging consistency among the three topic areas, demonstrating that vocabulary usage in a relatively new subject area like social software is no more inconsistent than the more established subject areas investigated. The implications of the method used and the findings are discussed.
Wolfram, D.; Zhang, J.: ¬An investigation of the influence of indexing exhaustivity and term distributions on a document space (2002) 0.00
```
0.0014873719 = product of:
  0.008924231 = sum of:
    0.008924231 = weight(_text_:in in 5238) [ClassicSimilarity], result of:
      0.008924231 = score(doc=5238,freq=8.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.15028831 = fieldWeight in 5238, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5238)
  0.16666667 = coord(1/6)
```
Abstract

Wolfram and Zhang are interested in the effect of different indexing exhaustivity, by which they mean the number of terms chosen, and of different index term distributions and different term weighting methods on the resulting document cluster organization. The Distance Angle Retrieval Environment, DARE, which provides a two dimensional display of retrieved documents was used to represent the document clusters based upon a document's distance from the searcher's main interest, and on the angle formed by the document, a point representing a minor interest, and the point representing the main interest. If the centroid and the origin of the document space are assigned as major and minor points the average distance between documents and the centroid can be measured providing an indication of cluster organization. in the form of a size normalized similarity measure. Using 500 records from NTIS and nine models created by intersecting low, observed, and high exhaustivity levels (based upon a negative binomial distribution) with shallow, observed, and steep term distributions (based upon a Zipf distribution) simulation runs were preformed using inverse document frequency, inter-document term frequency, and inverse document frequency based upon both inter and intra-document frequencies. Low exhaustivity and shallow distributions result in a more dense document space and less effective retrieval. High exhaustivity and steeper distributions result in a more diffuse space.
Zhang, J.; Chen, Y.; Zhao, Y.; Wolfram, D.; Ma, F.: Public health and social media : a study of Zika virus-related posts on Yahoo! Answers (2020) 0.00
```
0.0014873719 = product of:
  0.008924231 = sum of:
    0.008924231 = weight(_text_:in in 5672) [ClassicSimilarity], result of:
      0.008924231 = score(doc=5672,freq=8.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.15028831 = fieldWeight in 5672, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=5672)
  0.16666667 = coord(1/6)
```
Abstract

This study investigates the content of questions and responses about the Zika virus on Yahoo! Answers as a recent example of how public concerns regarding an international health issue are reflected in social media. We investigate the contents of posts about the Zika virus on Yahoo! Answers, identify and reveal subject patterns about the Zika virus, and analyze the temporal changes of the revealed subject topics over 4 defined periods of the Zika virus outbreak. Multidimensional scaling analysis, temporal analysis, and inferential statistical analysis approaches were used in the study. A resulting 2-layer Zika virus schema, and term connections and relationships are presented. The results indicate that consumers' concerns changed over the 4 defined periods. Consumers paid more attention to the basic information about the Zika virus, and the prevention and protection from the Zika virus at the beginning of the outbreak of the Zika virus. During the later periods, consumers became more interested in the role that the government and health organizations played in the public health emergency.
Wolfram, D.: Inter-record linkage structure in a hypertext bibliographic retrieval system (1996) 0.00
```
0.0014724231 = product of:
  0.008834538 = sum of:
    0.008834538 = weight(_text_:in in 6761) [ClassicSimilarity], result of:
      0.008834538 = score(doc=6761,freq=4.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.14877784 = fieldWeight in 6761, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6761)
  0.16666667 = coord(1/6)
```
Abstract

Explores inter record linkage relationships of a bibliographic hypertext system through the use of descriptor term co-occurrences. Using term distribution and term exhaustivity data for an existing system, develops 3 models of term co-occurrence and tests against the observed data. The developed models do not adequately model the observed co-occurrence patterns for select parts of the distribution using chi-square values. With knowledge of the structure of such a hypertext system, an appropriate model may be constructed and used as the basis for studying systems design of inter-record linkages and system navigation by users in such a hypertext system
Wolfram, D.; Dimitroff, A.: Preliminary findings on searcher performance and perceptions of performance in a hypertext bibliographic retrieval system (1997) 0.00
```
0.0014724231 = product of:
  0.008834538 = sum of:
    0.008834538 = weight(_text_:in in 1857) [ClassicSimilarity], result of:
      0.008834538 = score(doc=1857,freq=4.0), product of:
        0.059380736 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.043654136 = queryNorm
        0.14877784 = fieldWeight in 1857, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1857)
  0.16666667 = coord(1/6)
```
Abstract

Reports on research examining the relationship of searcher performance and perception of performance, particulary for hypertext-based onformation retrieval systems for bibliographic data. Employs a prototype hypertext bibliographic retrieval system called HyperLynx. Evaluates its use by 83 subjects at the School of Library and Information Science and the Golda Meir Library at the University of Wisconsin-Milwaukee, USA. Measures of system usgae indicate that there is no significant relationship between confidence and the number of record pages visited, although confident searchers searched for shorter time periods. The reality check measures shows that both novice and experienced searchers were over confident in their performance

Search (31 results, page 1 of 2)

Authors

Years

Themes