-
Na, S.-H.; Kang, I.-S.; Roh, J.-E.; Lee, J.-H.: ¬An empirical study of query expansion and cluster-based retrieval in language modeling approach (2007)
0.03
0.027885964 = product of:
0.05577193 = sum of:
0.05577193 = product of:
0.11154386 = sum of:
0.11154386 = weight(_text_:2007 in 906) [ClassicSimilarity], result of:
0.11154386 = score(doc=906,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.55205977 = fieldWeight in 906, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0546875 = fieldNorm(doc=906)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.2, S.302-314
- Year
- 2007
-
Abdelali, A.; Cowie, J.; Soliman, H.S.: Improving query precision using semantic expansion (2007)
0.03
0.027885964 = product of:
0.05577193 = sum of:
0.05577193 = product of:
0.11154386 = sum of:
0.11154386 = weight(_text_:2007 in 917) [ClassicSimilarity], result of:
0.11154386 = score(doc=917,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.55205977 = fieldWeight in 917, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0546875 = fieldNorm(doc=917)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.3, S.705-716
- Year
- 2007
-
Bhogal, J.; Macfarlane, A.; Smith, P.: ¬A review of ontology based query expansion (2007)
0.03
0.027885964 = product of:
0.05577193 = sum of:
0.05577193 = product of:
0.11154386 = sum of:
0.11154386 = weight(_text_:2007 in 919) [ClassicSimilarity], result of:
0.11154386 = score(doc=919,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.55205977 = fieldWeight in 919, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0546875 = fieldNorm(doc=919)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.4, S.866-886
- Year
- 2007
-
Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986)
0.02
0.024255017 = product of:
0.048510034 = sum of:
0.048510034 = product of:
0.09702007 = sum of:
0.09702007 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
0.09702007 = score(doc=402,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.61904186 = fieldWeight in 402, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.125 = fieldNorm(doc=402)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 22(1986) no.6, S.465-476
-
Yang, L.; Ji, D.; Leong, M.: Document reranking by term distribution and maximal marginal relevance for chinese information retrieval (2007)
0.02
0.023902256 = product of:
0.047804512 = sum of:
0.047804512 = product of:
0.095609024 = sum of:
0.095609024 = weight(_text_:2007 in 907) [ClassicSimilarity], result of:
0.095609024 = score(doc=907,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.47319412 = fieldWeight in 907, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.046875 = fieldNorm(doc=907)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.2, S.315-326
- Year
- 2007
-
Sakai, T.: On the reliability of information retrieval metrics based on graded relevance (2007)
0.02
0.023902256 = product of:
0.047804512 = sum of:
0.047804512 = product of:
0.095609024 = sum of:
0.095609024 = weight(_text_:2007 in 910) [ClassicSimilarity], result of:
0.095609024 = score(doc=910,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.47319412 = fieldWeight in 910, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.046875 = fieldNorm(doc=910)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.2, S.531-548
- Year
- 2007
-
Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983)
0.02
0.021223139 = product of:
0.042446278 = sum of:
0.042446278 = product of:
0.084892556 = sum of:
0.084892556 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
0.084892556 = score(doc=2134,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.5416616 = fieldWeight in 2134, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=2134)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 30. 3.2001 13:32:22
-
Back, J.: ¬An evaluation of relevancy ranking techniques used by Internet search engines (2000)
0.02
0.021223139 = product of:
0.042446278 = sum of:
0.042446278 = product of:
0.084892556 = sum of:
0.084892556 = weight(_text_:22 in 3445) [ClassicSimilarity], result of:
0.084892556 = score(doc=3445,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.5416616 = fieldWeight in 3445, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.109375 = fieldNorm(doc=3445)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 25. 8.2005 17:42:22
-
Dannenberg, R.B.; Birmingham, W.P.; Pardo, B.; Hu, N.; Meek, C.; Tzanetakis, G.: ¬A comparative evaluation of search techniques for query-by-humming using the MUSART testbed (2007)
0.02
0.019918546 = product of:
0.039837092 = sum of:
0.039837092 = product of:
0.079674184 = sum of:
0.079674184 = weight(_text_:2007 in 269) [ClassicSimilarity], result of:
0.079674184 = score(doc=269,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.39432842 = fieldWeight in 269, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0390625 = fieldNorm(doc=269)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Journal of the American Society for Information Science and Technology. 58(2007) no.5, S.687-701
- Year
- 2007
-
Chen, Z.; Fu, B.: On the complexity of Rocchio's similarity-based relevance feedback algorithm (2007)
0.02
0.019918546 = product of:
0.039837092 = sum of:
0.039837092 = product of:
0.079674184 = sum of:
0.079674184 = weight(_text_:2007 in 578) [ClassicSimilarity], result of:
0.079674184 = score(doc=578,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.39432842 = fieldWeight in 578, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0390625 = fieldNorm(doc=578)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Journal of the American Society for Information Science and Technology. 58(2007) no.10, S.1392-1400
- Year
- 2007
-
MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the update of partitioned inverted files (2007)
0.02
0.019918546 = product of:
0.039837092 = sum of:
0.039837092 = product of:
0.079674184 = sum of:
0.079674184 = weight(_text_:2007 in 819) [ClassicSimilarity], result of:
0.079674184 = score(doc=819,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.39432842 = fieldWeight in 819, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0390625 = fieldNorm(doc=819)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Aslib proceedings. 59(2007) no.4/5, S.367-396
- Year
- 2007
-
Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986)
0.02
0.018191261 = product of:
0.036382522 = sum of:
0.036382522 = product of:
0.072765045 = sum of:
0.072765045 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
0.072765045 = score(doc=58,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.46428138 = fieldWeight in 58, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=58)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 14. 6.2015 22:12:44
-
Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986)
0.02
0.018191261 = product of:
0.036382522 = sum of:
0.036382522 = product of:
0.072765045 = sum of:
0.072765045 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
0.072765045 = score(doc=2051,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.46428138 = fieldWeight in 2051, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=2051)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 14. 6.2015 22:12:56
-
Urbain, J.; Goharian, N.; Frieder, O.: Probabilistic passage models for semantic search of genomics literature (2008)
0.02
0.017815689 = product of:
0.035631377 = sum of:
0.035631377 = product of:
0.071262754 = sum of:
0.071262754 = weight(_text_:2007 in 2380) [ClassicSimilarity], result of:
0.071262754 = score(doc=2380,freq=4.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.35269803 = fieldWeight in 2380, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0390625 = fieldNorm(doc=2380)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- We explore unsupervised learning techniques for extracting semantic information about biomedical concepts and topics, and introduce a passage retrieval model for using these semantics in context to improve genomics literature search. Our contributions include a new passage retrieval model based on an undirected graphical model (Markov Random Fields), and new methods for modeling passage-concepts, document-topics, and passage-terms as potential functions within the model. Each potential function includes distributional evidence to disambiguate topics, concepts, and terms in context. The joint distribution across potential functions in the graph represents the probability of a passage being relevant to a biologist's information need. Relevance ranking within each potential function simplifies normalization across potential functions and eliminates the need for tuning of passage retrieval model parameters. Our dimensional indexing model facilitates efficient aggregation of topic, concept, and term distributions. The proposed passage-retrieval model improves search results in the presence of varying levels of semantic evidence, outperforming models of query terms, concepts, or document topics alone. Our results exceed the state-of-the-art for automatic document retrieval by 14.46% (0.3554 vs. 0.3105) and passage retrieval by 15.57% (0.1128 vs. 0.0976) as assessed by the TREC 2007 Genomics Track, and automatic document retrieval by 18.56% (0.3424 vs. 0.2888) as assessed by the TREC 2005 Genomics Track. Automatic document retrieval results for TREC 2007 and TREC 2005 are statistically significant at the 95% confidence level (p = .0359 and .0253, respectively). Passage retrieval is significant at the 90% confidence level (p = 0.0893).
-
White, R.W.; Marchionini, G.: Examining the effectiveness of real-time query expansion (2007)
0.02
0.015934838 = product of:
0.031869676 = sum of:
0.031869676 = product of:
0.06373935 = sum of:
0.06373935 = weight(_text_:2007 in 913) [ClassicSimilarity], result of:
0.06373935 = score(doc=913,freq=5.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.31546274 = fieldWeight in 913, product of:
2.236068 = tf(freq=5.0), with freq of:
5.0 = termFreq=5.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.03125 = fieldNorm(doc=913)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Source
- Information processing and management. 43(2007) no.3, S.685-704
- Year
- 2007
-
Jacso, P.: Testing the calculation of a realistic h-index in Google Scholar, Scopus, and Web of Science for F. W. Lancaster (2008)
0.01
0.0125975935 = product of:
0.025195187 = sum of:
0.025195187 = product of:
0.050390374 = sum of:
0.050390374 = weight(_text_:2007 in 5586) [ClassicSimilarity], result of:
0.050390374 = score(doc=5586,freq=2.0), product of:
0.20205033 = queryWeight, product of:
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.044755515 = queryNorm
0.24939516 = fieldWeight in 5586, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
4.514535 = idf(docFreq=1315, maxDocs=44218)
0.0390625 = fieldNorm(doc=5586)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Abstract
- This paper focuses on the practical limitations in the content and software of the databases that are used to calculate the h-index for assessing the publishing productivity and impact of researchers. To celebrate F. W. Lancaster's biological age of seventy-five, and "scientific age" of forty-five, this paper discusses the related features of Google Scholar, Scopus, and Web of Science (WoS), and demonstrates in the latter how a much more realistic and fair h-index can be computed for F. W. Lancaster than the one produced automatically. Browsing and searching the cited reference index of the 1945-2007 edition of WoS, which in my estimate has over a hundred million "orphan references" that have no counterpart master records to be attached to, and "stray references" that cite papers which do have master records but cannot be identified by the matching algorithm because of errors of omission and commission in the references of the citing works, can bring up hundreds of additional cited references given to works of an accomplished author but are ignored in the automatic process of calculating the h-index. The partially manual process doubled the h-index value for F. W. Lancaster from 13 to 26, which is a much more realistic value for an information scientist and professor of his stature.
-
MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004)
0.01
0.012127508 = product of:
0.024255017 = sum of:
0.024255017 = product of:
0.048510034 = sum of:
0.048510034 = weight(_text_:22 in 5108) [ClassicSimilarity], result of:
0.048510034 = score(doc=5108,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.30952093 = fieldWeight in 5108, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=5108)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 20. 1.2007 18:30:22
-
Faloutsos, C.: Signature files (1992)
0.01
0.012127508 = product of:
0.024255017 = sum of:
0.024255017 = product of:
0.048510034 = sum of:
0.048510034 = weight(_text_:22 in 3499) [ClassicSimilarity], result of:
0.048510034 = score(doc=3499,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.30952093 = fieldWeight in 3499, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=3499)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 7. 5.1999 15:22:48
-
Losada, D.E.; Barreiro, A.: Emebedding term similarity and inverse document frequency into a logical model of information retrieval (2003)
0.01
0.012127508 = product of:
0.024255017 = sum of:
0.024255017 = product of:
0.048510034 = sum of:
0.048510034 = weight(_text_:22 in 1422) [ClassicSimilarity], result of:
0.048510034 = score(doc=1422,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.30952093 = fieldWeight in 1422, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1422)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 3.2003 19:27:23
-
Bornmann, L.; Mutz, R.: From P100 to P100' : a new citation-rank approach (2014)
0.01
0.012127508 = product of:
0.024255017 = sum of:
0.024255017 = product of:
0.048510034 = sum of:
0.048510034 = weight(_text_:22 in 1431) [ClassicSimilarity], result of:
0.048510034 = score(doc=1431,freq=2.0), product of:
0.15672618 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.044755515 = queryNorm
0.30952093 = fieldWeight in 1431, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0625 = fieldNorm(doc=1431)
0.5 = coord(1/2)
0.5 = coord(1/2)
- Date
- 22. 8.2014 17:05:18