-
Kang, I.-S.; Na, S.-H.; Lee, S.; Jung, H.; Kim, P.; Sung, W.-K.; Lee, J.-H.: On co-authorship for author disambiguation (2009)
0.02
0.020735003 = product of:
0.04838167 = sum of:
0.015805235 = product of:
0.06322094 = sum of:
0.06322094 = weight(_text_:authors in 2453) [ClassicSimilarity], result of:
0.06322094 = score(doc=2453,freq=4.0), product of:
0.14792371 = queryWeight, product of:
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.03244785 = queryNorm
0.42738882 = fieldWeight in 2453, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
4.558814 = idf(docFreq=1258, maxDocs=44218)
0.046875 = fieldNorm(doc=2453)
0.25 = coord(1/4)
0.021717625 = weight(_text_:j in 2453) [ClassicSimilarity], result of:
0.021717625 = score(doc=2453,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.21064025 = fieldWeight in 2453, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=2453)
0.010858812 = product of:
0.021717625 = sum of:
0.021717625 = weight(_text_:j in 2453) [ClassicSimilarity], result of:
0.021717625 = score(doc=2453,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.21064025 = fieldWeight in 2453, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=2453)
0.5 = coord(1/2)
0.42857143 = coord(3/7)
- Abstract
- Author name disambiguation deals with clustering the same-name authors into different individuals. To attack the problem, many studies have employed a variety of disambiguation features such as coauthors, titles of papers/publications, topics of articles, emails/affiliations, etc. Among these, co-authorship is the most easily accessible and influential, since inter-person acquaintances represented by co-authorship could discriminate the identities of authors more clearly than other features. This study attempts to explore the net effects of co-authorship on author clustering in bibliographic data. First, to handle the shortage of explicit coauthors listed in known citations, a web-assisted technique of acquiring implicit coauthors of the target author to be disambiguated is proposed. Then, a coauthor disambiguation hypothesis that the identity of an author can be determined by his/her coauthors is examined and confirmed through a variety of author disambiguation experiments.
-
Na, S.-H.; Kang, I.-S.; Roh, J.-E.; Lee, J.-H.: ¬An empirical study of query expansion and cluster-based retrieval in language modeling approach (2007)
0.02
0.015356679 = product of:
0.053748377 = sum of:
0.035832252 = weight(_text_:j in 906) [ClassicSimilarity], result of:
0.035832252 = score(doc=906,freq=4.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.34753868 = fieldWeight in 906, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.0546875 = fieldNorm(doc=906)
0.017916126 = product of:
0.035832252 = sum of:
0.035832252 = weight(_text_:j in 906) [ClassicSimilarity], result of:
0.035832252 = score(doc=906,freq=4.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.34753868 = fieldWeight in 906, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.0546875 = fieldNorm(doc=906)
0.5 = coord(1/2)
0.2857143 = coord(2/7)
-
Kang, I.-S.; Na, S.-H.; Kim, J.; Lee, J.-H.: Cluster-based patent retrieval (2007)
0.01
0.013162869 = product of:
0.04607004 = sum of:
0.03071336 = weight(_text_:j in 930) [ClassicSimilarity], result of:
0.03071336 = score(doc=930,freq=4.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.2978903 = fieldWeight in 930, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=930)
0.01535668 = product of:
0.03071336 = sum of:
0.03071336 = weight(_text_:j in 930) [ClassicSimilarity], result of:
0.03071336 = score(doc=930,freq=4.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.2978903 = fieldWeight in 930, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=930)
0.5 = coord(1/2)
0.2857143 = coord(2/7)
-
Na, S.-H.; Kang, I.-S.; Lee, J.-H.: Adaptive document clustering based on query-based similarity (2007)
0.01
0.009307554 = product of:
0.032576438 = sum of:
0.021717625 = weight(_text_:j in 920) [ClassicSimilarity], result of:
0.021717625 = score(doc=920,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.21064025 = fieldWeight in 920, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=920)
0.010858812 = product of:
0.021717625 = sum of:
0.021717625 = weight(_text_:j in 920) [ClassicSimilarity], result of:
0.021717625 = score(doc=920,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.21064025 = fieldWeight in 920, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.046875 = fieldNorm(doc=920)
0.5 = coord(1/2)
0.2857143 = coord(2/7)
-
Na, S.-H.; Kang, I.-S.; Lee, J.-H.: Parsimonious translation models for information retrieval (2007)
0.01
0.007756295 = product of:
0.027147032 = sum of:
0.01809802 = weight(_text_:j in 898) [ClassicSimilarity], result of:
0.01809802 = score(doc=898,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.17553353 = fieldWeight in 898, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.0390625 = fieldNorm(doc=898)
0.00904901 = product of:
0.01809802 = sum of:
0.01809802 = weight(_text_:j in 898) [ClassicSimilarity], result of:
0.01809802 = score(doc=898,freq=2.0), product of:
0.10310292 = queryWeight, product of:
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.03244785 = queryNorm
0.17553353 = fieldWeight in 898, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.1774964 = idf(docFreq=5010, maxDocs=44218)
0.0390625 = fieldNorm(doc=898)
0.5 = coord(1/2)
0.2857143 = coord(2/7)