Search (4 results, page 1 of 1)

  • × author_ss:"Yang, J."
  1. Gachot, D.A.; Lange, E.; Yang, J.: ¬The SYSTRAN NLP browser : an application of machine translation technology in cross-language information retrieval (1998) 0.08
    0.08246493 = product of:
      0.16492987 = sum of:
        0.16492987 = product of:
          0.32985973 = sum of:
            0.32985973 = weight(_text_:d.a in 6213) [ClassicSimilarity], result of:
              0.32985973 = score(doc=6213,freq=2.0), product of:
                0.35180828 = queryWeight, product of:
                  7.071914 = idf(docFreq=101, maxDocs=44218)
                  0.04974725 = queryNorm
                0.9376122 = fieldWeight in 6213, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  7.071914 = idf(docFreq=101, maxDocs=44218)
                  0.09375 = fieldNorm(doc=6213)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  2. Zhang, L.; Lu, W.; Yang, J.: LAGOS-AND : a large gold standard dataset for scholarly author name disambiguation (2023) 0.04
    0.03704326 = sum of:
      0.020193094 = product of:
        0.08077238 = sum of:
          0.08077238 = weight(_text_:authors in 883) [ClassicSimilarity], result of:
            0.08077238 = score(doc=883,freq=4.0), product of:
              0.22678846 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.04974725 = queryNorm
              0.35615736 = fieldWeight in 883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
        0.25 = coord(1/4)
      0.016850164 = product of:
        0.03370033 = sum of:
          0.03370033 = weight(_text_:22 in 883) [ClassicSimilarity], result of:
            0.03370033 = score(doc=883,freq=2.0), product of:
              0.17420639 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.04974725 = queryNorm
              0.19345059 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
        0.5 = coord(1/2)
    
    Abstract
    In this article, we present a method to automatically build large labeled datasets for the author ambiguity problem in the academic world by leveraging the authoritative academic resources, ORCID and DOI. Using the method, we built LAGOS-AND, two large, gold-standard sub-datasets for author name disambiguation (AND), of which LAGOS-AND-BLOCK is created for clustering-based AND research and LAGOS-AND-PAIRWISE is created for classification-based AND research. Our LAGOS-AND datasets are substantially different from the existing ones. The initial versions of the datasets (v1.0, released in February 2021) include 7.5 M citations authored by 798 K unique authors (LAGOS-AND-BLOCK) and close to 1 M instances (LAGOS-AND-PAIRWISE). And both datasets show close similarities to the whole Microsoft Academic Graph (MAG) across validations of six facets. In building the datasets, we reveal the variation degrees of last names in three literature databases, PubMed, MAG, and Semantic Scholar, by comparing author names hosted to the authors' official last names shown on the ORCID pages. Furthermore, we evaluate several baseline disambiguation methods as well as the MAG's author IDs system on our datasets, and the evaluation helps identify several interesting findings. We hope the datasets and findings will bring new insights for future studies. The code and datasets are publicly available.
    Date
    22. 1.2023 18:40:36
  3. Wan, X.; Yang, J.; Xiao, J.: Incorporating cross-document relationships between sentences for single document summarizations (2006) 0.01
    0.010110098 = product of:
      0.020220196 = sum of:
        0.020220196 = product of:
          0.04044039 = sum of:
            0.04044039 = weight(_text_:22 in 2421) [ClassicSimilarity], result of:
              0.04044039 = score(doc=2421,freq=2.0), product of:
                0.17420639 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04974725 = queryNorm
                0.23214069 = fieldWeight in 2421, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2421)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Research and advanced technology for digital libraries : 10th European conference, proceedings / ECDL 2006, Alicante, Spain, September 17 - 22, 2006
  4. Tang, X.-B.; Liu, G.-C.; Yang, J.; Wei, W.: Knowledge-based financial statement fraud detection system : based on an ontology and a decision tree (2018) 0.01
    0.010110098 = product of:
      0.020220196 = sum of:
        0.020220196 = product of:
          0.04044039 = sum of:
            0.04044039 = weight(_text_:22 in 4306) [ClassicSimilarity], result of:
              0.04044039 = score(doc=4306,freq=2.0), product of:
                0.17420639 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.04974725 = queryNorm
                0.23214069 = fieldWeight in 4306, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4306)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    21. 6.2018 10:22:43

Years

Languages