Search (5 results, page 1 of 1)

  • × author_ss:"Lu, W."
  1. Lu, W.; Ding, H.; Jiang, J.: ¬A document expansion framework for tag-based image retrieval (2018) 0.08
    0.08113074 = product of:
      0.16226149 = sum of:
        0.16226149 = sum of:
          0.12618683 = weight(_text_:q in 4630) [ClassicSimilarity], result of:
            0.12618683 = score(doc=4630,freq=2.0), product of:
              0.34876913 = queryWeight, product of:
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.05325213 = queryNorm
              0.3618062 = fieldWeight in 4630, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4630)
          0.036074646 = weight(_text_:22 in 4630) [ClassicSimilarity], result of:
            0.036074646 = score(doc=4630,freq=2.0), product of:
              0.1864799 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.05325213 = queryNorm
              0.19345059 = fieldWeight in 4630, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4630)
      0.5 = coord(1/2)
    
    Abstract
    Purpose The purpose of this paper is to utilize document expansion techniques for improving image representation and retrieval. This paper proposes a concise framework for tag-based image retrieval (TBIR). Design/methodology/approach The proposed approach includes three core components: a strategy of selecting expansion (similar) images from the whole corpus (e.g. cluster-based or nearest neighbor-based); a technique for assessing image similarity, which is adopted for selecting expansion images (text, image, or mixed); and a model for matching the expanded image representation with the search query (merging or separate). Findings The results show that applying the proposed method yields significant improvements in effectiveness, and the method obtains better performance on the top of the rank and makes a great improvement on some topics with zero score in baseline. Moreover, nearest neighbor-based expansion strategy outperforms the cluster-based expansion strategy, and using image features for selecting expansion images is better than using text features in most cases, and the separate method for calculating the augmented probability P(q|RD) is able to erase the negative influences of error images in RD. Research limitations/implications Despite these methods only outperform on the top of the rank instead of the entire rank list, TBIR on mobile platforms still can benefit from this approach. Originality/value Unlike former studies addressing the sparsity, vocabulary mismatch, and tag relatedness in TBIR individually, the approach proposed by this paper addresses all these issues with a single document expansion framework. It is a comprehensive investigation of document expansion techniques in TBIR.
    Date
    20. 1.2015 18:30:22
  2. Zhang, L.; Lu, W.; Yang, J.: LAGOS-AND : a large gold standard dataset for scholarly author name disambiguation (2023) 0.04
    0.039653096 = sum of:
      0.021615773 = product of:
        0.086463094 = sum of:
          0.086463094 = weight(_text_:authors in 883) [ClassicSimilarity], result of:
            0.086463094 = score(doc=883,freq=4.0), product of:
              0.24276656 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.05325213 = queryNorm
              0.35615736 = fieldWeight in 883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
        0.25 = coord(1/4)
      0.018037323 = product of:
        0.036074646 = sum of:
          0.036074646 = weight(_text_:22 in 883) [ClassicSimilarity], result of:
            0.036074646 = score(doc=883,freq=2.0), product of:
              0.1864799 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.05325213 = queryNorm
              0.19345059 = fieldWeight in 883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=883)
        0.5 = coord(1/2)
    
    Abstract
    In this article, we present a method to automatically build large labeled datasets for the author ambiguity problem in the academic world by leveraging the authoritative academic resources, ORCID and DOI. Using the method, we built LAGOS-AND, two large, gold-standard sub-datasets for author name disambiguation (AND), of which LAGOS-AND-BLOCK is created for clustering-based AND research and LAGOS-AND-PAIRWISE is created for classification-based AND research. Our LAGOS-AND datasets are substantially different from the existing ones. The initial versions of the datasets (v1.0, released in February 2021) include 7.5 M citations authored by 798 K unique authors (LAGOS-AND-BLOCK) and close to 1 M instances (LAGOS-AND-PAIRWISE). And both datasets show close similarities to the whole Microsoft Academic Graph (MAG) across validations of six facets. In building the datasets, we reveal the variation degrees of last names in three literature databases, PubMed, MAG, and Semantic Scholar, by comparing author names hosted to the authors' official last names shown on the ORCID pages. Furthermore, we evaluate several baseline disambiguation methods as well as the MAG's author IDs system on our datasets, and the evaluation helps identify several interesting findings. We hope the datasets and findings will bring new insights for future studies. The code and datasets are publicly available.
    Date
    22. 1.2023 18:40:36
  3. Lu, W.; Li, X.; Liu, Z.; Cheng, Q.: How do author-selected keywords function semantically in scientific manuscripts? (2019) 0.03
    0.03154671 = product of:
      0.06309342 = sum of:
        0.06309342 = product of:
          0.12618683 = sum of:
            0.12618683 = weight(_text_:q in 5453) [ClassicSimilarity], result of:
              0.12618683 = score(doc=5453,freq=2.0), product of:
                0.34876913 = queryWeight, product of:
                  6.5493927 = idf(docFreq=171, maxDocs=44218)
                  0.05325213 = queryNorm
                0.3618062 = fieldWeight in 5453, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.5493927 = idf(docFreq=171, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5453)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  4. Huang, S.; Qian, J.; Huang, Y.; Lu, W.; Bu, Y.; Yang, J.; Cheng, Q.: Disclosing the relationship between citation structure and future impact of a publication (2022) 0.03
    0.03154671 = product of:
      0.06309342 = sum of:
        0.06309342 = product of:
          0.12618683 = sum of:
            0.12618683 = weight(_text_:q in 621) [ClassicSimilarity], result of:
              0.12618683 = score(doc=621,freq=2.0), product of:
                0.34876913 = queryWeight, product of:
                  6.5493927 = idf(docFreq=171, maxDocs=44218)
                  0.05325213 = queryNorm
                0.3618062 = fieldWeight in 621, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.5493927 = idf(docFreq=171, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=621)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  5. Jiang, Y.; Meng, R.; Huang, Y.; Lu, W.; Liu, J.: Generating keyphrases for readers : a controllable keyphrase generation framework (2023) 0.01
    0.009018661 = product of:
      0.018037323 = sum of:
        0.018037323 = product of:
          0.036074646 = sum of:
            0.036074646 = weight(_text_:22 in 1012) [ClassicSimilarity], result of:
              0.036074646 = score(doc=1012,freq=2.0), product of:
                0.1864799 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.05325213 = queryNorm
                0.19345059 = fieldWeight in 1012, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1012)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 6.2023 14:55:20