Search (26 results, page 2 of 2)

  • × author_ss:"Greenberg, J."
  • × language_ss:"e"
  1. Newby, G.B.; Greenberg, J.; Jones, P.: Open source software development and Lotka's law : bibliometric patterns in programming (2003) 0.00
    0.0014390396 = product of:
      0.0028780792 = sum of:
        0.0028780792 = product of:
          0.0057561584 = sum of:
            0.0057561584 = weight(_text_:a in 5140) [ClassicSimilarity], result of:
              0.0057561584 = score(doc=5140,freq=6.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.13239266 = fieldWeight in 5140, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5140)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Newby, Greenberg, and Jones analyze programming productivity of open source software by counting registered developers contributions found in the Linux Software Map and in Scourceforge. Using seven years of data from a subset of the Linux directory tree LSM data provided 4503 files with 3341 unique author names. The distribution follows Lotka's Law with an exponent of 2.82 as verified by the Kolmolgorov-Smirnov one sample goodness of fit test. Scourceforge data is broken into developers and administrators, but when both were used as authors the Lotka distribution exponent of 2.55 produces the lowest error. This would not be significant by the K-S test but the 3.54% maximum error would indicate a fit and calls into question the appropriateness of K-S for large populations of authors.
    Type
    a
  2. Grabus, S.; Logan, P.M.; Greenberg, J.: Temporal concept drift and alignment : an empirical approach to comparing knowledge organization systems over time (2022) 0.00
    0.0013847164 = product of:
      0.0027694327 = sum of:
        0.0027694327 = product of:
          0.0055388655 = sum of:
            0.0055388655 = weight(_text_:a in 1100) [ClassicSimilarity], result of:
              0.0055388655 = score(doc=1100,freq=8.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.12739488 = fieldWeight in 1100, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1100)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This research explores temporal concept drift and temporal alignment in knowledge organization systems (KOS). A comparative analysis is pursued using the 1910 Library of Congress Subject Headings, 2020 FAST Topical, and automatic indexing. The use case involves a sample of 90 nineteenth-century Encyclopedia Britannica entries. The entries were indexed using two approaches: 1) full-text indexing; 2) Named Entity Recognition was performed upon the entries with Stanza, Stanford's NLP toolkit, and entities were automatically indexed with the Helping Interdisciplinary Vocabulary application (HIVE), using both 1910 LCSH and FAST Topical. The analysis focused on three goals: 1) identifying results that were exclusive to the 1910 LCSH output; 2) identifying terms in the exclusive set that have been deprecated from the contemporary LCSH, demonstrating temporal concept drift; and 3) exploring the historical significance of these deprecated terms. Results confirm that historical vocabularies can be used to generate anachronistic subject headings representing conceptual drift across time in KOS and historical resources. A methodological contribution is made demonstrating how to study changes in KOS over time and improve the contextualization historical humanities resources.
    Type
    a
  3. Greenberg, J.: Advancing Semantic Web via library functions (2006) 0.00
    0.0011749709 = product of:
      0.0023499418 = sum of:
        0.0023499418 = product of:
          0.0046998835 = sum of:
            0.0046998835 = weight(_text_:a in 244) [ClassicSimilarity], result of:
              0.0046998835 = score(doc=244,freq=4.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.10809815 = fieldWeight in 244, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=244)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    This article explores the applicability primary library functions (collection development, cataloging, reference, and circulation) to the Semantic Web. The article defines the Semantic Web, identifies similarities between the library institution and the Semantic Web, and presents research questions guiding the inquiry. The article addresses each library function and demonstrates the applicability of each function's polices to Semantic Web development. Results indicate that library functions are applicable to Semantic Web, with "collection development" translating to "Semantic Web selection;" "cataloging" translating to "Semantic Web 'semantic' representation;" "reference" translating to "Semantic Web service," and circulation translating to "Semantic Web resource use." The last part of this article includes a discussion about the lack of embrace between the library and the Semantic Web communities, recommendations for improving this gap, and research conclusions.
    Type
    a
  4. Greenberg, J.: Metadata and the World Wide Web (2002) 0.00
    9.791424E-4 = product of:
      0.0019582848 = sum of:
        0.0019582848 = product of:
          0.0039165695 = sum of:
            0.0039165695 = weight(_text_:a in 4264) [ClassicSimilarity], result of:
              0.0039165695 = score(doc=4264,freq=4.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.090081796 = fieldWeight in 4264, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=4264)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Metadata is of paramount importance for persons, organizations, and endeavors of every dimension that are increasingly turning to the World Wide Web (hereafter referred to as the Web) as a chief conduit for accessing and disseminating information. This is evidenced by the development and implementation of metadata schemas supporting projects ranging from restricted corporate intranets, data warehouses, and consumer-oriented electronic commerce enterprises to freely accessible digital libraries, educational initiatives, virtual museums, and other public Web sites. Today's metadata activities are unprecedented because they extend beyond the traditional library environment in an effort to deal with the Web's exponential growth. This article considers metadata in today's Web environment. The article defines metadata, examines the relationship between metadata and cataloging, provides definitions for key metadata vocabulary terms, and explores the topic of metadata generation. Metadata is an extensive and expanding subject that is prevalent in many environments. For practical reasons, this article has elected to concentrate an the information resource domain, which is defined by electronic textual documents, graphical images, archival materials, museum artifacts, and other objects found in both digital and physical information centers (e.g., libraries, museums, record centers, and archives). To show the extent and larger application of metadata, several examples are also drawn from the data warehouse, electronic commerce, open source, and medical communities.
    Type
    a
  5. Greenberg, J.: Subject control of ephemera : MARC format options (1996) 0.00
    9.693015E-4 = product of:
      0.001938603 = sum of:
        0.001938603 = product of:
          0.003877206 = sum of:
            0.003877206 = weight(_text_:a in 543) [ClassicSimilarity], result of:
              0.003877206 = score(doc=543,freq=2.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.089176424 = fieldWeight in 543, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=543)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a
  6. Greenberg, J.: Optimal query expansion (QE) processing methods with semantically encoded structured thesaurus terminology (2001) 0.00
    8.308299E-4 = product of:
      0.0016616598 = sum of:
        0.0016616598 = product of:
          0.0033233196 = sum of:
            0.0033233196 = weight(_text_:a in 5750) [ClassicSimilarity], result of:
              0.0033233196 = score(doc=5750,freq=2.0), product of:
                0.043477926 = queryWeight, product of:
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.037706986 = queryNorm
                0.07643694 = fieldWeight in 5750, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  1.153047 = idf(docFreq=37942, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5750)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Type
    a