Search (3 results, page 1 of 1)

  • × theme_ss:"Automatisches Klassifizieren"
  • × author_ss:"Golub, K."
  1. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 0.02
    0.018957332 = product of:
      0.037914664 = sum of:
        0.037914664 = product of:
          0.07582933 = sum of:
            0.07582933 = weight(_text_:organization in 4558) [ClassicSimilarity], result of:
              0.07582933 = score(doc=4558,freq=6.0), product of:
                0.18523255 = queryWeight, product of:
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.051953442 = queryNorm
                0.40937364 = fieldWeight in 4558, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.046875 = fieldNorm(doc=4558)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    While automated methods for information organization have been around for several decades now, exponential growth of the World Wide Web has put them into the forefront of research in different communities, within which several approaches can be identified: 1) machine learning (algorithms that allow computers to improve their performance based on learning from pre-existing data); 2) document clustering (algorithms for unsupervised document organization and automated topic extraction); and 3) string matching (algorithms that match given strings within larger text). Here the aim was to automatically organize textual documents into hierarchical structures for subject browsing. The string-matching approach was tested using a controlled vocabulary (containing pre-selected and pre-defined authorized terms, each corresponding to only one concept). The results imply that an appropriate controlled vocabulary, with a sufficient number of entry terms designating classes, could in itself be a solution for automated classification. Then, if the same controlled vocabulary had an appropriat hierarchical structure, it would at the same time provide a good browsing structure for the collection of automatically classified documents.
    Source
    Knowledge organization. 38(2011) no.3, S.230-244
  2. Golub, K.: Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations (2006) 0.01
    0.010945019 = product of:
      0.021890039 = sum of:
        0.021890039 = product of:
          0.043780077 = sum of:
            0.043780077 = weight(_text_:organization in 5897) [ClassicSimilarity], result of:
              0.043780077 = score(doc=5897,freq=2.0), product of:
                0.18523255 = queryWeight, product of:
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.051953442 = queryNorm
                0.23635197 = fieldWeight in 5897, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5897)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Content
    Beitrag eines Themenheftes "Knowledge organization systems and services"
  3. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.01
    0.010945019 = product of:
      0.021890039 = sum of:
        0.021890039 = product of:
          0.043780077 = sum of:
            0.043780077 = weight(_text_:organization in 1461) [ClassicSimilarity], result of:
              0.043780077 = score(doc=1461,freq=2.0), product of:
                0.18523255 = queryWeight, product of:
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.051953442 = queryNorm
                0.23635197 = fieldWeight in 1461, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5653565 = idf(docFreq=3399, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1461)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Knowledge organization. 34(2007) no.4, S.247-263

Authors