Search (5 results, page 1 of 1)

  • × author_ss:"Ardö, A."
  1. Golub, K.; Hamon, T.; Ardö, A.: Automated classification of textual documents based on a controlled vocabulary in engineering (2007) 0.01
    0.012924549 = product of:
      0.051698197 = sum of:
        0.051698197 = weight(_text_:digital in 1461) [ClassicSimilarity], result of:
          0.051698197 = score(doc=1461,freq=2.0), product of:
            0.19770671 = queryWeight, product of:
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.050121464 = queryNorm
            0.26148933 = fieldWeight in 1461, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.944552 = idf(docFreq=2326, maxDocs=44218)
              0.046875 = fieldNorm(doc=1461)
      0.25 = coord(1/4)
    
    Abstract
    Automated subject classification has been a challenging research issue for many years now, receiving particular attention in the past decade due to rapid increase of digital documents. The most frequent approach to automated classification is machine learning. It, however, requires training documents and performs well on new documents only if these are similar enough to the former. We explore a string-matching algorithm based on a controlled vocabulary, which does not require training documents - instead it reuses the intellectual work put into creating the controlled vocabulary. Terms from the Engineering Information thesaurus and classification scheme were matched against title and abstract of engineering papers from the Compendex database. Simple string-matching was enhanced by several methods such as term weighting schemes and cut-offs, exclusion of certain terms, and en- richment of the controlled vocabulary with automatically extracted terms. The best results are 76% recall when the controlled vocabulary is enriched with new terms, and 79% precision when certain terms are excluded. Precision of individual classes is up to 98%. These results are comparable to state-of-the-art machine-learning algorithms.
  2. Koch, T.; Ardö, A.; Falcoz, F.; Nielsen, M.; Dandfær, M.: Improving resource discovery and retrieval on the Internet : the Nordic WAIS/World Wide Web Project and the classification of WAIS databases (1995) 0.01
    0.012332911 = product of:
      0.049331643 = sum of:
        0.049331643 = product of:
          0.098663285 = sum of:
            0.098663285 = weight(_text_:project in 3370) [ClassicSimilarity], result of:
              0.098663285 = score(doc=3370,freq=2.0), product of:
                0.21156175 = queryWeight, product of:
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.050121464 = queryNorm
                0.4663569 = fieldWeight in 3370, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3370)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
  3. Ardö, A.; Godby, J.; Houghton, A.; Koch, T.; Reighart, R.; Thompson, R.; Vizine-Goetz, D.: Browsing engineering resources on the Web : a general knowledge organization scheme (Dewey) vs. a special scheme (EI) (2000) 0.01
    0.010464822 = product of:
      0.041859288 = sum of:
        0.041859288 = product of:
          0.083718576 = sum of:
            0.083718576 = weight(_text_:project in 86) [ClassicSimilarity], result of:
              0.083718576 = score(doc=86,freq=4.0), product of:
                0.21156175 = queryWeight, product of:
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.050121464 = queryNorm
                0.39571697 = fieldWeight in 86, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.046875 = fieldNorm(doc=86)
          0.5 = coord(1/2)
      0.25 = coord(1/4)
    
    Abstract
    Under the auspices of the Desire II project, researchers at NetLab and OCLC are providing searching and browsing of a test collection of engineering documents on the Web. The goal of the project is to explore simple methods of automatic classification to provide subject browsing of a robot-generated engineering index. At NetLab the documents are automatically classified and organized using an engineering-specific scheme, the Engineering Index (Ei) Thesaurus and Classification; at OCLC the Dewey Decimal Classification (DDC), a general knowledge organization scheme, is being used
  4. Ardö, A.; Koch, T.: Wide-area information server (WAIS) as the hub of an electronic library service at Lund University (1993) 0.01
    0.009946881 = product of:
      0.039787523 = sum of:
        0.039787523 = weight(_text_:library in 8459) [ClassicSimilarity], result of:
          0.039787523 = score(doc=8459,freq=6.0), product of:
            0.1317883 = queryWeight, product of:
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.050121464 = queryNorm
            0.30190483 = fieldWeight in 8459, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.6293786 = idf(docFreq=8668, maxDocs=44218)
              0.046875 = fieldNorm(doc=8459)
      0.25 = coord(1/4)
    
    Abstract
    Electronic information sources are being collected at the Lund University Library within the areas of computer science, Internet use and environmental studies. Within each area there are several different types of sources e.g. the environment area has bibliographic information, journal content pages, a local directory database on environmental related research projects and an archive of articles from relevant electronic conferences. A seed-bank database is planned in collaboration with the Nordic Gene Bank. The popularity of the wide area information server is growing and there are several hundred available sources today, however, improvements need to be done with the possibilities to select sources and in the search and relevance-ranking algorithms
    Source
    Opportunity 2000: understanding and serving users in an electronic library; 15th Int. Symp., 12.-15.10.1992; Festschrift in honour of Herbert S. White. Ed.: A.H. Helal
  5. Koch, T.; Ardö, A.; Brümmer, A.: ¬The building and maintenance of robot based internet search services : A review of current indexing and data collection methods. Prepared to meet the requirements of Work Package 3 of EU Telematics for Research, project DESIRE. Version D3.11v0.3 (Draft version 3) (1996) 0.00
    0.0049331645 = product of:
      0.019732658 = sum of:
        0.019732658 = product of:
          0.039465316 = sum of:
            0.039465316 = weight(_text_:project in 1669) [ClassicSimilarity], result of:
              0.039465316 = score(doc=1669,freq=2.0), product of:
                0.21156175 = queryWeight, product of:
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.050121464 = queryNorm
                0.18654276 = fieldWeight in 1669, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.220981 = idf(docFreq=1764, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1669)
          0.5 = coord(1/2)
      0.25 = coord(1/4)