Document (#24553)

Author
Cortez, E.M.
Title
Planning and implementing a high performance knowledge base
Source
Knowledge: creation, organization and use. Proceedings of the 62nd Annual Meeting of the American Society for Information Science, 31.10.-4.11.1999. Ed.: L. Woods
Imprint
Medford, NJ : Information Today
Year
1999
Pages
S.161-171
Series
Proceedings of the American Society for Information Science; vol.36
Abstract
This paper discusses the conceptual framework for developing a rapid-prototype high-performance knowledge base for the four mission agencies of the U.S. Department of Agriculture and their university partners. These agencies include the Cooperative State Research, Education, Economic Service (CSREES), the Agricultural Research Service (ARS), the Economic Research Service (ERS), and the National Agriculture Statistical Service (NASS). The knowledge base, known as REEIS (Research, Education, Economic Information System), is a data mining application, where data are extracted from text, moved into different formats, allowing then the data mining features to run inferences, visualize connections, etc. -- all generated automatically. Described are alternative data mining models along with the generalized approach to building a Warehouse architecture. Also described is the methodology used for translating system requirements into specifications and for building the REEIS prototype. The method, known as the "Rational Unified Process", is one of iteration, quality assessment, and visual modeling. The two major obstacles in the project were the normalization of disparate data repositories, and the ability to achieve an acceptable level of semantic interoperability. A metadata vocabulary model is presented to address these obstacles

Similar documents (author)

  1. Cortez, E.M.: Use of metadata vocabularies in data retrieval (1999) 5.99
    5.989656 = sum of:
      5.989656 = weight(author_txt:cortez in 5058) [ClassicSimilarity], result of:
        5.989656 = fieldWeight in 5058, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.625 = fieldNorm(doc=5058)
    
  2. Cortez, E.; Smorch, T.: Planning second generation automated library systems (1993) 4.79
    4.7917247 = sum of:
      4.7917247 = weight(author_txt:cortez in 7491) [ClassicSimilarity], result of:
        4.7917247 = fieldWeight in 7491, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.5 = fieldNorm(doc=7491)
    
  3. Cortez, E.; Rice, R.: ¬An investigation into the role of public libraries with online reference service (1994) 4.79
    4.7917247 = sum of:
      4.7917247 = weight(author_txt:cortez in 2580) [ClassicSimilarity], result of:
        4.7917247 = fieldWeight in 2580, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.5 = fieldNorm(doc=2580)
    
  4. Cortez, E.M.; Park, S.C.; Kim, S.: ¬The hybrid application of an inductive learning method and a neural network (1995) 3.59
    3.5937934 = sum of:
      3.5937934 = weight(author_txt:cortez in 3107) [ClassicSimilarity], result of:
        3.5937934 = fieldWeight in 3107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.375 = fieldNorm(doc=3107)
    
  5. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 2.40
    2.3958623 = sum of:
      2.3958623 = weight(author_txt:cortez in 4849) [ClassicSimilarity], result of:
        2.3958623 = fieldWeight in 4849, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.583449 = idf(docFreq=7, maxDocs=42740)
          0.25 = fieldNorm(doc=4849)
    

Similar documents (content)

  1. Chen, J.: Artificial intelligence (2009) 0.13
    0.13481043 = sum of:
      0.13481043 = product of:
        0.48146585 = sum of:
          0.056779683 = weight(abstract_txt:education in 748) [ClassicSimilarity], result of:
            0.056779683 = score(doc=748,freq=1.0), product of:
              0.14117227 = queryWeight, product of:
                1.3604139 = boost
                5.1481776 = idf(docFreq=674, maxDocs=42740)
                0.02015695 = queryNorm
              0.40220138 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1481776 = idf(docFreq=674, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.05000134 = weight(abstract_txt:knowledge in 748) [ClassicSimilarity], result of:
            0.05000134 = score(doc=748,freq=3.0), product of:
              0.10294341 = queryWeight, product of:
                1.4227916 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.02015695 = queryNorm
              0.48571676 = fieldWeight in 748, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.06722539 = weight(abstract_txt:building in 748) [ClassicSimilarity], result of:
            0.06722539 = score(doc=748,freq=1.0), product of:
              0.15799487 = queryWeight, product of:
                1.4391891 = boost
                5.4462843 = idf(docFreq=500, maxDocs=42740)
                0.02015695 = queryNorm
              0.42549098 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4462843 = idf(docFreq=500, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.03907557 = weight(abstract_txt:research in 748) [ClassicSimilarity], result of:
            0.03907557 = score(doc=748,freq=2.0), product of:
              0.11004179 = queryWeight, product of:
                1.6985964 = boost
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.02015695 = queryNorm
              0.35509753 = fieldWeight in 748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.039881863 = weight(abstract_txt:data in 748) [ClassicSimilarity], result of:
            0.039881863 = score(doc=748,freq=1.0), product of:
              0.15139715 = queryWeight, product of:
                2.2275386 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02015695 = queryNorm
              0.26342544 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.07919217 = weight(abstract_txt:service in 748) [ClassicSimilarity], result of:
            0.07919217 = score(doc=748,freq=1.0), product of:
              0.22203371 = queryWeight, product of:
                2.4127972 = boost
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.02015695 = queryNorm
              0.3566673 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
          0.14930984 = weight(abstract_txt:mining in 748) [ClassicSimilarity], result of:
            0.14930984 = score(doc=748,freq=1.0), product of:
              0.3078767 = queryWeight, product of:
                2.46054 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.02015695 = queryNorm
              0.48496637 = fieldWeight in 748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.078125 = fieldNorm(doc=748)
        0.28 = coord(7/25)
    
  2. Chen, R.-S.; Hu, Y.-C.: ¬A novel method for discovering Fuzzy sequential patterns using the simple Fuzzy partition method (2003) 0.13
    0.12672965 = sum of:
      0.12672965 = product of:
        0.45260587 = sum of:
          0.057311088 = weight(abstract_txt:described in 2615) [ClassicSimilarity], result of:
            0.057311088 = score(doc=2615,freq=2.0), product of:
              0.13083076 = queryWeight, product of:
                1.3096381 = boost
                4.956028 = idf(docFreq=817, maxDocs=42740)
                0.02015695 = queryNorm
              0.43805513 = fieldWeight in 2615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.956028 = idf(docFreq=817, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.03266074 = weight(abstract_txt:knowledge in 2615) [ClassicSimilarity], result of:
            0.03266074 = score(doc=2615,freq=2.0), product of:
              0.10294341 = queryWeight, product of:
                1.4227916 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.02015695 = queryNorm
              0.31726888 = fieldWeight in 2615, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.05378031 = weight(abstract_txt:building in 2615) [ClassicSimilarity], result of:
            0.05378031 = score(doc=2615,freq=1.0), product of:
              0.15799487 = queryWeight, product of:
                1.4391891 = boost
                5.4462843 = idf(docFreq=500, maxDocs=42740)
                0.02015695 = queryNorm
              0.34039277 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4462843 = idf(docFreq=500, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.068510786 = weight(abstract_txt:prototype in 2615) [ClassicSimilarity], result of:
            0.068510786 = score(doc=2615,freq=1.0), product of:
              0.18566644 = queryWeight, product of:
                1.5601382 = boost
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.02015695 = queryNorm
              0.3689993 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.031905487 = weight(abstract_txt:data in 2615) [ClassicSimilarity], result of:
            0.031905487 = score(doc=2615,freq=1.0), product of:
              0.15139715 = queryWeight, product of:
                2.2275386 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02015695 = queryNorm
              0.21074034 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.08898958 = weight(abstract_txt:base in 2615) [ClassicSimilarity], result of:
            0.08898958 = score(doc=2615,freq=1.0), product of:
              0.25301754 = queryWeight, product of:
                2.2305775 = boost
                5.627409 = idf(docFreq=417, maxDocs=42740)
                0.02015695 = queryNorm
              0.35171306 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.627409 = idf(docFreq=417, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
          0.11944788 = weight(abstract_txt:mining in 2615) [ClassicSimilarity], result of:
            0.11944788 = score(doc=2615,freq=1.0), product of:
              0.3078767 = queryWeight, product of:
                2.46054 = boost
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.02015695 = queryNorm
              0.3879731 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2075696 = idf(docFreq=233, maxDocs=42740)
                0.0625 = fieldNorm(doc=2615)
        0.28 = coord(7/25)
    
  3. Cortez, E.M.: Use of metadata vocabularies in data retrieval (1999) 0.12
    0.12465979 = sum of:
      0.12465979 = product of:
        0.62329894 = sum of:
          0.06813562 = weight(abstract_txt:education in 5058) [ClassicSimilarity], result of:
            0.06813562 = score(doc=5058,freq=1.0), product of:
              0.14117227 = queryWeight, product of:
                1.3604139 = boost
                5.1481776 = idf(docFreq=674, maxDocs=42740)
                0.02015695 = queryNorm
              0.48264164 = fieldWeight in 5058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1481776 = idf(docFreq=674, maxDocs=42740)
                0.09375 = fieldNorm(doc=5058)
          0.102766186 = weight(abstract_txt:prototype in 5058) [ClassicSimilarity], result of:
            0.102766186 = score(doc=5058,freq=1.0), product of:
              0.18566644 = queryWeight, product of:
                1.5601382 = boost
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.02015695 = queryNorm
              0.553499 = fieldWeight in 5058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.903989 = idf(docFreq=316, maxDocs=42740)
                0.09375 = fieldNorm(doc=5058)
          0.046890683 = weight(abstract_txt:research in 5058) [ClassicSimilarity], result of:
            0.046890683 = score(doc=5058,freq=2.0), product of:
              0.11004179 = queryWeight, product of:
                1.6985964 = boost
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.02015695 = queryNorm
              0.42611706 = fieldWeight in 5058, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2139761 = idf(docFreq=4669, maxDocs=42740)
                0.09375 = fieldNorm(doc=5058)
          0.23813553 = weight(abstract_txt:agriculture in 5058) [ClassicSimilarity], result of:
            0.23813553 = score(doc=5058,freq=1.0), product of:
              0.32512426 = queryWeight, product of:
                2.0645294 = boost
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.02015695 = queryNorm
              0.73244464 = fieldWeight in 5058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.09375 = fieldNorm(doc=5058)
          0.16737093 = weight(abstract_txt:economic in 5058) [ClassicSimilarity], result of:
            0.16737093 = score(doc=5058,freq=1.0), product of:
              0.29420522 = queryWeight, product of:
                2.4052885 = boost
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.02015695 = queryNorm
              0.56889176 = fieldWeight in 5058, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.09375 = fieldNorm(doc=5058)
        0.2 = coord(5/25)
    
  4. Campbell, D.G.: Global abstractions : the Classification of International Economic Data for bibliographic and statistical purposes (2003) 0.11
    0.11430745 = sum of:
      0.11430745 = product of:
        0.7144216 = sum of:
          0.09842315 = weight(abstract_txt:agricultural in 524) [ClassicSimilarity], result of:
            0.09842315 = score(doc=524,freq=1.0), product of:
              0.1616872 = queryWeight, product of:
                1.029483 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.02015695 = queryNorm
              0.6087257 = fieldWeight in 524, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.078125 = fieldNorm(doc=524)
          0.2806454 = weight(abstract_txt:agriculture in 524) [ClassicSimilarity], result of:
            0.2806454 = score(doc=524,freq=2.0), product of:
              0.32512426 = queryWeight, product of:
                2.0645294 = boost
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.02015695 = queryNorm
              0.8631942 = fieldWeight in 524, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.078125 = fieldNorm(doc=524)
          0.056401465 = weight(abstract_txt:data in 524) [ClassicSimilarity], result of:
            0.056401465 = score(doc=524,freq=2.0), product of:
              0.15139715 = queryWeight, product of:
                2.2275386 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02015695 = queryNorm
              0.3725398 = fieldWeight in 524, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.078125 = fieldNorm(doc=524)
          0.27895156 = weight(abstract_txt:economic in 524) [ClassicSimilarity], result of:
            0.27895156 = score(doc=524,freq=4.0), product of:
              0.29420522 = queryWeight, product of:
                2.4052885 = boost
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.02015695 = queryNorm
              0.948153 = fieldWeight in 524, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.068179 = idf(docFreq=268, maxDocs=42740)
                0.078125 = fieldNorm(doc=524)
        0.16 = coord(4/25)
    
  5. Liang, A.C.; Sini, M.: Mapping AGROVOC and the Chinese Agricultural Thesaurus : definitions, tools, procedures (2006) 0.11
    0.11042428 = sum of:
      0.11042428 = product of:
        0.69015175 = sum of:
          0.28930384 = weight(abstract_txt:agricultural in 708) [ClassicSimilarity], result of:
            0.28930384 = score(doc=708,freq=6.0), product of:
              0.1616872 = queryWeight, product of:
                1.029483 = boost
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.02015695 = queryNorm
              1.7892811 = fieldWeight in 708, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7916894 = idf(docFreq=47, maxDocs=42740)
                0.09375 = fieldNorm(doc=708)
          0.23813553 = weight(abstract_txt:agriculture in 708) [ClassicSimilarity], result of:
            0.23813553 = score(doc=708,freq=1.0), product of:
              0.32512426 = queryWeight, product of:
                2.0645294 = boost
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.02015695 = queryNorm
              0.73244464 = fieldWeight in 708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8127427 = idf(docFreq=46, maxDocs=42740)
                0.09375 = fieldNorm(doc=708)
          0.06768176 = weight(abstract_txt:data in 708) [ClassicSimilarity], result of:
            0.06768176 = score(doc=708,freq=2.0), product of:
              0.15139715 = queryWeight, product of:
                2.2275386 = boost
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.02015695 = queryNorm
              0.44704777 = fieldWeight in 708, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3718455 = idf(docFreq=3987, maxDocs=42740)
                0.09375 = fieldNorm(doc=708)
          0.09503059 = weight(abstract_txt:service in 708) [ClassicSimilarity], result of:
            0.09503059 = score(doc=708,freq=1.0), product of:
              0.22203371 = queryWeight, product of:
                2.4127972 = boost
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.02015695 = queryNorm
              0.42800075 = fieldWeight in 708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5653415 = idf(docFreq=1208, maxDocs=42740)
                0.09375 = fieldNorm(doc=708)
        0.16 = coord(4/25)