Document (#4892)

Author
Haas, S.
He, S.
Title
Toward the automatic identification of sublanguage vocabulary
Source
Information processing and management. 29(1993) no.6, S.721-744
Year
1993
Abstract
Describes a method developed for automatic identification of sublanguage vocabulary words as they occur in abstracts. Describes the sublanguage vocabulary identification procedures using abstracts from computer science and library and information science as sublanguage sources. Evaluates the results using three criteria. Discuss the practical and theoretical significance of this research and plans for further experiments
Theme
Automatisches Indexieren

Similar documents (author)

  1. Haas, S.W.: ¬A feasibility study of the case hierarchy model for the construction and porting of natural language interfaces (1990) 5.48
    5.4764457 = sum of:
      5.4764457 = weight(author_txt:haas in 71) [ClassicSimilarity], result of:
        5.4764457 = fieldWeight in 71, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.762313 = idf(docFreq=17, maxDocs=42306)
          0.625 = fieldNorm(doc=71)
    
  2. Haas, S.W.: Disciplinary variation in automatic sublanguage term identification (1997) 5.48
    5.4764457 = sum of:
      5.4764457 = weight(author_txt:haas in 6569) [ClassicSimilarity], result of:
        5.4764457 = fieldWeight in 6569, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.762313 = idf(docFreq=17, maxDocs=42306)
          0.625 = fieldNorm(doc=6569)
    
  3. Haas, S.W.: ¬A text filter for the automatic identification of empirical articles (1996) 5.48
    5.4764457 = sum of:
      5.4764457 = weight(author_txt:haas in 6867) [ClassicSimilarity], result of:
        5.4764457 = fieldWeight in 6867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.762313 = idf(docFreq=17, maxDocs=42306)
          0.625 = fieldNorm(doc=6867)
    
  4. Haas, S.W.: Natural language processing : toward large-scale, robust systems (1996) 5.48
    5.4764457 = sum of:
      5.4764457 = weight(author_txt:haas in 485) [ClassicSimilarity], result of:
        5.4764457 = fieldWeight in 485, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.762313 = idf(docFreq=17, maxDocs=42306)
          0.625 = fieldNorm(doc=485)
    
  5. Haas, S.: Metadata mania : an overview (1998) 5.48
    5.4764457 = sum of:
      5.4764457 = weight(author_txt:haas in 3223) [ClassicSimilarity], result of:
        5.4764457 = fieldWeight in 3223, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.762313 = idf(docFreq=17, maxDocs=42306)
          0.625 = fieldNorm(doc=3223)
    

Similar documents (content)

  1. Haas, S.W.: Disciplinary variation in automatic sublanguage term identification (1997) 0.30
    0.30341563 = sum of:
      0.30341563 = product of:
        0.8428212 = sum of:
          0.019034684 = weight(abstract_txt:method in 6569) [ClassicSimilarity], result of:
            0.019034684 = score(doc=6569,freq=2.0), product of:
              0.04749033 = queryWeight, product of:
                1.051676 = boost
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.009958128 = queryNorm
              0.4008118 = fieldWeight in 6569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.534668 = idf(docFreq=1233, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.016357828 = weight(abstract_txt:practical in 6569) [ClassicSimilarity], result of:
            0.016357828 = score(doc=6569,freq=1.0), product of:
              0.054083962 = queryWeight, product of:
                1.1223121 = boost
                4.8392396 = idf(docFreq=909, maxDocs=42306)
                0.009958128 = queryNorm
              0.30245247 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8392396 = idf(docFreq=909, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.017321521 = weight(abstract_txt:theoretical in 6569) [ClassicSimilarity], result of:
            0.017321521 = score(doc=6569,freq=1.0), product of:
              0.05618781 = queryWeight, product of:
                1.1439326 = boost
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.009958128 = queryNorm
              0.308279 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.043627873 = weight(abstract_txt:occur in 6569) [ClassicSimilarity], result of:
            0.043627873 = score(doc=6569,freq=1.0), product of:
              0.104014546 = queryWeight, product of:
                1.5564187 = boost
                6.711042 = idf(docFreq=139, maxDocs=42306)
                0.009958128 = queryNorm
              0.41944012 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.711042 = idf(docFreq=139, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.01579047 = weight(abstract_txt:describes in 6569) [ClassicSimilarity], result of:
            0.01579047 = score(doc=6569,freq=1.0), product of:
              0.06655665 = queryWeight, product of:
                1.7607192 = boost
                3.7959774 = idf(docFreq=2582, maxDocs=42306)
                0.009958128 = queryNorm
              0.23724858 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7959774 = idf(docFreq=2582, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.017469693 = weight(abstract_txt:science in 6569) [ClassicSimilarity], result of:
            0.017469693 = score(doc=6569,freq=1.0), product of:
              0.07119534 = queryWeight, product of:
                1.8210429 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.009958128 = queryNorm
              0.24537691 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.060981594 = weight(abstract_txt:abstracts in 6569) [ClassicSimilarity], result of:
            0.060981594 = score(doc=6569,freq=1.0), product of:
              0.16383018 = queryWeight, product of:
                2.7624304 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.009958128 = queryNorm
              0.37222442 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.124182716 = weight(abstract_txt:identification in 6569) [ClassicSimilarity], result of:
            0.124182716 = score(doc=6569,freq=2.0), product of:
              0.23914203 = queryWeight, product of:
                4.087596 = boost
                5.875032 = idf(docFreq=322, maxDocs=42306)
                0.009958128 = queryNorm
              0.51928437 = fieldWeight in 6569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.875032 = idf(docFreq=322, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
          0.5280548 = weight(abstract_txt:sublanguage in 6569) [ClassicSimilarity], result of:
            0.5280548 = score(doc=6569,freq=1.0), product of:
              0.8704103 = queryWeight, product of:
                9.004745 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.009958128 = queryNorm
              0.60667336 = fieldWeight in 6569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.0625 = fieldNorm(doc=6569)
        0.36 = coord(9/25)
    
  2. Losee, R.M.; Haas, S.W.: Sublanguage terms : dictionaries, usage, and automatic classification (1995) 0.28
    0.2812938 = sum of:
      0.2812938 = product of:
        1.7580863 = sum of:
          0.018356029 = weight(abstract_txt:using in 2719) [ClassicSimilarity], result of:
            0.018356029 = score(doc=2719,freq=1.0), product of:
              0.05615473 = queryWeight, product of:
                1.6172887 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.009958128 = queryNorm
              0.32688302 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.09375 = fieldNorm(doc=2719)
          0.02620454 = weight(abstract_txt:science in 2719) [ClassicSimilarity], result of:
            0.02620454 = score(doc=2719,freq=1.0), product of:
              0.07119534 = queryWeight, product of:
                1.8210429 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.009958128 = queryNorm
              0.36806536 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.09375 = fieldNorm(doc=2719)
          0.12936148 = weight(abstract_txt:abstracts in 2719) [ClassicSimilarity], result of:
            0.12936148 = score(doc=2719,freq=2.0), product of:
              0.16383018 = queryWeight, product of:
                2.7624304 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.009958128 = queryNorm
              0.78960717 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.09375 = fieldNorm(doc=2719)
          1.5841643 = weight(abstract_txt:sublanguage in 2719) [ClassicSimilarity], result of:
            1.5841643 = score(doc=2719,freq=4.0), product of:
              0.8704103 = queryWeight, product of:
                9.004745 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.009958128 = queryNorm
              1.8200201 = fieldWeight in 2719, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.09375 = fieldNorm(doc=2719)
        0.16 = coord(4/25)
    
  3. Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.09
    0.090968095 = sum of:
      0.090968095 = product of:
        1.1371012 = sum of:
          0.08099165 = weight(abstract_txt:automatic in 4778) [ClassicSimilarity], result of:
            0.08099165 = score(doc=4778,freq=1.0), product of:
              0.12470051 = queryWeight, product of:
                2.4100635 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.009958128 = queryNorm
              0.64948934 = fieldWeight in 4778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.125 = fieldNorm(doc=4778)
          1.0561095 = weight(abstract_txt:sublanguage in 4778) [ClassicSimilarity], result of:
            1.0561095 = score(doc=4778,freq=1.0), product of:
              0.8704103 = queryWeight, product of:
                9.004745 = boost
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.009958128 = queryNorm
              1.2133467 = fieldWeight in 4778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.706774 = idf(docFreq=6, maxDocs=42306)
                0.125 = fieldNorm(doc=4778)
        0.08 = coord(2/25)
    
  4. Salton, G.: Automatic processing of foreign language documents (1985) 0.09
    0.090216465 = sum of:
      0.090216465 = product of:
        0.22554116 = sum of:
          0.008678527 = weight(abstract_txt:computer in 4651) [ClassicSimilarity], result of:
            0.008678527 = score(doc=4651,freq=1.0), product of:
              0.04293794 = queryWeight, product of:
                4.3118486 = idf(docFreq=1541, maxDocs=42306)
                0.009958128 = queryNorm
              0.2021179 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3118486 = idf(docFreq=1541, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.01147797 = weight(abstract_txt:further in 4651) [ClassicSimilarity], result of:
            0.01147797 = score(doc=4651,freq=1.0), product of:
              0.051735334 = queryWeight, product of:
                1.097673 = boost
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.009958128 = queryNorm
              0.2218594 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7330003 = idf(docFreq=1011, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.012268372 = weight(abstract_txt:practical in 4651) [ClassicSimilarity], result of:
            0.012268372 = score(doc=4651,freq=1.0), product of:
              0.054083962 = queryWeight, product of:
                1.1223121 = boost
                4.8392396 = idf(docFreq=909, maxDocs=42306)
                0.009958128 = queryNorm
              0.22683936 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8392396 = idf(docFreq=909, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.012991141 = weight(abstract_txt:theoretical in 4651) [ClassicSimilarity], result of:
            0.012991141 = score(doc=4651,freq=1.0), product of:
              0.05618781 = queryWeight, product of:
                1.1439326 = boost
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.009958128 = queryNorm
              0.23120925 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.932464 = idf(docFreq=828, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.016680954 = weight(abstract_txt:words in 4651) [ClassicSimilarity], result of:
            0.016680954 = score(doc=4651,freq=1.0), product of:
              0.06637805 = queryWeight, product of:
                1.2433449 = boost
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.009958128 = queryNorm
              0.25130227 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.018898804 = weight(abstract_txt:criteria in 4651) [ClassicSimilarity], result of:
            0.018898804 = score(doc=4651,freq=1.0), product of:
              0.07213844 = queryWeight, product of:
                1.2961724 = boost
                5.588899 = idf(docFreq=429, maxDocs=42306)
                0.009958128 = queryNorm
              0.26197964 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.588899 = idf(docFreq=429, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.012979671 = weight(abstract_txt:using in 4651) [ClassicSimilarity], result of:
            0.012979671 = score(doc=4651,freq=2.0), product of:
              0.05615473 = queryWeight, product of:
                1.6172887 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.009958128 = queryNorm
              0.2311412 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.01310227 = weight(abstract_txt:science in 4651) [ClassicSimilarity], result of:
            0.01310227 = score(doc=4651,freq=1.0), product of:
              0.07119534 = queryWeight, product of:
                1.8210429 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.009958128 = queryNorm
              0.18403268 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.052605618 = weight(abstract_txt:automatic in 4651) [ClassicSimilarity], result of:
            0.052605618 = score(doc=4651,freq=3.0), product of:
              0.12470051 = queryWeight, product of:
                2.4100635 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.009958128 = queryNorm
              0.4218557 = fieldWeight in 4651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
          0.06585783 = weight(abstract_txt:identification in 4651) [ClassicSimilarity], result of:
            0.06585783 = score(doc=4651,freq=1.0), product of:
              0.23914203 = queryWeight, product of:
                4.087596 = boost
                5.875032 = idf(docFreq=322, maxDocs=42306)
                0.009958128 = queryNorm
              0.27539212 = fieldWeight in 4651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.875032 = idf(docFreq=322, maxDocs=42306)
                0.046875 = fieldNorm(doc=4651)
        0.4 = coord(10/25)
    
  5. Hmeidi, I.; Kanaan, G.; Evens, M.: Design and implementation of automatic indexing for information retrieval with Arabic documents (1997) 0.08
    0.082891166 = sum of:
      0.082891166 = product of:
        0.29603988 = sum of:
          0.014464211 = weight(abstract_txt:computer in 2661) [ClassicSimilarity], result of:
            0.014464211 = score(doc=2661,freq=1.0), product of:
              0.04293794 = queryWeight, product of:
                4.3118486 = idf(docFreq=1541, maxDocs=42306)
                0.009958128 = queryNorm
              0.33686316 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3118486 = idf(docFreq=1541, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.02780159 = weight(abstract_txt:words in 2661) [ClassicSimilarity], result of:
            0.02780159 = score(doc=2661,freq=1.0), product of:
              0.06637805 = queryWeight, product of:
                1.2433449 = boost
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.009958128 = queryNorm
              0.4188371 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.361115 = idf(docFreq=539, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.027975775 = weight(abstract_txt:experiments in 2661) [ClassicSimilarity], result of:
            0.027975775 = score(doc=2661,freq=1.0), product of:
              0.06665501 = queryWeight, product of:
                1.2459362 = boost
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.009958128 = queryNorm
              0.41971 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.372288 = idf(docFreq=533, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.026494645 = weight(abstract_txt:using in 2661) [ClassicSimilarity], result of:
            0.026494645 = score(doc=2661,freq=3.0), product of:
              0.05615473 = queryWeight, product of:
                1.6172887 = boost
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.009958128 = queryNorm
              0.471815 = fieldWeight in 2661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.486752 = idf(docFreq=3518, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.021837117 = weight(abstract_txt:science in 2661) [ClassicSimilarity], result of:
            0.021837117 = score(doc=2661,freq=1.0), product of:
              0.07119534 = queryWeight, product of:
                1.8210429 = boost
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.009958128 = queryNorm
              0.30672115 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9260306 = idf(docFreq=2267, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.10123957 = weight(abstract_txt:automatic in 2661) [ClassicSimilarity], result of:
            0.10123957 = score(doc=2661,freq=4.0), product of:
              0.12470051 = queryWeight, product of:
                2.4100635 = boost
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.009958128 = queryNorm
              0.8118617 = fieldWeight in 2661, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1959147 = idf(docFreq=636, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
          0.076226994 = weight(abstract_txt:abstracts in 2661) [ClassicSimilarity], result of:
            0.076226994 = score(doc=2661,freq=1.0), product of:
              0.16383018 = queryWeight, product of:
                2.7624304 = boost
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.009958128 = queryNorm
              0.46528053 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9555907 = idf(docFreq=297, maxDocs=42306)
                0.078125 = fieldNorm(doc=2661)
        0.28 = coord(7/25)