Search (8 results, page 1 of 1)

Tsuji, K.; Kageura, K.: Automatic generation of Japanese-English bilingual thesauri based on bilingual corpora (2006) 0.02
```
0.023268566 = sum of:
  0.021536238 = product of:
    0.086144954 = sum of:
      0.086144954 = weight(_text_:authors in 5061) [ClassicSimilarity], result of:
        0.086144954 = score(doc=5061,freq=4.0), product of:
          0.2418733 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.053056188 = queryNorm
          0.35615736 = fieldWeight in 5061, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5061)
    0.25 = coord(1/4)
  0.0017323275 = product of:
    0.003464655 = sum of:
      0.003464655 = weight(_text_:s in 5061) [ClassicSimilarity], result of:
        0.003464655 = score(doc=5061,freq=2.0), product of:
          0.057684682 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.053056188 = queryNorm
          0.060061958 = fieldWeight in 5061, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5061)
    0.5 = coord(1/2)
```
Abstract

The authors propose a method for automatically generating Japanese-English bilingual thesauri based on bilingual corpora. The term bilingual thesaurus refers to a set of bilingual equivalent words and their synonyms. Most of the methods proposed so far for extracting bilingual equivalent word clusters from bilingual corpora depend heavily on word frequency and are not effective for dealing with low-frequency clusters. These low-frequency bilingual clusters are worth extracting because they contain many newly coined terms that are in demand but are not listed in existing bilingual thesauri. Assuming that single language-pair-independent methods such as frequency-based ones have reached their limitations and that a language-pair-dependent method used in combination with other methods shows promise, the authors propose the following approach: (a) Extract translation pairs based on transliteration patterns; (b) remove the pairs from among the candidate words; (c) extract translation pairs based on word frequency from the remaining candidate words; and (d) generate bilingual clusters based on the extracted pairs using a graph-theoretic method. The proposed method has been found to be significantly more effective than other methods.

Source

Journal of the American Society for Information Science and Technology. 57(2006) no.7, S.891-906
Yoshikane, F.; Kageura, K.; Tsuji, K.: ¬A method for the comparative analysis of concentration of author productivity, giving consideration to the effect of sample size dependency of statistical measures (2003) 0.02
```
0.023268566 = sum of:
  0.021536238 = product of:
    0.086144954 = sum of:
      0.086144954 = weight(_text_:authors in 5123) [ClassicSimilarity], result of:
        0.086144954 = score(doc=5123,freq=4.0), product of:
          0.2418733 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.053056188 = queryNorm
          0.35615736 = fieldWeight in 5123, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5123)
    0.25 = coord(1/4)
  0.0017323275 = product of:
    0.003464655 = sum of:
      0.003464655 = weight(_text_:s in 5123) [ClassicSimilarity], result of:
        0.003464655 = score(doc=5123,freq=2.0), product of:
          0.057684682 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.053056188 = queryNorm
          0.060061958 = fieldWeight in 5123, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.0390625 = fieldNorm(doc=5123)
    0.5 = coord(1/2)
```
Abstract

Studies of the concentration of author productivity based upon counts of papers by individual authors will produce measures that change systematically with sample size. Yoshikane, Kageura, and Tsuji seek a statistical framework which will avoid this scale effect problem. Using the number of authors in a field as an absolute concentration measure, and Gini's index as a relative concentration measure, they describe four literatures form both viewpoints with measures insensitive to one another. Both measures will increase with sample size. They then plot profiles of the two measures on the basis of a Monte-Carlo simulation of 1000 trials for 20 equally spaced intervals and compare the characteristics of the literatures. Using data from conferences hosted by four academic societies between 1992 and 1997, they find a coefficient of loss exceeding 0.15 indicating measures will depend highly on sample size. The simulation shows that a larger sample size leads to lower absolute concentration and higher relative concentration. Comparisons made at the same sample size present quite different results than the original data and allow direct comparison of population characteristics.

Source

Journal of the American Society for Information Science and technology. 54(2003) no.6, S.519-527
Kageura, K.: ¬The dynamics of terminology : a descriptive theory of term formation and terminological growth (2002) 0.01
```
0.010210417 = product of:
  0.020420834 = sum of:
    0.020420834 = sum of:
      0.002449881 = weight(_text_:s in 1787) [ClassicSimilarity], result of:
        0.002449881 = score(doc=1787,freq=4.0), product of:
          0.057684682 = queryWeight, product of:
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.053056188 = queryNorm
          0.042470217 = fieldWeight in 1787, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            1.0872376 = idf(docFreq=40523, maxDocs=44218)
            0.01953125 = fieldNorm(doc=1787)
      0.017970953 = weight(_text_:22 in 1787) [ClassicSimilarity], result of:
        0.017970953 = score(doc=1787,freq=2.0), product of:
          0.18579373 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.053056188 = queryNorm
          0.09672529 = fieldWeight in 1787, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.01953125 = fieldNorm(doc=1787)
  0.5 = coord(1/2)
```
Date

22. 3.2008 18:18:53

Footnote

Rez. in: Knowledge organization 30(2003) no.2, S.112-113 (L. Bowker): "Terminology is generally understood to be the activity that is concerned with the identification, collection and processing of terms; terms are the lexical items used to describe concepts in specialized subject fields Terminology is not always acknowledged as a discipline in its own right; it is sometimes considered to be a subfield of related disciplines such as lexicography or translation. However, a growing number of researchers are beginning to argue that terminology should be recognized as an autonomous discipline with its own theoretical underpinnings. Kageura's book is a valuable contribution to the formulation of a theory of terminology and will help to establish this discipline as an independent field of research. The general aim of this text is to present a theory of term formation and terminological growth by identifying conceptual regularities in term creation and by laying the foundations for the analysis of terminological growth patterns. The approach used is a descriptive one, which means that it is based an observations taken from a corpus. It is also synchronic in nature and therefore does not attempt to account for the evolution of terms over a given period of time (though it does endeavour to provide a means for predicting possible formation patterns of new terms). The descriptive, corpus-based approach is becoming very popular in terminology circles; however, it does pose certain limitations. To compensate for this, Kageura complements his descriptive analysis of conceptual patterns with a quantitative analysis of the patterns of the growth of terminology. Many existing investigations treat only a limited number of terms, using these for exemplification purposes. Kageura argues strongly (p. 31) that any theory of terms or terminology must be based an the examination of the terminology of a domain (i.e., a specialized subject field) in its entirety since it is only with respect to an individual domain that the concept of "term" can be established. To demonstrate the viability of his theoretical approach, Kageura has chosen to investigate and describe the domain of documentation, using Japanese terminological data. The data in the corpus are derived from a glossary (Wersig and Neveling 1984), and although this glossary is somewhat outdated (a fact acknowledged by the author), the data provided are nonetheless sufficient for demonstrating the viability of the approach, which can later be extended and applied to other languages and domains.

Pages

viii, 322 S

Kageura, K.: Theories of terminology : a quest for a framework for the study of term formation (1999) 0.00

0.0024252585 = product of:
  0.004850517 = sum of:
    0.004850517 = product of:
      0.009701034 = sum of:
        0.009701034 = weight(_text_:s in 6290) [ClassicSimilarity], result of:
          0.009701034 = score(doc=6290,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.16817348 = fieldWeight in 6290, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.109375 = fieldNorm(doc=6290)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Terminology. 5(1998/1999) no.1, S.21-40

Kageura, K.; Tsuji, K.; Takusa, A.: Some statistical characterizations of terminological and non-terminological elements : evaluation and examination in Japanese technical abstracts (1996) 0.00

0.0020787928 = product of:
  0.0041575856 = sum of:
    0.0041575856 = product of:
      0.008315171 = sum of:
        0.008315171 = weight(_text_:s in 6332) [ClassicSimilarity], result of:
          0.008315171 = score(doc=6332,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.14414869 = fieldWeight in 6332, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.09375 = fieldNorm(doc=6332)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: S.130-138

Tsuji, K.; Kageura, K.: Analysis of word structure of medical synonyms (1996) 0.00

0.0020787928 = product of:
  0.0041575856 = sum of:
    0.0041575856 = product of:
      0.008315171 = sum of:
        0.008315171 = weight(_text_:s in 6338) [ClassicSimilarity], result of:
          0.008315171 = score(doc=6338,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.14414869 = fieldWeight in 6338, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.09375 = fieldNorm(doc=6338)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: S.190-196

Fukuda, M.; Kageura, K.: Research into 'see also' references in the dictionary of terminology : using semantic relations between entries (1993) 0.00

0.001385862 = product of:
  0.002771724 = sum of:
    0.002771724 = product of:
      0.005543448 = sum of:
        0.005543448 = weight(_text_:s in 1050) [ClassicSimilarity], result of:
          0.005543448 = score(doc=1050,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.09609913 = fieldWeight in 1050, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=1050)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Library and information science. 1993, no.31, S.1-23

Kageura, K.: Terminological semantics : an examination of 'concept' and 'meaning' in the study of terms (1995) 0.00

0.001385862 = product of:
  0.002771724 = sum of:
    0.002771724 = product of:
      0.005543448 = sum of:
        0.005543448 = weight(_text_:s in 4561) [ClassicSimilarity], result of:
          0.005543448 = score(doc=4561,freq=2.0), product of:
            0.057684682 = queryWeight, product of:
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.053056188 = queryNorm
            0.09609913 = fieldWeight in 4561, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.0872376 = idf(docFreq=40523, maxDocs=44218)
              0.0625 = fieldNorm(doc=4561)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: International forum on information and documentation. 20(1995) no.4, S.25-31

Search (8 results, page 1 of 1)

Authors

Years

Languages

Types

Themes

Subjects

Classifications