Search (7 results, page 1 of 1)

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.04

0.036499076 = product of:
  0.18249537 = sum of:
    0.05658752 = weight(_text_:23 in 6265) [ClassicSimilarity], result of:
      0.05658752 = score(doc=6265,freq=4.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.78401303 = fieldWeight in 6265, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.109375 = fieldNorm(doc=6265)
    0.05658752 = weight(_text_:23 in 6265) [ClassicSimilarity], result of:
      0.05658752 = score(doc=6265,freq=4.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.78401303 = fieldWeight in 6265, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.109375 = fieldNorm(doc=6265)
    0.05658752 = weight(_text_:23 in 6265) [ClassicSimilarity], result of:
      0.05658752 = score(doc=6265,freq=4.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.78401303 = fieldWeight in 6265, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.109375 = fieldNorm(doc=6265)
    0.012732802 = product of:
      0.038198404 = sum of:
        0.038198404 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
          0.038198404 = score(doc=6265,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.5416616 = fieldWeight in 6265, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6265)
      0.33333334 = coord(1/3)
  0.2 = coord(4/20)

Date: 19. 9.2005 12:23:04
Source: Information outlook. 9(2005) no.8, S.22-23

Mongin, L.; Fu, Y.Y.; Mostafa, J.: Open Archives data Service prototype and automated subject indexing using D-Lib archive content as a testbed (2003) 0.01

0.013370992 = product of:
  0.08913994 = sum of:
    0.029713312 = weight(_text_:software in 1167) [ClassicSimilarity], result of:
      0.029713312 = score(doc=1167,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3719205 = fieldWeight in 1167, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.046875 = fieldNorm(doc=1167)
    0.029713312 = weight(_text_:software in 1167) [ClassicSimilarity], result of:
      0.029713312 = score(doc=1167,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3719205 = fieldWeight in 1167, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.046875 = fieldNorm(doc=1167)
    0.029713312 = weight(_text_:software in 1167) [ClassicSimilarity], result of:
      0.029713312 = score(doc=1167,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3719205 = fieldWeight in 1167, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.046875 = fieldNorm(doc=1167)
  0.15 = coord(3/20)

Abstract: The Indiana University School of Library and Information Science opened a new research laboratory in January 2003; The Indiana University School of Library and Information Science Information Processing Laboratory [IU IP Lab]. The purpose of the new laboratory is to facilitate collaboration between scientists in the department in the areas of information retrieval (IR) and information visualization (IV) research. The lab has several areas of focus. These include grid and cluster computing, and a standard Java-based software platform to support plug and play research datasets, a selection of standard IR modules and standard IV algorithms. Future development includes software to enable researchers to contribute datasets, IR algorithms, and visualization algorithms into the standard environment. We decided early on to use OAI-PMH as a resource discovery tool because it is consistent with our mission.

Witschel, H.F.: Terminology extraction and automatic indexing : comparison and qualitative evaluation of methods (2005) 0.00
```
0.0011352972 = product of:
  0.022705944 = sum of:
    0.022705944 = product of:
      0.04541189 = sum of:
        0.04541189 = weight(_text_:engineering in 1842) [ClassicSimilarity], result of:
          0.04541189 = score(doc=1842,freq=4.0), product of:
            0.10819342 = queryWeight, product of:
              5.372528 = idf(docFreq=557, maxDocs=44218)
              0.02013827 = queryNorm
            0.41972876 = fieldWeight in 1842, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.372528 = idf(docFreq=557, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1842)
      0.5 = coord(1/2)
  0.05 = coord(1/20)
```
Abstract

Many terminology engineering processes involve the task of automatic terminology extraction: before the terminology of a given domain can be modelled, organised or standardised, important concepts (or terms) of this domain have to be identified and fed into terminological databases. These serve in further steps as a starting point for compiling dictionaries, thesauri or maybe even terminological ontologies for the domain. For the extraction of the initial concepts, extraction methods are needed that operate on specialised language texts. On the other hand, many machine learning or information retrieval applications require automatic indexing techniques. In Machine Learning applications concerned with the automatic clustering or classification of texts, often feature vectors are needed that describe the contents of a given text briefly but meaningfully. These feature vectors typically consist of a fairly small set of index terms together with weights indicating their importance. Short but meaningful descriptions of document contents as provided by good index terms are also useful to humans: some knowledge management applications (e.g. topic maps) use them as a set of basic concepts (topics). The author believes that the tasks of terminology extraction and automatic indexing have much in common and can thus benefit from the same set of basic algorithms. It is the goal of this paper to outline some methods that may be used in both contexts, but also to find the discriminating factors between the two tasks that call for the variation of parameters or application of different techniques. The discussion of these methods will be based on statistical, syntactical and especially morphological properties of (index) terms. The paper is concluded by the presentation of some qualitative and quantitative results comparing statistical and morphological methods.

Source

TKE 2005: Proc. of Terminology and Knowledge Engineering (TKE) 2005

Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.00

3.441531E-4 = product of:
  0.0068830615 = sum of:
    0.0068830615 = product of:
      0.013766123 = sum of:
        0.013766123 = weight(_text_:29 in 5769) [ClassicSimilarity], result of:
          0.013766123 = score(doc=5769,freq=2.0), product of:
            0.070840135 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.02013827 = queryNorm
            0.19432661 = fieldWeight in 5769, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5769)
      0.5 = coord(1/2)
  0.05 = coord(1/20)

Date: 29. 9.2001 14:01:18

Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001) 0.00

3.441531E-4 = product of:
  0.0068830615 = sum of:
    0.0068830615 = product of:
      0.013766123 = sum of:
        0.013766123 = weight(_text_:29 in 6029) [ClassicSimilarity], result of:
          0.013766123 = score(doc=6029,freq=2.0), product of:
            0.070840135 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.02013827 = queryNorm
            0.19432661 = fieldWeight in 6029, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=6029)
      0.5 = coord(1/2)
  0.05 = coord(1/20)

Date: 29. 9.2001 14:02:50

Goller, C.; Löning, J.; Will, T.; Wolff, W.: Automatic document classification : a thourough evaluation of various methods (2000) 0.00

3.330612E-4 = product of:
  0.006661224 = sum of:
    0.006661224 = weight(_text_:der in 5480) [ClassicSimilarity], result of:
      0.006661224 = score(doc=5480,freq=2.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.14807922 = fieldWeight in 5480, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.046875 = fieldNorm(doc=5480)
  0.05 = coord(1/20)

Source: Informationskompetenz - Basiskompetenz in der Informationsgesellschaft: Proceedings des 7. Internationalen Symposiums für Informationswissenschaft (ISI 2000), Hrsg.: G. Knorz u. R. Kuhlen

Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.00

3.1832006E-4 = product of:
  0.006366401 = sum of:
    0.006366401 = product of:
      0.019099202 = sum of:
        0.019099202 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
          0.019099202 = score(doc=5291,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.2708308 = fieldWeight in 5291, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5291)
      0.33333334 = coord(1/3)
  0.05 = coord(1/20)

Date: 22. 7.2006 17:32:00

Search (7 results, page 1 of 1)

Authors

Types

Themes