Search (48 results, page 1 of 3)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.10

0.101439916 = sum of:
  0.08076982 = product of:
    0.24230945 = sum of:
      0.24230945 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
        0.24230945 = score(doc=562,freq=2.0), product of:
          0.43114176 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.050854117 = queryNorm
          0.56201804 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.33333334 = coord(1/3)
  0.020670092 = product of:
    0.041340183 = sum of:
      0.041340183 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
        0.041340183 = score(doc=562,freq=2.0), product of:
          0.17808245 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.050854117 = queryNorm
          0.23214069 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.5 = coord(1/2)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Bátori, I.: ¬Der sprachliche Verarbeitungsprozeß als paradigmatischer Kern der linguistischen Datenverarbeitung (1982) 0.05

0.04871808 = product of:
  0.09743616 = sum of:
    0.09743616 = product of:
      0.19487232 = sum of:
        0.19487232 = weight(_text_:90 in 8422) [ClassicSimilarity], result of:
          0.19487232 = score(doc=8422,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.7127794 = fieldWeight in 8422, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.09375 = fieldNorm(doc=8422)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: S.71-90

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.04

0.04038491 = product of:
  0.08076982 = sum of:
    0.08076982 = product of:
      0.24230945 = sum of:
        0.24230945 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.24230945 = score(doc=862,freq=2.0), product of:
            0.43114176 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.050854117 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Gillaspie, L.: ¬The role of linguistic phenomena in retrieval performance (1995) 0.03

0.032478724 = product of:
  0.06495745 = sum of:
    0.06495745 = product of:
      0.1299149 = sum of:
        0.1299149 = weight(_text_:90 in 3861) [ClassicSimilarity], result of:
          0.1299149 = score(doc=3861,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.4751863 = fieldWeight in 3861, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.0625 = fieldNorm(doc=3861)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pages: S.90-96

Warner, A.J.: Natural language processing (1987) 0.03

0.027560122 = product of:
  0.055120245 = sum of:
    0.055120245 = product of:
      0.11024049 = sum of:
        0.11024049 = weight(_text_:22 in 337) [ClassicSimilarity], result of:
          0.11024049 = score(doc=337,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.61904186 = fieldWeight in 337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=337)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Annual review of information science and technology. 22(1987), S.79-108

Rahmstorf, G.: Rückkehr von Ordnung in die Informationstechnik? (2000) 0.02

0.02435904 = product of:
  0.04871808 = sum of:
    0.04871808 = product of:
      0.09743616 = sum of:
        0.09743616 = weight(_text_:90 in 5504) [ClassicSimilarity], result of:
          0.09743616 = score(doc=5504,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.3563897 = fieldWeight in 5504, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.046875 = fieldNorm(doc=5504)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information und Öffentlichkeit: 1. Gemeinsamer Kongress der Bundesvereinigung Deutscher Bibliotheksverbände e.V. (BDB) und der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI), Leipzig, 20.-23.3.2000. Zugleich 90. Deutscher Bibliothekartag, 52. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI). Hrsg.: G. Ruppelt u. H. Neißer

Chowdhury, G.G.: Natural language processing (2002) 0.02

0.02435904 = product of:
  0.04871808 = sum of:
    0.04871808 = product of:
      0.09743616 = sum of:
        0.09743616 = weight(_text_:90 in 4284) [ClassicSimilarity], result of:
          0.09743616 = score(doc=4284,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.3563897 = fieldWeight in 4284, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.046875 = fieldNorm(doc=4284)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Annual review of information science and technology. 37(2003), S.51-90

Navarretta, C.; Pedersen, B.S.; Hansen, D.H.: Language technology in knowledge-organization systems (2006) 0.02
```
0.02435904 = product of:
  0.04871808 = sum of:
    0.04871808 = product of:
      0.09743616 = sum of:
        0.09743616 = weight(_text_:90 in 5706) [ClassicSimilarity], result of:
          0.09743616 = score(doc=5706,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.3563897 = fieldWeight in 5706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.046875 = fieldNorm(doc=5706)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This paper describes the language technology methods developed in the Danish research project VID to extract from Danish text material relevant information for the population of knowledge organization systems (KOS) within specific corporate domains. The results achieved by applying these methods to a prototype search engine tuned to the patent and trademark domain indicate that the use of human language technology can support the construction of a linguistically based KOS and that linguistic information in search improves recall substantially without harming precision (near 90%). Finally, we describe two research experiments where (1) linguistic analysis of Danish compounds and is exploited to improve search atrategies on these (2) linguistic knowledge is used to model corporate knowledge into a language-based ontology.
Al-Shawakfa, E.; Al-Badarneh, A.; Shatnawi, S.; Al-Rabab'ah, K.; Bani-Ismail, B.: ¬A comparison study of some Arabic root finding algorithms (2010) 0.02
```
0.02435904 = product of:
  0.04871808 = sum of:
    0.04871808 = product of:
      0.09743616 = sum of:
        0.09743616 = weight(_text_:90 in 3457) [ClassicSimilarity], result of:
          0.09743616 = score(doc=3457,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.3563897 = fieldWeight in 3457, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.046875 = fieldNorm(doc=3457)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Arabic has a complex structure, which makes it difficult to apply natural language processing (NLP). Much research on Arabic NLP (ANLP) does exist; however, it is not as mature as that of other languages. Finding Arabic roots is an important step toward conducting effective research on most of ANLP applications. The authors have studied and compared six root-finding algorithms with success rates of over 90%. All algorithms of this study did not use the same testing corpus and/or benchmarking measures. They unified the testing process by implementing their own algorithm descriptions and building a corpus out of 3823 triliteral roots, applying 73 triliteral patterns, and with 18 affixes, producing around 27.6 million words. They tested the algorithms with the generated corpus and have obtained interesting results; they offer to share the corpus freely for benchmarking and ANLP research.
Chowdhury, A.; Mccabe, M.C.: Improving information retrieval systems using part of speech tagging (1993) 0.02
```
0.02435904 = product of:
  0.04871808 = sum of:
    0.04871808 = product of:
      0.09743616 = sum of:
        0.09743616 = weight(_text_:90 in 1061) [ClassicSimilarity], result of:
          0.09743616 = score(doc=1061,freq=2.0), product of:
            0.2733978 = queryWeight, product of:
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.050854117 = queryNorm
            0.3563897 = fieldWeight in 1061, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.376119 = idf(docFreq=555, maxDocs=44218)
              0.046875 = fieldNorm(doc=1061)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The object of Information Retrieval is to retrieve all relevant documents for a user query and only those relevant documents. Much research has focused on achieving this objective with little regard for storage overhead or performance. In the paper we evaluate the use of Part of Speech Tagging to improve, the index storage overhead and general speed of the system with only a minimal reduction to precision recall measurements. We tagged 500Mbs of the Los Angeles Times 1990 and 1989 document collection provided by TREC for parts of speech. We then experimented to find the most relevant part of speech to index. We show that 90% of precision recall is achieved with 40% of the document collections terms. We also show that this is a improvement in overhead with only a 1% reduction in precision recall.

McMahon, J.G.; Smith, F.J.: Improved statistical language model performance with automatic generated word hierarchies (1996) 0.02

0.024115108 = product of:
  0.048230216 = sum of:
    0.048230216 = product of:
      0.09646043 = sum of:
        0.09646043 = weight(_text_:22 in 3164) [ClassicSimilarity], result of:
          0.09646043 = score(doc=3164,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.5416616 = fieldWeight in 3164, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3164)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Computational linguistics. 22(1996) no.2, S.217-248

Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.02

0.024115108 = product of:
  0.048230216 = sum of:
    0.048230216 = product of:
      0.09646043 = sum of:
        0.09646043 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
          0.09646043 = score(doc=4506,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.5416616 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=4506)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 8.10.2000 11:52:22

Somers, H.: Example-based machine translation : Review article (1999) 0.02

0.024115108 = product of:
  0.048230216 = sum of:
    0.048230216 = product of:
      0.09646043 = sum of:
        0.09646043 = weight(_text_:22 in 6672) [ClassicSimilarity], result of:
          0.09646043 = score(doc=6672,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.5416616 = fieldWeight in 6672, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6672)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19

Baayen, R.H.; Lieber, H.: Word frequency distributions and lexical semantics (1997) 0.02

0.024115108 = product of:
  0.048230216 = sum of:
    0.048230216 = product of:
      0.09646043 = sum of:
        0.09646043 = weight(_text_:22 in 3117) [ClassicSimilarity], result of:
          0.09646043 = score(doc=3117,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.5416616 = fieldWeight in 3117, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=3117)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 28. 2.1999 10:48:22

¬Der Student aus dem Computer (2023) 0.02

0.024115108 = product of:
  0.048230216 = sum of:
    0.048230216 = product of:
      0.09646043 = sum of:
        0.09646043 = weight(_text_:22 in 1079) [ClassicSimilarity], result of:
          0.09646043 = score(doc=1079,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.5416616 = fieldWeight in 1079, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=1079)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 27. 1.2023 16:22:55

Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999) 0.02

0.020670092 = product of:
  0.041340183 = sum of:
    0.041340183 = product of:
      0.08268037 = sum of:
        0.08268037 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
          0.08268037 = score(doc=4483,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.46428138 = fieldWeight in 4483, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=4483)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 15. 3.2000 10:22:37

Monnerjahn, P.: Vorsprung ohne Technik : Übersetzen: Computer und Qualität (2000) 0.02

0.020670092 = product of:
  0.041340183 = sum of:
    0.041340183 = product of:
      0.08268037 = sum of:
        0.08268037 = weight(_text_:22 in 5429) [ClassicSimilarity], result of:
          0.08268037 = score(doc=5429,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.46428138 = fieldWeight in 5429, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5429)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: c't. 2000, H.22, S.230-231

Hutchins, J.: From first conception to first demonstration : the nascent years of machine translation, 1947-1954. A chronology (1997) 0.02

0.017225077 = product of:
  0.034450155 = sum of:
    0.034450155 = product of:
      0.06890031 = sum of:
        0.06890031 = weight(_text_:22 in 1463) [ClassicSimilarity], result of:
          0.06890031 = score(doc=1463,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.38690117 = fieldWeight in 1463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1463)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19

Kuhlmann, U.; Monnerjahn, P.: Sprache auf Knopfdruck : Sieben automatische Übersetzungsprogramme im Test (2000) 0.02

0.017225077 = product of:
  0.034450155 = sum of:
    0.034450155 = product of:
      0.06890031 = sum of:
        0.06890031 = weight(_text_:22 in 5428) [ClassicSimilarity], result of:
          0.06890031 = score(doc=5428,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.38690117 = fieldWeight in 5428, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=5428)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: c't. 2000, H.22, S.220-229

Lezius, W.; Rapp, R.; Wettler, M.: ¬A morphology-system and part-of-speech tagger for German (1996) 0.02

0.017225077 = product of:
  0.034450155 = sum of:
    0.034450155 = product of:
      0.06890031 = sum of:
        0.06890031 = weight(_text_:22 in 1693) [ClassicSimilarity], result of:
          0.06890031 = score(doc=1693,freq=2.0), product of:
            0.17808245 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.050854117 = queryNorm
            0.38690117 = fieldWeight in 1693, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1693)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 22. 3.2015 9:37:18

Search (48 results, page 1 of 3)

Authors

Years

Languages

Types

Themes