Search (131 results, page 1 of 7)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.05

0.054003473 = product of:
  0.08100521 = sum of:
    0.0691992 = product of:
      0.20759758 = sum of:
        0.20759758 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
          0.20759758 = score(doc=562,freq=2.0), product of:
            0.36937886 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043569047 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.33333334 = coord(1/3)
    0.011806009 = product of:
      0.035418026 = sum of:
        0.035418026 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
          0.035418026 = score(doc=562,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.23214069 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.33333334 = coord(1/3)
  0.6666667 = coord(2/3)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.05

0.04806631 = product of:
  0.14419892 = sum of:
    0.14419892 = product of:
      0.21629839 = sum of:
        0.13365632 = weight(_text_:network in 4506) [ClassicSimilarity], result of:
          0.13365632 = score(doc=4506,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.6888462 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.109375 = fieldNorm(doc=4506)
        0.08264206 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
          0.08264206 = score(doc=4506,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.5416616 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=4506)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Date: 8.10.2000 11:52:22

Fóris, A.: Network theory and terminology (2013) 0.03
```
0.027774185 = product of:
  0.083322555 = sum of:
    0.083322555 = product of:
      0.124983825 = sum of:
        0.095468804 = weight(_text_:network in 1365) [ClassicSimilarity], result of:
          0.095468804 = score(doc=1365,freq=8.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.492033 = fieldWeight in 1365, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1365)
        0.029515022 = weight(_text_:22 in 1365) [ClassicSimilarity], result of:
          0.029515022 = score(doc=1365,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.19345059 = fieldWeight in 1365, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1365)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)
```
Abstract

The paper aims to present the relations of network theory and terminology. The model of scale-free networks, which has been recently developed and widely applied since, can be effectively used in terminology research as well. Operation based on the principle of networks is a universal characteristic of complex systems. Networks are governed by general laws. The model of scale-free networks can be viewed as a statistical-probability model, and it can be described with mathematical tools. Its main feature is that "everything is connected to everything else," that is, every node is reachable (in a few steps) starting from any other node; this phenomena is called "the small world phenomenon." The existence of a linguistic network and the general laws of the operation of networks enable us to place issues of language use in the complex system of relations that reveal the deeper connection s between phenomena with the help of networks embedded in each other. The realization of the metaphor that language also has a network structure is the basis of the classification methods of the terminological system, and likewise of the ways of creating terminology databases, which serve the purpose of providing easy and versatile accessibility to specialised knowledge.

Date

2. 9.2014 21:22:48

Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001) 0.02

0.024116507 = product of:
  0.07234952 = sum of:
    0.07234952 = product of:
      0.10852428 = sum of:
        0.06682816 = weight(_text_:network in 1595) [ClassicSimilarity], result of:
          0.06682816 = score(doc=1595,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.3444231 = fieldWeight in 1595, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1595)
        0.041696113 = weight(_text_:29 in 1595) [ClassicSimilarity], result of:
          0.041696113 = score(doc=1595,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.27205724 = fieldWeight in 1595, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1595)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Abstract: This paper presents a method that exploits the hierarchical structure of an indexing vocabulary to guide the development and training of machine learning methods for automatic text categorization. We present the design of a hierarchical classifier based an the divide-and-conquer principle. The method is evaluated using backpropagation neural networks, such as the machine learning algorithm, that leam to assign MeSH categories to a subset of MEDLINE records. Comparisons with traditional Rocchio's algorithm adapted for text categorization, as well as flat neural network classifiers, are provided. The results indicate that the use of hierarchical structures improves Performance significantly.
Date: 11. 5.2003 18:29:44

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.02

0.0230664 = product of:
  0.0691992 = sum of:
    0.0691992 = product of:
      0.20759758 = sum of:
        0.20759758 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.20759758 = score(doc=862,freq=2.0), product of:
            0.36937886 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043569047 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Sidhom, S.; Hassoun, M.: Morpho-syntactic parsing for a text mining environment : An NP recognition model for knowledge visualization and information retrieval (2002) 0.02

0.020671291 = product of:
  0.062013872 = sum of:
    0.062013872 = product of:
      0.093020804 = sum of:
        0.057281278 = weight(_text_:network in 1852) [ClassicSimilarity], result of:
          0.057281278 = score(doc=1852,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.29521978 = fieldWeight in 1852, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.046875 = fieldNorm(doc=1852)
        0.035739526 = weight(_text_:29 in 1852) [ClassicSimilarity], result of:
          0.035739526 = score(doc=1852,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.23319192 = fieldWeight in 1852, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=1852)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Abstract: Sidhom and Hassoun discuss the crucial role of NLP tools in Knowledge Extraction and Management as well as in the design of Information Retrieval Systems. The authors focus more specifically an the morpho-syntactic issues by describing their morpho-syntactic analysis platform, which has been implemented to cover the automatic indexing and information retrieval topics. To this end they implemented the Cascaded "Augmented Transition Network (ATN)". They used this formalism in order to analyse French text descriptions of Multimedia documents. An implementation of an ATN parsing automaton is briefly described. The Platform in its logical operation is considered as an investigative tool towards the knowledge organization (based an an NP recognition model) and management of multiform e-documents (text, multimedia, audio, image) using their text descriptions.
Source: Knowledge organization. 29(2002) nos.3/4, S.171-180

Snajder, J.; Almic, P.: Modeling semantic compositionality of Croatian multiword expressions (2015) 0.02

0.020671291 = product of:
  0.062013872 = sum of:
    0.062013872 = product of:
      0.093020804 = sum of:
        0.057281278 = weight(_text_:network in 2920) [ClassicSimilarity], result of:
          0.057281278 = score(doc=2920,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.29521978 = fieldWeight in 2920, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.046875 = fieldNorm(doc=2920)
        0.035739526 = weight(_text_:29 in 2920) [ClassicSimilarity], result of:
          0.035739526 = score(doc=2920,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.23319192 = fieldWeight in 2920, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=2920)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Abstract: A distinguishing feature of many multiword expressions (MWEs) is their semantic non-compositionality. Determining the semantic compositionality of MWEs is important for many natural language processing tasks. We address the task of modeling semantic compositionality of Croatian MWEs. We adopt a composition-based approach within the distributional semantics framework. We build and evaluate models based on Latent Semantic Analysis and the recently proposed neural network-based Skip-gram model, and experiment with different composition functions. We show that the compositionality scores predicted by the Skip-gram additive models correlate well with human judgments (=0.50). When framed as a classification task, the model achieves an accuracy of 0.64.
Date: 29. 4.2016 12:42:17

Melby, A.: Some notes on 'The proper place of men and machines in language translation' (1997) 0.02

0.018448254 = product of:
  0.05534476 = sum of:
    0.05534476 = product of:
      0.08301714 = sum of:
        0.041696113 = weight(_text_:29 in 330) [ClassicSimilarity], result of:
          0.041696113 = score(doc=330,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.27205724 = fieldWeight in 330, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0546875 = fieldNorm(doc=330)
        0.04132103 = weight(_text_:22 in 330) [ClassicSimilarity], result of:
          0.04132103 = score(doc=330,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.2708308 = fieldWeight in 330, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=330)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Date: 31. 7.1996 9:22:19
Source: Machine translation. 12(1997) nos.1/2, S.29-34

Doszkocs, T.E.; Zamora, A.: Dictionary services and spelling aids for Web searching (2004) 0.02

0.01589411 = product of:
  0.047682326 = sum of:
    0.047682326 = product of:
      0.07152349 = sum of:
        0.029782942 = weight(_text_:29 in 2541) [ClassicSimilarity], result of:
          0.029782942 = score(doc=2541,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.19432661 = fieldWeight in 2541, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2541)
        0.041740544 = weight(_text_:22 in 2541) [ClassicSimilarity], result of:
          0.041740544 = score(doc=2541,freq=4.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.27358043 = fieldWeight in 2541, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2541)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)

Date: 14. 8.2004 17:22:56
Source: Online. 28(2004) no.3, S.22-29

Yang, C.C.; Luk, J.: Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws (2003) 0.02
```
0.015092259 = product of:
  0.045276776 = sum of:
    0.045276776 = product of:
      0.067915164 = sum of:
        0.047254648 = weight(_text_:network in 1616) [ClassicSimilarity], result of:
          0.047254648 = score(doc=1616,freq=4.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.24354391 = fieldWeight in 1616, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1616)
        0.020660516 = weight(_text_:22 in 1616) [ClassicSimilarity], result of:
          0.020660516 = score(doc=1616,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.1354154 = fieldWeight in 1616, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1616)
      0.6666667 = coord(2/3)
  0.33333334 = coord(1/3)
```
Abstract

The information available in languages other than English in the World Wide Web is increasing significantly. According to a report from Computer Economics in 1999, 54% of Internet users are English speakers ("English Will Dominate Web for Only Three More Years," Computer Economics, July 9, 1999, http://www.computereconomics. com/new4/pr/pr990610.html). However, it is predicted that there will be only 60% increase in Internet users among English speakers verses a 150% growth among nonEnglish speakers for the next five years. By 2005, 57% of Internet users will be non-English speakers. A report by CNN.com in 2000 showed that the number of Internet users in China had been increased from 8.9 million to 16.9 million from January to June in 2000 ("Report: China Internet users double to 17 million," CNN.com, July, 2000, http://cnn.org/2000/TECH/computing/07/27/ china.internet.reut/index.html). According to Nielsen/ NetRatings, there was a dramatic leap from 22.5 millions to 56.6 millions Internet users from 2001 to 2002. China had become the second largest global at-home Internet population in 2002 (US's Internet population was 166 millions) (Robyn Greenspan, "China Pulls Ahead of Japan," Internet.com, April 22, 2002, http://cyberatias.internet.com/big-picture/geographics/article/0,,5911_1013841,00. html). All of the evidences reveal the importance of crosslingual research to satisfy the needs in the near future. Digital library research has been focusing in structural and semantic interoperability in the past. Searching and retrieving objects across variations in protocols, formats and disciplines are widely explored (Schatz, B., & Chen, H. (1999). Digital libraries: technological advances and social impacts. IEEE Computer, Special Issue an Digital Libraries, February, 32(2), 45-50.; Chen, H., Yen, J., & Yang, C.C. (1999). International activities: development of Asian digital libraries. IEEE Computer, Special Issue an Digital Libraries, 32(2), 48-49.). However, research in crossing language boundaries, especially across European languages and Oriental languages, is still in the initial stage. In this proposal, we put our focus an cross-lingual semantic interoperability by developing automatic generation of a cross-lingual thesaurus based an English/Chinese parallel corpus. When the searchers encounter retrieval problems, Professional librarians usually consult the thesaurus to identify other relevant vocabularies. In the problem of searching across language boundaries, a cross-lingual thesaurus, which is generated by co-occurrence analysis and Hopfield network, can be used to generate additional semantically relevant terms that cannot be obtained from dictionary. In particular, the automatically generated cross-lingual thesaurus is able to capture the unknown words that do not exist in a dictionary, such as names of persons, organizations, and events. Due to Hong Kong's unique history background, both English and Chinese are used as official languages in all legal documents. Therefore, English/Chinese cross-lingual information retrieval is critical for applications in courts and the government. In this paper, we develop an automatic thesaurus by the Hopfield network based an a parallel corpus collected from the Web site of the Department of Justice of the Hong Kong Special Administrative Region (HKSAR) Government. Experiments are conducted to measure the precision and recall of the automatic generated English/Chinese thesaurus. The result Shows that such thesaurus is a promising tool to retrieve relevant terms, especially in the language that is not the same as the input term. The direct translation of the input term can also be retrieved in most of the cases.

Fellbaum, C.: ¬A semantic network of English : the mother of all WordNets (1998) 0.01

0.014850704 = product of:
  0.04455211 = sum of:
    0.04455211 = product of:
      0.13365632 = sum of:
        0.13365632 = weight(_text_:network in 6416) [ClassicSimilarity], result of:
          0.13365632 = score(doc=6416,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.6888462 = fieldWeight in 6416, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.109375 = fieldNorm(doc=6416)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Kuhlen, R.: Morphologische Relationen durch Reduktionsalgorithmen (1974) 0.01

0.013103826 = product of:
  0.039311476 = sum of:
    0.039311476 = product of:
      0.11793443 = sum of:
        0.11793443 = weight(_text_:29 in 4251) [ClassicSimilarity], result of:
          0.11793443 = score(doc=4251,freq=4.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.7694941 = fieldWeight in 4251, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.109375 = fieldNorm(doc=4251)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Date: 29. 1.2011 14:56:29

Radev, D.R.; Joseph, M.T.; Gibson, B.; Muthukrishnan, P.: ¬A bibliometric and network analysis of the field of computational linguistics (2016) 0.01
```
0.012861087 = product of:
  0.03858326 = sum of:
    0.03858326 = product of:
      0.11574978 = sum of:
        0.11574978 = weight(_text_:network in 2764) [ClassicSimilarity], result of:
          0.11574978 = score(doc=2764,freq=6.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.59655833 = fieldWeight in 2764, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2764)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

The ACL Anthology is a large collection of research papers in computational linguistics. Citation data were obtained using text extraction from a collection of PDF files with significant manual postprocessing performed to clean up the results. Manual annotation of the references was then performed to complete the citation network. We analyzed the networks of paper citations, author citations, and author collaborations in an attempt to identify the most central papers and authors. The analysis includes general network statistics, PageRank, metrics across publication years and venues, the impact factor and h-index, as well as other measures.

Barthel, J.; Ciesielski, R.: Regeln zu ChatGPT an Unis oft unklar : KI in der Bildung (2023) 0.01

0.01146346 = product of:
  0.03439038 = sum of:
    0.03439038 = product of:
      0.10317113 = sum of:
        0.10317113 = weight(_text_:29 in 925) [ClassicSimilarity], result of:
          0.10317113 = score(doc=925,freq=6.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.6731671 = fieldWeight in 925, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.078125 = fieldNorm(doc=925)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Date: 29. 3.2023 13:23:26
29. 3.2023 13:29:19

Roberts, C.W.; Popping, R.: Computer-supported content analysis : some recent developments (1993) 0.01

0.010607645 = product of:
  0.031822935 = sum of:
    0.031822935 = product of:
      0.095468804 = sum of:
        0.095468804 = weight(_text_:network in 4236) [ClassicSimilarity], result of:
          0.095468804 = score(doc=4236,freq=2.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.492033 = fieldWeight in 4236, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.078125 = fieldNorm(doc=4236)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Abstract: Presents an overview of some recent developments in the clause-based content analysis of linguistic data. Introduces network analysis of evaluative texts, for the analysis of cognitive maps, and linguistic content analysis. Focuses on the types of substantive inferences afforded by the three approaches

Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.01
```
0.010607645 = product of:
  0.031822935 = sum of:
    0.031822935 = product of:
      0.095468804 = sum of:
        0.095468804 = weight(_text_:network in 5043) [ClassicSimilarity], result of:
          0.095468804 = score(doc=5043,freq=8.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.492033 = fieldWeight in 5043, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5043)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

This paper is concerned with paraphrase detection, i.e., identifying sentences that are semantically identical. The ability to detect similar sentences written in natural language is crucial for several applications, such as text mining, text summarization, plagiarism detection, authorship authentication and question answering. Recognizing this importance, we study in particular how to address the challenges with detecting paraphrases in user generated short texts, such as Twitter, which often contain language irregularity and noise, and do not necessarily contain as much semantic information as longer clean texts. We propose a novel deep neural network-based approach that relies on coarse-grained sentence modelling using a convolutional neural network (CNN) and a recurrent neural network (RNN) model, combined with a specific fine-grained word-level similarity matching model. More specifically, we develop a new architecture, called DeepParaphrase, which enables to create an informative semantic representation of each sentence by (1) using CNN to extract the local region information in form of important n-grams from the sentence, and (2) applying RNN to capture the long-term dependency information. In addition, we perform a comparative study on state-of-the-art approaches within paraphrase detection. An important insight from this study is that existing paraphrase approaches perform well when applied on clean texts, but they do not necessarily deliver good performance against noisy texts, and vice versa. In contrast, our evaluation has shown that the proposed DeepParaphrase-based approach achieves good results in both types of texts, thus making it more robust and generic than the existing approaches.

Wettler, M.; Rapp, R.; Ferber, R.: Freie Assoziationen und Kontiguitäten von Wörtern in Texten (1993) 0.01

0.010589491 = product of:
  0.03176847 = sum of:
    0.03176847 = product of:
      0.095305406 = sum of:
        0.095305406 = weight(_text_:29 in 2140) [ClassicSimilarity], result of:
          0.095305406 = score(doc=2140,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.6218451 = fieldWeight in 2140, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.125 = fieldNorm(doc=2140)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Date: 4.11.1998 14:30:29

Toutanova, K.; Klein, D.; Manning, C.D.; Singer, Y.: Feature-rich Part-of-Speech Tagging with a cyclic dependency network (2003) 0.01
```
0.010501034 = product of:
  0.0315031 = sum of:
    0.0315031 = product of:
      0.094509296 = sum of:
        0.094509296 = weight(_text_:network in 1059) [ClassicSimilarity], result of:
          0.094509296 = score(doc=1059,freq=4.0), product of:
            0.19402927 = queryWeight, product of:
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.043569047 = queryNorm
            0.48708782 = fieldWeight in 1059, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.4533744 = idf(docFreq=1398, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1059)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)
```
Abstract

We present a new part-of-speech tagger that demonstrates the following ideas: (i) explicit use of both preceding and following tag contexts via a dependency network representation, (ii) broad use of lexical features, including jointly conditioning on multiple consecutive words, (iii) effective use of priors in conditional loglinear models, and (iv) fine-grained modeling of unknown word features. Using these ideas together, the resulting tagger gives a 97.24%accuracy on the Penn TreebankWSJ, an error reduction of 4.4% on the best previous single automatically learned tagging result.

Warner, A.J.: Natural language processing (1987) 0.01

0.01049423 = product of:
  0.03148269 = sum of:
    0.03148269 = product of:
      0.09444807 = sum of:
        0.09444807 = weight(_text_:22 in 337) [ClassicSimilarity], result of:
          0.09444807 = score(doc=337,freq=2.0), product of:
            0.15257138 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043569047 = queryNorm
            0.61904186 = fieldWeight in 337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=337)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Source: Annual review of information science and technology. 22(1987), S.79-108

Hahn, U.; Reimer, U.: Informationslinguistische Konzepte der Volltextverarbeitung in TOPIC (1983) 0.01

0.009265803 = product of:
  0.027797408 = sum of:
    0.027797408 = product of:
      0.083392225 = sum of:
        0.083392225 = weight(_text_:29 in 450) [ClassicSimilarity], result of:
          0.083392225 = score(doc=450,freq=2.0), product of:
            0.15326229 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.043569047 = queryNorm
            0.5441145 = fieldWeight in 450, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.109375 = fieldNorm(doc=450)
      0.33333334 = coord(1/3)
  0.33333334 = coord(1/3)

Source: Deutscher Dokumentartag 1982, Lübeck-Travemünde, 29.-30.9.1982: Fachinformation im Zeitalter der Informationsindustrie. Bearb.: H. Strohl-Goebel

Search (131 results, page 1 of 7)

Authors

Years

Languages

Types

Themes

Subjects

Classifications