Search (31 results, page 2 of 2)

Shaalan, K.; Raza, H.: NERA: Named Entity Recognition for Arabic (2009) 0.01
```
0.007890998 = product of:
  0.015781997 = sum of:
    0.015781997 = product of:
      0.031563994 = sum of:
        0.031563994 = weight(_text_:b in 2953) [ClassicSimilarity], result of:
          0.031563994 = score(doc=2953,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.19572285 = fieldWeight in 2953, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2953)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Name identification has been worked on quite intensively for the past few years, and has been incorporated into several products revolving around natural language processing tasks. Many researchers have attacked the name identification problem in a variety of languages, but only a few limited research efforts have focused on named entity recognition for Arabic script. This is due to the lack of resources for Arabic named entities and the limited amount of progress made in Arabic natural language processing in general. In this article, we present the results of our attempt at the recognition and extraction of the 10 most important categories of named entities in Arabic script: the person name, location, company, date, time, price, measurement, phone number, ISBN, and file name. We developed the system Named Entity Recognition for Arabic (NERA) using a rule-based approach. The resources created are: a Whitelist representing a dictionary of names, and a grammar, in the form of regular expressions, which are responsible for recognizing the named entities. A filtration mechanism is used that serves two different purposes: (a) revision of the results from a named entity extractor by using metadata, in terms of a Blacklist or rejecter, about ill-formed named entities and (b) disambiguation of identical or overlapping textual matches returned by different name entity extractors to get the correct choice. In NERA, we addressed major challenges posed by NER in the Arabic language arising due to the complexity of the language, peculiarities in the Arabic orthographic system, nonstandardization of the written text, ambiguity, and lack of resources. NERA has been effectively evaluated using our own tagged corpus; it achieved satisfactory results in terms of precision, recall, and F-measure.

Li, Q.; Chen, Y.P.; Myaeng, S.-H.; Jin, Y.; Kang, B.-Y.: Concept unification of terms in different languages via web mining for Information Retrieval (2009) 0.01

0.007890998 = product of:
  0.015781997 = sum of:
    0.015781997 = product of:
      0.031563994 = sum of:
        0.031563994 = weight(_text_:b in 4215) [ClassicSimilarity], result of:
          0.031563994 = score(doc=4215,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.19572285 = fieldWeight in 4215, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4215)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Sienel, J.; Weiss, M.; Laube, M.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts (2000) 0.01

0.0077088396 = product of:
  0.015417679 = sum of:
    0.015417679 = product of:
      0.030835358 = sum of:
        0.030835358 = weight(_text_:22 in 5557) [ClassicSimilarity], result of:
          0.030835358 = score(doc=5557,freq=2.0), product of:
            0.15939656 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045518078 = queryNorm
            0.19345059 = fieldWeight in 5557, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5557)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 26.12.2000 13:22:17

Pinker, S.: Wörter und Regeln : Die Natur der Sprache (2000) 0.01

0.0077088396 = product of:
  0.015417679 = sum of:
    0.015417679 = product of:
      0.030835358 = sum of:
        0.030835358 = weight(_text_:22 in 734) [ClassicSimilarity], result of:
          0.030835358 = score(doc=734,freq=2.0), product of:
            0.15939656 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045518078 = queryNorm
            0.19345059 = fieldWeight in 734, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=734)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 19. 7.2002 14:22:31

Computational linguistics for the new millennium : divergence or synergy? Proceedings of the International Symposium held at the Ruprecht-Karls Universität Heidelberg, 21-22 July 2000. Festschrift in honour of Peter Hellwig on the occasion of his 60th birthday (2002) 0.01

0.0077088396 = product of:
  0.015417679 = sum of:
    0.015417679 = product of:
      0.030835358 = sum of:
        0.030835358 = weight(_text_:22 in 4900) [ClassicSimilarity], result of:
          0.030835358 = score(doc=4900,freq=2.0), product of:
            0.15939656 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045518078 = queryNorm
            0.19345059 = fieldWeight in 4900, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4900)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Rösener, C.: ¬Die Stecknadel im Heuhaufen : Natürlichsprachlicher Zugang zu Volltextdatenbanken (2005) 0.01
```
0.0063127987 = product of:
  0.012625597 = sum of:
    0.012625597 = product of:
      0.025251195 = sum of:
        0.025251195 = weight(_text_:b in 548) [ClassicSimilarity], result of:
          0.025251195 = score(doc=548,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.15657827 = fieldWeight in 548, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.03125 = fieldNorm(doc=548)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Content

5: Interaktion 5.1 Frage-Antwort- bzw. Dialogsysteme: Forschungen und Projekte 5.2 Darstellung und Visualisierung von Wissen 5.3 Das Dialogsystem im Rahmen des LeWi-Projektes 5.4 Ergebnisdarstellung und Antwortpräsentation im LeWi-Kontext 6: Testumgebungen und -ergebnisse 7: Ergebnisse und Ausblick 7.1 Ausgangssituation 7.2 Schlussfolgerungen 7.3 Ausblick Anhang A Auszüge aus der Grob- bzw. Feinklassifikation des BMM Anhang B MPRO - Formale Beschreibung der wichtigsten Merkmale ... Anhang C Fragentypologie mit Beispielsätzen (Auszug) Anhang D Semantische Merkmale im morphologischen Lexikon (Auszug) Anhang E Regelbeispiele für die Fragentypzuweisung Anhang F Aufstellung der möglichen Suchen im LeWi-Dialogmodul (Auszug) Anhang G Vollständiger Dialogbaum zu Beginn des Projektes Anhang H Statuszustände zur Ermittlung der Folgefragen (Auszug)

Kiss, T.: Anmerkungen zur scheinbaren Konkurrenz von numerischen und symbolischen Verfahren in der Computerlinguistik (2002) 0.01

0.0063127987 = product of:
  0.012625597 = sum of:
    0.012625597 = product of:
      0.025251195 = sum of:
        0.025251195 = weight(_text_:b in 1752) [ClassicSimilarity], result of:
          0.025251195 = score(doc=1752,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.15657827 = fieldWeight in 1752, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.03125 = fieldNorm(doc=1752)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Computerlinguistik -- Was geht, was kommt? = Computational linguistics : achievements and perspectives; Festschrift für Winfried Lenders. Hrsg.: G. Willée, B. Schröder & H.-C. Schmitz

Schürmann, H.: Software scannt Radio- und Fernsehsendungen : Recherche in Nachrichtenarchiven erleichtert (2001) 0.01

0.0053961873 = product of:
  0.010792375 = sum of:
    0.010792375 = product of:
      0.02158475 = sum of:
        0.02158475 = weight(_text_:22 in 5759) [ClassicSimilarity], result of:
          0.02158475 = score(doc=5759,freq=2.0), product of:
            0.15939656 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045518078 = queryNorm
            0.1354154 = fieldWeight in 5759, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=5759)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Handelsblatt. Nr.79 vom 24.4.2001, S.22

Melzer, C.: ¬Der Maschine anpassen : PC-Spracherkennung - Programme sind mittlerweile alltagsreif (2005) 0.01

0.0053961873 = product of:
  0.010792375 = sum of:
    0.010792375 = product of:
      0.02158475 = sum of:
        0.02158475 = weight(_text_:22 in 4044) [ClassicSimilarity], result of:
          0.02158475 = score(doc=4044,freq=2.0), product of:
            0.15939656 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045518078 = queryNorm
            0.1354154 = fieldWeight in 4044, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4044)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 3. 5.1997 8:44:22

Conceptual structures : logical, linguistic, and computational issues. 8th International Conference on Conceptual Structures, ICCS 2000, Darmstadt, Germany, August 14-18, 2000 (2000) 0.00

0.004734599 = product of:
  0.009469198 = sum of:
    0.009469198 = product of:
      0.018938396 = sum of:
        0.018938396 = weight(_text_:b in 691) [ClassicSimilarity], result of:
          0.018938396 = score(doc=691,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.117433704 = fieldWeight in 691, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.0234375 = fieldNorm(doc=691)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Editor: Ganter, B.

Sprachtechnologie, mobile Kommunikation und linguistische Ressourcen : Beiträge zur GLDV Tagung 2005 in Bonn (2005) 0.00

0.004734599 = product of:
  0.009469198 = sum of:
    0.009469198 = product of:
      0.018938396 = sum of:
        0.018938396 = weight(_text_:b in 3578) [ClassicSimilarity], result of:
          0.018938396 = score(doc=3578,freq=2.0), product of:
            0.16126883 = queryWeight, product of:
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.045518078 = queryNorm
            0.117433704 = fieldWeight in 3578, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.542962 = idf(docFreq=3476, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3578)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Editor: Fisseni, B. u.a.

Search (31 results, page 2 of 2)

Authors

Languages

Types

Themes

Subjects