Search (190 results, page 1 of 10)

  • × theme_ss:"Computerlinguistik"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07
    0.065473676 = sum of:
      0.05333997 = product of:
        0.21335988 = sum of:
          0.21335988 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.21335988 = score(doc=562,freq=2.0), product of:
              0.37963173 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.044778395 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.012133708 = product of:
        0.036401123 = sum of:
          0.036401123 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.036401123 = score(doc=562,freq=2.0), product of:
              0.1568063 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.044778395 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.33333334 = coord(1/3)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Chibout, K.; Vilnat, A.: Primitive sémantiques, classification des verbes et polysémie (1999) 0.05
    0.047213987 = product of:
      0.09442797 = sum of:
        0.09442797 = product of:
          0.14164196 = sum of:
            0.07859619 = weight(_text_:f in 6229) [ClassicSimilarity], result of:
              0.07859619 = score(doc=6229,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.4403713 = fieldWeight in 6229, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.078125 = fieldNorm(doc=6229)
            0.06304577 = weight(_text_:k in 6229) [ClassicSimilarity], result of:
              0.06304577 = score(doc=6229,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.39440846 = fieldWeight in 6229, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.078125 = fieldNorm(doc=6229)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Language
    f
  3. Liu, S.; Liu, F.; Yu, C.; Meng, W.: ¬An effective approach to document retrieval via utilizing WordNet and recognizing phrases (2004) 0.05
    0.047213987 = product of:
      0.09442797 = sum of:
        0.09442797 = product of:
          0.14164196 = sum of:
            0.07859619 = weight(_text_:f in 4078) [ClassicSimilarity], result of:
              0.07859619 = score(doc=4078,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.4403713 = fieldWeight in 4078, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4078)
            0.06304577 = weight(_text_:k in 4078) [ClassicSimilarity], result of:
              0.06304577 = score(doc=4078,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.39440846 = fieldWeight in 4078, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.078125 = fieldNorm(doc=4078)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
  4. Kocijan, K.: Visualizing natural language resources (2015) 0.05
    0.047213987 = product of:
      0.09442797 = sum of:
        0.09442797 = product of:
          0.14164196 = sum of:
            0.07859619 = weight(_text_:f in 2995) [ClassicSimilarity], result of:
              0.07859619 = score(doc=2995,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.4403713 = fieldWeight in 2995, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2995)
            0.06304577 = weight(_text_:k in 2995) [ClassicSimilarity], result of:
              0.06304577 = score(doc=2995,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.39440846 = fieldWeight in 2995, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2995)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Re:inventing information science in the networked society: Proceedings of the 14th International Symposium on Information Science, Zadar/Croatia, 19th-21st May 2015. Eds.: F. Pehar, C. Schloegl u. C. Wolff
  5. Rieger, F.: Lügende Computer (2023) 0.04
    0.037137263 = product of:
      0.074274525 = sum of:
        0.074274525 = product of:
          0.11141179 = sum of:
            0.062876955 = weight(_text_:f in 912) [ClassicSimilarity], result of:
              0.062876955 = score(doc=912,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.35229704 = fieldWeight in 912, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0625 = fieldNorm(doc=912)
            0.048534833 = weight(_text_:22 in 912) [ClassicSimilarity], result of:
              0.048534833 = score(doc=912,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.30952093 = fieldWeight in 912, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=912)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    16. 3.2023 19:22:55
  6. Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia : Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000 Fachhochschule Köln (2000) 0.04
    0.036595136 = product of:
      0.07319027 = sum of:
        0.07319027 = product of:
          0.10978541 = sum of:
            0.039298095 = weight(_text_:f in 5527) [ClassicSimilarity], result of:
              0.039298095 = score(doc=5527,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.22018565 = fieldWeight in 5527, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5527)
            0.07048731 = weight(_text_:k in 5527) [ClassicSimilarity], result of:
              0.07048731 = score(doc=5527,freq=10.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.44096208 = fieldWeight in 5527, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5527)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Content
    Enthält die Beiträge: WRIGHT, S.E.: Leveraging terminology resources across application boundaries: accessing resources in future integrated environments; PALME, K.: E-Commerce: Verhindert Sprache Business-to-business?; RÜEGGER, R.: Die qualität der virtuellen Information als Wettbewerbsvorteil: Information im Internet ist Sprache - noch; SCHIRMER, K. u. J. HALLER: Zugang zu mehrsprachigen Nachrichten im Internet; WEISS, A. u. W. WIEDEN: Die Herstellung mehrsprachiger Informations- und Wissensressourcen in Unternehmen; FULFORD, H.: Monolingual or multilingual web sites? An exploratory study of UK SMEs; SCHMIDTKE-NIKELLA, M.: Effiziente Hypermediaentwicklung: Die Autorenentlastung durch eine Engine; SCHMIDT, R.: Maschinelle Text-Ton-Synchronisation in Wissenschaft und Wirtschaft; HELBIG, H. u.a.: Natürlichsprachlicher Zugang zu Informationsanbietern im Internet und zu lokalen Datenbanken; SIENEL, J. u.a.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts; ERBACH, G.: Sprachdialogsysteme für Telefondienste: Stand der Technik und zukünftige Entwicklungen; SUSEN, A.: Spracherkennung: Akteulle Einsatzmöglichkeiten im Bereich der Telekommunikation; BENZMÜLLER, R.: Logox WebSpeech: die neue Technologie für sprechende Internetseiten; JAARANEN, K. u.a.: Webtran tools for in-company language support; SCHMITZ, K.-D.: Projektforschung und Infrastrukturen im Bereich der Terminologie: Wie kann die Wirtschaft davon profitieren?; SCHRÖTER, F. u. U. MEYER: Entwicklung sprachlicher Handlungskompetenz in englisch mit hilfe eines Multimedia-Sprachlernsystems; KLEIN, A.: Der Einsatz von Sprachverarbeitungstools beim Sprachenlernen im Intranet; HAUER, M.: Knowledge Management braucht Terminologie Management; HEYER, G. u.a.: Texttechnologische Anwendungen am Beispiel Text Mining
    Editor
    Schmitz, K.-D.
  7. Semantik, Lexikographie und Computeranwendungen : Workshop ... (Bonn) : 1995.01.27-28 (1996) 0.04
    0.036310155 = product of:
      0.07262031 = sum of:
        0.07262031 = product of:
          0.10893046 = sum of:
            0.07859619 = weight(_text_:f in 190) [ClassicSimilarity], result of:
              0.07859619 = score(doc=190,freq=8.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.4403713 = fieldWeight in 190, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=190)
            0.030334271 = weight(_text_:22 in 190) [ClassicSimilarity], result of:
              0.030334271 = score(doc=190,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19345059 = fieldWeight in 190, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=190)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Classification
    Spr F 510
    Spr F 87 / Lexikographie
    Date
    14. 4.2007 10:04:22
    SBB
    Spr F 510
    Spr F 87 / Lexikographie
  8. Schneider, J.W.; Borlund, P.: ¬A bibliometric-based semiautomatic approach to identification of candidate thesaurus terms : parsing and filtering of noun phrases from citation contexts (2005) 0.03
    0.032495104 = product of:
      0.06499021 = sum of:
        0.06499021 = product of:
          0.09748531 = sum of:
            0.055017333 = weight(_text_:f in 156) [ClassicSimilarity], result of:
              0.055017333 = score(doc=156,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.3082599 = fieldWeight in 156, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=156)
            0.042467978 = weight(_text_:22 in 156) [ClassicSimilarity], result of:
              0.042467978 = score(doc=156,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.2708308 = fieldWeight in 156, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=156)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    8. 3.2007 19:55:22
    Source
    Context: nature, impact and role. 5th International Conference an Conceptions of Library and Information Sciences, CoLIS 2005 Glasgow, UK, June 2005. Ed. by F. Crestani u. I. Ruthven
  9. Natural language processing and speech technology : Results of the 3rd KONVENS Conference, Bielefeld, October 1996 (1996) 0.03
    0.028328393 = product of:
      0.056656785 = sum of:
        0.056656785 = product of:
          0.084985174 = sum of:
            0.047157712 = weight(_text_:f in 7291) [ClassicSimilarity], result of:
              0.047157712 = score(doc=7291,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.26422277 = fieldWeight in 7291, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7291)
            0.037827462 = weight(_text_:k in 7291) [ClassicSimilarity], result of:
              0.037827462 = score(doc=7291,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.23664509 = fieldWeight in 7291, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=7291)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Content
    Enthält u.a. die Beiträge: HILDEBRANDT, B. u.a.: Kognitive Modellierung von Sprach- und Bildverstehen; KELLER, F.: How do humans deal with ungrammatical input? Experimental evidence and computational modelling; MARX, J:: Die 'Computer-Talk-These' in der Sprachgenerierung: Hinweise zur Gestaltung natürlichsprachlicher Zustandsanzeigen in multimodalen Informationssystemen; SCHULTZ, T. u. H. SOLTAU: Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen; WAUSCHKUHN, O.: Ein Werkzeug zur partiellen syntaktischen Analyse deutscher Textkorpora; LEZIUS, W., R. RAPP u. M. WETTLER: A morphology-system and part-of-speech tagger for German; KONRAD, K. u.a.: CLEARS: ein Werkzeug für Ausbildung und Forschung in der Computerlinguistik
  10. Schröter, F.; Meyer, U.: Entwicklung sprachlicher Handlungskompetenz in Englisch mit Hilfe eines Multimedia-Sprachlernsystems (2000) 0.03
    0.028328393 = product of:
      0.056656785 = sum of:
        0.056656785 = product of:
          0.084985174 = sum of:
            0.047157712 = weight(_text_:f in 5567) [ClassicSimilarity], result of:
              0.047157712 = score(doc=5567,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.26422277 = fieldWeight in 5567, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5567)
            0.037827462 = weight(_text_:k in 5567) [ClassicSimilarity], result of:
              0.037827462 = score(doc=5567,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.23664509 = fieldWeight in 5567, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.046875 = fieldNorm(doc=5567)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Source
    Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
  11. Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.03
    0.026669985 = product of:
      0.05333997 = sum of:
        0.05333997 = product of:
          0.21335988 = sum of:
            0.21335988 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
              0.21335988 = score(doc=862,freq=2.0), product of:
                0.37963173 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.044778395 = queryNorm
                0.56201804 = fieldWeight in 862, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=862)
          0.25 = coord(1/4)
      0.5 = coord(1/2)
    
    Source
    https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN
  12. Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001) 0.02
    0.023606993 = product of:
      0.047213987 = sum of:
        0.047213987 = product of:
          0.07082098 = sum of:
            0.039298095 = weight(_text_:f in 6029) [ClassicSimilarity], result of:
              0.039298095 = score(doc=6029,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.22018565 = fieldWeight in 6029, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6029)
            0.031522885 = weight(_text_:k in 6029) [ClassicSimilarity], result of:
              0.031522885 = score(doc=6029,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19720423 = fieldWeight in 6029, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6029)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  13. Shaalan, K.; Raza, H.: NERA: Named Entity Recognition for Arabic (2009) 0.02
    0.023606993 = product of:
      0.047213987 = sum of:
        0.047213987 = product of:
          0.07082098 = sum of:
            0.039298095 = weight(_text_:f in 2953) [ClassicSimilarity], result of:
              0.039298095 = score(doc=2953,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.22018565 = fieldWeight in 2953, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2953)
            0.031522885 = weight(_text_:k in 2953) [ClassicSimilarity], result of:
              0.031522885 = score(doc=2953,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19720423 = fieldWeight in 2953, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2953)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Abstract
    Name identification has been worked on quite intensively for the past few years, and has been incorporated into several products revolving around natural language processing tasks. Many researchers have attacked the name identification problem in a variety of languages, but only a few limited research efforts have focused on named entity recognition for Arabic script. This is due to the lack of resources for Arabic named entities and the limited amount of progress made in Arabic natural language processing in general. In this article, we present the results of our attempt at the recognition and extraction of the 10 most important categories of named entities in Arabic script: the person name, location, company, date, time, price, measurement, phone number, ISBN, and file name. We developed the system Named Entity Recognition for Arabic (NERA) using a rule-based approach. The resources created are: a Whitelist representing a dictionary of names, and a grammar, in the form of regular expressions, which are responsible for recognizing the named entities. A filtration mechanism is used that serves two different purposes: (a) revision of the results from a named entity extractor by using metadata, in terms of a Blacklist or rejecter, about ill-formed named entities and (b) disambiguation of identical or overlapping textual matches returned by different name entity extractors to get the correct choice. In NERA, we addressed major challenges posed by NER in the Arabic language arising due to the complexity of the language, peculiarities in the Arabic orthographic system, nonstandardization of the written text, ambiguity, and lack of resources. NERA has been effectively evaluated using our own tagged corpus; it achieved satisfactory results in terms of precision, recall, and F-measure.
  14. Laparra, E.; Binford-Walsh, A.; Emerson, K.; Miller, M.L.; López-Hoffman, L.; Currim, F.; Bethard, S.: Addressing structural hurdles for metadata extraction from environmental impact statements (2023) 0.02
    0.023606993 = product of:
      0.047213987 = sum of:
        0.047213987 = product of:
          0.07082098 = sum of:
            0.039298095 = weight(_text_:f in 1042) [ClassicSimilarity], result of:
              0.039298095 = score(doc=1042,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.22018565 = fieldWeight in 1042, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1042)
            0.031522885 = weight(_text_:k in 1042) [ClassicSimilarity], result of:
              0.031522885 = score(doc=1042,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19720423 = fieldWeight in 1042, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1042)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
  15. Luo, L.; Ju, J.; Li, Y.-F.; Haffari, G.; Xiong, B.; Pan, S.: ChatRule: mining logical rules with large language models for knowledge graph reasoning (2023) 0.02
    0.02321079 = product of:
      0.04642158 = sum of:
        0.04642158 = product of:
          0.06963237 = sum of:
            0.039298095 = weight(_text_:f in 1171) [ClassicSimilarity], result of:
              0.039298095 = score(doc=1171,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.22018565 = fieldWeight in 1171, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1171)
            0.030334271 = weight(_text_:22 in 1171) [ClassicSimilarity], result of:
              0.030334271 = score(doc=1171,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19345059 = fieldWeight in 1171, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1171)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    23.11.2023 19:07:22
  16. Vichot, F.; Wolinksi, F.; Tomeh, J.; Guennou, S.; Dillet, B.; Aydjian, S.: High precision hypertext navigation based on NLP automation extractions (1997) 0.02
    0.02223036 = product of:
      0.04446072 = sum of:
        0.04446072 = product of:
          0.13338216 = sum of:
            0.13338216 = weight(_text_:f in 733) [ClassicSimilarity], result of:
              0.13338216 = score(doc=733,freq=4.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.74733484 = fieldWeight in 733, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.09375 = fieldNorm(doc=733)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  17. Sienel, J.; Weiss, M.; Laube, M.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts (2000) 0.02
    0.020619053 = product of:
      0.041238107 = sum of:
        0.041238107 = product of:
          0.061857156 = sum of:
            0.031522885 = weight(_text_:k in 5557) [ClassicSimilarity], result of:
              0.031522885 = score(doc=5557,freq=2.0), product of:
                0.15984893 = queryWeight, product of:
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19720423 = fieldWeight in 5557, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.569778 = idf(docFreq=3384, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5557)
            0.030334271 = weight(_text_:22 in 5557) [ClassicSimilarity], result of:
              0.030334271 = score(doc=5557,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.19345059 = fieldWeight in 5557, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5557)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    26.12.2000 13:22:17
    Source
    Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
  18. Rötzer, F.: KI-Programm besser als Menschen im Verständnis natürlicher Sprache (2018) 0.02
    0.018568631 = product of:
      0.037137263 = sum of:
        0.037137263 = product of:
          0.055705894 = sum of:
            0.031438477 = weight(_text_:f in 4217) [ClassicSimilarity], result of:
              0.031438477 = score(doc=4217,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.17614852 = fieldWeight in 4217, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4217)
            0.024267416 = weight(_text_:22 in 4217) [ClassicSimilarity], result of:
              0.024267416 = score(doc=4217,freq=2.0), product of:
                0.1568063 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.044778395 = queryNorm
                0.15476047 = fieldWeight in 4217, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4217)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    22. 1.2018 11:32:44
  19. Latzer, F.-M.: Yo Computa! (1997) 0.02
    0.018339112 = product of:
      0.036678225 = sum of:
        0.036678225 = product of:
          0.11003467 = sum of:
            0.11003467 = weight(_text_:f in 6005) [ClassicSimilarity], result of:
              0.11003467 = score(doc=6005,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.6165198 = fieldWeight in 6005, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6005)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  20. Blanchon, E.: Terminology software : pt.1.2 (1995) 0.02
    0.018339112 = product of:
      0.036678225 = sum of:
        0.036678225 = product of:
          0.11003467 = sum of:
            0.11003467 = weight(_text_:f in 6408) [ClassicSimilarity], result of:
              0.11003467 = score(doc=6408,freq=2.0), product of:
                0.1784771 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.044778395 = queryNorm
                0.6165198 = fieldWeight in 6408, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6408)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Language
    f

Languages

  • e 134
  • d 47
  • f 6
  • m 2
  • chi 1
  • More… Less…

Types

  • a 154
  • m 22
  • el 13
  • s 11
  • x 4
  • p 2
  • d 1
  • More… Less…

Subjects

Classifications