Document (#26854)

Author
Sidhom, S.
Hassoun, M.
Title
Morpho-syntactic parsing for a text mining environment : An NP recognition model for knowledge visualization and information retrieval
Source
Knowledge organization. 29(2002) nos.3/4, S.171-180
Year
2002
Abstract
Sidhom and Hassoun discuss the crucial role of NLP tools in Knowledge Extraction and Management as well as in the design of Information Retrieval Systems. The authors focus more specifically an the morpho-syntactic issues by describing their morpho-syntactic analysis platform, which has been implemented to cover the automatic indexing and information retrieval topics. To this end they implemented the Cascaded "Augmented Transition Network (ATN)". They used this formalism in order to analyse French text descriptions of Multimedia documents. An implementation of an ATN parsing automaton is briefly described. The Platform in its logical operation is considered as an investigative tool towards the knowledge organization (based an an NP recognition model) and management of multiform e-documents (text, multimedia, audio, image) using their text descriptions.
Theme
Computerlinguistik

Similar documents (author)

  1. Sidhom, S.; Hassoun, M.: Morpho-syntactic parsing to text mining environment : NP recognition model to knowledge visualization and information (2003) 4.94
    4.9355655 = sum of:
      4.9355655 = weight(author_txt:hassoun in 4547) [ClassicSimilarity], result of:
        4.9355655 = fieldWeight in 4547, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.5 = fieldNorm(doc=4547)
    
  2. Bernaoui, R.; Hassoun, M.: Knowledge awareness and standards in agricultural research in Algeria : prerequisites for a national information system of high added value (2011) 4.94
    4.9355655 = sum of:
      4.9355655 = weight(author_txt:hassoun in 1740) [ClassicSimilarity], result of:
        4.9355655 = fieldWeight in 1740, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.5 = fieldNorm(doc=1740)
    
  3. Bernaoui, R.; Hassoun, M.: User expectations, reality and delineation of agricultural information systems in the Maghreb (2012) 4.94
    4.9355655 = sum of:
      4.9355655 = weight(author_txt:hassoun in 2869) [ClassicSimilarity], result of:
        4.9355655 = fieldWeight in 2869, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.5 = fieldNorm(doc=2869)
    
  4. Bernaoui, R.; Ohly, H.P.; Salim, K.; Hassoun, M.: ¬The mobile phone between challenge and expectations : a potential for information sharing between Algerian breeders and veterinarians (2018) 3.08
    3.0847285 = sum of:
      3.0847285 = weight(author_txt:hassoun in 860) [ClassicSimilarity], result of:
        3.0847285 = fieldWeight in 860, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.871131 = idf(docFreq=5, maxDocs=42740)
          0.3125 = fieldNorm(doc=860)
    

Similar documents (content)

  1. Vilares, J.; Alonso, M.A.; Vilares, M.: Extraction of complex index terms in non-English IR : a shallow parsing based approach (2008) 0.25
    0.25406992 = sum of:
      0.25406992 = product of:
        0.9073926 = sum of:
          0.011297738 = weight(abstract_txt:information in 4108) [ClassicSimilarity], result of:
            0.011297738 = score(doc=4108,freq=3.0), product of:
              0.042946324 = queryWeight, product of:
                1.1277642 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015670499 = queryNorm
              0.26306647 = fieldWeight in 4108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.02986845 = weight(abstract_txt:documents in 4108) [ClassicSimilarity], result of:
            0.02986845 = score(doc=4108,freq=2.0), product of:
              0.08211203 = queryWeight, product of:
                1.2732482 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.015670499 = queryNorm
              0.36375242 = fieldWeight in 4108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.023203958 = weight(abstract_txt:management in 4108) [ClassicSimilarity], result of:
            0.023203958 = score(doc=4108,freq=1.0), product of:
              0.08742783 = queryWeight, product of:
                1.3138161 = boost
                4.246512 = idf(docFreq=1662, maxDocs=42740)
                0.015670499 = queryNorm
              0.265407 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246512 = idf(docFreq=1662, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.018905636 = weight(abstract_txt:retrieval in 4108) [ClassicSimilarity], result of:
            0.018905636 = score(doc=4108,freq=1.0), product of:
              0.08730376 = queryWeight, product of:
                1.6079472 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.015670499 = queryNorm
              0.21655008 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.14220259 = weight(abstract_txt:parsing in 4108) [ClassicSimilarity], result of:
            0.14220259 = score(doc=4108,freq=1.0), product of:
              0.29278353 = queryWeight, product of:
                2.4042687 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.015670499 = queryNorm
              0.48569188 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.28185728 = weight(abstract_txt:syntactic in 4108) [ClassicSimilarity], result of:
            0.28185728 = score(doc=4108,freq=5.0), product of:
              0.30926794 = queryWeight, product of:
                3.0263753 = boost
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.015670499 = queryNorm
              0.9113692 = fieldWeight in 4108, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
          0.40005696 = weight(abstract_txt:morpho in 4108) [ClassicSimilarity], result of:
            0.40005696 = score(doc=4108,freq=1.0), product of:
              0.6679131 = queryWeight, product of:
                4.447493 = boost
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.015670499 = queryNorm
              0.5989656 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.583449 = idf(docFreq=7, maxDocs=42740)
                0.0625 = fieldNorm(doc=4108)
        0.28 = coord(7/25)
    
  2. Chowdhury, G.G.: Natural language processing and information retrieval : pt.1: basic issues; pt.2: major applications (1991) 0.16
    0.16170478 = sum of:
      0.16170478 = product of:
        0.8085239 = sum of:
          0.01630688 = weight(abstract_txt:information in 3313) [ClassicSimilarity], result of:
            0.01630688 = score(doc=3313,freq=1.0), product of:
              0.042946324 = queryWeight, product of:
                1.1277642 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015670499 = queryNorm
              0.37970376 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.15625 = fieldNorm(doc=3313)
          0.04726409 = weight(abstract_txt:retrieval in 3313) [ClassicSimilarity], result of:
            0.04726409 = score(doc=3313,freq=1.0), product of:
              0.08730376 = queryWeight, product of:
                1.6079472 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.015670499 = queryNorm
              0.5413752 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.15625 = fieldNorm(doc=3313)
          0.07432051 = weight(abstract_txt:knowledge in 3313) [ClassicSimilarity], result of:
            0.07432051 = score(doc=3313,freq=2.0), product of:
              0.09370035 = queryWeight, product of:
                1.6658118 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.015670499 = queryNorm
              0.7931722 = fieldWeight in 3313, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.15625 = fieldNorm(doc=3313)
          0.35550645 = weight(abstract_txt:parsing in 3313) [ClassicSimilarity], result of:
            0.35550645 = score(doc=3313,freq=1.0), product of:
              0.29278353 = queryWeight, product of:
                2.4042687 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.015670499 = queryNorm
              1.2142297 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.15625 = fieldNorm(doc=3313)
          0.31512597 = weight(abstract_txt:syntactic in 3313) [ClassicSimilarity], result of:
            0.31512597 = score(doc=3313,freq=1.0), product of:
              0.30926794 = queryWeight, product of:
                3.0263753 = boost
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.015670499 = queryNorm
              1.0189416 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.15625 = fieldNorm(doc=3313)
        0.2 = coord(5/25)
    
  3. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 0.13
    0.13499118 = sum of:
      0.13499118 = product of:
        0.48211136 = sum of:
          0.015977414 = weight(abstract_txt:information in 2928) [ClassicSimilarity], result of:
            0.015977414 = score(doc=2928,freq=6.0), product of:
              0.042946324 = queryWeight, product of:
                1.1277642 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015670499 = queryNorm
              0.3720322 = fieldWeight in 2928, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.021120183 = weight(abstract_txt:documents in 2928) [ClassicSimilarity], result of:
            0.021120183 = score(doc=2928,freq=1.0), product of:
              0.08211203 = queryWeight, product of:
                1.2732482 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.015670499 = queryNorm
              0.2572118 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.023203958 = weight(abstract_txt:management in 2928) [ClassicSimilarity], result of:
            0.023203958 = score(doc=2928,freq=1.0), product of:
              0.08742783 = queryWeight, product of:
                1.3138161 = boost
                4.246512 = idf(docFreq=1662, maxDocs=42740)
                0.015670499 = queryNorm
              0.265407 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246512 = idf(docFreq=1662, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.021021014 = weight(abstract_txt:knowledge in 2928) [ClassicSimilarity], result of:
            0.021021014 = score(doc=2928,freq=1.0), product of:
              0.09370035 = queryWeight, product of:
                1.6658118 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.015670499 = queryNorm
              0.22434297 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.14220259 = weight(abstract_txt:parsing in 2928) [ClassicSimilarity], result of:
            0.14220259 = score(doc=2928,freq=1.0), product of:
              0.29278353 = queryWeight, product of:
                2.4042687 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.015670499 = queryNorm
              0.48569188 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.0402605 = weight(abstract_txt:text in 2928) [ClassicSimilarity], result of:
            0.0402605 = score(doc=2928,freq=1.0), product of:
              0.15905151 = queryWeight, product of:
                2.5060723 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015670499 = queryNorm
              0.2531287 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
          0.2183257 = weight(abstract_txt:syntactic in 2928) [ClassicSimilarity], result of:
            0.2183257 = score(doc=2928,freq=3.0), product of:
              0.30926794 = queryWeight, product of:
                3.0263753 = boost
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.015670499 = queryNorm
              0.7059435 = fieldWeight in 2928, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.0625 = fieldNorm(doc=2928)
        0.28 = coord(7/25)
    
  4. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.13
    0.1348196 = sum of:
      0.1348196 = product of:
        0.48149854 = sum of:
          0.009224565 = weight(abstract_txt:information in 4896) [ClassicSimilarity], result of:
            0.009224565 = score(doc=4896,freq=2.0), product of:
              0.042946324 = queryWeight, product of:
                1.1277642 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015670499 = queryNorm
              0.21479288 = fieldWeight in 4896, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.019718966 = weight(abstract_txt:model in 4896) [ClassicSimilarity], result of:
            0.019718966 = score(doc=4896,freq=1.0), product of:
              0.078438826 = queryWeight, product of:
                1.2444437 = boost
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.015670499 = queryNorm
              0.25139293 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.022287 = idf(docFreq=2080, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.021021014 = weight(abstract_txt:knowledge in 4896) [ClassicSimilarity], result of:
            0.021021014 = score(doc=4896,freq=1.0), product of:
              0.09370035 = queryWeight, product of:
                1.6658118 = boost
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.015670499 = queryNorm
              0.22434297 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5894876 = idf(docFreq=3207, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.123020515 = weight(abstract_txt:recognition in 4896) [ClassicSimilarity], result of:
            0.123020515 = score(doc=4896,freq=3.0), product of:
              0.18431172 = queryWeight, product of:
                1.9075949 = boost
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.015670499 = queryNorm
              0.667459 = fieldWeight in 4896, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1657224 = idf(docFreq=243, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.14220259 = weight(abstract_txt:parsing in 4896) [ClassicSimilarity], result of:
            0.14220259 = score(doc=4896,freq=1.0), product of:
              0.29278353 = queryWeight, product of:
                2.4042687 = boost
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.015670499 = queryNorm
              0.48569188 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.77107 = idf(docFreq=48, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.0402605 = weight(abstract_txt:text in 4896) [ClassicSimilarity], result of:
            0.0402605 = score(doc=4896,freq=1.0), product of:
              0.15905151 = queryWeight, product of:
                2.5060723 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015670499 = queryNorm
              0.2531287 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
          0.1260504 = weight(abstract_txt:syntactic in 4896) [ClassicSimilarity], result of:
            0.1260504 = score(doc=4896,freq=1.0), product of:
              0.30926794 = queryWeight, product of:
                3.0263753 = boost
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.015670499 = queryNorm
              0.40757668 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.521227 = idf(docFreq=170, maxDocs=42740)
                0.0625 = fieldNorm(doc=4896)
        0.28 = coord(7/25)
    
  5. Lee, N.S.: InfoStation: a multimedia access system for library automation (1990) 0.13
    0.12547554 = sum of:
      0.12547554 = product of:
        0.52281475 = sum of:
          0.090293154 = weight(abstract_txt:audio in 7070) [ClassicSimilarity], result of:
            0.090293154 = score(doc=7070,freq=1.0), product of:
              0.10814711 = queryWeight, product of:
                1.0332422 = boost
                6.679284 = idf(docFreq=145, maxDocs=42740)
                0.015670499 = queryNorm
              0.8349105 = fieldWeight in 7070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.679284 = idf(docFreq=145, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
          0.01844913 = weight(abstract_txt:information in 7070) [ClassicSimilarity], result of:
            0.01844913 = score(doc=7070,freq=2.0), product of:
              0.042946324 = queryWeight, product of:
                1.1277642 = boost
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.015670499 = queryNorm
              0.42958575 = fieldWeight in 7070, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.430104 = idf(docFreq=10226, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
          0.03781127 = weight(abstract_txt:retrieval in 7070) [ClassicSimilarity], result of:
            0.03781127 = score(doc=7070,freq=1.0), product of:
              0.08730376 = queryWeight, product of:
                1.6079472 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.015670499 = queryNorm
              0.43310016 = fieldWeight in 7070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
          0.12148521 = weight(abstract_txt:multimedia in 7070) [ClassicSimilarity], result of:
            0.12148521 = score(doc=7070,freq=2.0), product of:
              0.1318036 = queryWeight, product of:
                1.6131448 = boost
                5.214001 = idf(docFreq=631, maxDocs=42740)
                0.015670499 = queryNorm
              0.9217139 = fieldWeight in 7070, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.214001 = idf(docFreq=631, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
          0.17425498 = weight(abstract_txt:platform in 7070) [ClassicSimilarity], result of:
            0.17425498 = score(doc=7070,freq=1.0), product of:
              0.2112087 = queryWeight, product of:
                2.0420463 = boost
                6.6002955 = idf(docFreq=157, maxDocs=42740)
                0.015670499 = queryNorm
              0.82503694 = fieldWeight in 7070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6002955 = idf(docFreq=157, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
          0.080521 = weight(abstract_txt:text in 7070) [ClassicSimilarity], result of:
            0.080521 = score(doc=7070,freq=1.0), product of:
              0.15905151 = queryWeight, product of:
                2.5060723 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015670499 = queryNorm
              0.5062574 = fieldWeight in 7070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.125 = fieldNorm(doc=7070)
        0.24 = coord(6/25)