Document (#25463)

Author
Riggs, K.R.
Title
XML and free text
Source
Journal of the American Society for Information Science and technology. 53(2002) no.6, S.526-528
Year
2002
Abstract
We show several problems with marking free text, text that is either natural language or semigrammatical but unstructured. These problems prevent well-formed XML from marking text for readily available meaning. A solution is proposed to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML.
Object
XML

Similar documents (author)

  1. Riggs, F.W.: Information and social science : the need for onomantics (1989) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:riggs in 2911) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 2911, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=2911)
    
  2. Riggs, F.W.: Onomantics and terminology : pt.1: their contributions to knowledge organization (1996) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:riggs in 3819) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 3819, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=3819)
    
  3. Riggs, F.W.: Onomantics and terminology : pt.2: core concepts (1996) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:riggs in 5456) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 5456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=5456)
    
  4. Riggs, F.W.: Onomantics and terminology : pt.3: formats, borrowed terms and omissions (1996) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:riggs in 6109) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 6109, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=6109)
    
  5. Riggs, F.W.: Onomantics and terminology : pt.4: neologisms, neoterisms, meta-terms, phrases, and pleonisms (1997) 6.07
    6.0731125 = sum of:
      6.0731125 = weight(author_txt:riggs in 536) [ClassicSimilarity], result of:
        6.0731125 = fieldWeight in 536, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.71698 = idf(docFreq=6, maxDocs=42740)
          0.625 = fieldNorm(doc=536)
    

Similar documents (content)

  1. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.15
    0.14566888 = sum of:
      0.14566888 = product of:
        0.45521528 = sum of:
          0.037633516 = weight(abstract_txt:language in 3317) [ClassicSimilarity], result of:
            0.037633516 = score(doc=3317,freq=3.0), product of:
              0.066315606 = queryWeight, product of:
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.015812747 = queryNorm
              0.5674911 = fieldWeight in 3317, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.036816247 = weight(abstract_txt:show in 3317) [ClassicSimilarity], result of:
            0.036816247 = score(doc=3317,freq=2.0), product of:
              0.074809365 = queryWeight, product of:
                1.0621115 = boost
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.015812747 = queryNorm
              0.4921342 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4542904 = idf(docFreq=1350, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.027907776 = weight(abstract_txt:several in 3317) [ClassicSimilarity], result of:
            0.027907776 = score(doc=3317,freq=1.0), product of:
              0.07835916 = queryWeight, product of:
                1.0870187 = boost
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.015812747 = queryNorm
              0.35615206 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.011441019 = weight(abstract_txt:that in 3317) [ClassicSimilarity], result of:
            0.011441019 = score(doc=3317,freq=2.0), product of:
              0.04324303 = queryWeight, product of:
                1.1419976 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.015812747 = queryNorm
              0.2645749 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.013295531 = weight(abstract_txt:with in 3317) [ClassicSimilarity], result of:
            0.013295531 = score(doc=3317,freq=2.0), product of:
              0.047798038 = queryWeight, product of:
                1.2006382 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.015812747 = queryNorm
              0.2781606 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.06809415 = weight(abstract_txt:natural in 3317) [ClassicSimilarity], result of:
            0.06809415 = score(doc=3317,freq=3.0), product of:
              0.098470405 = queryWeight, product of:
                1.2185546 = boost
                5.1103826 = idf(docFreq=700, maxDocs=42740)
                0.015812747 = queryNorm
              0.6915189 = fieldWeight in 3317, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1103826 = idf(docFreq=700, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.10432503 = weight(abstract_txt:meaning in 3317) [ClassicSimilarity], result of:
            0.10432503 = score(doc=3317,freq=1.0), product of:
              0.23779824 = queryWeight, product of:
                2.6780055 = boost
                5.6155186 = idf(docFreq=422, maxDocs=42740)
                0.015812747 = queryNorm
              0.4387124 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6155186 = idf(docFreq=422, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
          0.155702 = weight(abstract_txt:free in 3317) [ClassicSimilarity], result of:
            0.155702 = score(doc=3317,freq=1.0), product of:
              0.35550264 = queryWeight, product of:
                4.0102754 = boost
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.015812747 = queryNorm
              0.43797705 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.078125 = fieldNorm(doc=3317)
        0.32 = coord(8/25)
    
  2. Ashford, J.H.: Free text retrieval in the Welsh language : problems, and proposed working practice (1995) 0.14
    0.13535042 = sum of:
      0.13535042 = product of:
        0.6767521 = sum of:
          0.034764353 = weight(abstract_txt:language in 6509) [ClassicSimilarity], result of:
            0.034764353 = score(doc=6509,freq=1.0), product of:
              0.066315606 = queryWeight, product of:
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.015812747 = queryNorm
              0.52422583 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.125 = fieldNorm(doc=6509)
          0.047110695 = weight(abstract_txt:proposed in 6509) [ClassicSimilarity], result of:
            0.047110695 = score(doc=6509,freq=1.0), product of:
              0.08120934 = queryWeight, product of:
                1.1066114 = boost
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.015812747 = queryNorm
              0.58011425 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.640914 = idf(docFreq=1120, maxDocs=42740)
                0.125 = fieldNorm(doc=6509)
          0.07459497 = weight(abstract_txt:problems in 6509) [ClassicSimilarity], result of:
            0.07459497 = score(doc=6509,freq=1.0), product of:
              0.13899824 = queryWeight, product of:
                2.047443 = boost
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.015812747 = queryNorm
              0.5366612 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.125 = fieldNorm(doc=6509)
          0.2491232 = weight(abstract_txt:free in 6509) [ClassicSimilarity], result of:
            0.2491232 = score(doc=6509,freq=1.0), product of:
              0.35550264 = queryWeight, product of:
                4.0102754 = boost
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.015812747 = queryNorm
              0.7007633 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.125 = fieldNorm(doc=6509)
          0.27115884 = weight(abstract_txt:text in 6509) [ClassicSimilarity], result of:
            0.27115884 = score(doc=6509,freq=3.0), product of:
              0.3092372 = queryWeight, product of:
                4.82862 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015812747 = queryNorm
              0.87686354 = fieldWeight in 6509, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.125 = fieldNorm(doc=6509)
        0.2 = coord(5/25)
    
  3. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.13
    0.13317242 = sum of:
      0.13317242 = product of:
        0.47561577 = sum of:
          0.030727638 = weight(abstract_txt:language in 3636) [ClassicSimilarity], result of:
            0.030727638 = score(doc=3636,freq=2.0), product of:
              0.066315606 = queryWeight, product of:
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.015812747 = queryNorm
              0.46335456 = fieldWeight in 3636, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1938066 = idf(docFreq=1752, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.008090023 = weight(abstract_txt:that in 3636) [ClassicSimilarity], result of:
            0.008090023 = score(doc=3636,freq=1.0), product of:
              0.04324303 = queryWeight, product of:
                1.1419976 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.015812747 = queryNorm
              0.18708271 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.00940136 = weight(abstract_txt:with in 3636) [ClassicSimilarity], result of:
            0.00940136 = score(doc=3636,freq=1.0), product of:
              0.047798038 = queryWeight, product of:
                1.2006382 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.015812747 = queryNorm
              0.19668923 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.05559864 = weight(abstract_txt:natural in 3636) [ClassicSimilarity], result of:
            0.05559864 = score(doc=3636,freq=2.0), product of:
              0.098470405 = queryWeight, product of:
                1.2185546 = boost
                5.1103826 = idf(docFreq=700, maxDocs=42740)
                0.015812747 = queryNorm
              0.5646228 = fieldWeight in 3636, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1103826 = idf(docFreq=700, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.04662185 = weight(abstract_txt:problems in 3636) [ClassicSimilarity], result of:
            0.04662185 = score(doc=3636,freq=1.0), product of:
              0.13899824 = queryWeight, product of:
                2.047443 = boost
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.015812747 = queryNorm
              0.33541325 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.155702 = weight(abstract_txt:free in 3636) [ClassicSimilarity], result of:
            0.155702 = score(doc=3636,freq=1.0), product of:
              0.35550264 = queryWeight, product of:
                4.0102754 = boost
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.015812747 = queryNorm
              0.43797705 = fieldWeight in 3636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
          0.16947427 = weight(abstract_txt:text in 3636) [ClassicSimilarity], result of:
            0.16947427 = score(doc=3636,freq=3.0), product of:
              0.3092372 = queryWeight, product of:
                4.82862 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015812747 = queryNorm
              0.54803973 = fieldWeight in 3636, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=3636)
        0.28 = coord(7/25)
    
  4. Gagan, D.: Scanning: a survival guide : 6: text scanning - editing and performance (1993) 0.13
    0.1321721 = sum of:
      0.1321721 = product of:
        0.8260756 = sum of:
          0.04465244 = weight(abstract_txt:several in 6302) [ClassicSimilarity], result of:
            0.04465244 = score(doc=6302,freq=1.0), product of:
              0.07835916 = queryWeight, product of:
                1.0870187 = boost
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.015812747 = queryNorm
              0.5698433 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.125 = fieldNorm(doc=6302)
          0.015042176 = weight(abstract_txt:with in 6302) [ClassicSimilarity], result of:
            0.015042176 = score(doc=6302,freq=1.0), product of:
              0.047798038 = queryWeight, product of:
                1.2006382 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.015812747 = queryNorm
              0.31470278 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.125 = fieldNorm(doc=6302)
          0.54498076 = weight(abstract_txt:marking in 6302) [ClassicSimilarity], result of:
            0.54498076 = score(doc=6302,freq=1.0), product of:
              0.5233478 = queryWeight, product of:
                3.9728515 = boost
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.015812747 = queryNorm
              1.0413357 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.330686 = idf(docFreq=27, maxDocs=42740)
                0.125 = fieldNorm(doc=6302)
          0.22140026 = weight(abstract_txt:text in 6302) [ClassicSimilarity], result of:
            0.22140026 = score(doc=6302,freq=2.0), product of:
              0.3092372 = queryWeight, product of:
                4.82862 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.015812747 = queryNorm
              0.7159561 = fieldWeight in 6302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.125 = fieldNorm(doc=6302)
        0.16 = coord(4/25)
    
  5. Karamuftuoglu, M.: Collaborative information retrieval : toward a social informatics view of IR interaction (1998) 0.13
    0.12888491 = sum of:
      0.12888491 = product of:
        0.46030328 = sum of:
          0.027907776 = weight(abstract_txt:several in 3152) [ClassicSimilarity], result of:
            0.027907776 = score(doc=3152,freq=1.0), product of:
              0.07835916 = queryWeight, product of:
                1.0870187 = boost
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.015812747 = queryNorm
              0.35615206 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5587463 = idf(docFreq=1216, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.011441019 = weight(abstract_txt:that in 3152) [ClassicSimilarity], result of:
            0.011441019 = score(doc=3152,freq=2.0), product of:
              0.04324303 = queryWeight, product of:
                1.1419976 = boost
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.015812747 = queryNorm
              0.2645749 = fieldWeight in 3152, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3946586 = idf(docFreq=10595, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.00940136 = weight(abstract_txt:with in 3152) [ClassicSimilarity], result of:
            0.00940136 = score(doc=3152,freq=1.0), product of:
              0.047798038 = queryWeight, product of:
                1.2006382 = boost
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.015812747 = queryNorm
              0.19668923 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5176222 = idf(docFreq=9369, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.10490425 = weight(abstract_txt:readily in 3152) [ClassicSimilarity], result of:
            0.10490425 = score(doc=3152,freq=1.0), product of:
              0.18943854 = queryWeight, product of:
                1.6901541 = boost
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.015812747 = queryNorm
              0.55376405 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0881796 = idf(docFreq=96, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.04662185 = weight(abstract_txt:problems in 3152) [ClassicSimilarity], result of:
            0.04662185 = score(doc=3152,freq=1.0), product of:
              0.13899824 = queryWeight, product of:
                2.047443 = boost
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.015812747 = queryNorm
              0.33541325 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2932897 = idf(docFreq=1586, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.10432503 = weight(abstract_txt:meaning in 3152) [ClassicSimilarity], result of:
            0.10432503 = score(doc=3152,freq=1.0), product of:
              0.23779824 = queryWeight, product of:
                2.6780055 = boost
                5.6155186 = idf(docFreq=422, maxDocs=42740)
                0.015812747 = queryNorm
              0.4387124 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6155186 = idf(docFreq=422, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
          0.155702 = weight(abstract_txt:free in 3152) [ClassicSimilarity], result of:
            0.155702 = score(doc=3152,freq=1.0), product of:
              0.35550264 = queryWeight, product of:
                4.0102754 = boost
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.015812747 = queryNorm
              0.43797705 = fieldWeight in 3152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6061063 = idf(docFreq=426, maxDocs=42740)
                0.078125 = fieldNorm(doc=3152)
        0.28 = coord(7/25)