Document (#25463)

Author
Riggs, K.R.
Title
XML and free text
Source
Journal of the American Society for Information Science and technology. 53(2002) no.6, S.526-528
Year
2002
Abstract
We show several problems with marking free text, text that is either natural language or semigrammatical but unstructured. These problems prevent well-formed XML from marking text for readily available meaning. A solution is proposed to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML.
Object
XML

Similar documents (author)

  1. Riggs, F.W.: Information and social science : the need for onomantics (1989) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:riggs in 3911) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 3911, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=3911)
    
  2. Riggs, F.W.: Onomantics and terminology : pt.1: their contributions to knowledge organization (1996) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:riggs in 4819) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 4819, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=4819)
    
  3. Riggs, F.W.: Onomantics and terminology : pt.2: core concepts (1996) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:riggs in 6456) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 6456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=6456)
    
  4. Riggs, F.W.: Onomantics and terminology : pt.3: formats, borrowed terms and omissions (1996) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:riggs in 109) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 109, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=109)
    
  5. Riggs, F.W.: Onomantics and terminology : pt.4: neologisms, neoterisms, meta-terms, phrases, and pleonisms (1997) 6.08
    6.0805845 = sum of:
      6.0805845 = weight(author_txt:riggs in 536) [ClassicSimilarity], result of:
        6.0805845 = fieldWeight in 536, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.728935 = idf(docFreq=6, maxDocs=43254)
          0.625 = fieldNorm(doc=536)
    

Similar documents (content)

  1. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.15
    0.14525686 = sum of:
      0.14525686 = product of:
        0.4539277 = sum of:
          0.037601493 = weight(abstract_txt:language in 3317) [ClassicSimilarity], result of:
            0.037601493 = score(doc=3317,freq=3.0), product of:
              0.066285156 = queryWeight, product of:
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.015811684 = queryNorm
              0.56726867 = fieldWeight in 3317, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.036526956 = weight(abstract_txt:show in 3317) [ClassicSimilarity], result of:
            0.036526956 = score(doc=3317,freq=2.0), product of:
              0.07442502 = queryWeight, product of:
                1.0596229 = boost
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.015811684 = queryNorm
              0.49078867 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.442112 = idf(docFreq=1383, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.027822271 = weight(abstract_txt:several in 3317) [ClassicSimilarity], result of:
            0.027822271 = score(doc=3317,freq=1.0), product of:
              0.07820749 = queryWeight, product of:
                1.0862156 = boost
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.015811684 = queryNorm
              0.35574943 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.0113127325 = weight(abstract_txt:that in 3317) [ClassicSimilarity], result of:
            0.0113127325 = score(doc=3317,freq=2.0), product of:
              0.042923816 = queryWeight, product of:
                1.138036 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.015811684 = queryNorm
              0.26355374 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.013189707 = weight(abstract_txt:with in 3317) [ClassicSimilarity], result of:
            0.013189707 = score(doc=3317,freq=2.0), product of:
              0.04754922 = queryWeight, product of:
                1.1977842 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.015811684 = queryNorm
              0.2773906 = fieldWeight in 3317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.06797181 = weight(abstract_txt:natural in 3317) [ClassicSimilarity], result of:
            0.06797181 = score(doc=3317,freq=3.0), product of:
              0.09836308 = queryWeight, product of:
                1.21817 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.015811684 = queryNorm
              0.6910297 = fieldWeight in 3317, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.10372402 = weight(abstract_txt:meaning in 3317) [ClassicSimilarity], result of:
            0.10372402 = score(doc=3317,freq=1.0), product of:
              0.2369097 = queryWeight, product of:
                2.6736114 = boost
                5.6041074 = idf(docFreq=432, maxDocs=43254)
                0.015811684 = queryNorm
              0.43782088 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6041074 = idf(docFreq=432, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
          0.15577869 = weight(abstract_txt:free in 3317) [ClassicSimilarity], result of:
            0.15577869 = score(doc=3317,freq=1.0), product of:
              0.35565788 = queryWeight, product of:
                4.012072 = boost
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.015811684 = queryNorm
              0.4380015 = fieldWeight in 3317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.078125 = fieldNorm(doc=3317)
        0.32 = coord(8/25)
    
  2. Ashford, J.H.: Free text retrieval in the Welsh language : problems, and proposed working practice (1995) 0.14
    0.13532694 = sum of:
      0.13532694 = product of:
        0.67663467 = sum of:
          0.03473477 = weight(abstract_txt:language in 6509) [ClassicSimilarity], result of:
            0.03473477 = score(doc=6509,freq=1.0), product of:
              0.066285156 = queryWeight, product of:
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.015811684 = queryNorm
              0.5240204 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.125 = fieldNorm(doc=6509)
          0.046687007 = weight(abstract_txt:proposed in 6509) [ClassicSimilarity], result of:
            0.046687007 = score(doc=6509,freq=1.0), product of:
              0.080730446 = queryWeight, product of:
                1.103597 = boost
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.015811684 = queryNorm
              0.57830733 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6264586 = idf(docFreq=1150, maxDocs=43254)
                0.125 = fieldNorm(doc=6509)
          0.07478468 = weight(abstract_txt:problems in 6509) [ClassicSimilarity], result of:
            0.07478468 = score(doc=6509,freq=1.0), product of:
              0.1392489 = queryWeight, product of:
                2.0497587 = boost
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.015811684 = queryNorm
              0.53705764 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.125 = fieldNorm(doc=6509)
          0.24924591 = weight(abstract_txt:free in 6509) [ClassicSimilarity], result of:
            0.24924591 = score(doc=6509,freq=1.0), product of:
              0.35565788 = queryWeight, product of:
                4.012072 = boost
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.015811684 = queryNorm
              0.70080245 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.125 = fieldNorm(doc=6509)
          0.27118233 = weight(abstract_txt:text in 6509) [ClassicSimilarity], result of:
            0.27118233 = score(doc=6509,freq=3.0), product of:
              0.30928853 = queryWeight, product of:
                4.8301296 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.015811684 = queryNorm
              0.876794 = fieldWeight in 6509, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.125 = fieldNorm(doc=6509)
        0.2 = coord(5/25)
    
  3. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.13
    0.13314956 = sum of:
      0.13314956 = product of:
        0.47553414 = sum of:
          0.03070149 = weight(abstract_txt:language in 4636) [ClassicSimilarity], result of:
            0.03070149 = score(doc=4636,freq=2.0), product of:
              0.066285156 = queryWeight, product of:
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.015811684 = queryNorm
              0.46317294 = fieldWeight in 4636, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.192163 = idf(docFreq=1776, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.00799931 = weight(abstract_txt:that in 4636) [ClassicSimilarity], result of:
            0.00799931 = score(doc=4636,freq=1.0), product of:
              0.042923816 = queryWeight, product of:
                1.138036 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.015811684 = queryNorm
              0.18636064 = fieldWeight in 4636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.0093265325 = weight(abstract_txt:with in 4636) [ClassicSimilarity], result of:
            0.0093265325 = score(doc=4636,freq=1.0), product of:
              0.04754922 = queryWeight, product of:
                1.1977842 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.015811684 = queryNorm
              0.19614479 = fieldWeight in 4636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.055498753 = weight(abstract_txt:natural in 4636) [ClassicSimilarity], result of:
            0.055498753 = score(doc=4636,freq=2.0), product of:
              0.09836308 = queryWeight, product of:
                1.21817 = boost
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.015811684 = queryNorm
              0.5642234 = fieldWeight in 4636, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106767 = idf(docFreq=711, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.046740428 = weight(abstract_txt:problems in 4636) [ClassicSimilarity], result of:
            0.046740428 = score(doc=4636,freq=1.0), product of:
              0.1392489 = queryWeight, product of:
                2.0497587 = boost
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.015811684 = queryNorm
              0.33566102 = fieldWeight in 4636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.15577869 = weight(abstract_txt:free in 4636) [ClassicSimilarity], result of:
            0.15577869 = score(doc=4636,freq=1.0), product of:
              0.35565788 = queryWeight, product of:
                4.012072 = boost
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.015811684 = queryNorm
              0.4380015 = fieldWeight in 4636, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
          0.16948895 = weight(abstract_txt:text in 4636) [ClassicSimilarity], result of:
            0.16948895 = score(doc=4636,freq=3.0), product of:
              0.30928853 = queryWeight, product of:
                4.8301296 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.015811684 = queryNorm
              0.5479962 = fieldWeight in 4636, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.078125 = fieldNorm(doc=4636)
        0.28 = coord(7/25)
    
  4. Gagan, D.: Scanning: a survival guide : 6: text scanning - editing and performance (1993) 0.13
    0.13253851 = sum of:
      0.13253851 = product of:
        0.82836574 = sum of:
          0.044515632 = weight(abstract_txt:several in 6302) [ClassicSimilarity], result of:
            0.044515632 = score(doc=6302,freq=1.0), product of:
              0.07820749 = queryWeight, product of:
                1.0862156 = boost
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.015811684 = queryNorm
              0.5691991 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.125 = fieldNorm(doc=6302)
          0.014922451 = weight(abstract_txt:with in 6302) [ClassicSimilarity], result of:
            0.014922451 = score(doc=6302,freq=1.0), product of:
              0.04754922 = queryWeight, product of:
                1.1977842 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.015811684 = queryNorm
              0.31383166 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.125 = fieldNorm(doc=6302)
          0.5475082 = weight(abstract_txt:marking in 6302) [ClassicSimilarity], result of:
            0.5475082 = score(doc=6302,freq=1.0), product of:
              0.52502143 = queryWeight, product of:
                3.9801128 = boost
                8.342641 = idf(docFreq=27, maxDocs=43254)
                0.015811684 = queryNorm
              1.0428301 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.342641 = idf(docFreq=27, maxDocs=43254)
                0.125 = fieldNorm(doc=6302)
          0.22141944 = weight(abstract_txt:text in 6302) [ClassicSimilarity], result of:
            0.22141944 = score(doc=6302,freq=2.0), product of:
              0.30928853 = queryWeight, product of:
                4.8301296 = boost
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.015811684 = queryNorm
              0.7158993 = fieldWeight in 6302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.125 = fieldNorm(doc=6302)
        0.16 = coord(4/25)
    
  5. Karamuftuoglu, M.: Collaborative information retrieval : toward a social informatics view of IR interaction (1998) 0.13
    0.12872115 = sum of:
      0.12872115 = product of:
        0.4597184 = sum of:
          0.027822271 = weight(abstract_txt:several in 4152) [ClassicSimilarity], result of:
            0.027822271 = score(doc=4152,freq=1.0), product of:
              0.07820749 = queryWeight, product of:
                1.0862156 = boost
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.015811684 = queryNorm
              0.35574943 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5535927 = idf(docFreq=1237, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.0113127325 = weight(abstract_txt:that in 4152) [ClassicSimilarity], result of:
            0.0113127325 = score(doc=4152,freq=2.0), product of:
              0.042923816 = queryWeight, product of:
                1.138036 = boost
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.015811684 = queryNorm
              0.26355374 = fieldWeight in 4152, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3854163 = idf(docFreq=10822, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.0093265325 = weight(abstract_txt:with in 4152) [ClassicSimilarity], result of:
            0.0093265325 = score(doc=4152,freq=1.0), product of:
              0.04754922 = queryWeight, product of:
                1.1977842 = boost
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.015811684 = queryNorm
              0.19614479 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.5106533 = idf(docFreq=9548, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.10501375 = weight(abstract_txt:readily in 4152) [ClassicSimilarity], result of:
            0.10501375 = score(doc=4152,freq=1.0), product of:
              0.18959087 = queryWeight, product of:
                1.6912218 = boost
                7.0898776 = idf(docFreq=97, maxDocs=43254)
                0.015811684 = queryNorm
              0.55389667 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0898776 = idf(docFreq=97, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.046740428 = weight(abstract_txt:problems in 4152) [ClassicSimilarity], result of:
            0.046740428 = score(doc=4152,freq=1.0), product of:
              0.1392489 = queryWeight, product of:
                2.0497587 = boost
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.015811684 = queryNorm
              0.33566102 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.296461 = idf(docFreq=1600, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.10372402 = weight(abstract_txt:meaning in 4152) [ClassicSimilarity], result of:
            0.10372402 = score(doc=4152,freq=1.0), product of:
              0.2369097 = queryWeight, product of:
                2.6736114 = boost
                5.6041074 = idf(docFreq=432, maxDocs=43254)
                0.015811684 = queryNorm
              0.43782088 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6041074 = idf(docFreq=432, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
          0.15577869 = weight(abstract_txt:free in 4152) [ClassicSimilarity], result of:
            0.15577869 = score(doc=4152,freq=1.0), product of:
              0.35565788 = queryWeight, product of:
                4.012072 = boost
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.015811684 = queryNorm
              0.4380015 = fieldWeight in 4152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6064196 = idf(docFreq=431, maxDocs=43254)
                0.078125 = fieldNorm(doc=4152)
        0.28 = coord(7/25)