Document (#38748)

Author
Lorenz, S.
Title
Konzeption und prototypische Realisierung einer begriffsbasierten Texterschließung
Imprint
Trier : Universität
Year
2006
Pages
XV, 147 S
Abstract
Im Rahmen dieser Arbeit wird eine Vorgehensweise entwickelt, die die Fixierung auf das Wort und die damit verbundenen Schwächen überwindet. Sie gestattet die Extraktion von Informationen anhand der repräsentierten Begriffe und bildet damit die Basis einer inhaltlichen Texterschließung. Die anschließende prototypische Realisierung dient dazu, die Konzeption zu überprüfen sowie ihre Möglichkeiten und Grenzen abzuschätzen und zu bewerten. Arbeiten zum Information Extraction widmen sich fast ausschließlich dem Englischen, wobei insbesondere im Bereich der Named Entities sehr gute Ergebnisse erzielt werden. Deutlich schlechter sehen die Resultate für weniger regelmäßige Sprachen wie beispielsweise das Deutsche aus. Aus diesem Grund sowie praktischen Erwägungen wie insbesondere der Vertrautheit des Autors damit, soll diese Sprache primär Gegenstand der Untersuchungen sein. Die Lösung von einer engen Termorientierung bei gleichzeitiger Betonung der repräsentierten Begriffe legt nahe, dass nicht nur die verwendeten Worte sekundär werden sondern auch die verwendete Sprache. Um den Rahmen dieser Arbeit nicht zu sprengen wird bei der Untersuchung dieses Punktes das Augenmerk vor allem auf die mit unterschiedlichen Sprachen verbundenen Schwierigkeiten und Besonderheiten gelegt.
Content
Dissertation an der Universität Trier - Fachbereich IV - zur Erlangung der Würde eines Doktors der Wirtschafts- und Sozialwissenschaften. Vgl.: http://ubt.opus.hbz-nrw.de/volltexte/2006/377/pdf/LorenzSaschaDiss.pdf.
Theme
Computerlinguistik
Automatisches Indexieren

Similar documents (author)

  1. Lorenz, B.: Anmerkung zu: Bernd Maaßen: Inhaltserschließung und Innovationsbereitschaft (Erwiderung) (1985) 4.89
    4.8942966 = sum of:
      4.8942966 = weight(author_txt:lorenz in 71) [ClassicSimilarity], result of:
        4.8942966 = fieldWeight in 71, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8308744 = idf(docFreq=45, maxDocs=42596)
          0.625 = fieldNorm(doc=71)
    
  2. Lorenz, B.: Klassifikation im Bibliothekenverbund : Das Beispiel der Regensburger Aufstellungssystematiken (1992) 4.89
    4.8942966 = sum of:
      4.8942966 = weight(author_txt:lorenz in 211) [ClassicSimilarity], result of:
        4.8942966 = fieldWeight in 211, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8308744 = idf(docFreq=45, maxDocs=42596)
          0.625 = fieldNorm(doc=211)
    
  3. Lorenz, B.: Bibliotheksklassifikation im Verbund: Notizen zur Anwendung der Regensburger Aufstellungssystematiken (1989) 4.89
    4.8942966 = sum of:
      4.8942966 = weight(author_txt:lorenz in 548) [ClassicSimilarity], result of:
        4.8942966 = fieldWeight in 548, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8308744 = idf(docFreq=45, maxDocs=42596)
          0.625 = fieldNorm(doc=548)
    
  4. Lorenz, B.: Bibliothekarisches Zusammenwirken im Systematikverbund : Bemerkungen auf dem Hintergrund der Arbeit im Verbund der Anwender der Regensburger Aufstellungssystematiken (1986) 4.89
    4.8942966 = sum of:
      4.8942966 = weight(author_txt:lorenz in 626) [ClassicSimilarity], result of:
        4.8942966 = fieldWeight in 626, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8308744 = idf(docFreq=45, maxDocs=42596)
          0.625 = fieldNorm(doc=626)
    
  5. Lorenz, B.: Systematische Aufstellung in Bibliotheken als Werkzeug wissenschaftlicher Arbeit (1986) 4.89
    4.8942966 = sum of:
      4.8942966 = weight(author_txt:lorenz in 671) [ClassicSimilarity], result of:
        4.8942966 = fieldWeight in 671, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.8308744 = idf(docFreq=45, maxDocs=42596)
          0.625 = fieldNorm(doc=671)
    

Similar documents (content)

  1. Knorz, G.; Müller, J.: Wissensbasiertes Hochschulportal (2004) 0.11
    0.10721409 = sum of:
      0.10721409 = product of:
        0.53607047 = sum of:
          0.04918803 = weight(abstract_txt:sowie in 3306) [ClassicSimilarity], result of:
            0.04918803 = score(doc=3306,freq=1.0), product of:
              0.08460618 = queryWeight, product of:
                1.0312138 = boost
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.017640304 = queryNorm
              0.5813763 = fieldWeight in 3306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.125 = fieldNorm(doc=3306)
          0.07874689 = weight(abstract_txt:rahmen in 3306) [ClassicSimilarity], result of:
            0.07874689 = score(doc=3306,freq=1.0), product of:
              0.11578477 = queryWeight, product of:
                1.2063502 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.017640304 = queryNorm
              0.68011445 = fieldWeight in 3306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.125 = fieldNorm(doc=3306)
          0.04368439 = weight(abstract_txt:einer in 3306) [ClassicSimilarity], result of:
            0.04368439 = score(doc=3306,freq=1.0), product of:
              0.089483656 = queryWeight, product of:
                1.2988684 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.017640304 = queryNorm
              0.48818287 = fieldWeight in 3306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.125 = fieldNorm(doc=3306)
          0.17697926 = weight(abstract_txt:konzeption in 3306) [ClassicSimilarity], result of:
            0.17697926 = score(doc=3306,freq=1.0), product of:
              0.19866014 = queryWeight, product of:
                1.5801672 = boost
                7.126916 = idf(docFreq=92, maxDocs=42596)
                0.017640304 = queryNorm
              0.8908645 = fieldWeight in 3306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.126916 = idf(docFreq=92, maxDocs=42596)
                0.125 = fieldNorm(doc=3306)
          0.18747193 = weight(abstract_txt:realisierung in 3306) [ClassicSimilarity], result of:
            0.18747193 = score(doc=3306,freq=1.0), product of:
              0.20643657 = queryWeight, product of:
                1.6107976 = boost
                7.2650666 = idf(docFreq=80, maxDocs=42596)
                0.017640304 = queryNorm
              0.9081333 = fieldWeight in 3306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2650666 = idf(docFreq=80, maxDocs=42596)
                0.125 = fieldNorm(doc=3306)
        0.2 = coord(5/25)
    
  2. Rolland, M.T.: Grammatikstandardisierung im Bereich der Sprachverarbeitung (1996) 0.09
    0.09029601 = sum of:
      0.09029601 = product of:
        0.45148003 = sum of:
          0.06522628 = weight(abstract_txt:insbesondere in 5425) [ClassicSimilarity], result of:
            0.06522628 = score(doc=5425,freq=1.0), product of:
              0.1237096 = queryWeight, product of:
                1.246951 = boost
                5.6240344 = idf(docFreq=417, maxDocs=42596)
                0.017640304 = queryNorm
              0.5272532 = fieldWeight in 5425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6240344 = idf(docFreq=417, maxDocs=42596)
                0.09375 = fieldNorm(doc=5425)
          0.03276329 = weight(abstract_txt:einer in 5425) [ClassicSimilarity], result of:
            0.03276329 = score(doc=5425,freq=1.0), product of:
              0.089483656 = queryWeight, product of:
                1.2988684 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.017640304 = queryNorm
              0.36613715 = fieldWeight in 5425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.09375 = fieldNorm(doc=5425)
          0.16065495 = weight(abstract_txt:sprache in 5425) [ClassicSimilarity], result of:
            0.16065495 = score(doc=5425,freq=4.0), product of:
              0.1421339 = queryWeight, product of:
                1.3365848 = boost
                6.0283036 = idf(docFreq=278, maxDocs=42596)
                0.017640304 = queryNorm
              1.130307 = fieldWeight in 5425, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.0283036 = idf(docFreq=278, maxDocs=42596)
                0.09375 = fieldNorm(doc=5425)
          0.12606844 = weight(abstract_txt:sprachen in 5425) [ClassicSimilarity], result of:
            0.12606844 = score(doc=5425,freq=1.0), product of:
              0.19195196 = queryWeight, product of:
                1.5532593 = boost
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.017640304 = queryNorm
              0.6567708 = fieldWeight in 5425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.005555 = idf(docFreq=104, maxDocs=42596)
                0.09375 = fieldNorm(doc=5425)
          0.06676708 = weight(abstract_txt:damit in 5425) [ClassicSimilarity], result of:
            0.06676708 = score(doc=5425,freq=1.0), product of:
              0.14383359 = queryWeight, product of:
                1.646734 = boost
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.017640304 = queryNorm
              0.46419674 = fieldWeight in 5425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.09375 = fieldNorm(doc=5425)
        0.2 = coord(5/25)
    
  3. Studer, R.; Studer, H.-P.; Studer, A.: Semantisches Knowledge Retrieval (2001) 0.08
    0.078020535 = sum of:
      0.078020535 = product of:
        0.39010268 = sum of:
          0.030742517 = weight(abstract_txt:sowie in 323) [ClassicSimilarity], result of:
            0.030742517 = score(doc=323,freq=1.0), product of:
              0.08460618 = queryWeight, product of:
                1.0312138 = boost
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.017640304 = queryNorm
              0.3633602 = fieldWeight in 323, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.078125 = fieldNorm(doc=323)
          0.054355238 = weight(abstract_txt:insbesondere in 323) [ClassicSimilarity], result of:
            0.054355238 = score(doc=323,freq=1.0), product of:
              0.1237096 = queryWeight, product of:
                1.246951 = boost
                5.6240344 = idf(docFreq=417, maxDocs=42596)
                0.017640304 = queryNorm
              0.4393777 = fieldWeight in 323, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6240344 = idf(docFreq=417, maxDocs=42596)
                0.078125 = fieldNorm(doc=323)
          0.047289737 = weight(abstract_txt:einer in 323) [ClassicSimilarity], result of:
            0.047289737 = score(doc=323,freq=3.0), product of:
              0.089483656 = queryWeight, product of:
                1.2988684 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.017640304 = queryNorm
              0.52847344 = fieldWeight in 323, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.078125 = fieldNorm(doc=323)
          0.07868576 = weight(abstract_txt:damit in 323) [ClassicSimilarity], result of:
            0.07868576 = score(doc=323,freq=2.0), product of:
              0.14383359 = queryWeight, product of:
                1.646734 = boost
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.017640304 = queryNorm
              0.5470611 = fieldWeight in 323, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.078125 = fieldNorm(doc=323)
          0.17902945 = weight(abstract_txt:verbundenen in 323) [ClassicSimilarity], result of:
            0.17902945 = score(doc=323,freq=2.0), product of:
              0.21736126 = queryWeight, product of:
                1.6528702 = boost
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.017640304 = queryNorm
              0.8236493 = fieldWeight in 323, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.078125 = fieldNorm(doc=323)
        0.2 = coord(5/25)
    
  4. Mühlbacher, S.: Information literacy in enterprises (2009) 0.08
    0.07606894 = sum of:
      0.07606894 = product of:
        0.3803447 = sum of:
          0.078559406 = weight(abstract_txt:arbeit in 3582) [ClassicSimilarity], result of:
            0.078559406 = score(doc=3582,freq=3.0), product of:
              0.109648034 = queryWeight, product of:
                1.1739459 = boost
                5.294765 = idf(docFreq=580, maxDocs=42596)
                0.017640304 = queryNorm
              0.71646893 = fieldWeight in 3582, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.294765 = idf(docFreq=580, maxDocs=42596)
                0.078125 = fieldNorm(doc=3582)
          0.04921681 = weight(abstract_txt:rahmen in 3582) [ClassicSimilarity], result of:
            0.04921681 = score(doc=3582,freq=1.0), product of:
              0.11578477 = queryWeight, product of:
                1.2063502 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.017640304 = queryNorm
              0.42507154 = fieldWeight in 3582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.078125 = fieldNorm(doc=3582)
          0.047289737 = weight(abstract_txt:einer in 3582) [ClassicSimilarity], result of:
            0.047289737 = score(doc=3582,freq=3.0), product of:
              0.089483656 = queryWeight, product of:
                1.2988684 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.017640304 = queryNorm
              0.52847344 = fieldWeight in 3582, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.078125 = fieldNorm(doc=3582)
          0.07868576 = weight(abstract_txt:damit in 3582) [ClassicSimilarity], result of:
            0.07868576 = score(doc=3582,freq=2.0), product of:
              0.14383359 = queryWeight, product of:
                1.646734 = boost
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.017640304 = queryNorm
              0.5470611 = fieldWeight in 3582, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.078125 = fieldNorm(doc=3582)
          0.12659295 = weight(abstract_txt:verbundenen in 3582) [ClassicSimilarity], result of:
            0.12659295 = score(doc=3582,freq=1.0), product of:
              0.21736126 = queryWeight, product of:
                1.6528702 = boost
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.017640304 = queryNorm
              0.5824081 = fieldWeight in 3582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.078125 = fieldNorm(doc=3582)
        0.2 = coord(5/25)
    
  5. Horvarth, P.: Fachinformationspolitik ohne Geschichtswissenschaft oder Was ist eigentlich aus FIZ 14 geworden (1997) 0.07
    0.072192825 = sum of:
      0.072192825 = product of:
        0.36096412 = sum of:
          0.03689102 = weight(abstract_txt:sowie in 667) [ClassicSimilarity], result of:
            0.03689102 = score(doc=667,freq=1.0), product of:
              0.08460618 = queryWeight, product of:
                1.0312138 = boost
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.017640304 = queryNorm
              0.43603224 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6510105 = idf(docFreq=1105, maxDocs=42596)
                0.09375 = fieldNorm(doc=667)
          0.05906017 = weight(abstract_txt:rahmen in 667) [ClassicSimilarity], result of:
            0.05906017 = score(doc=667,freq=1.0), product of:
              0.11578477 = queryWeight, product of:
                1.2063502 = boost
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.017640304 = queryNorm
              0.5100858 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4409156 = idf(docFreq=501, maxDocs=42596)
                0.09375 = fieldNorm(doc=667)
          0.04633429 = weight(abstract_txt:einer in 667) [ClassicSimilarity], result of:
            0.04633429 = score(doc=667,freq=2.0), product of:
              0.089483656 = queryWeight, product of:
                1.2988684 = boost
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.017640304 = queryNorm
              0.5177961 = fieldWeight in 667, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.905463 = idf(docFreq=2330, maxDocs=42596)
                0.09375 = fieldNorm(doc=667)
          0.06676708 = weight(abstract_txt:damit in 667) [ClassicSimilarity], result of:
            0.06676708 = score(doc=667,freq=1.0), product of:
              0.14383359 = queryWeight, product of:
                1.646734 = boost
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.017640304 = queryNorm
              0.46419674 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9514318 = idf(docFreq=818, maxDocs=42596)
                0.09375 = fieldNorm(doc=667)
          0.15191154 = weight(abstract_txt:verbundenen in 667) [ClassicSimilarity], result of:
            0.15191154 = score(doc=667,freq=1.0), product of:
              0.21736126 = queryWeight, product of:
                1.6528702 = boost
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.017640304 = queryNorm
              0.6988897 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.454823 = idf(docFreq=66, maxDocs=42596)
                0.09375 = fieldNorm(doc=667)
        0.2 = coord(5/25)