Search (23 results, page 1 of 2)

Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 0.01
```
0.011498496 = product of:
  0.10923571 = sum of:
    0.054617856 = weight(_text_:web in 953) [ClassicSimilarity], result of:
      0.054617856 = score(doc=953,freq=18.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.64902663 = fieldWeight in 953, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=953)
    0.054617856 = weight(_text_:web in 953) [ClassicSimilarity], result of:
      0.054617856 = score(doc=953,freq=18.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.64902663 = fieldWeight in 953, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=953)
  0.10526316 = coord(2/19)
```
Abstract

Due to a large variety of noisy information embedded in Web pages, Web-page classification is much more difficult than pure-text classification. In this paper, we propose to improve the Web-page classification performance by removing the noise through summarization techniques. We first give empirical evidence that ideal Web-page summaries generated by human editors can indeed improve the performance of Web-page classification algorithms. We then put forward a new Web-page summarization algorithm based on Web-page layout and evaluate it along with several other state-of-the-art text summarization algorithms on the LookSmart Web directory. Experimental results show that the classification algorithms (NB or SVM) augmented by any summarization approach can achieve an improvement by more than 5.0% as compared to pure-text-based classification algorithms. We further introduce an ensemble method to combine the different summarization algorithms. The ensemble summarization method achieves more than 12.0% improvement over pure-text based methods.

Endres-Niggemeyer, B.: Kognitive Modellierung des Abstracting (1991) 0.01

0.0093203485 = product of:
  0.17708662 = sum of:
    0.17708662 = weight(_text_:modellierung in 23) [ClassicSimilarity], result of:
      0.17708662 = score(doc=23,freq=2.0), product of:
        0.18558519 = queryWeight, product of:
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.025786186 = queryNorm
        0.9542067 = fieldWeight in 23, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.09375 = fieldNorm(doc=23)
  0.05263158 = coord(1/19)

Endres-Niggemeyer, B.; Jauris-Heipke, S.; Pinsky, S.M.; Ulbricht, U.: Wissen gewinnen durch Wissen : Ontologiebasierte Informationsextraktion (2006) 0.01
```
0.007339875 = product of:
  0.13945763 = sum of:
    0.13945763 = weight(_text_:ontologie in 6016) [ClassicSimilarity], result of:
      0.13945763 = score(doc=6016,freq=8.0), product of:
        0.18041065 = queryWeight, product of:
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.025786186 = queryNorm
        0.7730011 = fieldWeight in 6016, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=6016)
  0.05263158 = coord(1/19)
```
Abstract

Die ontologiebasierte Informationsextraktion, über die hier berichtet wird, ist Teil eines Systems zum automatischen Zusammenfassen, das sich am Vorgehen kompetenter Menschen orientiert. Dahinter steht die Annahme, dass Menschen die Ergebnisse eines Systems leichter übernehmen können, wenn sie mit Verfahren erarbeitet worden sind, die sie selbst auch benutzen. Das erste Anwendungsgebiet ist Knochenmarktransplantation (KMT). Im Kern des Systems Summit-BMT (Summarize It in Bone Marrow Transplantation) steht eine Ontologie des Fachgebietes. Sie ist als MySQL-Datenbank realisiert und versorgt menschliche Benutzer und Systemkomponenten mit Wissen. Summit-BMT unterstützt die Frageformulierung mit einem empirisch fundierten Szenario-Interface. Die Retrievalergebnisse werden durch ein Textpassagenretrieval vorselektiert und dann kognitiv fundierten Agenten unterbreitet, die unter Einsatz ihrer Wissensbasis / Ontologie genauer prüfen, ob die Propositionen aus der Benutzerfrage getroffen werden. Die relevanten Textclips aus dem Duelldokument werden in das Szenarioformular eingetragen und mit einem Link zu ihrem Vorkommen im Original präsentiert. In diesem Artikel stehen die Ontologie und ihr Gebrauch zur wissensbasierten Informationsextraktion im Mittelpunkt. Die Ontologiedatenbank hält unterschiedliche Wissenstypen so bereit, dass sie leicht kombiniert werden können: Konzepte, Propositionen und ihre syntaktisch-semantischen Schemata, Unifikatoren, Paraphrasen und Definitionen von Frage-Szenarios. Auf sie stützen sich die Systemagenten, welche von Menschen adaptierte Zusammenfassungsstrategien ausführen. Mängel in anderen Verarbeitungsschritten führen zu Verlusten, aber die eigentliche Qualität der Ergebnisse steht und fällt mit der Qualität der Ontologie. Erste Tests der Extraktionsleistung fallen verblüffend positiv aus.
Yulianti, E.; Huspi, S.; Sanderson, M.: Tweet-biased summarization (2016) 0.01
```
0.0063880538 = product of:
  0.06068651 = sum of:
    0.030343255 = weight(_text_:web in 2926) [ClassicSimilarity], result of:
      0.030343255 = score(doc=2926,freq=8.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.36057037 = fieldWeight in 2926, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2926)
    0.030343255 = weight(_text_:web in 2926) [ClassicSimilarity], result of:
      0.030343255 = score(doc=2926,freq=8.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.36057037 = fieldWeight in 2926, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2926)
  0.10526316 = coord(2/19)
```
Abstract

We examined whether the microblog comments given by people after reading a web document could be exploited to improve the accuracy of a web document summarization system. We examined the effect of social information (i.e., tweets) on the accuracy of the generated summaries by comparing the user preference for TBS (tweet-biased summary) with GS (generic summary). The result of crowdsourcing-based evaluation shows that the user preference for TBS was significantly higher than GS. We also took random samples of the documents to see the performance of summaries in a traditional evaluation using ROUGE, which, in general, TBS was also shown to be better than GS. We further analyzed the influence of the number of tweets pointed to a web document on summarization accuracy, finding a positive moderate correlation between the number of tweets pointed to a web document and the performance of generated TBS as measured by user preference. The results show that incorporating social information into the summary generation process can improve the accuracy of summary. The reason for people choosing one summary over another in a crowdsourcing-based evaluation is also presented in this article.
Endres-Niggemeyer, B.; Ziegert, C.: SummIt-BMT : (Summarize It in BMT) in Diagnose und Therapie, Abschlussbericht (2002) 0.01
```
0.0063565187 = product of:
  0.12077385 = sum of:
    0.12077385 = weight(_text_:ontologie in 4497) [ClassicSimilarity], result of:
      0.12077385 = score(doc=4497,freq=6.0), product of:
        0.18041065 = queryWeight, product of:
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.025786186 = queryNorm
        0.6694386 = fieldWeight in 4497, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=4497)
  0.05263158 = coord(1/19)
```
Abstract

SummIt-BMT (Summarize It in Bone Marrow Transplantation) - das Zielsystem des Projektes - soll Ärzten in der Knochenmarktransplantation durch kognitiv fundiertes Zusammenfassen (Endres-Niggemeyer, 1998) aus dem WWW eine schnelle Informationsaufnahme ermöglichen. Im bmbffinanzierten Teilprojekt, über das hier zu berichten ist, liegt der Schwerpunkt auf den klinischen Fragestellungen. SummIt-BMT hat als zentrale Komponente eine KMT-Ontologie. Den Systemablauf veranschaulicht Abb. 1: Benutzer geben ihren Informationsbedarf in ein strukturiertes Szenario ein. Sie ziehen dazu Begriffe aus der Ontologie heran. Aus dem Szenario werden Fragen an Suchmaschinen abgeleitet. Die Summit-BMT-Metasuchmaschine stößt Google an und sucht in Medline, der zentralen Literaturdatenbank der Medizin. Das Suchergebnis wird aufbereitet. Dabei werden Links zu Volltexten verfolgt und die Volltexte besorgt. Die beschafften Dokumente werden mit einem Schlüsselwortretrieval auf Passagen untersucht, in denen sich Suchkonzepte aus der Frage / Ontologie häufen. Diese Passagen werden zum Zusammenfassen vorgeschlagen. In ihnen werden die Aussagen syntaktisch analysiert. Die Systemagenten untersuchen sie. Lassen Aussagen sich mit einer semantischen Relation an die Frage anbinden, tragen also zur deren Beantwortung bei, werden sie in die Zusammenfassung aufgenommen, es sei denn, andere Agenten machen Hinderungsgründe geltend, z.B. Redundanz. Das Ergebnis der Zusammenfassung wird in das Frage/Antwort-Szenario integriert. Präsentiert werden Exzerpte aus den Quelldokumenten. Mit einem Link vermitteln sie einen sofortigen Rückgriff auf die Quelle. SummIt-BMT ist zum nächsten Durchgang von Informationssuche und Zusammenfassung bereit, sobald der Benutzer dies wünscht.
Endres-Niggemeyer, B.: Bessere Information durch Zusammenfassen aus dem WWW (1999) 0.01
```
0.0058719004 = product of:
  0.111566104 = sum of:
    0.111566104 = weight(_text_:ontologie in 4496) [ClassicSimilarity], result of:
      0.111566104 = score(doc=4496,freq=2.0), product of:
        0.18041065 = queryWeight, product of:
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.025786186 = queryNorm
        0.6184009 = fieldWeight in 4496, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.996407 = idf(docFreq=109, maxDocs=44218)
          0.0625 = fieldNorm(doc=4496)
  0.05263158 = coord(1/19)
```
Abstract

Am Beispiel der Knochenmarktransplantation, eines medizinischen Spezialgebietes, wird im folgenden dargelegt, wie man BenutzerInnen eine großen Teil des Aufwandes bei der Wissensbeschaffung abnehmen kann, indem man Suchergebnisse aus dem Netz fragebezogen zusammenfaßt. Dadurch wird in zeitkritischen Situationen, wie sie in Diagnose und Therapie alltäglich sind, die Aufnahme neuen Wissens ermöglicht. Auf einen Überblick über den Stand des Textzusammenfassens und der Ontologieentwicklung folgt eine Systemskizze, in der die Informationssuche im WWW durch ein kognitiv fundiertes Zusammenfassungssystem ergänzt wird. Dazu wird eine Fach-Ontologie vorgeschlagen, die das benötigte Wissen organisiert und repräsentiert.

Robin, J.; McKeown, K.: Empirically designing and evaluating a new revision-based model for summary generation (1996) 0.00

0.004704834 = product of:
  0.04469592 = sum of:
    0.03072123 = weight(_text_:services in 6751) [ClassicSimilarity], result of:
      0.03072123 = score(doc=6751,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.3245064 = fieldWeight in 6751, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.0625 = fieldNorm(doc=6751)
    0.013974689 = product of:
      0.027949378 = sum of:
        0.027949378 = weight(_text_:22 in 6751) [ClassicSimilarity], result of:
          0.027949378 = score(doc=6751,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.30952093 = fieldWeight in 6751, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6751)
      0.5 = coord(1/2)
  0.10526316 = coord(2/19)

Abstract: Presents a system for summarizing quantitative data in natural language, focusing on the use of a corpus of basketball game summaries, drawn from online news services, to empirically shape the system design and to evaluate the approach. Initial corpus analysis revealed characteristics of textual summaries that challenge the capabilities of current language generation systems. A revision based corpus analysis was used to identify and encode the revision rules of the system. Presents a quantitative evaluation, using several test corpora, to measure the robustness of the new revision based model
Date: 6. 3.1997 16:22:15

Liang, S.-F.; Devlin, S.; Tait, J.: Investigating sentence weighting components for automatic summarisation (2007) 0.00

0.003832832 = product of:
  0.036411904 = sum of:
    0.018205952 = weight(_text_:web in 899) [ClassicSimilarity], result of:
      0.018205952 = score(doc=899,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 899, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=899)
    0.018205952 = weight(_text_:web in 899) [ClassicSimilarity], result of:
      0.018205952 = score(doc=899,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 899, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=899)
  0.10526316 = coord(2/19)

Abstract: The work described here initially formed part of a triangulation exercise to establish the effectiveness of the Query Term Order algorithm. It subsequently proved to be a reliable indicator for summarising English web documents. We utilised the human summaries from the Document Understanding Conference data, and generated queries automatically for testing the QTO algorithm. Six sentence weighting schemes that made use of Query Term Frequency and QTO were constructed to produce system summaries, and this paper explains the process of combining and balancing the weighting components. The summaries produced were evaluated by the ROUGE-1 metric, and the results showed that using QTO in a weighting combination resulted in the best performance. We also found that using a combination of more weighting components always produced improved performance compared to any single weighting component.

Xu, D.; Cheng, G.; Qu, Y.: Preferences in Wikipedia abstracts : empirical findings and implications for automatic entity summarization (2014) 0.00
```
0.003832832 = product of:
  0.036411904 = sum of:
    0.018205952 = weight(_text_:web in 2700) [ClassicSimilarity], result of:
      0.018205952 = score(doc=2700,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 2700, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2700)
    0.018205952 = weight(_text_:web in 2700) [ClassicSimilarity], result of:
      0.018205952 = score(doc=2700,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 2700, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=2700)
  0.10526316 = coord(2/19)
```
Abstract

The volume of entity-centric structured data grows rapidly on the Web. The description of an entity, composed of property-value pairs (a.k.a. features), has become very large in many applications. To avoid information overload, efforts have been made to automatically select a limited number of features to be shown to the user based on certain criteria, which is called automatic entity summarization. However, to the best of our knowledge, there is a lack of extensive studies on how humans rank and select features in practice, which can provide empirical support and inspire future research. In this article, we present a large-scale statistical analysis of the descriptions of entities provided by DBpedia and the abstracts of their corresponding Wikipedia articles, to empirically study, along several different dimensions, which kinds of features are preferable when humans summarize. Implications for automatic entity summarization are drawn from the findings.
Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.00
```
0.0031940269 = product of:
  0.030343255 = sum of:
    0.0151716275 = weight(_text_:web in 657) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=657,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 657, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=657)
    0.0151716275 = weight(_text_:web in 657) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=657,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 657, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=657)
  0.10526316 = coord(2/19)
```
Abstract

Purpose - The purpose of this research is to develop a method for automatic construction of multi-document summaries of sets of news articles that might be retrieved by a web search engine in response to a user query. Design/methodology/approach - Based on the cross-document discourse analysis, an event-based framework is proposed for integrating and organizing information extracted from different news articles. It has a hierarchical structure in which the summarized information is presented at the top level and more detailed information given at the lower levels. A tree-view interface was implemented for displaying a multi-document summary based on the framework. A preliminary user evaluation was performed by comparing the framework-based summaries against the sentence-based summaries. Findings - In a small evaluation, all the human subjects preferred the framework-based summaries to the sentence-based summaries. It indicates that the event-based framework is an effective way to summarize a set of news articles reporting an event or a series of relevant events. Research limitations/implications - Limited to event-based news articles only, not applicable to news critiques and other kinds of news articles. A summarization system based on the event-based framework is being implemented. Practical implications - Multi-document summarization of news articles can adopt the proposed event-based framework. Originality/value - An event-based framework for summarizing sets of news articles was developed and evaluated using a tree-view interface for displaying such summaries.
Ou, S.; Khoo, S.G.; Goh, D.H.: Automatic multidocument summarization of research abstracts : design and user evaluation (2007) 0.00
```
0.0031940269 = product of:
  0.030343255 = sum of:
    0.0151716275 = weight(_text_:web in 522) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=522,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 522, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=522)
    0.0151716275 = weight(_text_:web in 522) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=522,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 522, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=522)
  0.10526316 = coord(2/19)
```
Abstract

The purpose of this study was to develop a method for automatic construction of multidocument summaries of sets of research abstracts that may be retrieved by a digital library or search engine in response to a user query. Sociology dissertation abstracts were selected as the sample domain in this study. A variable-based framework was proposed for integrating and organizing research concepts and relationships as well as research methods and contextual relations extracted from different dissertation abstracts. Based on the framework, a new summarization method was developed, which parses the discourse structure of abstracts, extracts research concepts and relationships, integrates the information across different abstracts, and organizes and presents them in a Web-based interface. The focus of this article is on the user evaluation that was performed to assess the overall quality and usefulness of the summaries. Two types of variable-based summaries generated using the summarization method-with or without the use of a taxonomy-were compared against a sentence-based summary that lists only the research-objective sentences extracted from each abstract and another sentence-based summary generated using the MEAD system that extracts important sentences. The evaluation results indicate that the majority of sociological researchers (70%) and general users (64%) preferred the variable-based summaries generated with the use of the taxonomy.
Ruda, S.: Abstracting: eine Auswahlbibliographie (1992) 0.00
```
0.0030604713 = product of:
  0.058148954 = sum of:
    0.058148954 = weight(_text_:semantische in 6603) [ClassicSimilarity], result of:
      0.058148954 = score(doc=6603,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.41761774 = fieldWeight in 6603, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6603)
  0.05263158 = coord(1/19)
```
Abstract

Die vorliegende Auswahlbibliographie ist in 9 Themenbereiche unterteilt. Der erste Abschnitt enthält Literatur, in der auf Abstracts und Abstracting-Verfahren allgemein eingegangen und ein Überblick über den Stand der Forschung gegeben wird. Im nächsten Abschnitt werden solche Aufsätze referiert, die die historische Entwicklung des Abstracting beschreiben. Im dritten Teil sind Abstracting-Richtlinien verschiedener Institutionen aufgelistet. Lexikalische, syntaktische und semantische Textkondensierungsverfahren sind das Thema der in Abschnitt 4 präsentierten Arbeiten. Textstrukturen von Abstracts werden unter Punkt 5 betrachtet, und die Arbeiten des nächsten Themenbereiches befassen sich mit dem Problem des Schreibens von Abstracts. Der siebte Abschnitt listet sog. 'maschinelle' und maschinen-unterstützte Abstracting-Methoden auf. Anschließend werden 'maschinelle' und maschinenunterstützte Abstracting-Verfahren, Abstracts im Vergleich zu ihren Primärtexten sowie Abstracts im allgemeien bewertet. Den Abschluß bilden Bibliographien

Dammeyer, A.; Jürgensen, W.; Krüwel, C.; Poliak, E.; Ruttkowski, S.; Schäfer, Th.; Sirava, M.; Hermes, T.: Videoanalyse mit DiVA (1998) 0.00

0.0022457521 = product of:
  0.042669293 = sum of:
    0.042669293 = weight(_text_:suche in 23) [ClassicSimilarity], result of:
      0.042669293 = score(doc=23,freq=2.0), product of:
        0.12883182 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.025786186 = queryNorm
        0.3312015 = fieldWeight in 23, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.046875 = fieldNorm(doc=23)
  0.05263158 = coord(1/19)

Source: Inhaltsbezogene Suche von Bildern und Videosequenzen in digitalen multimedialen Archiven: Beiträge eines Workshops der KI'98 am 16./17.9.1998 in Bremen. Hrsg.: N. Luth

Gomez, J.; Allen, K.; Matney, M.; Awopetu, T.; Shafer, S.: Experimenting with a machine generated annotations pipeline (2020) 0.00
```
0.0016169068 = product of:
  0.03072123 = sum of:
    0.03072123 = weight(_text_:services in 657) [ClassicSimilarity], result of:
      0.03072123 = score(doc=657,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.3245064 = fieldWeight in 657, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.0625 = fieldNorm(doc=657)
  0.05263158 = coord(1/19)
```
Abstract

The UCLA Library reorganized its software developers into focused subteams with one, the Labs Team, dedicated to conducting experiments. In this article we describe our first attempt at conducting a software development experiment, in which we attempted to improve our digital library's search results with metadata from cloud-based image tagging services. We explore the findings and discuss the lessons learned from our first attempt at running an experiment.
Haag, M.: Automatic text summarization : Evaluation des Copernic Summarizer und mögliche Einsatzfelder in der Fachinformation der DaimlerCrysler AG (2002) 0.00
```
0.0012126801 = product of:
  0.023040922 = sum of:
    0.023040922 = weight(_text_:services in 649) [ClassicSimilarity], result of:
      0.023040922 = score(doc=649,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.2433798 = fieldWeight in 649, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.046875 = fieldNorm(doc=649)
  0.05263158 = coord(1/19)
```
Abstract

An evaluation of the Copernic Summarizer, a software for automatically summarizing text in various data formats, is being presented. It shall be assessed if and how the Copernic Summarizer can reasonably be used in the DaimlerChrysler Information Division in order to enhance the quality of its information services. First, an introduction into Automatic Text Summarization is given and the Copernic Summarizer is being presented. Various methods for evaluating Automatic Text Summarization systems and software ergonomics are presented. Two evaluation forms are developed with which the employees of the Information Division shall evaluate the quality and relevance of the extracted keywords and summaries as well as the software's usability. The quality and relevance assessment is done by comparing the original text to the summaries. Finally, a recommendation is given concerning the use of the Copernic Summarizer.
Wang, W.; Hwang, D.: Abstraction Assistant : an automatic text abstraction system (2010) 0.00
```
0.0012126801 = product of:
  0.023040922 = sum of:
    0.023040922 = weight(_text_:services in 3981) [ClassicSimilarity], result of:
      0.023040922 = score(doc=3981,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.2433798 = fieldWeight in 3981, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.046875 = fieldNorm(doc=3981)
  0.05263158 = coord(1/19)
```
Abstract

In the interest of standardization and quality assurance, it is desirable for authors and staff of access services to follow the American National Standards Institute (ANSI) guidelines in preparing abstracts. Using the statistical approach an extraction system (the Abstraction Assistant) was developed to generate informative abstracts to meet the ANSI guidelines for structural content elements. The system performance is evaluated by comparing the system-generated abstracts with the author's original abstracts and the manually enhanced system abstracts on three criteria: balance (satisfaction of the ANSI standards), fluency (text coherence), and understandability (clarity). The results suggest that it is possible to use the system output directly without manual modification, but there are issues that need to be addressed in further studies to make the system a better tool.

Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.00

7.3550997E-4 = product of:
  0.013974689 = sum of:
    0.013974689 = product of:
      0.027949378 = sum of:
        0.027949378 = weight(_text_:22 in 6599) [ClassicSimilarity], result of:
          0.027949378 = score(doc=6599,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.30952093 = fieldWeight in 6599, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6599)
      0.5 = coord(1/2)
  0.05263158 = coord(1/19)

Date: 26. 2.1997 10:22:43

Jones, P.A.; Bradbeer, P.V.G.: Discovery of optimal weights in a concept selection system (1996) 0.00

7.3550997E-4 = product of:
  0.013974689 = sum of:
    0.013974689 = product of:
      0.027949378 = sum of:
        0.027949378 = weight(_text_:22 in 6974) [ClassicSimilarity], result of:
          0.027949378 = score(doc=6974,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.30952093 = fieldWeight in 6974, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=6974)
      0.5 = coord(1/2)
  0.05263158 = coord(1/19)

Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Vanderwende, L.; Suzuki, H.; Brockett, J.M.; Nenkova, A.: Beyond SumBasic : task-focused summarization with sentence simplification and lexical expansion (2007) 0.00
```
5.516325E-4 = product of:
  0.010481017 = sum of:
    0.010481017 = product of:
      0.020962033 = sum of:
        0.020962033 = weight(_text_:22 in 948) [ClassicSimilarity], result of:
          0.020962033 = score(doc=948,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.23214069 = fieldWeight in 948, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=948)
      0.5 = coord(1/2)
  0.05263158 = coord(1/19)
```
Abstract

In recent years, there has been increased interest in topic-focused multi-document summarization. In this task, automatic summaries are produced in response to a specific information request, or topic, stated by the user. The system we have designed to accomplish this task comprises four main components: a generic extractive summarization system, a topic-focusing component, sentence simplification, and lexical expansion of topic words. This paper details each of these components, together with experiments designed to quantify their individual contributions. We include an analysis of our results on two large datasets commonly used to evaluate task-focused summarization, the DUC2005 and DUC2006 datasets, using automatic metrics. Additionally, we include an analysis of our results on the DUC2006 task according to human evaluation metrics. In the human evaluation of system summaries compared to human summaries, i.e., the Pyramid method, our system ranked first out of 22 systems in terms of overall mean Pyramid score; and in the human evaluation of summary responsiveness to the topic, our system ranked third out of 35 systems.

Wu, Y.-f.B.; Li, Q.; Bot, R.S.; Chen, X.: Finding nuggets in documents : a machine learning approach (2006) 0.00

4.5969372E-4 = product of:
  0.008734181 = sum of:
    0.008734181 = product of:
      0.017468361 = sum of:
        0.017468361 = weight(_text_:22 in 5290) [ClassicSimilarity], result of:
          0.017468361 = score(doc=5290,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.19345059 = fieldWeight in 5290, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5290)
      0.5 = coord(1/2)
  0.05263158 = coord(1/19)

Date: 22. 7.2006 17:25:48

Search (23 results, page 1 of 2)

Authors

Years

Languages

Types

Themes