Search (27 results, page 1 of 2)

Mayr, P.: Re-Ranking auf Basis von Bradfordizing für die verteilte Suche in Digitalen Bibliotheken (2009) 0.03
```
0.027726589 = product of:
  0.17560174 = sum of:
    0.046991456 = weight(_text_:semantische in 4302) [ClassicSimilarity], result of:
      0.046991456 = score(doc=4302,freq=4.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.33748612 = fieldWeight in 4302, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.03125 = fieldNorm(doc=4302)
    0.040228993 = weight(_text_:suche in 4302) [ClassicSimilarity], result of:
      0.040228993 = score(doc=4302,freq=4.0), product of:
        0.12883182 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.025786186 = queryNorm
        0.31225976 = fieldWeight in 4302, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.03125 = fieldNorm(doc=4302)
    0.08838129 = weight(_text_:anwendungsbereich in 4302) [ClassicSimilarity], result of:
      0.08838129 = score(doc=4302,freq=2.0), product of:
        0.22708645 = queryWeight, product of:
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.025786186 = queryNorm
        0.38919666 = fieldWeight in 4302, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.03125 = fieldNorm(doc=4302)
  0.15789473 = coord(3/19)
```
Abstract

Trotz großer Dokumentmengen für datenbankübergreifende Literaturrecherchen erwarten akademische Nutzer einen möglichst hohen Anteil an relevanten und qualitativen Dokumenten in den Trefferergebnissen. Insbesondere die Reihenfolge und Struktur der gelisteten Ergebnisse (Ranking) spielt, neben dem direkten Volltextzugriff auf die Dokumente, inzwischen eine entscheidende Rolle beim Design von Suchsystemen. Nutzer erwarten weiterhin flexible Informationssysteme, die es unter anderem zulassen, Einfluss auf das Ranking der Dokumente zu nehmen bzw. alternative Rankingverfahren zu verwenden. In dieser Arbeit werden zwei Mehrwertverfahren für Suchsysteme vorgestellt, die die typischen Probleme bei der Recherche nach wissenschaftlicher Literatur behandeln und damit die Recherchesituation messbar verbessern können. Die beiden Mehrwertdienste semantische Heterogenitätsbehandlung am Beispiel Crosskonkordanzen und Re-Ranking auf Basis von Bradfordizing, die in unterschiedlichen Phasen der Suche zum Einsatz kommen, werden hier ausführlich beschrieben und im empirischen Teil der Arbeit bzgl. der Effektivität für typische fachbezogene Recherchen evaluiert. Vorrangiges Ziel der Promotion ist es, zu untersuchen, ob das hier vorgestellte alternative Re-Rankingverfahren Bradfordizing im Anwendungsbereich bibliographischer Datenbanken zum einen operabel ist und zum anderen voraussichtlich gewinnbringend in Informationssystemen eingesetzt und dem Nutzer angeboten werden kann. Für die Tests wurden Fragestellungen und Daten aus zwei Evaluationsprojekten (CLEF und KoMoHe) verwendet. Die intellektuell bewerteten Dokumente stammen aus insgesamt sieben wissenschaftlichen Fachdatenbanken der Fächer Sozialwissenschaften, Politikwissenschaft, Wirtschaftswissenschaften, Psychologie und Medizin. Die Evaluation der Crosskonkordanzen (insgesamt 82 Fragestellungen) zeigt, dass sich die Retrievalergebnisse signifikant für alle Crosskonkordanzen verbessern; es zeigt sich zudem, dass interdisziplinäre Crosskonkordanzen den stärksten (positiven) Effekt auf die Suchergebnisse haben. Die Evaluation des Re-Ranking nach Bradfordizing (insgesamt 164 Fragestellungen) zeigt, dass die Dokumente der Kernzone (Kernzeitschriften) für die meisten Testreihen eine signifikant höhere Precision als Dokumente der Zone 2 und Zone 3 (Peripheriezeitschriften) ergeben. Sowohl für Zeitschriften als auch für Monographien kann dieser Relevanzvorteil nach Bradfordizing auf einer sehr breiten Basis von Themen und Fragestellungen an zwei unabhängigen Dokumentkorpora empirisch nachgewiesen werden.

Theme

Semantische Interoperabilität
Mayr, P.; Mutschke, P.; Petras, V.: Reducing semantic complexity in distributed digital libraries : Treatment of term vagueness and document re-ranking (2008) 0.02
```
0.021820541 = product of:
  0.103647575 = sum of:
    0.021455921 = weight(_text_:web in 1909) [ClassicSimilarity], result of:
      0.021455921 = score(doc=1909,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.25496176 = fieldWeight in 1909, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1909)
    0.021455921 = weight(_text_:web in 1909) [ClassicSimilarity], result of:
      0.021455921 = score(doc=1909,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.25496176 = fieldWeight in 1909, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1909)
    0.019200768 = weight(_text_:services in 1909) [ClassicSimilarity], result of:
      0.019200768 = score(doc=1909,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.2028165 = fieldWeight in 1909, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1909)
    0.041534968 = weight(_text_:semantische in 1909) [ClassicSimilarity], result of:
      0.041534968 = score(doc=1909,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.2982984 = fieldWeight in 1909, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1909)
  0.21052632 = coord(4/19)
```
Abstract

Purpose - The general science portal "vascoda" merges structured, high-quality information collections from more than 40 providers on the basis of search engine technology (FAST) and a concept which treats semantic heterogeneity between different controlled vocabularies. First experiences with the portal show some weaknesses of this approach which come out in most metadata-driven Digital Libraries (DLs) or subject specific portals. The purpose of the paper is to propose models to reduce the semantic complexity in heterogeneous DLs. The aim is to introduce value-added services (treatment of term vagueness and document re-ranking) that gain a certain quality in DLs if they are combined with heterogeneity components established in the project "Competence Center Modeling and Treatment of Semantic Heterogeneity". Design/methodology/approach - Two methods, which are derived from scientometrics and network analysis, will be implemented with the objective to re-rank result sets by the following structural properties: the ranking of the results by core journals (so-called Bradfordizing) and ranking by centrality of authors in co-authorship networks. Findings - The methods, which will be implemented, focus on the query and on the result side of a search and are designed to positively influence each other. Conceptually, they will improve the search quality and guarantee that the most relevant documents in result sets will be ranked higher. Originality/value - The central impact of the paper focuses on the integration of three structural value-adding methods, which aim at reducing the semantic complexity represented in distributed DLs at several stages in the information retrieval process: query construction, search and ranking and re-ranking.

Footnote

Beitrag eines Themenheftes "Digital libraries and the semantic web: context, applications and research".

Theme

Semantic Web
Semantische Interoperabilität

Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.02

0.021087546 = product of:
  0.13355446 = sum of:
    0.031533636 = weight(_text_:web in 3392) [ClassicSimilarity], result of:
      0.031533636 = score(doc=3392,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.37471575 = fieldWeight in 3392, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3392)
    0.031533636 = weight(_text_:web in 3392) [ClassicSimilarity], result of:
      0.031533636 = score(doc=3392,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.37471575 = fieldWeight in 3392, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=3392)
    0.07048718 = weight(_text_:semantische in 3392) [ClassicSimilarity], result of:
      0.07048718 = score(doc=3392,freq=4.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.50622916 = fieldWeight in 3392, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.046875 = fieldNorm(doc=3392)
  0.15789473 = coord(3/19)

Abstract: Im August 2009 wurde SKOS "Simple Knowledge Organization System" als neuer Standard für web-basierte kontrollierte Vokabulare durch das W3C veröffentlicht1. SKOS dient als Datenmodell, um kontrollierte Vokabulare über das Web anzubieten sowie technisch und semantisch interoperabel zu machen. Perspektivisch kann die heterogene Landschaft der Erschließungsvokabulare über SKOS vereinheitlicht und vor allem die Inhalte der klassischen Datenbanken (Bereich Fachinformation) für Anwendungen des Semantic Web, beispielsweise als Linked Open Data2 (LOD), zugänglich und stär-ker miteinander vernetzt werden. Vokabulare im SKOS-Format können dabei eine relevante Funktion einnehmen, indem sie als standardisiertes Brückenvokabular dienen und semantische Verlinkung zwischen erschlossenen, veröffentlichten Daten herstellen. Die folgende Fallstudie skizziert ein Szenario mit drei thematisch verwandten Thesauri, die ins SKOS-Format übertragen und inhaltlich über Crosskonkordanzen aus dem Projekt KoMoHe verbunden werden. Die Mapping Properties von SKOS bieten dazu standardisierte Relationen, die denen der Crosskonkordanzen entsprechen. Die beteiligten Thesauri der Fallstudie sind a) TheSoz (Thesaurus Sozialwissenschaften, GESIS), b) STW (Standard-Thesaurus Wirtschaft, ZBW) und c) IBLK-Thesaurus (SWP).
Theme: Semantische Interoperabilität

Mayr, P.; Walter, A.-K.: Einsatzmöglichkeiten von Crosskonkordanzen (2007) 0.02

0.018158708 = product of:
  0.11500516 = sum of:
    0.024274603 = weight(_text_:web in 162) [ClassicSimilarity], result of:
      0.024274603 = score(doc=162,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2884563 = fieldWeight in 162, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0625 = fieldNorm(doc=162)
    0.024274603 = weight(_text_:web in 162) [ClassicSimilarity], result of:
      0.024274603 = score(doc=162,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2884563 = fieldWeight in 162, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0625 = fieldNorm(doc=162)
    0.06645595 = weight(_text_:semantische in 162) [ClassicSimilarity], result of:
      0.06645595 = score(doc=162,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.47727743 = fieldWeight in 162, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0625 = fieldNorm(doc=162)
  0.15789473 = coord(3/19)

Abstract: Der Beitrag stellt Einsatzmöglichkeiten und spezifische Problembereiche von Crosskonkordanzen (CK) im Projekt "Kompetenznetzwerk Modellbildung und Heterogenitätsbehand lung" (KoMoHe) so wie das Netz der bis dato entstandenen Terminologie-Überstiege vor. Die am IZ entstandenen CK sollen künftig über einen Terminologie-Service als Web Service genutzt werden, dieser wird im Beitrag exemplarisch vorgestellt. Des Weiteren wird ein Testszenario samt Evaluationsdesign beschrieben über das der Mehrwert von Crosskonkordanzen empirisch untersucht werden kann.
Theme: Semantische Interoperabilität

Lauser, B.; Johannsen, G.; Caracciolo, C.; Hage, W.R. van; Keizer, J.; Mayr, P.: Comparing human and automatic thesaurus mapping approaches in the agricultural domain (2008) 0.02

0.016971033 = product of:
  0.080612406 = sum of:
    0.0151716275 = weight(_text_:web in 2627) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=2627,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 2627, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2627)
    0.0151716275 = weight(_text_:web in 2627) [ClassicSimilarity], result of:
      0.0151716275 = score(doc=2627,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.18028519 = fieldWeight in 2627, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2627)
    0.041534968 = weight(_text_:semantische in 2627) [ClassicSimilarity], result of:
      0.041534968 = score(doc=2627,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.2982984 = fieldWeight in 2627, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2627)
    0.008734181 = product of:
      0.017468361 = sum of:
        0.017468361 = weight(_text_:22 in 2627) [ClassicSimilarity], result of:
          0.017468361 = score(doc=2627,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.19345059 = fieldWeight in 2627, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2627)
      0.5 = coord(1/2)
  0.21052632 = coord(4/19)

Abstract: Knowledge organization systems (KOS), like thesauri and other controlled vocabularies, are used to provide subject access to information systems across the web. Due to the heterogeneity of these systems, mapping between vocabularies becomes crucial for retrieving relevant information. However, mapping thesauri is a laborious task, and thus big efforts are being made to automate the mapping process. This paper examines two mapping approaches involving the agricultural thesaurus AGROVOC, one machine-created and one human created. We are addressing the basic question "What are the pros and cons of human and automatic mapping and how can they complement each other?" By pointing out the difficulties in specific cases or groups of cases and grouping the sample into simple and difficult types of mappings, we show the limitations of current automatic methods and come up with some basic recommendations on what approach to use when.
Source: Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Theme: Semantische Interoperabilität

Mayr, P.; Walter, A.-K.: Mapping Knowledge Organization Systems (2008) 0.02

0.016878802 = product of:
  0.10689908 = sum of:
    0.018205952 = weight(_text_:web in 1676) [ClassicSimilarity], result of:
      0.018205952 = score(doc=1676,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 1676, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=1676)
    0.018205952 = weight(_text_:web in 1676) [ClassicSimilarity], result of:
      0.018205952 = score(doc=1676,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.21634221 = fieldWeight in 1676, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=1676)
    0.07048718 = weight(_text_:semantische in 1676) [ClassicSimilarity], result of:
      0.07048718 = score(doc=1676,freq=4.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.50622916 = fieldWeight in 1676, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.046875 = fieldNorm(doc=1676)
  0.15789473 = coord(3/19)

Abstract: Die Vernetzung der Informationssysteme und Datenbanken aus dem wissenschaftlichen Fachinformationsbereich lässt bislang den Aspekt der Kompatibilität und Konkordanz zwischen kontrollierten Vokabularen (semantische Heterogenität) weitgehend unberücksichtigt. Gerade aber für den inhaltlichen Zugang sachlich heterogen erschlössener Bestände spielen für den Nutzer die semantischen Querverbindungen (Mappings /Crosskonkordanzen) zwischen den zugrunde liegenden Knowledge Organization Systems (KOS) der Datenbanken eine entscheidende Rolle. Der Beitrag stellt Einsatzmöglichkeiten und Beispiele von Crosskonkordanzen (CK) im Projekt "Kompetenznetzwerk Modellbildung und Heterogenitätsbehandlung" (KoMoHe) sowie das Netz der bis dato entstandenen Terminolögie-Überstiege vor. Die am IZ entstandenen CK sollen künftig über einen Terminolögie-Service als Web Service genutzt werden, dieser wird im Beitrag exemplarisch vorgestellt.
Theme: Semantische Interoperabilität

Mayr, P.; Walter, A.-K.: Zum Stand der Heterogenitätsbehandlung in vascoda : Bestandsaufnahme und Ausblick (2007) 0.02

0.01588887 = product of:
  0.10062951 = sum of:
    0.02124028 = weight(_text_:web in 59) [ClassicSimilarity], result of:
      0.02124028 = score(doc=59,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.25239927 = fieldWeight in 59, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=59)
    0.02124028 = weight(_text_:web in 59) [ClassicSimilarity], result of:
      0.02124028 = score(doc=59,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.25239927 = fieldWeight in 59, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=59)
    0.058148954 = weight(_text_:semantische in 59) [ClassicSimilarity], result of:
      0.058148954 = score(doc=59,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.41761774 = fieldWeight in 59, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0546875 = fieldNorm(doc=59)
  0.15789473 = coord(3/19)

Abstract: Der Beitrag stellt das Verfahren zur Erstellung von Crosskonkordanzen (CK) im Projekt "Kompetenznetzwerk Modellbildung und Heterogenitätsbehandlung" (KoMoHe)1 sowie das Netz der bis dato entstandenen Terminologie-Überstiege vor. Neben CK zwischen Indexierungssprachen innerhalb eines Anwendungsgebiets (z.B. Sozial- und Politikwissenschaften), werden Termbeispiele vorgestellt, die Fächer unterschiedlicher Fachgebiete verknüpfen. Es werden weiterhin typische Einsatzszenarien der CK innerhalb von Informationssystemen präsentiert. Die am IZ entstandenen CK sollen künftig über einen Terminologie-Service als Web Service genutzt werden. Der sog. Heterogenitätsservice, der als Term-Umschlüsselungs-Dienst fungieren soll, wird exemplarisch anhand konkreter Fragestellungen vorgeführt.
Theme: Semantische Interoperabilität

Mayr, P.: Informationsangebote für das Wissenschaftsportal vascoda : eine Bestandsaufnahme (2006) 0.01
```
0.01456687 = product of:
  0.13838527 = sum of:
    0.049841963 = weight(_text_:semantische in 154) [ClassicSimilarity], result of:
      0.049841963 = score(doc=154,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.35795808 = fieldWeight in 154, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.046875 = fieldNorm(doc=154)
    0.08854331 = weight(_text_:modellierung in 154) [ClassicSimilarity], result of:
      0.08854331 = score(doc=154,freq=2.0), product of:
        0.18558519 = queryWeight, product of:
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.025786186 = queryNorm
        0.47710335 = fieldWeight in 154, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.1970778 = idf(docFreq=89, maxDocs=44218)
          0.046875 = fieldNorm(doc=154)
  0.10526316 = coord(2/19)
```
Abstract

Der Arbeitsbericht ist eine vorläufige Bestandsaufnahme der Informationsangebote der Virtuellen Fachbibliotheken und Informationsverbünde, die in das interdisziplinäre Wissenschaftsportal vascoda integriert werden sollen. Die strukturierte Beschreibung der heterogenen Informationsangebote, insbesondere Internetquellen, Fachdatenbanken/Bibliographien, SSG Online-Contents, OPACs, Volltextserver und Digitalisate konzentriert sich auf ausgewählte Aspekte, die eine Grundlage für weitere Arbeiten und Analysen im Projekt "Modellbildung und Heterogenitätsbehandlung" sind. Die Bestandsaufnahme liegt in der Version 2 vor. Neben der knappen Charakterisierung der Informationstypen und Fachinformationsanbieter wird vor allem der IST-Stand der strukturellen und semantischen Heterogenität der analysierten Bestände beschrieben. Zu diesem Zweck wurden die einzelnen Informationsangebote über deren Eingangswebseiten untersucht und zusätzlich bestehende Daten aus vorherigen Erhebungen einbezogen. Die Bestandsaufnahme der Informationsangebote und Kollektionen zeigt eine große Vielfalt an unterschiedlichen formalen und inhaltlichen Erschließungsformen. Die beobachtbare strukturelle und semantische Heterogenität zwischen den einzelnen Beständen hat weit reichende Folgen für die kontrollierte und begründete Integration und Modellierung der Dokumente. Der Bericht ist verfügbar unter http://www.gesis.orq/Publikationen/Berichte/IZ Arbeitsberichte/pdf/ab 37.pdf und kann über den IZ-Bestellservice als Broschüre angefordert werden.
Mayr, P.; Petras, V.; Walter, A.-K.: Results from a German terminology mapping effort : intra- and interdisciplinary cross-concordances between controlled vocabularies (2007) 0.01
```
0.013422168 = product of:
  0.063755296 = sum of:
    0.01062014 = weight(_text_:web in 542) [ClassicSimilarity], result of:
      0.01062014 = score(doc=542,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.12619963 = fieldWeight in 542, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.02734375 = fieldNorm(doc=542)
    0.01062014 = weight(_text_:web in 542) [ClassicSimilarity], result of:
      0.01062014 = score(doc=542,freq=2.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.12619963 = fieldWeight in 542, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.02734375 = fieldNorm(doc=542)
    0.013440539 = weight(_text_:services in 542) [ClassicSimilarity], result of:
      0.013440539 = score(doc=542,freq=2.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.14197156 = fieldWeight in 542, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.02734375 = fieldNorm(doc=542)
    0.029074477 = weight(_text_:semantische in 542) [ClassicSimilarity], result of:
      0.029074477 = score(doc=542,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.20880887 = fieldWeight in 542, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.02734375 = fieldNorm(doc=542)
  0.21052632 = coord(4/19)
```
Abstract

In 2004, the German Federal Ministry for Education and Research funded a major terminology mapping initiative at the GESIS Social Science Information Centre in Bonn (GESIS-IZ), which will find its conclusion this year. The task of this terminology mapping initiative was to organize, create and manage 'crossconcordances' between major controlled vocabularies (thesauri, classification systems, subject heading lists) centred around the social sciences but quickly extending to other subject areas. Cross-concordances are intellectually (manually) created crosswalks that determine equivalence, hierarchy, and association relations between terms from two controlled vocabularies. Most vocabularies have been related bilaterally, that is, there is a cross-concordance relating terms from vocabulary A to vocabulary B as well as a cross-concordance relating terms from vocabulary B to vocabulary A (bilateral relations are not necessarily symmetrical). Till August 2007, 24 controlled vocabularies from 11 disciplines will be connected with vocabulary sizes ranging from 2,000 - 17,000 terms per vocabulary. To date more than 260,000 relations are generated. A database including all vocabularies and cross-concordances was built and a 'heterogeneity service' developed, a web service, which makes the cross-concordances available for other applications. Many cross-concordances are already implemented and utilized for the German Social Science Information Portal Sowiport (www.sowiport.de), which searches bibliographical and other information resources (incl. 13 databases with 10 different vocabularies and ca. 2.5 million references).

Content

Präsentation während der Veranstaltung "Networked Knowledge Organization Systems and Services: The 6th European Networked Knowledge Organization Systems (NKOS) Workshop, Workshop at the 11th ECDL Conference, Budapest, Hungary, September 21st 2007".

Theme

Semantische Interoperabilität

Mayr, P.: Information Retrieval-Mehrwertdienste für Digitale Bibliotheken: : Crosskonkordanzen und Bradfordizing (2010) 0.01

0.011911207 = product of:
  0.11315647 = sum of:
    0.07048718 = weight(_text_:semantische in 4910) [ClassicSimilarity], result of:
      0.07048718 = score(doc=4910,freq=4.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.50622916 = fieldWeight in 4910, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.046875 = fieldNorm(doc=4910)
    0.042669293 = weight(_text_:suche in 4910) [ClassicSimilarity], result of:
      0.042669293 = score(doc=4910,freq=2.0), product of:
        0.12883182 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.025786186 = queryNorm
        0.3312015 = fieldWeight in 4910, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.046875 = fieldNorm(doc=4910)
  0.10526316 = coord(2/19)

Abstract: In dieser Arbeit werden zwei Mehrwertdienste für Suchsysteme vorgestellt, die typische Probleme bei der Recherche nach wissenschaftlicher Literatur behandeln können. Die beiden Mehrwertdienste semantische Heterogenitätsbehandlung am Beispiel Crosskonkordanzen und Re-Ranking auf Basis von Bradfordizing, die in unterschiedlichen Phasen der Suche zum Einsatz kommen, werden in diesem Buch ausführlich beschrieben und evaluiert. Für die Tests wurden Fragestellungen und Daten aus zwei Evaluationsprojekten (CLEF und KoMoHe) verwendet. Die intellektuell bewerteten Dokumente stammen aus insgesamt sieben Fachdatenbanken der Fächer Sozialwissenschaften, Politikwissenschaft, Wirtschaftswissenschaften, Psychologie und Medizin. Die Ergebnisse dieser Arbeit sind in das GESIS-Projekt IRM eingeflossen.
Theme: Semantische Interoperabilität

Mayr, P.; Schaer, P.; Mutschke, P.: ¬A science model driven retrieval prototype (2011) 0.01

0.010669793 = product of:
  0.10136303 = sum of:
    0.05152107 = weight(_text_:services in 649) [ClassicSimilarity], result of:
      0.05152107 = score(doc=649,freq=10.0), product of:
        0.094670646 = queryWeight, product of:
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.025786186 = queryNorm
        0.5442138 = fieldWeight in 649, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.6713707 = idf(docFreq=3057, maxDocs=44218)
          0.046875 = fieldNorm(doc=649)
    0.049841963 = weight(_text_:semantische in 649) [ClassicSimilarity], result of:
      0.049841963 = score(doc=649,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.35795808 = fieldWeight in 649, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.046875 = fieldNorm(doc=649)
  0.10526316 = coord(2/19)

Abstract: This paper is about a better understanding of the structure and dynamics of science and the usage of these insights for compensating the typical problems that arises in metadata-driven Digital Libraries. Three science model driven retrieval services are presented: co-word analysis based query expansion, re-ranking via Bradfordizing and author centrality. The services are evaluated with relevance assessments from which two important implications emerge: (1) precision values of the retrieval services are the same or better than the tf-idf retrieval baseline and (2) each service retrieved a disjoint set of documents. The different services each favor quite other - but still relevant - documents than pure term-frequency based rankings. The proposed models and derived retrieval services therefore open up new viewpoints on the scientific knowledge space and provide an alternative framework to structure scholarly information systems.
Theme: Semantische Interoperabilität

Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.01
```
0.010100399 = product of:
  0.09595379 = sum of:
    0.047976896 = weight(_text_:web in 3752) [ClassicSimilarity], result of:
      0.047976896 = score(doc=3752,freq=20.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.5701118 = fieldWeight in 3752, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3752)
    0.047976896 = weight(_text_:web in 3752) [ClassicSimilarity], result of:
      0.047976896 = score(doc=3752,freq=20.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.5701118 = fieldWeight in 3752, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3752)
  0.10526316 = coord(2/19)
```
Abstract

Purpose: To provide a critical review of Bergman's 2001 study on the Deep Web. In addition, we bring a new concept into the discussion, the Academic Invisible Web (AIW). We define the Academic Invisible Web as consisting of all databases and collections relevant to academia but not searchable by the general-purpose internet search engines. Indexing this part of the Invisible Web is central to scien-tific search engines. We provide an overview of approaches followed thus far. Design/methodology/approach: Discussion of measures and calculations, estima-tion based on informetric laws. Literature review on approaches for uncovering information from the Invisible Web. Findings: Bergman's size estimate of the Invisible Web is highly questionable. We demonstrate some major errors in the conceptual design of the Bergman paper. A new (raw) size estimate is given. Research limitations/implications: The precision of our estimate is limited due to a small sample size and lack of reliable data. Practical implications: We can show that no single library alone will be able to index the Academic Invisible Web. We suggest collaboration to accomplish this task. Originality/value: Provides library managers and those interested in developing academic search engines with data on the size and attributes of the Academic In-visible Web.

Content

Bezug zu: Bergman, M.K.: The Deep Web: surfacing hidden value. In: Journal of Electronic Publishing. 7(2001) no.1, S.xxx-xxx. [Vgl. unter: http://www.press.umich.edu/jep/07-01/bergman.html].
Lewandowski, D.; Mayr, P.: Exploring the academic invisible Web (2006) 0.01
```
0.00958208 = product of:
  0.091029756 = sum of:
    0.045514878 = weight(_text_:web in 2580) [ClassicSimilarity], result of:
      0.045514878 = score(doc=2580,freq=18.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.5408555 = fieldWeight in 2580, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2580)
    0.045514878 = weight(_text_:web in 2580) [ClassicSimilarity], result of:
      0.045514878 = score(doc=2580,freq=18.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.5408555 = fieldWeight in 2580, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2580)
  0.10526316 = coord(2/19)
```
Abstract

Purpose: To provide a critical review of Bergman's 2001 study on the deep web. In addition, we bring a new concept into the discussion, the academic invisible web (AIW). We define the academic invisible web as consisting of all databases and collections relevant to academia but not searchable by the general-purpose internet search engines. Indexing this part of the invisible web is central to scientific search engines. We provide an overview of approaches followed thus far. Design/methodology/approach: Discussion of measures and calculations, estimation based on informetric laws. Literature review on approaches for uncovering information from the invisible web. Findings: Bergman's size estimate of the invisible web is highly questionable. We demonstrate some major errors in the conceptual design of the Bergman paper. A new (raw) size estimate is given. Research limitations/implications: The precision of our estimate is limited due to a small sample size and lack of reliable data. Practical implications: We can show that no single library alone will be able to index the academic invisible web. We suggest collaboration to accomplish this task. Originality/value: Provides library managers and those interested in developing academic search engines with data on the size and attributes of the academic invisible web.

Mayr, P.; Tosques, F.: Webometrische Analysen mit Hilfe der Google Web APIs (2005) 0.01

0.0077451034 = product of:
  0.073578484 = sum of:
    0.036789242 = weight(_text_:web in 3189) [ClassicSimilarity], result of:
      0.036789242 = score(doc=3189,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.43716836 = fieldWeight in 3189, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3189)
    0.036789242 = weight(_text_:web in 3189) [ClassicSimilarity], result of:
      0.036789242 = score(doc=3189,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.43716836 = fieldWeight in 3189, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3189)
  0.10526316 = coord(2/19)

Abstract: Der Report stellt die Möglichkeiten und Einschränkungen der Google Web APIs (Google API) dar. Die Implementierung der Google API anhand einzelner informationswissenschaftlicher Untersuchungen aus der Webometrie ergibt, dass die Google API mit Einschränkungen für internetbezogene Untersuchungen eingesetzt werden können. Vergleiche der Trefferergebnisse über die beiden Google-Schnittstellen Google API und die Standard Weboberfläche Google.com (Google Web) zeigen Unterschiede bezüglich der Reichweite, der Zusammensetzung und Verfügbarkeit. Die Untersuchung basiert auf einfachen und erweiterten Suchanfragen in den Sprachen Deutsch und Englisch. Die analysierten Treffermengen der Google API bestätigen tendenziell frühere Internet-Studien.

Mayr, P.; Petras, V.: Building a Terminology Network for Search : the KoMoHe project (2008) 0.01

0.007408085 = product of:
  0.070376806 = sum of:
    0.058148954 = weight(_text_:semantische in 2618) [ClassicSimilarity], result of:
      0.058148954 = score(doc=2618,freq=2.0), product of:
        0.13923967 = queryWeight, product of:
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.025786186 = queryNorm
        0.41761774 = fieldWeight in 2618, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.399778 = idf(docFreq=542, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2618)
    0.012227853 = product of:
      0.024455706 = sum of:
        0.024455706 = weight(_text_:22 in 2618) [ClassicSimilarity], result of:
          0.024455706 = score(doc=2618,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.2708308 = fieldWeight in 2618, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2618)
      0.5 = coord(1/2)
  0.10526316 = coord(2/19)

Source: Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Theme: Semantische Interoperabilität

Daquino, M.; Peroni, S.; Shotton, D.; Colavizza, G.; Ghavimi, B.; Lauscher, A.; Mayr, P.; Romanello, M.; Zumstein, P.: ¬The OpenCitations Data Model (2020) 0.01
```
0.00663866 = product of:
  0.06306727 = sum of:
    0.031533636 = weight(_text_:web in 38) [ClassicSimilarity], result of:
      0.031533636 = score(doc=38,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.37471575 = fieldWeight in 38, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=38)
    0.031533636 = weight(_text_:web in 38) [ClassicSimilarity], result of:
      0.031533636 = score(doc=38,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.37471575 = fieldWeight in 38, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.046875 = fieldNorm(doc=38)
  0.10526316 = coord(2/19)
```
Abstract

A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data supplier or context application. In this paper we present the OpenCitations Data Model (OCDM), a generic data model for describing bibliographic entities and citations, developed using Semantic Web technologies. We also evaluate the effective reusability of OCDM according to ontology evaluation practices, mention existing users of OCDM, and discuss the use and impact of OCDM in the wider open science community.

Content

Erschienen in: The Semantic Web - ISWC 2020, 19th International Semantic Web Conference, Athens, Greece, November 2-6, 2020, Proceedings, Part II. Vgl.: DOI: 10.1007/978-3-030-62466-8_28.
Reichert, S.; Mayr, P.: Untersuchung von Relevanzeigenschaften in einem kontrollierten Eyetracking-Experiment (2012) 0.01
```
0.0055947695 = product of:
  0.05315031 = sum of:
    0.042669293 = weight(_text_:suche in 328) [ClassicSimilarity], result of:
      0.042669293 = score(doc=328,freq=2.0), product of:
        0.12883182 = queryWeight, product of:
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.025786186 = queryNorm
        0.3312015 = fieldWeight in 328, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.996156 = idf(docFreq=812, maxDocs=44218)
          0.046875 = fieldNorm(doc=328)
    0.010481017 = product of:
      0.020962033 = sum of:
        0.020962033 = weight(_text_:22 in 328) [ClassicSimilarity], result of:
          0.020962033 = score(doc=328,freq=2.0), product of:
            0.09029883 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.025786186 = queryNorm
            0.23214069 = fieldWeight in 328, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=328)
      0.5 = coord(1/2)
  0.10526316 = coord(2/19)
```
Abstract

In diesem Artikel wird ein Eyetracking-Experiment beschrieben, bei dem untersucht wurde, wann und auf Basis welcher Informationen Relevanzentscheidungen bei der themenbezogenen Dokumentenbewertung fallen und welche Faktoren auf die Relevanzentscheidung einwirken. Nach einer kurzen Einführung werden relevante Studien aufgeführt, in denen Eyetracking als Untersuchungsmethode für Interaktionsverhalten mit Ergebnislisten (Information Seeking Behavior) verwendet wurde. Nutzerverhalten wird hierbei vor allem durch unterschiedliche Aufgaben-Typen, dargestellte Informationen und durch das Ranking eines Ergebnisses beeinflusst. Durch EyetrackingUntersuchungen lassen sich Nutzer außerdem in verschiedene Klassen von Bewertungs- und Lesetypen einordnen. Diese Informationen können als implizites Feedback genutzt werden, um so die Suche zu personalisieren und um die Relevanz von Suchergebnissen ohne aktives Zutun des Users zu erhöhen. In einem explorativen Eyetracking-Experiment mit 12 Studenten der Hochschule Darmstadt werden anhand der Länge der Gesamtbewertung, Anzahl der Fixationen, Anzahl der besuchten Metadatenelemente und Länge des Scanpfades zwei typische Bewertungstypen identifiziert. Das Metadatenfeld Abstract wird im Experiment zuverlässig als wichtigste Dokumenteigenschaft für die Zuordnung von Relevanz ermittelt.

Date

22. 7.2012 19:25:54
Schaer, P.; Mayr, P.; Sünkler, S.; Lewandowski, D.: How relevant is the long tail? : a relevance assessment study on million short (2016) 0.01
```
0.0055322167 = product of:
  0.05255606 = sum of:
    0.02627803 = weight(_text_:web in 3144) [ClassicSimilarity], result of:
      0.02627803 = score(doc=3144,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.3122631 = fieldWeight in 3144, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3144)
    0.02627803 = weight(_text_:web in 3144) [ClassicSimilarity], result of:
      0.02627803 = score(doc=3144,freq=6.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.3122631 = fieldWeight in 3144, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3144)
  0.10526316 = coord(2/19)
```
Abstract

Users of web search engines are known to mostly focus on the top ranked results of the search engine result page. While many studies support this well known information seeking pattern only few studies concentrate on the question what users are missing by neglecting lower ranked results. To learn more about the relevance distributions in the so-called long tail we conducted a relevance assessment study with the Million Short long-tail web search engine. While we see a clear difference in the content between the head and the tail of the search engine result list we see no statistical significant differences in the binary relevance judgments and weak significant differences when using graded relevance. The tail contains different but still valuable results. We argue that the long tail can be a rich source for the diversification of web search engine result lists but it needs more evaluation to clearly describe the differences.
Mayr, P.: Google Scholar als akademische Suchmaschine (2009) 0.00
```
0.003613629 = product of:
  0.034329474 = sum of:
    0.017164737 = weight(_text_:web in 3023) [ClassicSimilarity], result of:
      0.017164737 = score(doc=3023,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2039694 = fieldWeight in 3023, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=3023)
    0.017164737 = weight(_text_:web in 3023) [ClassicSimilarity], result of:
      0.017164737 = score(doc=3023,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2039694 = fieldWeight in 3023, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=3023)
  0.10526316 = coord(2/19)
```
Abstract

Neben den klassischen Informationsanbietern Bibliothek, Fachinformation und den Verlagen sind Internetsuchmaschinen inzwischen fester Bestandteil bei der Recherche nach wissenschaftlicher Information. Scirus (Elsevier, 2004) und Google Scholar sind zwei Beispiele für Suchdienste kommerzieller Suchmaschinen-Unternehmen, die eine Einschränkung auf den wissenschaftlichen Dokumentenraum anstreben und nennenswerte Dokumentzahlen in allen Disziplinen generieren. Der Vergleich der Treffermengen für beliebige Suchthemen zeigt, dass die Wahl des Suchsystems, des Dokumentenpools und der Dokumenttypen entscheidenden Einfluss auf die Relevanz und damit letztlich auch die Akzeptanz des Suchergebnisses hat. Tabelle 1 verdeutlicht die Mengenunterschiede am Beispiel der Trefferergebnisse für die Suchbegriffe "search engines" bzw. "Suchmaschinen" in der allgemeinen Internetsuchmaschine Google, der wissenschaftlichen Suchmaschine Google Scholar (GS) und der größten fachübergreifenden bibliographischen Literaturdatenbank Web of Science (WoS). Der Anteil der Dokumente, die in diesem Fall eindeutig der Wissenschaft zuzuordnen sind (siehe GS und insbesondere WoS in Tabelle 1), liegt gegenüber der allgemeinen Websuche lediglich im Promille-Bereich. Dieses Beispiel veranschaulicht, dass es ausgesprochen problematisch sein kann, fachwissenschaftliche Fragestellungen ausschließlich mit Internetsuchmaschinen zu recherchieren. Der Anteil der fachwissenschaftlich relevanten Dokumente in diesem Trefferpool ist i. d. R. sehr gering. Damit sinkt die Wahrscheinlichkeit, wissenschaftlich relevantes (z. B. einen Zeitschriftenaufsatz) auf den ersten Trefferseiten zu finden, deutlich ab.
Die drei oben genannten Suchsysteme (Google, GS und WoS) unterscheiden sich in mehrerlei Hinsicht fundamental und eignen sich daher gut, um in die Grundthematik dieses Artikels einzuleiten. Die obigen Suchsysteme erschließen zunächst unterschiedliche Suchräume, und dies auf sehr spezifische Weise. Während Google frei zugängliche und über Hyperlink adressierbare Dokumente im Internet erfasst, gehen die beiden akademischen Suchsysteme deutlich selektiver bei der Inhaltserschließung vor. Google Scholar erfasst neben frei zugänglichen elektronischen Publikationstypen im Internet hauptsächlich wissenschaftliche Dokumente, die direkt von den akademischen Verlagen bezogen werden. Das WoS, das auf den unterschiedlichen bibliographischen Datenbanken und Zitationsindizes des ehemaligen "Institute for Scientific Information" (ISI) basiert, selektiert gegenüber den rein automatischen brute-force-Ansätzen der Internetsuchmaschine über einen qualitativen Ansatz. In den Datenbanken des WoS werden ausschließlich internationale Fachzeitschriften erfasst, die ein kontrolliertes Peer-Review durchlaufen. Insgesamt werden ca. 12.000 Zeitschriften ausgewertet und über die Datenbank verfügbar gemacht. Wie bereits erwähnt, spielt neben der Abgrenzung der Suchräume und Dokumenttypen die Zugänglichkeit und Relevanz der Dokumente eine entscheidende Bedeutung für den Benutzer. Die neueren technologischen Entwicklungen des Web Information Retrieval (IR), wie sie Google oder GS implementieren, werten insbesondere frei zugängliche Dokumente mit ihrer gesamten Text- und Linkinformation automatisch aus. Diese Verfahren sind vor allem deshalb erfolgreich, weil sie Ergebnislisten nach Relevanz gerankt darstellen, einfach und schnell zu recherchieren sind und direkt auf die Volltexte verweisen. Die qualitativen Verfahren der traditionellen Informationsanbieter (z. B. WoS) hingegen zeigen genau bei diesen Punkten (Ranking, Einfachheit und Volltextzugriff) Schwächen, überzeugen aber vor allem durch ihre Stringenz, in diesem Fall die selektive Aufnahme von qualitätsgeprüften Dokumenten in das System und die inhaltliche Erschließung der Dokumente (siehe dazu Mayr und Petras, 2008).
Hobert, A.; Jahn, N.; Mayr, P.; Schmidt, B.; Taubert, N.: Open access uptake in Germany 2010-2018 : adoption in a diverse research landscape (2021) 0.00
```
0.003613629 = product of:
  0.034329474 = sum of:
    0.017164737 = weight(_text_:web in 250) [ClassicSimilarity], result of:
      0.017164737 = score(doc=250,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2039694 = fieldWeight in 250, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=250)
    0.017164737 = weight(_text_:web in 250) [ClassicSimilarity], result of:
      0.017164737 = score(doc=250,freq=4.0), product of:
        0.08415349 = queryWeight, product of:
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.025786186 = queryNorm
        0.2039694 = fieldWeight in 250, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.2635105 = idf(docFreq=4597, maxDocs=44218)
          0.03125 = fieldNorm(doc=250)
  0.10526316 = coord(2/19)
```
Abstract

Es handelt sich um eine bibliometrische Untersuchung der Entwicklung der Open-Access-Verfügbarkeit wissenschaftlicher Zeitschriftenartikel in Deutschland, die im Zeitraum 2010-18 erschienen und im Web of Science indexiert sind. Ein besonderes Augenmerk der Analyse lag auf der Frage, ob und inwiefern sich die Open-Access-Profile der Universitäten und außeruniversitären Wissenschaftseinrichtungen in Deutschland voneinander unterscheiden.

Content

This study investigates the development of open access (OA) to journal articles from authors affiliated with German universities and non-university research institutions in the period 2010-2018. Beyond determining the overall share of openly available articles, a systematic classification of distinct categories of OA publishing allowed us to identify different patterns of adoption of OA. Taking into account the particularities of the German research landscape, variations in terms of productivity, OA uptake and approaches to OA are examined at the meso-level and possible explanations are discussed. The development of the OA uptake is analysed for the different research sectors in Germany (universities, non-university research institutes of the Helmholtz Association, Fraunhofer Society, Max Planck Society, Leibniz Association, and government research agencies). Combining several data sources (incl. Web of Science, Unpaywall, an authority file of standardised German affiliation information, the ISSN-Gold-OA 3.0 list, and OpenDOAR), the study confirms the growth of the OA share mirroring the international trend reported in related studies. We found that 45% of all considered articles during the observed period were openly available at the time of analysis. Our findings show that subject-specific repositories are the most prevalent type of OA. However, the percentages for publication in fully OA journals and OA via institutional repositories show similarly steep increases. Enabling data-driven decision-making regarding the implementation of OA in Germany at the institutional level, the results of this study furthermore can serve as a baseline to assess the impact recent transformative agreements with major publishers will likely have on scholarly communication.

Search (27 results, page 1 of 2)

Authors

Years

Languages

Types

Themes

Classifications