Search (32 results, page 2 of 2)

Laparra, E.; Binford-Walsh, A.; Emerson, K.; Miller, M.L.; López-Hoffman, L.; Currim, F.; Bethard, S.: Addressing structural hurdles for metadata extraction from environmental impact statements (2023) 0.00
```
0.0016913437 = product of:
  0.0033826875 = sum of:
    0.0033826875 = product of:
      0.006765375 = sum of:
        0.006765375 = weight(_text_:a in 1042) [ClassicSimilarity], result of:
          0.006765375 = score(doc=1042,freq=8.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.12739488 = fieldWeight in 1042, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1042)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Natural language processing techniques can be used to analyze the linguistic content of a document to extract missing pieces of metadata. However, accurate metadata extraction may not depend solely on the linguistics, but also on structural problems such as extremely large documents, unordered multi-file documents, and inconsistency in manually labeled metadata. In this work, we start from two standard machine learning solutions to extract pieces of metadata from Environmental Impact Statements, environmental policy documents that are regularly produced under the US National Environmental Policy Act of 1969. We present a series of experiments where we evaluate how these standard approaches are affected by different issues derived from real-world data. We find that metadata extraction can be strongly influenced by nonlinguistic factors such as document length and volume ordering and that the standard machine learning solutions often do not scale well to long documents. We demonstrate how such solutions can be better adapted to these scenarios, and conclude with suggestions for other NLP practitioners cataloging large document collections.

Type

a

Dogtas, G.; Ibitz, M.-P.; Jonitz, F.; Kocher, V.; Poyer, A.,; Stapf, L.: Kritik an rassifizierenden und diskriminierenden Titeln und Metadaten : Praxisorientierte Lösungsansätze (2022) 0.00

0.001674345 = product of:
  0.00334869 = sum of:
    0.00334869 = product of:
      0.00669738 = sum of:
        0.00669738 = weight(_text_:a in 1828) [ClassicSimilarity], result of:
          0.00669738 = score(doc=1828,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.12611452 = fieldWeight in 1828, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1828)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Heng, G.; Cole, T.W.; Tian, T.(C.); Han, M.-J.: Rethinking authority reconciliation process (2022) 0.00
```
0.001674345 = product of:
  0.00334869 = sum of:
    0.00334869 = product of:
      0.00669738 = sum of:
        0.00669738 = weight(_text_:a in 727) [ClassicSimilarity], result of:
          0.00669738 = score(doc=727,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.12611452 = fieldWeight in 727, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=727)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Entity identity management and name reconciliation are intrinsic to both Linked Open Data (LOD) and traditional library authority control. Does this mean that LOD sources can facilitate authority control? This Emblematica Online case study examines the utility of five LOD sources for name reconciliation, comparing design differences regarding ontologies, linking models, and entity properties. It explores the challenges of name reconciliation in the LOD environment and provides lessons learned during a semi-automated name reconciliation process. It also briefly discusses the potential values and benefits of LOD authorities to the authority reconciliation process itself and library services in general.

Type

a
Nabavi, M.; Karimi, E.: Metadata elements for children in theory and practice (2022) 0.00
```
0.0014351527 = product of:
  0.0028703054 = sum of:
    0.0028703054 = product of:
      0.005740611 = sum of:
        0.005740611 = weight(_text_:a in 1110) [ClassicSimilarity], result of:
          0.005740611 = score(doc=1110,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.10809815 = fieldWeight in 1110, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1110)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

This research aimed to investigate the status of children-specific metadata elements in theory (existing literature) and practice (metadata standards and children's digital libraries). Literature reviews as well as two cases, including children's online national libraries of Iran, and Singapore, are used to identify children-specific metadata elements and their application. The results revealed that descriptive metadata types had been mentioned more than analytical, social, and relational types; the DCMI metadata standard, besides LOM and ALTO metadata standards, can be used to develop an application profile for children's library catalogs. Two cases showed that they partially cover children-specific metadata elements, and neither has covered relational metadata elements. A deeper analysis of the children-specific metadata elements suggests that children's catalogs should be semantic and social. The results of this study can be insightful for children's book catalogers and children's book publishers (for marketing purposes).

Type

a

Skare, R.: Paratext (2020) 0.00

0.001353075 = product of:
  0.00270615 = sum of:
    0.00270615 = product of:
      0.0054123 = sum of:
        0.0054123 = weight(_text_:a in 20) [ClassicSimilarity], result of:
          0.0054123 = score(doc=20,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.10191591 = fieldWeight in 20, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=20)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Laczny, J.: Fit for Purpose : Standardisierung von inhaltserschließenden Informationen durch Richtlinien für Metadaten (2021) 0.00

0.001353075 = product of:
  0.00270615 = sum of:
    0.00270615 = product of:
      0.0054123 = sum of:
        0.0054123 = weight(_text_:a in 363) [ClassicSimilarity], result of:
          0.0054123 = score(doc=363,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.10191591 = fieldWeight in 363, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=363)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Assfalg, R.: Metadaten (2023) 0.00

0.001353075 = product of:
  0.00270615 = sum of:
    0.00270615 = product of:
      0.0054123 = sum of:
        0.0054123 = weight(_text_:a in 787) [ClassicSimilarity], result of:
          0.0054123 = score(doc=787,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.10191591 = fieldWeight in 787, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0625 = fieldNorm(doc=787)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Lorenzo, L.; Mak, L.; Smeltekop, N.: FAST Headings in MODS : Michigan State University libraries digital repository case study (2023) 0.00

0.0011839407 = product of:
  0.0023678814 = sum of:
    0.0023678814 = product of:
      0.0047357627 = sum of:
        0.0047357627 = weight(_text_:a in 1177) [ClassicSimilarity], result of:
          0.0047357627 = score(doc=1177,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.089176424 = fieldWeight in 1177, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1177)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Qualität in der Inhaltserschließung (2021) 0.00

9.567685E-4 = product of:
  0.001913537 = sum of:
    0.001913537 = product of:
      0.003827074 = sum of:
        0.003827074 = weight(_text_:a in 753) [ClassicSimilarity], result of:
          0.003827074 = score(doc=753,freq=4.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.072065435 = fieldWeight in 753, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.03125 = fieldNorm(doc=753)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Editor: Franke-Maier, M., A. Kasprzik, A. Ledl u. H. Schürmann

Markus, K.: Metadatenschemata für Forschungsdaten : Generische Standards und Spezifika in der Biologie und den Ingenieurwissenschaften (2020) 0.00

8.4567186E-4 = product of:
  0.0016913437 = sum of:
    0.0016913437 = product of:
      0.0033826875 = sum of:
        0.0033826875 = weight(_text_:a in 133) [ClassicSimilarity], result of:
          0.0033826875 = score(doc=133,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.06369744 = fieldWeight in 133, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=133)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Neudecker, C.; Zaczynska, K.; Baierer, K.; Rehm, G.; Gerber, M.; Moreno Schneider, J.: Methoden und Metriken zur Messung von OCR-Qualität für die Kuratierung von Daten und Metadaten (2021) 0.00

8.4567186E-4 = product of:
  0.0016913437 = sum of:
    0.0016913437 = product of:
      0.0033826875 = sum of:
        0.0033826875 = weight(_text_:a in 369) [ClassicSimilarity], result of:
          0.0033826875 = score(doc=369,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.06369744 = fieldWeight in 369, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=369)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Qin, C.; Liu, Y.; Ma, X.; Chen, J.; Liang, H.: Designing for serendipity in online knowledge communities : an investigation of tag presentation formats and openness to experience (2022) 0.00

8.4567186E-4 = product of:
  0.0016913437 = sum of:
    0.0016913437 = product of:
      0.0033826875 = sum of:
        0.0033826875 = weight(_text_:a in 664) [ClassicSimilarity], result of:
          0.0033826875 = score(doc=664,freq=2.0), product of:
            0.053105544 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046056706 = queryNorm
            0.06369744 = fieldWeight in 664, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=664)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: a

Search (32 results, page 2 of 2)

Authors

Languages

Types

Themes

Subjects

Classifications