Your browser doesn't support the features required by impress.mod.js, so you are presented with a simplified version of this presentation.

For the best experience please use the latest Chrome, Safari or Firefox browser.

2017-08-09 | WCJS #810

Linked Open Data in Jewish Studies

Sharing Data

The Case of Jewish Funerary Inscriptions Databases

Data: https://github.com/KollatzThomas/2017_Jerusalem_WCJS
Slides: https://kollatzthomas.github.io/2017_Jerusalem_WCJS
Thomas Kollatz | @kol_t | Twitter KollatzThomas | CC-BY 4.0
Mainz – Academy of Science and Literature | Digital Academy
Essen – Steinheim-Institute for German-Jewish History

Table of Contents

01
databases & structured access
02
formats & controlled vocabularies
03
linked open data & sharing

01

databases & structured access

Epigraphic databases

IIP – Inscriptions of Israel | Palestine epidat – Database of Jewish Epigraphy
online
inscriptions 3.423 33.065
time 500 BCE - 640 CE mid 11th - 20th century CE
space Israel | Palestine Germany | Netherlands | Czechia | Lithunia
content different types of inscriptions funerary inscriptions

IIP – access options

text-based search
  • text | translation
  • description | place | id
  • figures
index-based search
  • places
  • dates
  • type of Inscription
  • form of Inscription | physical Type
  • language
  • religion

epidat – access options

Browse
  • chronological
    • by year
    • by period
  • topographical
    • by place
    • by region
    • by map
  • visual – by images
Access
Analyze /w embedded webservices

02

data & controlled vocabularies

shared format

one format for (two) epigraphic databases

EpiDoc: Epigraphic documents in TEI XML

IIP epidat
The texts are extensively marked-up as part of their addition to the database. We are using the Epidoc schema and Guidelines to mark up our inscriptions. Epidoc is a customization of the Text Encoding Initiative schema that was developed specifically for marking up inscribed objects. Rather than treating the text as the primary object (with the goal of moving it relatively easily to publication), our mark-up treats the inscribed object as primary. epidat records are provided in different data-formats
as html
as EpiDoc: Epigraphic Documents in TEI XML
as plain text
Inscriptions may be accessed via an API epidat records are machine-readable and can be harvested, How to harvest epidat records

definitions

What is XML

XML
Extensible Markup Language
  • semantic in stead of typography
  • device-independent and system-independent storing and processing of texts in electronic form
  • interchange and communication format
Generalizing … we define markup, or (synonymously) encoding, as any means of making explicit an interpretation of a text.

A Gentle Introduction to XML

What is TEI XML

TEI
Text Encoding Initiative
[The TEI] is a consortium which collectively develops and maintains a standard for the representation of texts in digital form. Its chief deliverable is a set of Guidelines which specify encoding methods for machine-readable texts

http://www.tei-c.org

TEI delivers Guidelines

example

Something is [rotten] in the state of Denmark

<p>
 Something is 
   <damage agent="inkstain"
           extent="6" 
           unit="chars">
       <supplied cert="high" 
                 confidence="100" 
                 reason="damage" 
                 evidence="external"
                 resp="editio_princeps">
           rotten
       </supplied>
   </damage> in the state of 
   <placeName ref="http://vocab.getty.edu/tgn/1000066">
       Denmark
   </placeName>
</p>
Use of TEI XML is Close Reading

what is EpiDoc

EpiDoc TEI XML
Epigraphic Documents in TEI XML
EpiDoc is an international, collaborative effort that provides guidelines and tools for encoding scholarly and educational editions of ancient documents. It uses a subset of the Text Encoding Initiative's standard for the representation of texts in digital form and was developed initially for the publication of digital editions of ancient inscriptions [...]It addresses not only the transcription and editorial treatment of texts themselves, but also the history and materiality of the objects on which the texts appear.

https://sourceforge.net/p/epidoc/wiki/Home/
http://www.stoa.org/epidoc/gl/latest/

exampel

XML-paths works both for

<!-- path to edition -->
/TEI//body//div [@type='edition']

<!-- path to settlement -->
/TEI/teiHeader/fileDesc/sourceDesc/msDesc/history//settlement

<!-- metadata -->
/TEI/teiHeader 
                    
<!-- data -->
/TEI/test

authority files & controlled vocabularies

IIP

epidat

work in progress – physical form | symbols | material

By using controlled vocabularies project-specific findings are addressed by unique identifiers [digital shelf mark]

example

Grabmal

  sameAs 
                            
מצבה

  sameAs 

τάφος

  sameAs 

sepulchral monument

  sameAs                            

http://vocab.getty.edu/aat/300005909

example

<!-- person -->
<profileDesc>
 <particDesc>
    <listPerson>
        <person xml:id="ffb-80-1" sex="1">
            <persName ref="http://d-nb.info/gnd/11879132X">
             Meir Rothschild ben Anschel Rothschild
            </persName>
            <death when="1812-09-19"/> 
        </person>
    </listPerson>
 </particDesc>
 <langUsage>
    <language ident="he" usage="100">Hebrew</language>
 </langUsage>
 </profileDesc>

http://steinheim-institut.de/cgi-bin/epidat?id=ffb-80-teip5

Good preconditions for Linked Open Data

IIP and epidat provide research data
  • system-independent
  • in a documented standard format
  • with controlled vocabularies
  • via machine readable interfaces
  • under an open licence

03

linked open data

Explicit Semantics

                                                 
                 
Subject    <http://steinheim-institut.de/cgi-bin/epidat?id=ffb-80-1> 
             
Predicate  sameAs 
                        
Object     <http://d-nb.info/gnd/11879132X> 

.
Linked [Open] Data need access to data, whereby relationship between those data should be made explicit

more: https://www.w3.org/standards/semanticweb/data

Linked Open Data – in practice

By providing the GND-Identification-number more information about the person in question is on the fly dynamically provided by harvesting other online databases, which also refer to the person with the unique identifier 11879132X

  • easy
  • effective
  • inspiring new | further research
  • vice versa: our data a available outside our limited domain

challenge

How to dis|un|cover the unexpected ?

Names, Persons, Symbols, ornaments, decoration, etc. found on Jewish headstones as well as biblical and rabbinic quotations or poetry as part of the epitaphs or the materials of the headstones or the type of script used are not strictly limited to the field of epigraphy or Jewish Studies.
  • Linked Open Data could help to disseminate more or less accidental spinoff-results
  • Linked Open Data are a way to make known all kind of facts uncovered in epigraphic research to neighbouring knowledge sectors (Jewish studies, Cultural Studies, History of Art, and Literature)

source: Rufus Pollock

Linked Open Data

Modelling – From EpiDoc TEI XML to CidocCRM

CIDOC CRM
The CIDOC Conceptual Reference Model (CRM) provides definitions and a formal structure for describing the implicit and explicit concepts and relationships used in cultural heritage documentation.

http://www.cidoc-crm.org

Linked Data – Tripels

d1e255 A1 crm:E55_Type: "Mann" "Mann" A1->"Mann" crm:P3_has_note A3 crm:E21_Person: A4 crm:E55_Type: A3->A4 crm:P2_has_type A5 crm:E82_Actor_Appellation: A3->A5 crm:P131_is_identified_by A4->"Mann" crm:P3_has_note "Jehuda" "Jehuda" A5->"Jehuda" crm:P3_has_note http://www.steinheim-institut.de/cgi-bin/epidat?id=aha-13 crm:E19_Physical_Object: http://www.steinheim-institut.de/cgi-bin/epidat?id=aha-13 http://www.steinheim-institut.de/cgi-bin/epidat?id=aha-13->A3 crm:P131i_identifies "Breite, hochrechteckige Sandsteinstele mit geradem\nAbschluss. Das ver [...]" "Breite, hochrechteckige Sandsteinstele mit geradem Abschluss. Das ver [...]" http://www.steinheim-institut.de/cgi-bin/epidat?id=aha-13->"Breite, hochrechteckige Sandsteinstele mit geradem\nAbschluss. Das ver [...]" crm:P3_has_note A0 crm:E21_Person: http://www.steinheim-institut.de/cgi-bin/epidat?id=aha-13->A0 crm:P131i_identifies A0->A1 crm:P2_has_type A2 crm:E82_Actor_Appellation: A0->A2 crm:P131_is_identified_by "Elieser ben Josef Hakohen" "Elieser ben Josef Hakohen" A2->"Elieser ben Josef Hakohen" crm:P3_has_note
demo: http://xtriples.spatialhumanities.de
code: https://github.com/spatialhumanities/xtriples
presentation: Torsten Schrade (Digitale Akademie Mainz): CIDOC-CRM Modellierung epigraphischer Fachdaten mit dem XTriples Webservice - Werkstattbericht aus dem DARIAH UseCase 6.1 (Steinheim-Institut & ADWLM)

F I N I S

Thank you

Literature

Databases

Download