The paper describes the main challenges faced, and the solutions adopted in the frame of the project DASI - Digital Archive for the study of pre-Islamic Arabian inscriptions. In particular, the methodological and technological issues emerged in the conversion from a domain-specific text-based project of digital edition of an epigraphic corpus, to an objective-driven archive for the study and dissemination of inscriptions in different languages and scripts are discussed. With a view to keeping pace with, and possibly fostering reasoning on best practices in the community of digital epigraphers beyond each specific cultural/linguistic domain, special attention is devoted to: the modelling of data and encoding (XML annotation vs database approach; the conceptual model for the valorization of the material aspect of the epigraph; the textual encoding for critical editions); interoperability (pros and cons of compliance to standards; harmonization of metadata; openness; semantic interoperability); lexicography (tools for under-resourced languages; translations).
This article discusses the challenges addressed in the digital scholarly encoding of the fragmentary texts of the languages of Ancient Italy according to the TEI/EpiDoc Guidelines in XML format. It describes the solutions and customisations that have been adopted for dealing with the peculiarities of our epigraphical documentation and with the formalisation of epigraphical information deemed interesting for data retrieval in a historical linguistic perspective. The making of a digital corpus consisting of new critical editions of selected inscriptions is a work carried out in the context of the project “Languages and Cultures of Ancient Italy. Historical Linguistics and Digital Models”, which aims to investigate the languages of Ancient Italy by combining the traditional methods, proper to historical linguistics, with methods and technology proper to the digital humanities and computational lexicography. More specifically, the purpose of the project is to create a set of interrelated digital language resources which comprise: (1) a digital corpus of texts editions; (2) a computational lexicon compliant with the Web Semantic requirements; (3) a relevant bibliographic reference dataset encoded according to the FRBRoo/LRMoo specifications. Additionally, selected textual data and scientific interpretations will be encoded using CIDOC CRM and its extensions, namely CRMtex and CRMinf. The present contribution thus tackles one of the main aspects of the project, and proposes significant innovations in the encoding of critical editions for epigraphic texts of fragmentary languages, which will hopefully foster future interoperability and integration with other external datasets, a paramount concern of the project.
Study of Roman Sicily is well established and has a long tradition, with the two most authoritative and well-established epigraphic corpora –CIL X (1883) and IG XIV (1890)– dating to the late 19th century. While I.Sicily was conceived to offer easy and up-to-date access to the evergrowing but increasingly scattered epigraphic evidence of Sicily, its digital nature also enables the adoption of new approaches and the pursuit of novel research questions. The open-access dataset has recently been expanded to include institutional annotations, which hold great promise for research, particularly in fields that rely on extensive and detailed datasets, such as administrative and onomastic history (prosopographic annotation will follow). This paper aims to demonstrate both the potential and the limitations of a digitally annotated dataset as a tool for historical research, through a preliminary case study on the practice of dedications to the Roman emperor in Sicily. Recent scholarship suggests that provincial subjects also contributed to shaping the notion and the expectations around emperorship, which were not only imposed from above. The data-driven approach facilitated by an annotated corpus is well-suited to the new bottom-up perspective, but it is not without methodological pitfalls, which will be highlighted in this paper.