EpiHub - Open Digital Epigraphy Hub

Palladino, Chiara, and Tariq Yousef. 2024. “Development of Robust NER Models and Named Entity Tagsets for Ancient Greek.” In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, edited by Rachele Sprugnoli and Marco Passarotti, 89–97. Torino, Italia: ELRA and ICCL. https://aclanthology.org/2024.lt4hala-1.11/.

This contribution presents a novel approach to the development and evaluation of transformer-based models for Named Entity Recognition and Classification in Ancient Greek texts. We trained two models with annotated datasets by consolidating potentially ambiguous entity types under a harmonized set of classes. Then, we tested their performance with out-of-domain texts, reproducing a real-world use case. Both models performed very well under these conditions, with the multilingual model being slightly superior on the monolingual one. In the conclusion, we emphasize current limitations due to the scarcity of high-quality annotated corpora and to the lack of cohesive annotation strategies for ancient languages.

View on aclanthology.org

Panciera, Silvio, and Silvia Orlandi. 2017. “EAGLE: Past, Present, and Future.” In Digital and Traditional Epigraphy in Context. Proceedings of the EAGLE 2016 International Conference, edited by Silvia Orlandi, Raffaella Santucci, Francesco Mambrini, and Pietro Maria Liuzzo, 1–10. Roma: Sapienza Università Editrice. doi:10.13133/978-88-9377-021-7.

Polis, Stéphane, Anne-Claude Honnay, and Jean Winand. 2013. “Building an Annotated Corpus of Late Egyptian. The Ramses Project: Review and Perspectives.” In Texts, Languages & Information Technology in Egyptology. Selected Papers from the Meeting of the Computer Working Group of the International Association of Egyptologists (Informatique & Égyptologie), Liège, 6-8 July 2010, edited by Stéphane Polis and Jean Winand, 25–44. Ægyptiaca Leodiensia 9. Liège, Belgium: Presses Universitaires de Liège. https://hdl.handle.net/2268/110307.

This paper reviews the experience of the Ramses Project in constructing a richly annotated corpus of Late Egyptian that consists of 300 000 words in 2011 (and is expected to grow up to more than 1 million words in coming years). During the first five years of the project, this corpus has been encoded in hieroglyphic script, translated in French or English and received annotations for part-of-speech information, lemmatization, and morphological analysis. The methodology and working tools that have been developed in order to build this corpus are here discussed and future developments are presented.

View on hdl.handle.net

Polis, Stéphane, and Serge Rosmorduc. 2013. “Building a Construction-Based Treebank of Late Egyptian. The Syntactic Layer in Ramsès.” In Texts, Languages & Information Technology in Egyptology. Selected Papers from the Meeting of the Computer Working Group of the International Association of Egyptologists (Informatique & Égyptologie), Liège, 6-8 July 2010, edited by Stéphane Polis and Jean Winand, 45–59. Ægyptiaca Leodiensia 9. Liège, Belgium: Presses Universitaires de Liège. https://hdl.handle.net/2268/110307.

This paper reports on the construction-based Treebank currently under development in the frame-work of the Ramses Project, which aims at building a multifaceted annotated corpus of Late Egyptian texts. We describe the specifications that have been implemented and we introduce the syntactic formalism and the related representation format that are used for the syntactic annotation. Further-more, the annotation scheme is discussed with particular attention paid to its evolutionary nature. Finally, we explain the methods as well as the annotating tool, called SyntaxEditor; we conclude by addressing the question of forthcoming developments, especially the search engine and a context-sensitive parser.

View on hdl.handle.net

Rosmorduc, Serge, Stéphane Polis, and Jean Winand. 2009. “Ramses. A New Research Tool in Philology and Linguistics.” In Information Technology and Egyptology in 2008. Proceedings of the Meeting of the Computer Working Group of the International Association of Egyptologists (Informatique et Egyptologie), Vienna, 8–11 July 2008, edited by Nigel Strudwick, 133–42. Bible in Technology 2. Vienna, Austria: Gorgias Press. https://hdl.handle.net/2268/26438.

This paper introduces Ramses, a database of Late Egyptian texts, currently under development at the University of Liège (Belgium). Ramses sets out to be a new and powerful research tool. Its main applications are linguistically and philologically orientated. After a general overview of the structure of the database, the search engines are described with some detail.

View on hdl.handle.net

Rossi, Irene. 2025. “Building an Ecosystem of Digital Resources on the Written Heritage of Ancient Arabia.” Archeologia e Calcolatori 36 (1): 469–80. doi:10.19282/ac.36.1.2025.26.

The Digital Archive for the Study of pre-Islamic Arabian Inscriptions (DASI, https://dasi. cnr.it/) currently provides open access to the digital editions of nearly 8800 ancient epigraphic texts from the Arabian Peninsula. After presenting an outline of DASI ecosystem through its 25-year history, this paper focuses on the recent enrichment of its data model, carried out within a pilot project of the E-RIHS infrastructure under the H2IOSC programme. The aim was to optimise DASI as an up-to-date tool for the digital critical edition of a broad spectrum of epigraphic sources from ancient Arabia, including graffiti, instrumenta inscripta, coins, and inscribed sticks, alongside ‘monumental’ inscriptions. Most of the interventions targeted the description of the visual aspect of writing and related contextual information, enhancing the digital representation of the material dimension of written heritage, which is often overlooked in philological studies. Ongoing work is targeting the FAIRification of DASI data, which has so far resulted in the sharing of an extensive bibliography of 1800 records through Zotero.

View on www.archcalc.cnr.it

Salomon, Corinna. 2024. “Lexicon Leponticum – Concept and Implementation.” In Cisalpine Celtic Literacy – Proceedings of the International Symposium Maynooth 23–24 June 2022, edited by Corinna Salomon and David Stifter. Hagen.

Schiettecatte, Jérémie, and Irene Rossi. 2021. “Mapping and Synthesizing Ancient Arabia: The Maparabia Project (2019-23).” The International Association for the Study of Arabia Bulletin 25: 17–18. https://shs.hal.science/halshs-03619697.

Based on archaeological data and large epigraphic corpuses (DASI, OCIANA), the project aims to develop three free online research instruments, adhering to Open Science and FAIR principles: 1/ Digital atlas of ancient Arabia 2/ Gazetteer of ancient Arabia 3/ Thematic Dictionary of Ancient Arabia (TDAA)

View on shs.hal.science

Seales, W. Brent, and Christy Y. Chapman. 2023. “From Stone to Silicon: Technical Advances in Epigraphy.” International Journal on Digital Libraries 24 (2): 129–38. doi:10.1007/s00799-023-00362-5.

Through the annals of time, writing has slowly scrawled its way from the painted surfaces of stone walls to the grooves of inscriptions to the strokes of quill, pen, and ink. While we still inscribe stone (tombstones, monuments) and we continue to write on skin (tattoos abound), our quotidian method of writing on paper is increasingly abandoned in favor of the quick-to-generate digital text. And even though the stone-inscribed text of epigraphy offers demonstrably better permanence than that of writing on skin and paper—even better than that of the memory system of the modern computer (Bollacker in Am Sci 98:106, 2010)—this field of study has also made the digital leap. Today’s scholarly analyses of epigraphic content increasingly rely on high-tech approaches involving data science and computer models. This essay discusses how advances in a number of exciting technologies are enabling the digital analysis of epigraphic texts and accelerating the ability of scholars to preserve, renew, and reinvigorate the study of the inscriptions that remain from throughout history.

Sövegjártó, Szilvia, and Márton Vér, eds. 2024. Exploring Multilingualism and Multiscriptism in Written Artefacts. Studies in Manuscript Cultures 38. Berlin; Boston: De Gruyter. doi:10.1515/9783111380544.

Tabin, J. 2022. “From Papyrus to Pixels: Optical Character Recognition Applied to Ancient Egyptian Hieratic.” https://doi.org/10.6082/uchicago.3695.

View on doi.org

Tupman, Charlotte. 2021. “Where Can Our Inscriptions Take Us? Harnessing the Potential of Linked Open Data for Epigraphy.” In Epigraphy in the Digital Age. Opportunities and Challenges in the Recording, Analysis and Dissemination of Inscriptions, edited by Isabel Velázquez Soriano and David Espinosa Espinosa, 115–28. Oxford: Archaeopress.

University of Chicago., and Krisztián Vértes. 2014. Digital Epigraphy. The University of Chicago Oriental Institute Publications. Chicago: The Epigraphic Survey at Oriental Institute of the University of Chicago.

Vagionakis, Irene. 2021. “Cretan Institutional Inscriptions: A New EpiDoc Database.” Journal of the Text Encoding Initiative. doi:10.4000/jtei.3570.

View on journals.openedition.org

Waters, Donald J. 2023. “The Emerging Digital Infrastructure for Research in the Humanities.” International Journal on Digital Libraries 24 (2): 87–102. doi:10.1007/s00799-022-00332-3.

This article advances the thesis that three decades of investments by national and international funders, combined with those of scholars, technologists, librarians, archivists, and their institutions, have resulted in a digital infrastructure in the humanities that is now capable of supporting end-to-end research workflows. The article refers to key developments in the epigraphy and paleography of the premodern period. It draws primarily on work in classical studies but also highlights related work in the adjacent disciplines of Egyptology, ancient Near East studies, and medieval studies. The argument makes a case that much has been achieved but it does not declare “mission accomplished.” The capabilities of the infrastructure remain unevenly distributed within and across disciplines, institutions, and regions. Moreover, the components, including the links between steps in the workflow, are generally far from user-friendly and seamless in operation. Because further refinements and additional capacities are still much needed, the article concludes with a discussion of key priorities for future work.

Williams, Henrik, Marco Bianchi, and Christiane Zimmermann. 2021. “Corpus Editions of Runic Inscriptions in Supranational Databases.” Futhark: International Journal of Runic Studies 12: 117–35. doi:10.33063/diva-491882.

View on urn.kb.se

Winand, Jean, Stéphane Polis, and Serge Rosmorduc. 2015. “Ramses. An Annotated Corpus of Late Egyptian.” In Proceedings of the 10th International Congress of Egyptologists. University of the Aegean, Rhodes 22-29 May 2008, edited by P. Kousoulis and N. Lazaridis, 1513–21. Orientalia Lovaniensia Analecta 241. Leuven – Paris – Bristol, CT: Peeters. https://hdl.handle.net/2268/23960.

First official presentation of the "Ramses Project", an richly annotated corpus of Late Egyptian [Paper submitted in 2008/2009]

View on hdl.handle.net

Yousef, Tariq, Chiara Palladino, and Farnoosh Shamsian. 2023. “Classical Philology in the Time of AI: Exploring the Potential of Parallel Corpora in Ancient Language.” In Proceedings of the Ancient Language Processing Workshop, edited by Adam Anderson, Shai Gordin, Bin Li, Yudong Liu, and Marco C. Passarotti, 179–92. Varna, Bulgaria: INCOMA Ltd., Shoumen, Bulgaria. https://aclanthology.org/2023.alp-1.21/.

This paper provides an overview of diverse applications of parallel corpora in ancient languages, particularly Ancient Greek. In the first part, we provide the fundamental principles of parallel corpora and a short overview of their applications in the study of ancient texts. In the second part, we illustrate how to leverage on parallel corpora to perform various NLP tasks, including automatic translation alignment, dynamic lexica induction, and Named Entity Recognition. In the conclusions, we emphasize current limitations and future work.

View on aclanthology.org

Bibliography

Search and browse EpiHub's bibliography

Your search

Results 58 resources

Explore

General

Topic

Resource type

Publication year