Collana |
Studi di archivistica, bibliografia, paleografia
Miscellanea | Models of Data Extraction and Architecture in Relational Databases of Early Modern Private Political Archives
Capitolo | Cracking the Historical Code
Abstract
The chapter addresses a methodological approach to unstructured data and discusses the potential that structured data offers in the field of historical research. The dataset, which initially consists of textual content sourced from digital collections at the Portuguese Overseas Archives in Lisbon, undergoes a preprocessing phase that forms the basis for the extraction of structured data. The authors combine history, social sciences, and computer science to convert the correspondence repository into a machine‑processable form. This transformation is supported by an interdisciplinary strategy in which they weave together elements of effective content management, topic modelling, and social network analysis.
Presentato: 03 Ottobre 2023 | Accettato: 18 Gennaio 2024 | Pubblicato 22 Maggio 2025 | Lingua: en
Keywords Public correspondence • Structured data • Historical dataset • Digital infrastructure • Colonial Portuguese Empire
Copyright © 2025 Agata Bloch, Michał Bojanowski, Clodomir Santana, Demival Vasques Filho. This is an open-access work distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction is permitted, provided that the original author(s) and the copyright owner(s) are credited and that the original publication is cited, in accordance with accepted academic practice. The license allows for commercial use. No use, distribution or reproduction is permitted which does not comply with these terms.
Permalink http://doi.org/10.30687/978-88-6969-919-1/006