HTR model for Latin and French Medieval Documentary Manuscripts (12th-15th)

TORRES AGUILAR, Sergio Octavio; Jolivet, Vincent

doi:10.5281/zenodo.7547438

No full text

Software (Computer developments)

HTR model for Latin and French Medieval Documentary Manuscripts (12th-15th)

TORRES AGUILAR, Sergio Octavio; Jolivet, Vincent

2023

Dataset

Permalink
https://hdl.handle.net/10993/59647

DOI
10.5281/zenodo.7547438

Files (0)Send to Details Statistics Bibliography Similar publications

Files

Full Text

No document available.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

handwritting text recognition; image-text model; IA applied to historical texts

Abstract :

[en] This HTR model operates in a multilingual environment (Latin and Old French) and it is able to recognize several Latin script families (mostly Textualis and Cursiva) in documents produced in ca. 12th - 15th centuries. During the evaluation the models shows an accuracy of 94.1% on the validation set and a CER (character error ratio) of about 0.12 to 0.17 on four external unseen datasets. A fine-tuning exercise using 10 ground-truth pages can raise these results to a CER between 0.06 to 0.10 respectively.

Disciplines :

Arts & humanities: Multidisciplinary, general & others

Author, co-author :

TORRES AGUILAR, Sergio Octavio ; University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Humanities (DHUM) > History

Jolivet, Vincent

Language :

English

Title :

HTR model for Latin and French Medieval Documentary Manuscripts (12th-15th)

Publication date :

January 2023

Technical description :

This is Handwritting Text Recognition model trained on a charters and registers dataset from the Late-medieval period (12th-15th). The model uses an CNN+RNN+CTC approach backed by kraken.

Focus Area :

Computational Sciences

Additional URL :

https://hal.science/hal-03892163

Data Set :

https://doi.org/10.5281/zenodo.7547438

Available on ORBilu :

since 12 January 2024

Statistics

Number of views

212 (1 by Unilu)

Number of downloads

0 (0 by Unilu)

More statistics

OpenAlex citations