digital humanities; NLP for latin; Named entities recognition
Precision for document type :
Review article
Disciplines :
History
Author, co-author :
Chastang, Pierre
TORRES AGUILAR, Sergio Octavio ; University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Humanities (DHUM) > History
Tannier, Xavier
External co-authors :
yes
Language :
English
Title :
A Named Entity Recognition Model for Medieval Latin Charters
Abacha et al. 2011 Abacha, A. B, Zweigenbaum, Pierre. (2011). “Medical entity recognition: A comparison of semantic and statistical methods.” In Proceedings of BioNLP 2011 Workshop. Association for Computational Linguistics, p. 56-64.
Bange 1984 Bange, F. (1984). “L'ager et la villa: structures du paysage et du peuplement dans la région mâconnaise à la fin du Haut Moyen Age (IXe-XIe siècles).” Annales. Economies, sociétés, civilisations, 39(03), pp.529-569.
Barthélemy 1997 Barthélemy, D. (1997). “ La mutation de l'an mil, a-t-elle eu lieu?” En Annales. Histoire, Sciences Sociales. Cambridge University Press, p. 767-777.
Billy 1995 Billy, P. (1995). “Nommer en Basse-Normandie aux Xle-XVe siècles.” Cahier des Annales de Normandie, 26(1), pp.223-232.
Bollacker at al. 2008 Bollacker, K., Evans, C., Paritosh, P., Sturge, T. and Taylor, J. (2008). “Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge.” Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, ACM, pp. 1247-1250.
Bourin 1996 Bourin, Monique. (1996). “France du Midi et France du Nord: deux systèmes anthroponymiques? L’anthroponymie document de l’histoire sociale des mondes méditerranéens médiévaux.” Publications de l'École française de Rome, 226(1), pp.179-202.
Brooke et al. 2016 Brooke, J., Hammond, A., Baldwin, T. (2016). “Bootstrapped text-level named entity recognition for literature.” In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 344-350.
Brunner 2009 Brunner, T. (2009). “Le passage aux langues vernaculaires dans les actes de la pratique en Occident.” Le Moyen Age, vol. 115, no 1, pp. 29-72.
Budassi et al. 2016 BUDASSI, M., PASSAROTTI, M. (2016). “Nomen Omen. Enhancing the Latin Morphological Analyser Lemlat with an Onomasticon.” In Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. pp. 90-94.
Chastang 2007 Chastang, P. (2007). “Du locus au territorium. Quelques remarques sur l’évolution des catégories en usage dans le classement des cartulaires méridionaux au XIIe siècle.” In Annales du Midi: revue archéologique, historique et philologique de la France méridionale, Tome 119, N°260, pp. 457-474.
Cohen et al. 2004 Cohen, W. W., and Sarawagi, S. (2004). “Exploiting dictionaries in named entity extraction: combining semi-markov extraction processes and data integration methods.” In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 89-98.
Corrarati 1994 Corrarati, P. (1994). “Nomi, individui, famiglie a Milano nel secolo XI. Mélanges de l'Ecole française de Rome.” Moyen-Age, 106(2), pp. 459-474.
Curran et al. 2003 Curran, J. R., Clark, S. (2003). “Language independent NER using a maximum entropy tagger.” In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, p. 164-167.
Duby 1953 Duby, Georges (1953). La société aux XIe et XIIe siècles dans la région mâconnaise. Bibliothèque Générale de l’Ecole Pratique des Hautes Etudes, 6e section, Paris.
Durrell 2007 Durrell, M. (2007). “GerManC: a historical corpus of German 1650-1800: Full Research Report.” ESRC End of Award Report, RES-000-22-1609.
Eger et al. 2015 Eger, S., Vor der Brück, T., Mehler, A. (2015). “Lexicon-assisted tagging and lemmatization in Latin: A comparison of six taggers and two lemmatization methods.” In Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH 2015), pp. 105-113.
Ehrmann et al. 2016 Ehrmann, Maud, et al. (2016). Diachronic “Evaluation of NER Systems on Old Newspapers.” In Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)). “Bochumer Linguistische Arbeitsberichte”, p. 97-107.
Elson et al. 2010 Elson, D. K., Dames, N., McKeown, K. R. (2010). “Extracting social networks from literary fiction.” In Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics, p. 138-147.
Eltyeb et al. 2014 Eltyeb, S. and Salim, N. (2014). “Chemical named entities recognition: a review on approaches and applications.” Journal of cheminformatics, vol. 6, no 1, p. 17.
Erdmann et al. 2016 Erdmann, A., Brown, C., Joseph, B., Janse, M., Ajaka, P., Elsner, M., and de Marneffe, M. C. (2016). “Challenges and Solutions for Latin Named Entity Recognition.” LT4DH 2016, 85.
Frontini et al. 2016 Frontini, F., Brando, C., Riguet, M., Jacquot, C., and Jolivet, V. (2016). “Annotation of Toponyms in TEI Digital Literary Editions and Linking to the Web of Data,” MATLIT: Materialidades da Literatura, 4(2), pp. 49-75.
Grover et al. 2008 Grover, C., Givon, S., Tobin, R., and Ball, J. (2008). “Named Entity Recognition for Digitised Historical Texts.” In LREC.
Guyotjeannin 1997 Guyotjeannin, Olivier (1997). “‘Penuria scriptorum’: le mythe de l’anarchie documentaire dans la France du Nord (Xe-première moitié du XIe siècle),” Bibliothèque de l'école des chartes. Tome 155, livraison 1, pp. 11-44.
Hoffart et al. 2013 Hoffart, J., Suchanek, F. M., Berberich, K. and Weikum, G. (2013). “YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia Artificial Intelligence,” special issue on Wikipedia and Semi-Structured Resources.
Iogna-Prat et al. 2013 Iogna-Prat, D., et al. (2013), Cluny: les moines et la société au premier âge féodal. Presses universitaires de Rennes. Collection « Art et Société ».
Klein et al. 2014 Klein, E., Alex, B., Clifford, J. (2014). “Bootstrapping a historical commodities lexicon with SKOS and DBpedia.” In Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH), pp. 13-21.
Lafferty et al. 2001 Lafferty, J., McCallum, A., and Pereira, F. (2001). “Conditional random fields: Probabilistic models for segmenting and labeling sequence data.” In Proceedings of the eighteenth international conference on machine learning, ICML, Vol. 1, pp. 282-289.
Lavergne et al. 2010 Lavergne, T., Cappé, O., and Yvon, F. (2010), “Practical very large scale CRFs.” In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, pp. 504-513.
Lehmann et al. 2013 Lehmann, J.; Isele, R.; Jakob, M.; Jentzsch, A.; Kontokostas, D.; Mendes, P. N.; Hellmann, S.; Morsey, M.; van Kleef, P.; Auer, Sö. & Bizer, C. “DBpedia-A Large-scale, Multilingual Knowledge Base Extracted” from Wikipedia Semantic Web Journal, 2013
Li et al. 2016 Li, H., and Shi, J. (2016). “Linking Named Entity in a Question with DBpedia Knowledge Base.” In Joint International Semantic Technology Conference, Springer International Publishing, pp. 263-270.
Magnani 2002 Magnani, Eliana. (2002). “Le don au moyen âge.” Revue du MAUSS, no 1, pp. 309-322.
Mosallam et al. 2014 Mosallam, Y., Abi-Haidar, A., Ganascia, J. (2014). “Unsupervised named entity recognition and disambiguation: an application to old French journals.” In Industrial Conference on Data Mining. Springer, Cham, pp. 12-23.
Nadeau et al. 2007 Nadeau, D., Sekine, S. (2007). “A survey of named entity recognition and classification.” Lingvisticae Investigationes, vol. 30, no 1, pp. 3-26.
Neelakantan et al. 2015 Neelakantan, A., Collins, M. (2015). Learning dictionaries for named entity recognition using minimal supervision. arXiv preprint arXiv:1504.06650.
Nouvel et al. 2016 Nouvel, D., Ehrmann, M., Rosset, S. (2016). Named Entities for Computational Linguistics. ISTE, cognitive science series.
Passarotti 2014 Passarotti, Marco. (2014). “From Syntax to Semantics. First Steps Towards Tectogrammatical Annotation of Latin.” In Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH). pp. 100-109.
Plank 2016 Plank, B. (2016). What to do about non-standard (or non-canonical) language in NLP. arXiv preprint arXiv:1608.07836.
Pochampally et al. 2016 Pochampally, Y., Karlapalem, K., Yarrabelly, N. (2016), “Semi-Supervised Automatic Generation of Wikipedia Articles for Named Entities.” In Wiki@ ICWSM.
Rayson et al. 2017 Rayson, P., et al. (2017). “A deeply annotated testbed for geographical text analysis: The Corpus of Lake District Writing.” In Proceedings of the 1st ACM SIGSPATIAL Workshop on Geospatial Humanities. ACM, pp. 9-15.
Rizzo et al. 2011 Rizzo, G., Troncy, R. (2011). “Nerd: evaluating named entity recognition tools in the web of data.” In Workshop on Web Scale Knowledge Extraction (WEKEX’11).
Rosé 2007 Rosé, I. (2007). “Panorama de l’écrit diplomatique en Bourgogne: autour des cartulaires (XIe-XVIIIe siècles).” Bulletin du centre d’études médiévales d’Auxerre, BUCEMA, (11).
Sopena 1996 Sopena, P. M. (1996). “L'anthroponymie de l'Espagne chrétienne entre le IXe et le XIIe siècle.” L’anthroponymie document de l’histoire sociale des mondes méditerranéens médiévaux, Publications de l'École française de Rome, 226(1), pp. 63-85.
Wallach 2004 Wallach, H. M. (2004), “Conditional random fields: An introduction.” Technical Reports (CIS), pp. 22.
Won et al. 2018 Won, M., Murrieta-Flores, P. and Martins, B. (2018) “Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in Historical Corpora.” Front. Digit. Humanit. 5:2. doi: 10.3389/fdigh.2018.00002
Zimmermann 2003 Zimmermann 2003 Zimmermann, M. (2003), Écrire et lire en Catalogne: IXe-XIIe siècle (Vol. 1). Casa de Velázquez, Madrid, pp.251-284.