T. Bluche, S. Hamel, C. Kermorvant, J. Puigcerver, D. Stutzmann et al., Preparatory kws experiments for large-scale indexing of a vast medieval manuscript collection in the himanis project, 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol.01, pp.311-316, 2017.
URL : https://hal.archives-ouvertes.fr/halshs-01853682

L. George, C. Campbell, and . Moseley, Jean-Baptiste Camps, Thibault Clérice, Mike Kestemont, and Enrique Manjavacas. Pandora, a (language independent) tagger lemmatizer for latin and the vernacular, 2012.

J. Camps, T. Clérice, and A. Pinche, Stylometry for noisy medieval data: Evaluating paul meyer's hagiographic hypothesis, DH2019, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02182737

J. Camps, A. Cochet, E. Albarran, L. Ing, P. L. Jean-baptiste-camps et al., Geste: un corpus de chansons de geste, 2016.

K. Ceynowa, Monumenta Germanica Historica, 2019.

X. Chen, X. Qiu, C. Zhu, P. Liu, and X. Huang, Long short-term memory neural networks for chinese word segmentation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.1197-1206, 2015.

. Thibault-clérice, Ponteineptique/boudams: Article release, 2019.

. Thibault-clérice, Ponteineptique/mufidecode: v0.1.0, 2019.

G. R. Crane, T. Clérice, L. Cerrato, B. Almas, N. Jovanovi? et al.,

, , 2019.

P. Depreux, M. Munson, M. Pica, and C. Faye, Formulae -litterae -chartae, 2019.

M. Grossel, Vie en prose romane de saint thibaut, d'après le manuscrit fr. 23686 de la bibliothèque nationale de france, 2019.

S. Hochreiter, J. Schmidhuber, ;. Chu-ren, T. Huang, P. Yo et al., A realistic and robust model for chinese word segmentation, Grant Jenks. Wordsegment (1.3.1), vol.9, pp.1735-1780, 1997.

, Alexei Lavrentiev. Corpus BFMMSS, 2019.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998.

E. Manjavacas, T. Clérice, and M. Kestemont, , 2019.

C. Marchello-nizia, A. Lavrentiev, I. Vedrenne-fajolles, and S. Heiden, Queste du graal d'après bibliothèque municipale de lyon, ms. arts 77, 2019.

T. Oliphant, NumPy: A guide to NumPy, 2006.

A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury et al., Pytorch: An imperative style, highperformance deep learning library, Advances in Neural Information Processing Systems, vol.32, pp.8024-8035, 2019.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in python, Journal of machine learning research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

A. Pinche, Édition nativement numérique des oeuvres hagiographiques 'Li Seint Confessor' de Wauchier de Denain, d'après le manuscrit 412 de la bibliothèque nationale de France. 40 ans du laboratoire du CIHAM et de la création du pôle de Lyon de l'EHESS, 2017.

A. Pinche, C. Andrieux, L. Vieillon, M. Morillon, M. Schmied et al., Exercices TEI du master Technologies Numériques Appliquéesà l'Histoire, 2019.

C. R. Sneddon, Old french corpus, 2019.

M. Straka and J. Straková, Tokenizing, pos tagging, lemmatizing and parsing ud 2.0 with udpipe, Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp.88-99, 2017.

D. Stutzmann, Words as graphic and linguistic structures: word spacing in psalm 101 domine exaudi orationem meam (11th-15th c.). In 13e symposium annuel de la Société Internationale des Médiévistes, 2016.

B. Trevett, Pytorch seq2seq, 2019.

C. Tse, M. J. Yap, Y. Chan, W. P. Sze, C. Shaoul et al., The chinese lexicon project: A megastudy of lexical decision performance for 25,000+ traditional chinese two-character compound words, Behavior Research Methods, vol.49, issue.4, pp.1503-1519, 2017.

F. Wahlberg, M. Dahllöf, L. Mårtensson, and A. Brun, Spotting words in medieval manuscripts, Studia Neophilologica, vol.86, issue.sup1, pp.171-186, 2014.

C. Witschel, G. Alföldy, M. S. James, F. Cowey, B. Feraudi-gruénais et al., Epigraphic Database Heidelberg, 2019.

C. Yu, S. Wang, and J. Guo, Learning chinese word segmentation based on bidirectional gru-crf and cnn network model, International Journal of Technology and Human Interaction (IJTHI), vol.15, issue.3, pp.47-62, 2019.

, Lecture,écriture et morphologie latines en irlande aux viiè et viiiè siècles. Archivum Latinitatis Medii Aevi-Bulletin du Cange (ALMA), 1998.