Skip to content

Gallic(orpor)a

The Gallic(orpor)a project develops a pipeline for mass digitisation of historical documents. It focuses on French documents written between the 15th and the 18th centuries, may they be manuscripts, incunabula or prints. It focuses on three main tasks: document layout analysis, handwritten text recognition and linguistic annotation (lemmata, POS tags and morphology).

Project investigators

  • Simon Gabay, Université de Genève
  • Ariane Pinche, PSL | École nationale des chartes
  • Jean-Baptiste Camps, PSL | École nationale des chartes
  • Pedro Ortiz Suárez, Inria
  • Rachel Bawden, Inria
  • Benoît Sagot, Inria
  • Laurent Romary, Inria

Project participants

  • Nicola Carboni, Université de Genève
  • Kelly Christensen, INRIA/PSL | École nationale des chartes
  • Noé Leroy, PSL | École nationale des chartes
  • Matenia Vlachou, PSL | École nationale des chartes
  • Eliott Fabert, PSL | École nationale des chartes
  • Johannes Laroche, PSL | École nationale des chartes
  • Maeva Nguyen, PSL | École nationale des chartes

Past members

  • Claire Jahan, PSL | École nationale des chartes
  • Juliette Janès, PSL | École nationale des chartes
  • Alexandre Bartz, Sorbonne Université

You feel find on this website all the information about our work. For data and code, please have a look at our GitHub repo.

logo INRIA logo UniGE logo ENC logo BNF