Grailles Bénédicte, Université Angers
Aït El Mekki Touria, Université Angers
The Pêle-Mél project, supported by the French Ministry of Culture and led by the University of Angers, has tested strategies for exploring corpora of French emails in order to promote their evaluation and exploitation.
The Pêle-Mél project addressed the issue of electronic mailboxes from a resolutely interdisciplinary perspective, combining expertise in artificial intelligence (Leria laboratory) and archival science (Temos laboratory), and in collaboration with the archives mission of the Ministry of Health and the École des Chartes.
Pêle-mél’s objective is to help the archivist to interpret and contextualize an individual mailbox or several mailboxes by categorizing and classifying the mails. To do this, it has developed a method that involves the extraction of named entities, mainly individuals and companies, the extraction of terms, the classification of terms and messages via word embeddings and document embeddings, i.e. by mobilizing a number of tools and techniques of natural language processing. The method is based on supervised machine learning and uses neural network models. It takes into account the linguistic specificities of French.
The program has developed two prototypes, one to classify, the other to query metadata and explore content. Although the experimentation was based on a specific field – the Ministry of Health – the results are generalizable to other environments.
Event Timeslots (1)
Day Two – June 14
Grailles Bénédicte, Aït El Mekki Touria