Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project - Institut d’Histoire des Représentations et des Idées dans les Modernités Accéder directement au contenu
Article Dans Une Revue Digital Humanities Quarterly Année : 2021

Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project

Résumé

The ANTRACT project is a cross-disciplinary apparatus dedicated to the analysis of the French newsreel company Les Actualités Françaises (1945-1969) and its film productions. Founded during the liberation of France, this state-owned company filmed more than 20,000 news reports shown in French cinemas and throughout the world over its 24 years of activity. The project brings together research organizations with a dual historical and technological perspective. ANTRACT’s goal is to study the production process, the film content, the way historical events are represented and the audience reception of Les Actualités Françaises newsreels using innovative AI-based data processing tools developed by partners specialized in image, audio, and text analysis. This article focuses on the data processing apparatus and tools of the project. Automatic content analysis is used to select data, to segment video units and typescript images, and to align them with their archival description. Automatic speech recognition provides a textual representation and natural language processing can extract named entities from the voice-over recording; automatic visual analysis is applied to detect and recognize faces of well-known characters in videos. These multifaceted data can then be queried and explored with the TXM text-mining platform. The results of these automatic analysis processes are feeding the Okapi platform, a client-server software that integrates documentation, information retrieval, and hypermedia capabilities within a single environment based on the Semantic Web standards. The complete corpus of Les Actualités Françaises, enriched with data and metadata, will be made available to the scientific community by the end of the project.
Fichier principal
Vignette du fichier
antract_carrive_al_dhq21_200712.pdf (7.15 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03166755 , version 1 (11-03-2021)

Licence

Paternité - Pas de modifications

Identifiants

  • HAL Id : hal-03166755 , version 1

Citer

Jean Carrive, Abdelkrim Beloued, Pascale Goetschel, Serge Heiden, Antoine Laurent, et al.. Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project. Digital Humanities Quarterly, 2021, Special Issue on AudioVisual Data in DH, 15 (1). ⟨hal-03166755⟩
938 Consultations
111 Téléchargements

Partager

Gmail Facebook X LinkedIn More