Nzeuhang Yannick Yomie ; Yonta Paulin Melatagia ; Lecouteux Benjamin - Application of the multilingual acoustic representation model XLSR for the transcription of Ewondo

arima:13621 - Revue Africaine de Recherche en Informatique et Mathématiques Appliquées, October 28, 2024, Volume 42 - Special issue CRI 2023 - 2024 - https://doi.org/10.46298/arima.13621
Application of the multilingual acoustic representation model XLSR for the transcription of EwondoArticle

Authors: Nzeuhang Yannick Yomie 1; Yonta Paulin Melatagia 1; Lecouteux Benjamin ORCID2

Recently popularized self-supervised models appear as a solution to the problem of low data availability via parsimonious learning transfer. We investigate the effectiveness of these multilingual acoustic models, in this case wav2vec 2.0 XLSR-53 and wav2vec 2.0 XLSR-128, for the transcription task of the Ewondo language (spoken in Cameroon). The experiments were conducted on 11 minutes of speech constructed from 103 read sentences. Despite a strong generalization capacity of multilingual acoustic model, preliminary results show that the distance between XLSR embedded languages (English, French, Spanish, German, Mandarin, . . . ) and Ewondo strongly impacts the performance of the transcription model. The highest performances obtained are around 69% on the WER and 28.1% on the CER. An analysis of these preliminary results is carried out andthen interpreted; in order to ultimately propose effective ways of improvement.


Volume: Volume 42 - Special issue CRI 2023 - 2024
Published on: October 28, 2024
Accepted on: September 9, 2024
Submitted on: May 20, 2024
Keywords: Low resource language,Self-supervised model,XLSR,Transcription,Ewondo,[INFO]Computer Science [cs],[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Funding:
    Source : HAL
  • European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 101007666,; Funder: European Commission

Consultation statistics

This page has been seen 132 times.
This article's PDF has been downloaded 84 times.