Florentin Flambeau Jiechieu Kameni ; Norbert Tsopze - Approche hiérarchique d’extraction des compétences dans des CVs en format PDF

arima:4964 - Revue Africaine de Recherche en Informatique et Mathématiques Appliquées, October 3, 2019, Volume 32 - 2019 - 2021 - https://doi.org/10.46298/arima.4964
Approche hiérarchique d’extraction des compétences dans des CVs en format PDF

Authors: Florentin Flambeau Jiechieu Kameni ; Norbert Tsopze

    The aim of this work is to use a hybrid approach to extract CVs' competences. The extraction approach for competences is made of two phases: a segmentation into sections phase within which the terms representing the competences are extracted from a CV; and a prediction phase that consists from the features previously extracted, to foretell a set of competences that would have been deduced and that would not have been necessary to mention in the resume of that expert. The main contributions of the work are two folds : the use of the approach of the hierarchical clustering of a résume in section before extracting the competences; the use of the multi-label learning model based on SVMs so as to foretell among a set of skills, those that we deduce during the reading of a CV. Experimentation carried out on a set of CVs collected from an internet source have shown that, more than 10% improvement in the identification of blocs compared to a model of the start of the art. The multi-label competences model of prediction allows finding the list of competences with a precision and a reminder respectively in an order of 90.5 % and 92.3 %. .

    Volume: Volume 32 - 2019 - 2021
    Published on: October 3, 2019
    Accepted on: October 3, 2019
    Submitted on: November 12, 2018
    Keywords: Skill Gap,Resume,Skills Extraction,Multi-label classification,Classification Multi-label,Skill Gap,CV,Extraction des Compétences,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]


    Consultation statistics

    This page has been seen 565 times.
    This article's PDF has been downloaded 613 times.