Sébastien Le Maguer - Professional webpage

Table of Contents


1 Short bio.

After achieving a Master of Science in Complex System and Algorithms at Université de Lille 1 (Lille, France) in 2008 and a PhD in Text-To-Speech synthesis (TTS), I’m currently a Post-Doc researcher at the Saarland University in the Ingmar Steiner’s MMCI Group. My work focuses on TTS (annotation, corpus building, descriptive features, statistical speech synthesis and corpus analysis) and information retrieval.

I’m also an official maintainer of Marytts and Roots.

Keywords : TTS, HTS, Descriptive features, Corpus annotation, Evaluation

2 Research

2.1 Research experience

Year Description team/group Place Supervisors
10/14 Post-Doc IS - Saarland University Saarbrücken - Germany Ingmar Steiner
02/14 - 09/14 Post-Doc LINKMEDIA - INRIA Rennes - France Vincent Claveau
09/13 - 12/13 Post-Doc CORDIAL - IRISA Lannion - France Damien Lolive / Nelly Barbot
09/11 - 08/13 Research Assistant CORDIAL - IRISA Lannion - France Olivier Boëffard / Nelly Barbot
10/08 - 07/13 PhD CORDIAL - IRISA Lannion - France Olivier Boëffard / Nelly Barbot

2.2 PhD

  • Title : Experimental evaluation of statistical speech synthesis system, HTS, for French
  • Supervisors : Olivier Boëffard, Nelly Barbot
  • Defended the 2nd of July 2013
  • Prix de l’innovation du Trégor

The work presented in this thesis is about TTS speech synthesis and, more particularly, about statistical speech synthesis for French. We present an analysis on the impact of the linguistic contextual factors on the synthesis achieved by the HTS statistical speech synthesis system. To conduct the experiments, two objective evaluation protocols are proposed. The first one uses Gaussian mixture models (GMM) to represent the acoustical space produced by HTS according to a contextual feature set. By using a constant reference set of natural speech stimuli, GMM can be compared between themselves and consequently acoustic spaces generated by HTS. The second objective evaluation that we propose is based on pairwise distances between natural speech and synthetic speech generated by HTS. Results obtained by both protocols, and confirmed by subjective evaluations, show that using a large set of contextual factors does not necessarily improve the modeling and could be counter-productive on the speech quality.

Keywords : Computer science, Speech processing, Text-to-Speech synthesis, HTS

3 Teaching

3.1 Current (2016-2017)

Title Description Attachments
FLST Presentation in front of coli students about the research group slides
Statistical speech synthesis   slides (to appear)

3.2 Previous

3.2.1 2015-2016

Title Description Attachments
Statistical speech synthesis   slides

3.2.2 2014-2015

Title Description Attachments
Statistical speech synthesis   slides

3.2.3 2012-2013

Title Description Attachments
Unix Programmation   (to appear)
Unix Utilisation   (to appear)
Algorithmique distribuée   (to appear)

3.2.4 2011-2012

Title Description Attachments
Unix Programmation   CM1 CM2 TP1 TP2 TP3 TP3.2 TP4 TP5 TP6 TP7 TP8
Unix Utilisation   CM TP1 TP2
Algorithmique distribuée   CM TP

4 Education

Année Description
2008 - 2013 Doctorate Degree in Computer Science, ENSSAT/Université de Rennes 1 (22, France)
2006 - 2008 Master of science - Complex systems and algorihms, Université de Lille 1 (59, France)
2005 - 2006 Bachelor’s Degree in computer science - A.I and robotic, U.B.O. (29, France)
2003 - 2005 DUT in computer science, IUT de Lannion (22, France)

5 Technical skills

  • Programming :
    • skilled : Perl, C, C++, Java, Python, Php, Shells, Matlab
    • fundamentals : Scheme/Lisp, Caml
  • Systems : Linux, Windows
  • Foreign Language : English (Written and Spoken), French (Native)

6 Publications

References

[19] Sébastien Le Maguer and Ingmar Steiner. The marytts entry for the blizzard challenge 2016. In Blizzard challenge proceedings, 2016. [ bib ]
[18] Sébastien Le Maguer, Bernd Möbius, and Ingmar Steiner. Toward the use of information density based descriptive features in hmm based speech synthesis. In Proceedings of the Speech Prosody conference, 2016. [ bib ]
[17] Sébastien Le Maguer, Bernd Möbius, and Ingmar Steiner. De l'utilisation de descripteurs issus de la linguistique computationnelle dans le cadre de la synthèse par hmm. In Proceedings of Journée d'Étude de la Parole (JEP), 2016. [ bib ]
[16] Jonathan Chevelu, Damien Lolive, Sébastien Le Maguer, and David Guennec. Se concentrer sur les différences : une méthode d'évaluation subjective efficace pour la comparaison de systèmes de synthèse. In Proceedings of Journée d'Étude de la Parole (JEP), 2016. [ bib ]
[15] Sébastien Le Maguer, Ingmar Steiner, and Bernd Möbius. Toward a speech synthesis guided by the modeling of unexpected events. In Proceedings of the workshop "Modeling variability in speech", 2015. [ bib ]
[14] Jonathan Chevelu, Damien Lolive, Sébastien Le Maguer, and David Guennec. How to compare tts systems: A new subjective evaluation methodology focused on differences. In Proceedings of the International Conference on Speech Communication and Technology (Interspeech), 2015. [ bib ]
[13] Vincent Claveau, Thierry Hamon, Sébastien Le Maguer, and Natalia Grabar. Health consumer-oriented information retrieval. In Proceedings of the Medical Informatics Europe conference (MIE), 2015. [ bib ]
[12] Sébastien Le Maguer, Thierry Hamon, Natalia Grabar, and Vincent Claveau. Recherche d’information médicale pour le patient : Impact de ressources terminologiques. In Proceedings of COnférence en Recherche d’Information et Applications (CORIA), 2015. [ bib ]
[11] Vincent Claveau, Thierry Hamon, Nathalie Grabar, and Sébastien Le Maguer. Repali participation to clef ehealth ir challenge 2014: leveraging term variation. In Proceedings of The ShARe/CLEF eHealth Challenge, 2014. [ bib | http ]
[10] Sébastien Le Maguer, Elisabeth Delais-Roussarie, Nelly Barbot, Mathieu Avanzi, Olivier Rosec, and Damien Lolive. Prosodic chunking algorithm for dictation with the use of speech synthesis. In Proceedings of the Speech Prosody conference, Dublin (Ireland), 2014. [ bib | http ]
[9] Hiyon Yoo, Sébastien Le Maguer, Elisabeth Delais-Roussarie, Nelly Barbot, and Damien Lolive. Évaluation d'un algorithme de chunking appliqué à la dictée. In Proceedings of Journée d'Étude de la Parole (JEP), Le Mans, 2014. [ bib ]
[8] Sébastien Le Maguer, Elisabeth Delais-Roussarie, Nelly Barbot, Mathieu Avanzi, Olivier Rosec, and Damien Lolive. Algorithme de découpage en groupes prosodiques pour la dictée par l'usage de synthèse vocale. In Proceedings of Journée d'Étude de la Parole (JEP), Le Mans, 2014. [ bib ]
[7] Sébastien Le Maguer, Nelly Barbot, and Olivier Boeffard. Evaluation of contextual descriptors for hmm-based speech synthesis in french. In Proceedings of the Speech Synthesis Workshop (SSW), Barcelona (Spain), 2013. [ bib | .pdf ]
[6] the roots toolkit, 2013. (APP submission). [ bib ]
[5] Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer, Damien Lolive, and Gaelle Vidal. Towards fully automatic annotation of audiobooks for tts. In Proceedings of Language Resources and Evaluation Conference (LREC), Istanbul (Turquie), 2012. [ bib | .pdf ]
[4] Sébastien Le Maguer, Nelly Barbot, and Olivier Boeffard. Évaluation segmentale du système de synthèse hts pour le français. In Proceedings of Journée d'Étude de la Parole (JEP), Grenoble (France), 2012. [ bib ]
[3] Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer, Damien Lolive, and Gaelle Vidal. Vers une annotation automatique de corpus audio pour la synthèse de parole. In Proceedings of Journée d'Étude de la Parole (JEP), Grenoble (France), 2012. [ bib ]
[2] Nelly Barbot, Vincent Barreaud, Olivier Boeffard, Laure Charonnat, Arnaud Delhay, Sébastien Le Maguer, and Damien Lolive. Towards a versatile multi-layered description of speech corpora using algebraic relations. In Proceedings of the International Conference on Speech Communication and Technology (Interspeech), Florence (Italie), 2011. [ bib | http ]
[1] Pierre Alain, Nelly Barbot, Vincent Barreaud, Laurent Blin, Olivier Boeffard, Laure Charonnat, Ali Choumane, Arnaud Delhay, Sebastien Le Maguer, Damien Lolive, and Thierry Moudenc. A Multi-Agent Platform for Multimodal Pervasive Applications. In Proceedings of the Conference on the Networked and Electronic Media, Saint Malo (France), 2009. [ bib | http ]

Author: Sébastien Le Maguer

Created: 2016-10-29 Sat 18:36

Emacs 26.0.50.1 (Org mode 8.3.6)

Validate