Le Coz, Maxime and Pinquier, Julien and André-Obrecht, Régine Superposed Speech Localisation using Frequency Tracking. (2013) In: 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), 25 August 2013 - 29 August 2013 (Lyon, France).
![]() |
(Document in English)
PDF (Publisher's version) - Depositor and staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader 2MB |
Official URL: http://www.isca-speech.org/archive/interspeech_2013/i13_0714.html
Abstract
On this paper we present a new approach for the localisation of superposed speech areas. The system is based on the frequency tracking of speech segments following the evolution of the main amplitude frequencies and uses no learning of acoustic or prosodic models. The set of trackings of the frequencies are then grouped together using a distance based on the harmonicity, each group being the production of a single speaker. The co-occurrence of different harmonic groups is then used as a consequence of the presence of multiple speakers. Our method has been evaluated on the data of the French ANR evaluation campaign ETAPE, showing the usability of this approach.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Audience (conference): | International conference proceedings |
Uncontrolled Keywords: | |
Institution: | Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE) French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE) Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE) Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE) Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE) |
Laboratory name: | |
Statistics: | download |
Deposited On: | 16 Oct 2015 08:20 |
Repository Staff Only: item control page