LANGUAGE IDENTIFICATION FROM PROSODY WITHOUT EXPLICIT FEATURES

Fred Cummins, Felix Gers, Jürgen Schmidhuber

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Scopus citations

Abstract

Most current language identification (LID) systems make little or no use of prosodic information, despite the importance of prosody in LID by humans. The greatest obstacle has been that of finding an appropriate feature set which captures linguistically relevant prosodic information. The only system to attempt LID entirely on the basis of prosodic variables uses a set of over 200 features which are selected and combined in a task-specific manner [12]. We apply a novel recurrent neural network model to the task of pairwise discrimination among languages. Network inputs are limited to delta-Fo and the first difference of the band limited amplitude envelope. Initial results are based on all pairwise combinations of English, German, Japanese, Mandarin and Spanish, with 90 speakers per language.
Original languageEnglish (US)
Title of host publication6th European Conference on Speech Communication and Technology, EUROSPEECH 1999
PublisherThe International Society for Computers and Their Applications (ISCA)
Pages371-374
Number of pages4
StatePublished - Jan 1 1999
Externally publishedYes

Fingerprint

Dive into the research topics of 'LANGUAGE IDENTIFICATION FROM PROSODY WITHOUT EXPLICIT FEATURES'. Together they form a unique fingerprint.

Cite this