A simple and comutationally efficient algorithm for real-time blind source separation of speech mixtures

Tarig Ballal*, Nedelko Grbic, Abbas Mohammed

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In this paper we exploit the amplitude diversity provided by two sensors to achieve blind separation of two speech sources. We propose a simple and highly computationally efficient method for separating sources that are W-disjoint orthogonal (W-DO), that are sources whose time-frequency representations are disjoint sets. The Degenerate Unmixing and Estimation Technique (DUET), a powerful and efficient method that exploits the W-disjoint orthogonality property, requires extensive computations for maximum likehood parameter learning. Our proposed method avoids all the computations required for parameters estimation by assuming that the sources are "cross high-low diverse (CH-LD)", an assumption that is explained later and that can be satisfied exploiting the sensors settings/directions. With this assumption and the W-disjoint orthogonality property, two binary time-frequency masks that can extract the original sources from one of the two mixtures, can be constructed directly from the amplitude ratios of the time-frequency points of the two mixtures. The method works very well when tested with both artificial and real mixtures. Its performance is comparable to DUET, and it requires only 2% of the computations required by the DUET method. Moreover, it is free of convergence problems that lead to poor SIR ratios in the first parts of the signals. As with all binary masking approaches, the method suffers from artifacts that appear in the output signals.

Original languageEnglish (US)
Title of host publicationSIGMAP 2006 - International Conference on Signal Processing and Multimedia Applications, Proceedings
Pages105-109
Number of pages5
StatePublished - 2006
Externally publishedYes
EventInternational Conference on Signal Processing and Multimedia Applications, SIGMAP 2006 - Setubal, Portugal
Duration: Aug 7 2006Aug 10 2006

Publication series

NameSIGMAP 2006 - International Conference on Signal Processing and Multimedia Applications, Proceedings

Other

OtherInternational Conference on Signal Processing and Multimedia Applications, SIGMAP 2006
Country/TerritoryPortugal
CitySetubal
Period08/7/0608/10/06

Keywords

  • BSS
  • Blind source separation
  • Speech analysis
  • Speech enhancement
  • Speech synthesis

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'A simple and comutationally efficient algorithm for real-time blind source separation of speech mixtures'. Together they form a unique fingerprint.

Cite this