Blind Audio Source Separation (Bass): An Unsuperwised Approach

Page 1

Int. Journal of Electrical & Electronics Engg.

Vol. 2, Spl. Issue 1 (2015)

e-ISSN: 1694-2310 | p-ISSN: 1694-2426

Blind Audio Source Separation (Bass): An Unsuperwised Approach 1

2

Naveen Dubey1, Rajesh Mehra2

ME Scholar, Dept. Of Electronics, NITTTR, Chandigarh, India Associate Professor, Dept. Of Electronics, NITTTR, Chandigarh, India 1 naveen_elex@rediffmail.com

ABSTRACT: Audio processing is an area where signal separation is considered as a fascinating works, potentially offering a vivid range of new scope and experience in professional and personal context. The objective of Blind Audio Source Separation is to separate audio signals from multiple independent sources in an unknown mixing environment. This paper addresses the key challenges in BASS and unsupervised approaches to counter these challenges. Comparative performance analysis of Fast-ICA algorithm and Convex Divergence ICA for Blind Source Separation is presented with the help of experimental result. Result reflects Convex Divergence ICA with α=-1 gives more accurate estimate in comparison of Fast ICA . In this paper algorithms are considered for ideal mixing situation where no noise component taken in to account.

Fig.1 BASS System Diagram

Index Terms: BASS, ICA, Fast-ICA, SIR, Convex Divergence, Entropy, Unsupervised Learning.

And the model for un-mixing using BASS

I. INTRODUCTION Blind separation of at a time active audio sources is very interesting area for researchers and is a popular task in field of audio signal processing motivated by many emerging applications , like distant-talking speech communication, human-machine applications, in intelligence for national security in call interception, handfree and so on[1]. The key objective of BASS is to retrieve ‘p’ audio source from a convolutive mixture of audio signals captured by ‘m’ microphone sensors, can be mathematically represented as.

Where:  ʘ denotes matrix convolution  t is the sample index  S(t)= [S1(t). . . . .Sp(t)]T is the vector of ‘p’ sources.  X(t)= [X1(t). . . . Xm(t)]T is observed signal from ‘m’ microphones.

p Mij 1

xi ( n )  

h

j 0 k 0

ij

(k ) s j n  k , i  1,....., m(1)

Where: Xi(n) : ‘m’ recorded audio (observed) signals Sj(n) : ‘p’ original (audio) signals. The original signals Sj(n) are unknown in “blind” scenario. In actual sense, the mixing system is a multi-input multioutput (MIMO) linear filter with source microphone impulse response hij, each of length Mij,[2]. The BASS system can be understood by another mathematical model of matrix convolution [3]. As the model for mixing X(t) = A(t) ʘ S(t) (2)

29

Sˆ (t )  W (t ) ʘ X(t)

(3)

Sˆ (t ) =[ Sˆ1(t ) . . . . Sˆp(t ) ]T is the output of

reconstructed sources.  A(t) is the M X P X L mixing array,  W(t) is the P X M X L un mixing array, A(t) and W(t) can also be considered as M X P and P X M matrices, where each element is an FIR filter of length L, [4]. Previously discussed model is a an ideal representation of BASS model where number of audio sources is equal to number of microphone sensors, termed as complete model or critically determined model. The modelling can be more complex for more practicability of application, as if number of microphone sensors more than number of audio source (m > p), termed as overdetermined or over complete model. If number of sources are greater than number of microphone sensors (p > m) , named as underdertermined or under complete model [5,6]. Inclusion of noise component and delay between microphones, echo makes BASS problem more complex.ICA is a dominant algorithm for blind source separation problem and based on metrics of likelihood function, negentropy, kurtosis and NITTTR, Chandigarh EDIT-2015


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
Blind Audio Source Separation (Bass): An Unsuperwised Approach by IJEEE (Elixir Publications) - Issuu