Electerical Engineering, University of Mazandaran
Electrical and Computer Engineering, Babol Noshirvani University of Technology
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet Packet Transform (WPT). Our algorithm utilizes the differences between spectral distribution of human speech (voice) and general noise. First, the algorithm performs wavelet transform on the signal resulting in its decomposition into subbands using coefficients of WPT, and then it detects the voice within the signal by comparing the subband energy of components between detail and approximation coefficients. Computer simulation results are given to illustrate the effectiveness of our new VAD algorithms.