Affordable Access

Access to the full text

An adaptive a priori SNR estimator for perceptual speech enhancement

Authors
  • Nahma, Lara
  • Yong, Pei Chee
  • Dam, Hai Huyen
  • Nordholm, Sven
Type
Published Article
Journal
EURASIP Journal on Audio, Speech, and Music Processing
Publisher
Springer International Publishing
Publication Date
Jun 07, 2019
Volume
2019
Issue
1
Identifiers
DOI: 10.1186/s13636-019-0150-3
Source
Springer Nature
Keywords
License
Green

Abstract

In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed weighting with the value close to one, which makes it slow in following the onsets of speech utterances. The proposed SNR estimator provides a means to solve this issue by employing an adaptive weighting factor. This allows an improved tracking of onset changes in the speech signal. As a consequence, it results in better preservation of speech components. This adaptive technique ensures that the weighting between the modified decision-directed a priori estimate and the maximum likelihood a priori estimate is a function of the speech absence probability. The estimate of the speech absence probability is modeled by a sigmoid function. Furthermore, a critical band mapping for the short-time Fourier transform analysis-synthesis system is utilized in the speech enhancement to achieve less musical noise. In addition, to evaluate the ability of the a priori SNR estimation method in preserving speech components, we proposed a modified objective measurement known as modified hamming distance. Evaluations are performed by utilizing both objective and subjective measurements. The experimental results show that the proposed method improves the speech quality under different noise conditions. Moreover, it maintains the advantage of the DD approach in eliminating the musical noise under different SNR conditions. The objective results are supported by subjective listening tests using 10 subjects (5 males and 5 females).

Report this publication

Statistics

Seen <100 times