This documentation page applies to an outdated AMT version (1.3.0). Click here for the most recent page.

JOERGENSEN2011 - Speech-based envelope power spectrum (EPSM)

Usage

output = joergensen2011(x,y,fs_input,IO_param)

Input parameters

`x`	noisy speech mixture
`y`	noise alone
`fs`	sample rate in Hz
`IO_param`	(optional) vector with parameters for the ideal observer that converts the SNRenv to probability of correct, assuming a given speech material. It contains four parameters of the ideal observer formatted as [k q m sigma_s].

Output parameters

output

struct containing the SNRenv and the probability of correct given the SNRenv. This field is only included if IO_param is specified. Its calculation requires the Statistics ToolBox.

Description

output = joergensen2011(x,y,fs_input,IO_param) returns the output of signal-to-noise envelope-power (SNRenv) ratio using the multi-resolution speech-based envelope spectrum model (mr-sEPSM) described in Joergensen et al. (2013)

The model consists of the following stages:

1 A gammatone bandpass filterbank to simulate the auditory filters

2 An envelope extraction stage via the Hilbert Transform

3 A modulation filterbank

4 Computation of the long-term envelope power (output.SNRenv)

5 A decision mechanism based on a statistically ideal observer (output.P_correct)

References: