This documentation page applies to an outdated AMT version (1.2.0). Click here for the most recent page.

HAUTH2020_SRMR - Computes the speech-to-reverberation modulation energy ratio of the given signal

Usage

[ratio, energy] = hauth2020_srmr(s, fs, 'fast', 0, 'norm', 0, 'minCF', 4, 'maxCF', 128)

Input parameters

`s`	either the path to a WAV file or an array containing a single-channel speech sentence.
`fs`	sampling rate of the data in `s`. If `s` is the path to a WAV file, this parameter has to be omitted.

Output parameters

`ratio`	the SRMR score.
`energy`	a 3D matrix with the per-frame modulation spectrum extracted from the input.

Description

hauth2020_srmr calculates the speech-to-reverberation modulation energy ratio using the modulation filterbank described by Ewert and Dau in "Characterizing frequency selectivity for envelope fluctuations" (2000).

The code has been derived from the SRMR toolbox where it has been published under the MIT license.

Optional parameters:

`'fast',F`	flag to activate (`F = 1`)/deactivate (`F = 0`) the fast implementation. The default is `'fast', 0` (this can be omitted).
`'norm',N`	flag to activate (`N = 1`)/deactivate (`N = 0`) the normalization step in the modulation spectrum representation, used for variability reduction. The default is `'norm', 0`.
`'minCF',cf1`	value of the center frequency of the first filter in the modulation filterbank. The default value is 4 Hz.
`'maxCF',cf8`	value of the center frequency of the first filter in the modulation filterbank. The default value is 128 Hz if the normalization is off and 30 Hz if normalization is on.

THE AUDITORY MODELING TOOLBOX

HAUTH2020_SRMR - Computes the speech-to-reverberation modulation energy ratio of the given signal

Usage

Input parameters

Output parameters

Description