In order to describe the quality of the models available in the AMT, we rate the implementation of every model by considering its source code and documentation. We also rate the models in terms of their verification, i.e., we rate the results of the implementation versus the results shown in the corresponding publication. The comparison is done within the experiments implemented in the exp_ functions. In the best case, the experiments produce the same results as in the publication - up to some minor layout issues in the graphical representations.
The following table provides an overview of the available models, their documentation, code, and verification status.
Peripheral models | Function | Doc | Code | Verification |
Gammatone filterbank | gammatone | |||
Linear filtering for monaural masking (basic) | dau1996 | |||
Linear filtering for monaural masking (improved) | dau1997 | |||
Invertible Gammatone filterbank | hohmann2002 | |||
Dual-resonance nonlinear filterbank (DRNL) | lopezpoveda2001 | |||
Fast acting compression (CARFAC) model | lyon2011 | |||
Cochlear transmission-line model (basic) | verhulst2012 | |||
Cochlear transmission-line model (improved) | verhulst2015 | |||
Cochlear transmission-line model (improved, incl. brainstem) | verhulst2018 | |||
Auditory-nerve spike generation | decheveigne2023 | |||
Auditory-nerve filterbank (basic) | zilany2007 | |||
Auditory-nerve filterbank (improved) | zilany2014 | |||
Auditory nerve filterbank (improved, ready for brainstem) | bruce2018 | |||
Compression in the simultaneous masker phase effect | tabuchi2016 | |||
Temporal-modulation sensitivity | Function | Doc | Code | Verification |
Brainstem processing (CN and IC) | carney2015 | |||
Auditory brainstem responses | roenne2012 | |||
Modulation filterbank (based on EPSM) | ewert2000 | |||
Modulation filterbank (based on nonlinear processing) | king2019 | |||
Modulation filterbank (based on DRNL) | relanoiborra2019 | |||
Modulation (leaky-integrator model) | viemeister1979 | |||
Non-linear adapation network | karjalainen1996 | |||
Binaural processing | Function | Doc | Code | Verification |
Binaural processing | smalt2014 | |||
Binaural masking level difference | culling2004 | |||
Binaural masking level difference (dynamic sources) | bischof2023 | |||
Binaural activity (based on cross-correlation) | lindemann1986 | |||
Binaural signal detection | breebaart2001 | |||
Binaural detection model based on interaural coherence | eurich2022 | |||
ITDs of hearing-aid users | pausch2022 | |||
Binaural activity map | takanen2013 | |||
Monaural speech perception | Function | Doc | Code | Verification |
Intelligibility in noise | joergensen2011 | |||
Intelligibility in noise | joergensen2013 | |||
Intelligibility with harmonic-cancellation | prudhomme2020 | |||
Short-time objective intelligibility | taal2011 | |||
Binaural speech perception | Function | Doc | Code | Verification |
Blind equalization-cancellation model | hauth2020 | |||
Binaural intelligibility in stationary noise (from BRIRs) | jelfs2011 | |||
Binaural intelligibility in stationary noise | lavandier2022 | |||
Binaural intelligibility of a reverberated speech target | leclere2015 | |||
Binaural intelligibility in non-stationary noise considering audibility | vicente2020 | |||
Binaural intelligibility in non-stationary noise (NH listeners only) | vicente2020nh | |||
Perceptual similarity | Function | Doc | Code | Verification |
Monaural perceptual similarity | osses2021 | |||
Binaural perceptual similarity | mckenzie2022 | |||
Binaural perceptual similarity | llado2022 | |||
Loudness models | Function | Doc | Code | Verification |
Stationary sounds | moore1997 | |||
Time-varying sounds | glasberg2002 | |||
Binaural hearing impaired | chen2011 | |||
Binaural loudness | moore2016 | |||
Spatial models | Function | Doc | Code | Verification |
Sound lateral direction | dietz2011 | |||
Lateralization, supervised training | may2011 | |||
´Lateralization in cochlear-implant listeners | kelvasa2015 | |||
Contextual lateralization based on interaural level differences | laback2023 | |||
Median-plane localization | langendijk2002 | |||
Vertical-plane localization (simple) | zakarauskas1993 | |||
Sagittal-plane localization (simple) | baumgartner2013 | |||
Sagittal-plane localization (robust) | baumgartner2014 | |||
Sagittal-plane localization (nonlinear, for hearing impairements) | baumgartner2016 | |||
Sound externalization (ILD based) | hassager2016 | |||
Sound externalization (multi-cue) | baumgartner2021 | |||
Sound externalization (reverberant spaces) | li2020 | |||
Distance perception | georganti2013 | |||
Bayesian spherical sound localization (basic) | reijniers2014 | |||
Bayesian spherical sound localization (multi-feature) | barumerli2023 | |||
Bayesian sound localization (dynamic, ITD-based) | mclachlan2021 | |||
Lateralization in sound reproduction systems | wierstorf2013 | |||
Directional time-of-arrival (on-axis only) | ziegelwanger2013 | |||
Directional time-of-arrival in HRTFs (off-axis, robust) | ziegelwanger2014 | |||
Data from various publications | Function | Doc | Code | Verification |
HRTFs and listener-specific sensitivities from Baumgartner et al. (2013) | data_baumgartner2013 | |||
HRTFs and listener-specific sensitivities from Baumgartner et al. (2014) | data_baumgartner2014 | |||
HRTFs and listener-specific sensitivities from Baumgartner et al. (2016) | data_baumgartner2016 | |||
Localization errors and SCCs from Best et al. (2005) | data_best2005 | |||
Externalization ratings from Boyd et al. (2012) | data_best2012 | |||
Reverberant harmonic complex tone from Bischof et al. (2023) | data_bischof2023 | |||
BMLD thresholds from Breebaart et al. (2001) | data_breebaart2001 | |||
ABR wave V data from Elberling et al. (2010) | data_elberling2010 | |||
Notched-noise masking thresholds for the ERB scale | data_glasberg1990 | |||
Stapes footplate diplacement from Goode et al. (1994) | data_goode1994 | |||
Localization performance in sagittal planes from Goupell et al. (2013) | data_goupell2013 | |||
Tone burst stimuli from Harte et al. (2009) | data_harte2009 | |||
Externalization ratings from Hartmann and Wittenberg (1996) | data_hartmann1996 | |||
Externalization ratings from Hassager et al. (2016) | data_hassager2016 | |||
SRTs tested by Joergensen and Dau (2011) | data_joergensen2011 | |||
Localization performance and HRTFs from Langendijk et al. (2002) | data_langendijk2002 | |||
Data from Lindemann (1986a) | data_lindenmann1986 | |||
Outer and middle ear filter data | data_lopezpoveda2001 | |||
Localization polar error rates from Macpherson et al. (2003) | data_macpherson2003 | |||
Localization performance from Majdak et al. (2010) | data_majdak2010 | |||
Localization training performance from Majdak et al. (2013) | data_majdak2013 | |||
Localization performance (CTC condition) from Majdak et al. (2013b) | data_majdak2013ctc | |||
Localization performance (non-individualized) from Middlebrooks (1999) | data_middlebrooks1999 | |||
ABR wave V data as functon of level and sweeping rate from Neely et al. (1988) | data_neely1988 | |||
Responses to amplitude panning in median plane from Pulkki (2001) | data_pulkki2001 | |||
"Unity responses" from Roenne (2012) | data_roenne2012 | |||
Localization response gains from Sabin et al. (2005) | data_sabin2005 | |||
Data involved in the modeling process of Takanen et al. (2005) | data_takanen2005 | |||
Masking threshold (binaural/monaural) from van der Par and Kohlrausch (1999) | data_vandepar1999 | |||
Data involved in the modeling process of Wierstorf et al. (2013) | data_wierstorf2013 | |||
HRTFs and other data involved in Ziegelwanger et al. (2013) | data_ziegelwanger2013 | |||
HRTFs and other data involved in Ziegelwanger et al. (2014) | data_ziegelwanger2014 | |||
Bark scale according to Zwicker (1961) | data_zwicker1961 | |||
Data from sound externalization experiments Baumgartner et al. (2017) | data_baumgartner2017looming | |||
HRTFs and data of a sound externalization model Baumgartner et al. (2017b) | data_baumgartner2017 | |||
Outer- and middle-ear data from Glasberg and Moore (2002) | data_glasberg2002 |