Acoustic Modeling

description2,405 papers

group43 followers

lightbulbAbout this topic

Acoustic modeling is the process of creating mathematical representations of sound production and propagation in various environments. It involves analyzing sound waves, their interactions with materials, and the effects of different acoustic conditions to improve applications in fields such as speech recognition, audio engineering, and environmental acoustics.

lightbulbAbout this topic

Key research themes

1. How can machine learning improve acoustic modeling for robust feature extraction and surrogate modeling?

This research theme explores the integration of machine learning (ML), including deep learning, to enhance acoustic modeling by learning robust representations from raw or frequency-domain acoustic data. It focuses on improving the generalization of acoustic models across varied environments, as well as creating surrogate models that efficiently approximate complex vibroacoustic simulations. Such approaches aim to overcome the limitations of traditional handcrafted features and expensive computational methods, enabling better performance in speech recognition, sound transmission loss predictions, and environmental noise conditions.

Machine learning in acoustics: Theory and applications

by Sharon Gannot

2023, The Journal of the Acoustical Society of America

Key finding: The paper surveys the transformative impact of ML in diverse acoustics applications, establishing that data-driven representation learning can discover complex acoustic phenomena such as human speech and reverberation... Read more

articleView Paper downloadDownload

Towards Robust Waveform-Based Acoustic Models

by Peter Sollich

2021

Key finding: This study proposes a vicinal risk minimization framework for learning robust acoustic models directly from raw waveforms, addressing significant mismatches between training and test environments. By modeling local... Read more

articleView Paper downloadDownload

Acoustic Modeling from Frequency Domain Representations of Speech

by Hossein Hadian

2021, Interspeech 2018

Key finding: The paper introduces a frequency-domain feature learning layer that integrates a Fourier transform inside the network to enable acoustic model training directly from raw waveforms. By incorporating a novel normalization layer... Read more

articleView Paper downloadDownload

On machine learning-driven surrogates for sound transmission loss simulations

by Malek Zine

2023, The Journal of the Acoustical Society of America

Key finding: This work investigates multiple ML methods, including Gaussian Process Regression, Radial Basis Functions, and Neural Networks, to create surrogate models approximating sound transmission loss (STL) simulations, which are... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What numerical and physics-informed modeling approaches enable efficient and accurate simulation of acoustic wave propagation and wave-based systems?

This theme covers advanced modeling methods for acoustic wave propagation that balance computational efficiency with physical accuracy, especially in complex and large-scale acoustic domains like rooms, resonators, and coupled subsystems. It includes the development of wave-based multipole models, state-space approaches for networked acoustic elements, digital filter design for reflections and air absorption, and reduced-order models for visco-thermal losses. These methods provide practical frameworks for sound propagation simulations, offering causal, compact representations of boundary conditions and subsystem interconnections that are essential for accurate acoustic predictions and real-time applications.

Boundary admittance estimation for wave-based acoustic simulations using Bayesian inference

by Ning Xiang

2023, JASA Express Letters

Key finding: The paper presents a Bayesian inference framework to estimate both the order and parameters of multipole acoustic admittance models from experimentally measured frequency-dependent admittance data. By incorporating maximum... Read more

articleView Paper downloadDownload

Linear State Space Interconnect Modeling of Acoustic Systems

by Max Meindl

2024, Acta Acustica united with Acustica

Key finding: The study introduces a generalized linear state space framework to model interconnected acoustic subsystems, combining low-order 1D models, 3D linearized perturbation-based models, and data-driven models into a unified... Read more

articleView Paper downloadDownload

Modeling of reflections and air absorption in acoustical spaces a digital filter design approach

by Lauri Savioja

2023, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics

Key finding: This paper proposes low-order minimum-phase digital filter design techniques to model acoustic reflection and air absorption effects based on measured absorption coefficients and impedance data. The method addresses the... Read more

articleView Paper downloadDownload

Sound absorption prediction of linear damped acoustic resonators using a lightweight hybrid model

by Emmanuel Redon

2025, Applied Acoustics

Key finding: The authors develop a computationally lightweight hybrid model combining lossless Helmholtz equations with viscous and thermal boundary layer perturbation theory to predict sound absorption in resonators with large... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Acoustic Modeling

Free software toolkit for Japanese large vocabulary continuous speech recognition

by Akinori Ito

2026, 6th International Conference on Spoken Language Processing (ICSLP 2000)

A sharable software repository for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) is introduced. It is designed as a baseline platform for research and developed by researchers of different academic institutes under a... more

descriptionView Paper arrow_downwardDownload

A Lightweight Real-Time Multilingual Video Communication System using On-Device Speech Translation

by Dr. AVIJIT MONDAL

2026, IEEE

Real-time multilingual interaction during mobile video calls still difficult to achieve due to strict latency, fluctuating network conditions, and the limited resources capacity of handheld devices. Although recent speech translation... more

descriptionView Paper arrow_downwardDownload

On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition

by Bogdan Vlasenko

2026, Lecture Notes in Computer Science

Acoustic Modeling in today's emotion recognition engines employs general models independent of the spoken phonetic content. This seems to work well enough given sufficient instances to cover for a broad variety of phonetic structures and... more

descriptionView Paper arrow_downwardDownload

Language Modeling of Nonverbal Vocalizations in Spontaneous Speech

by Bogdan Vlasenko

2026, Lecture Notes in Computer Science

Nonverbal vocalizations are one of the characteristics of spontaneous speech distinguishing it from written text. These phenomena are sometimes regarded as a problem in language and acoustic modeling. However, vocalizations such as filled... more

descriptionView Paper arrow_downwardDownload

Comparing one and two-stage acoustic modeling in the recognition of emotion in speech

by Bogdan Vlasenko

2026, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

In the search for a standard unit for use in recognition of emotion in speech, a whole turn, that is the full section of speech by one person in a conversation, is common. Within applications such turns often seem favorable. Yet, high... more

descriptionView Paper arrow_downwardDownload

Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition

by Bogdan Vlasenko

2026, 2008 IEEE International Conference on Multimedia and Expo

Recognition of emotion in speech usually uses acoustic models that ignore the spoken content. Likewise one general model per emotion is trained independent of the phonetic structure. Given sufficient data, this approach seemingly works... more

descriptionView Paper arrow_downwardDownload

Developing Tigrinya Speech Recognizer Using Amharic and Tigrinya Data

by fekadu deressa

2026

This study has introduced the design of a Hidden Markov Model based LVCSR system in a new target language based on a different source language and without the need of a large speech databases on the target language. The Tigrinya LVCSR was... more

descriptionView Paper arrow_downwardDownload

Obliquity-correction imaging condition for reverse time migration

by Francisco de Assis Silva Neto

2026

The quality of seismic images obtained by reverse time migration ͑RTM͒ strongly depends on the imaging condition. We propose a new imaging condition that is motivated by stationary phase analysis of the classical crosscorrelation imaging... more

descriptionView Paper arrow_downwardDownload

Recognition of phoneme strings using TRAP technique

by Petr Schwarz

2026, Conference of the International Speech Communication Association

We investigate and compare several techniques for automatic recognition of unconstrained context-independent phoneme strings from TIMIT and NTIMIT databases. Among the compared techniques, the technique based on TempoRAl Patterns (TRAP)... more

descriptionView Paper arrow_downwardDownload

Phonotactic language identification using high quality phoneme recognition

by Petr Schwarz

2026, Conference of the International Speech Communication Association

Phoneme Recognizers followed by Language Modeling (PRLM) have consistently yielded top performance in language identification (LID) task. Parallel ordering of PRLMs (PPRLM) improves performance even more. Since tokenizer is the most... more

descriptionView Paper arrow_downwardDownload

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

by Nagendra Kumar Goel

2026

Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approach has been to use some kind of "universal phone set" that... more

descriptionView Paper arrow_downwardDownload

Subspace Gaussian Mixture Models for speech recognition

by Nagendra Kumar Goel

2026

We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian... more

descriptionView Paper arrow_downwardDownload

The Kaldi Speech Recognition Toolkit

by Nagendra Kumar Goel

2026, IEEE Automatic Speech Recognition and Understanding Workshop

We describe the design of Kaldi, a free, open-source toolkit for speech recognition research. Kaldi provides a speech recognition system based on finite-state transducers (using the freely available OpenFst), together with detailed... more

descriptionView Paper arrow_downwardDownload

AMADEUS - The Acoustic Neutrino Detection Test System of the ANTARES Deep-Sea Neutrino Telescope

by C. Bigongiari

2026, Nuclear Instruments and Methods in Physics Research

The AMADEUS (ANTARES Modules for the Acoustic Detection Under the Sea) system which is described in this article aims at the investigation of techniques for acoustic detection of neutrinos in the deep sea. It is integrated into the... more

descriptionView Paper arrow_downwardDownload

Acoustic models and Kalman filtering strategies for active binaural sound localization

by Patrick Danès

2026, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems

This paper deals with binaural sound localization. An active strategy is proposed, relying on a precise model of the dynamic changes induced by motion on the auditive perception. The proposed framework allows motions of both the sound... more

descriptionView Paper arrow_downwardDownload

Analysis of Leveraging Fastspeech 2 and Hifi-Gan Models for Speech Synthesis Adapted for Nigerian Languages

by Ikechukwu B Igbokwe

2026

The aim of this research is to develop a speech synthesis model tailored towards Nigerian languages by leveraging natural language processing tool such as FastSpeech 2 and meta-tts for high-quality, non-autoregressive text-to-speech (TTS)... more

descriptionView Paper arrow_downwardDownload

CNN-Based Accent Similarity Detection Using Masked Spectrogram Reconstruction

by AMINA SALIFU

2026

descriptionView Paper arrow_downwardDownload

Acoustic Modeling

Key research themes

1. How can machine learning improve acoustic modeling for robust feature extraction and surrogate modeling?

2. What numerical and physics-informed modeling approaches enable efficient and accurate simulation of acoustic wave propagation and wave-based systems?

Related Topics

All papers in Acoustic Modeling