Cepstral Analysis

description63 papers

group17 followers

lightbulbAbout this topic

Cepstral analysis is a signal processing technique used to analyze the frequency spectrum of signals by transforming them into the cepstral domain. It involves taking the inverse Fourier transform of the logarithm of the power spectrum, allowing for the separation of different signal components, such as pitch and timbre, in various applications including speech and audio processing.

lightbulbAbout this topic

Key research themes

1. How can cepstral analysis be optimized for accurate voice disorder detection and characterization?

This research area focuses on evaluating and enhancing cepstral-based voice measures, particularly Cepstral Peak Prominence Smoothed (CPPS) and related parameters, for objective assessment and differentiation of voice quality in pathological and healthy voices. It matters because voice disorders can be subtle, and reliable quantitative tools are crucial for diagnosis, treatment monitoring, and differentiating conditions such as spasmodic dysphonia, resonant voice training effects, or endocrine-related voice changes.

Voice analysis in adductor spasmodic dysphonia: Objective diagnosis and response to botulinum toxin

by Giovanni Saggio

2022, Parkinsonism & Related Disorders

Key finding: This study demonstrated that cepstral analysis via CPP values, enhanced by machine-learning algorithms, more accurately distinguished patients with adductor-type spasmodic dysphonia (ASD) from healthy subjects than... Read more

articleView Paper downloadDownload

Cepstral Measures of Voice in Women with Polycystic Ovarian Syndrome

by Research and Statistics Center

2020, Asia Pacific Journal of Multidisciplinary Research

Key finding: The study found that women with polycystic ovarian syndrome (PCOS) exhibited significantly lower Cepstral Peak Prominence (CPP) and Smoothed Cepstral Peak Prominence (CPPS) values than healthy controls, indicating that... Read more

articleView Paper downloadDownload

Cepstral and Entropy Analyses in Vowels Excerpted from Continuous Speech of Dysphonic and Control Speakers

by Giampiero Salvi

2024, Interspeech 2017

Key finding: This study empirically established that CPPS had higher diagnostic precision (AUC = 0.85) than Sample Entropy (AUC = 0.72) in differentiating dysphonic from normal voices using vowel excerpts from continuous speech. The... Read more

articleView Paper downloadDownload

Is the Cepstral Analysis Sensitive Enough to Detect Untrained/Trained Resonant Voice in Healthy Subjects? A Preliminary Study

by Prof.Dr. Esra Ozcebe

2023

Key finding: Findings revealed that cepstral analysis (CPP and CPP standard deviation) was sensitive in detecting subtle voice quality changes pre- and post-resonant voice training in healthy subjects, particularly with voiced-weighted... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How is Mel Frequency Cepstral Coefficient (MFCC) feature extraction utilized and adapted across diverse signal processing applications beyond traditional acoustic speech recognition?

This research theme investigates the computation, adaptation, and application of MFCC features in various domains—not limited to speech and speaker recognition but including biomedical signal classification (e.g., EEG, ECG), fault detection, and even non-acoustic signals. Understanding MFCC's applicability, parameter tuning, and its integration with machine and deep learning models informs its generalized utility and guides improvements for specific tasks.

Mel Frequency Cepstral Coefficient and its Applications: A Review

by Abdulbasit al-Talabani

2023, IEEE Access

Key finding: This comprehensive review synthesizes MFCC computation steps, challenges, and adaptations across diverse application fields, highlighting key considerations such as parameter tuning, combination with other features, and... Read more

articleView Paper downloadDownload

Computer-assisted analysis of routine EEG to identify hidden biomarkers of epilepsy: protocol for a systematic review

by Denahin Toffa

2023

Key finding: Although primarily focused on EEG analysis for epilepsy diagnosis, this work underscores the emerging role of quantitative EEG feature extraction—including cepstral-based metrics—as potential biomarkers. It situates cepstral... Read more

articleView Paper downloadDownload

Unveiling Multi-Dimensional Factors of Consumer Switching Intention Towards Electric Vehicles

2026, Journal of Human Earth and Future

Key finding: The research advances the classification of non-acoustic geophysical signals using cepstral features extracted from empirical mode decomposition (EMD) components of multichannel seismic data. Integrating cepstral attributes... Read more

articleView Paper

Emotion Detection from Voice Based Classified Frame-Energy Signal Using K-Means Clustering

by Rifat Hossain

2025, Zenodo (CERN European Organization for Nuclear Research)

Key finding: Utilizing Cepstral Coefficient features from significant short-time frames combined with k-means clustering, this paper shows enhanced emotion detection accuracy (happy, angry, sad) compared to prior methods. It also... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What advancements and evaluation exist in automated cephalometric measurement accuracy using AI-driven digital tools versus conventional manual methods?

This theme centers on assessing the precision, reliability, and clinical feasibility of automated cephalometric analysis systems powered by artificial intelligence (AI) compared to traditional manual cephalometric tracing. Given cephalometry’s critical role in orthodontic diagnosis and treatment planning, improving automation accuracy reduces errors, time, and costs while ensuring reproducibility and standardization.

Evaluation of fully automated cephalometric measurements obtained from web-based artificial intelligence driven platform

by sanjeev luintel

2023, BMC Oral Health

Key finding: The study validated the accuracy and reliability of cephalometric measurements from the fully automated AI platform "WebCeph"™ by comparing its linear and angular measurements against manual tracings performed by an... Read more

articleView Paper downloadDownload

Comparison of Semi and Fully Automated Artificial Intelligence Driven Softwares and Manual System for Cephalometric Analysis

by Zahra Khalid

2023

Key finding: By analyzing 54 patients' cephalograms using manual, semi-automatic, and fully automatic AI-driven software (WebCeph and CephX), this study found no significant overall difference in the accuracy of cephalometric parameters... Read more

articleView Paper downloadDownload

The Reliability of Two- and Three-Dimensional Cephalometric Measurements: A CBCT Study

by Nipul Tanna

2023, Diagnostics

Key finding: This research discussed the advantages and limitations of 2D versus 3D cephalometric measurements using cone-beam computed tomography (CBCT), noting errors in 2D modalities like magnification and distortion, which automated... Read more

articleView Paper downloadDownload

Steiner cephalometric analysis discrepancies between conventional and digital methods using Cephninja® application software

by Gita Gayatri

2023, Padjadjaran Journal of Dentistry

Key finding: This experimental study compared conventional manual Steiner cephalometric analysis to digital analysis with CephNinja® application on 32 cephalograms, finding no significant discrepancies. This result supports the clinical... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Cepstral Analysis

Feature selection using genetics-based algorithm and its application to speaker identification

by Ali Haydar

2026

This paper introduces the use of genetics-based algorithm in the reduction of 24 parameter set (i.e the base set) to a 5,6,7,8 or 10 parameter set, for each speaker in text-independent speaker identification. The feature selection is done... more

descriptionView Paper arrow_downwardDownload

Applying improved spectral modeling for High Quality voice conversion

by Axel Roebel

2026, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this work, accurate spectral envelope estimation is applied to Voice Conversion in order to achieve High-Quality timbre conversion. True-Envelope based estimators allow model order selection leading to an adaptation of the spectral... more

descriptionView Paper arrow_downwardDownload

Moving Shadow Detection in Video Using Cepstrum

by Ahmet Enis Cetin

2025, International Journal of Advanced Robotic Systems

Moving shadows constitute problems in various applications such as image segmentation and object tracking. The main cause of these problems is the misclassification of the shadow pixels as target pixels. Therefore, the use of an accurate... more

descriptionView Paper arrow_downwardDownload

Upgrading FPGA Implementation of Isolated Word Recognition System for a Real-Time Operation

by Dalius Navakauskas

2025, Electronics and Electrical Engineering

The article reports on the upgrading of the FPGA based isolated word recognition system for real-time tasks. All recognition system components (except some feature calculation steps) were implemented using VHDL. Some high precision calculations were implemented on soft core processor. The employed Dynamic time warping algorithm was speeded-up 2.8 times by restricting the calculated error matrix size. This enabled us to reduce the average word recognition time to 12.81 ms. Linear predictive coding, linear predictive coding cepstral and linear frequency cepstral coefficients feature analyses were investigated for 100 Lithuanian word recognition. In speaker dependent experiments linear predictive coding cepstral analysis gave the highest average recognition rate of 95 % and the highest robustness to white noise in speech. 15 dB noise level lowered average recognition rate to 86.2 %. Index Terms-Cepstral analysis, dynamic time warping, field programmable gate array, intellectual property core, isolated word recognition, linear predictive coefficients. Despite of recent software-based Lithuanian speech recognition [1] and synthesis [2] implementations on personal computers and servers there is an unaddressed need of embedded systems for mobile and stand-alone devices, interactive voice controlled systems, disabled person equipment, etc. Embedded systems bring in their specific requirements for speech recognizers: the limited speed of processing and of memory, the low power consumption. The recognition of large vocabulary and continuous speech requires complicated algorithms with huge amounts of calculations, large quantities of memory [3], [4]. This can result in enlarged power consumption, longer recognition time and higher recognition error rate. Many automatic speech recognition systems for the languages of minor use are now developed. Presented in Croatian speech recognizer uses acoustic models based on context-dependent triphone hidden Markov models (HMM) and phonetic rules. Experimentally it is shown that the system can be used for speech recognition with word error rate below 5 %. In [6] a speaker independent speech Manuscript

descriptionView Paper arrow_downwardDownload

Automatic Multichannel Volcano-Seismic Classification Using Machine Learning and EMD

by Mauro Dalla Mura

2025, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

This article proposes the design of an automatic classifier using the empirical mode decomposition (EMD) along with machine learning techniques for identifying the five most important types of events of the Ubinas volcano, the most active... more

descriptionView Paper arrow_downwardDownload

Emotion Detection from Voice Based Classified Frame-Energy Signal Using K-Means Clustering

by Rifat Hossain

2025, Zenodo (CERN European Organization for Nuclear Research)

Emotion detection is a new research era in health informatics and forensic technology. Besides having some challenges, voice based emotion recognition is getting popular, as the situation where the facial image is not available, the voice... more

descriptionView Paper arrow_downwardDownload

Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification

by Patrick Flandrin

2025, IEEE Signal Processing Letters

Usually the mel-frequency cepstral coefficients are estimated either from a periodogram or from a windowed periodogram. We state a general estimator which also includes multitaper estimators. We propose approximations of the variance and... more

descriptionView Paper arrow_downwardDownload

A Tutorial on Text-Independent Speaker Verification

by Douglas Reynolds

2025, EURASIP Journal on Advances in Signal Processing

This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly... more

descriptionView Paper arrow_downwardDownload

Zastosowanie algorytmu pogoni za dopasowaniem do oceny emisji niestacjonarnego pola magnetycznego

by Beata Palczynska

2024, Zeszyty Naukowe Akademii Morskiej w Gdyni

W artykule przedstawiono sposób wyznaczania wskaźnika ekspozycji na niestacjonarne pola magnetyczne na podstawie adaptacyjnej analizy czasowo-częstotliwościowej, zarejestrowanych przebiegów czasowych indukcji pola magnetycznego B. Metodę... more

descriptionView Paper arrow_downwardDownload

Automatic Multichannel Volcano-Seismic Classification Using Machine Learning and EMD

by Jean-Philippe Metaxian

2024, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

descriptionView Paper arrow_downwardDownload

Channel distortion compensation based on the measurement of handset's frequency responses

by Man-wai Mak

2024

A new cepstrum-based channel compensation technique is proposed for speaker veri cation. Under this approach, channel cepstra are derived from the direct measurements of the frequency responses of telephone handsets. Speci cally, they are... more

descriptionView Paper arrow_downwardDownload

preprint of the article "Standardization of noisy volcano-seismic waveforms as a key step towards station-independent, robust automatic recognition" published in Seismological Research Letters-2019

by Roberto Carniel

2024

Preprint of the article: <strong><em>"Standardization of noisy volcano-seismic waveforms as a key step towards station-independent, robust automatic recognition" </em></strong> publshed in Seismological... more

descriptionView Paper arrow_downwardDownload

Dysphonia assessment using automatic classification system

by Alain Ghio

2024, HAL (Le Centre pour la Communication Scientifique Directe)

There is a paucity of quantitative data on the vestibular folds (VsF). Renewed interest in the VsF relates to their contribution to dysphonia. This study provides quantitative data to characterize developmentalinvolutional VsF changes and... more

descriptionView Paper arrow_downwardDownload

Multi Lingual Speaker Identification on Foreign Languages using Artificial Neural Network

by Prateek Agrawal

2024

Based on the Back Propagation Algorithm, this paper portrait a method for speaker identification in multiple foreign languages. In order to identify speaker, the complete process goes through recording of the speech utterances of... more

descriptionView Paper arrow_downwardDownload

Emotion Detection from Voice Based Classified Frame-Energy Signal Using K-Means Clustering

by Nazia Briti

Cepstral Analysis

Key research themes

1. How can cepstral analysis be optimized for accurate voice disorder detection and characterization?

2. How is Mel Frequency Cepstral Coefficient (MFCC) feature extraction utilized and adapted across diverse signal processing applications beyond traditional acoustic speech recognition?

3. What advancements and evaluation exist in automated cephalometric measurement accuracy using AI-driven digital tools versus conventional manual methods?

Related Topics

All papers in Cepstral Analysis