Academia.eduAcademia.edu

Cepstral Analysis

description63 papers
group17 followers
lightbulbAbout this topic
Cepstral analysis is a signal processing technique used to analyze the frequency spectrum of signals by transforming them into the cepstral domain. It involves taking the inverse Fourier transform of the logarithm of the power spectrum, allowing for the separation of different signal components, such as pitch and timbre, in various applications including speech and audio processing.
lightbulbAbout this topic
Cepstral analysis is a signal processing technique used to analyze the frequency spectrum of signals by transforming them into the cepstral domain. It involves taking the inverse Fourier transform of the logarithm of the power spectrum, allowing for the separation of different signal components, such as pitch and timbre, in various applications including speech and audio processing.

Key research themes

1. How can cepstral analysis be optimized for accurate voice disorder detection and characterization?

This research area focuses on evaluating and enhancing cepstral-based voice measures, particularly Cepstral Peak Prominence Smoothed (CPPS) and related parameters, for objective assessment and differentiation of voice quality in pathological and healthy voices. It matters because voice disorders can be subtle, and reliable quantitative tools are crucial for diagnosis, treatment monitoring, and differentiating conditions such as spasmodic dysphonia, resonant voice training effects, or endocrine-related voice changes.

Key finding: This study demonstrated that cepstral analysis via CPP values, enhanced by machine-learning algorithms, more accurately distinguished patients with adductor-type spasmodic dysphonia (ASD) from healthy subjects than... Read more
Key finding: The study found that women with polycystic ovarian syndrome (PCOS) exhibited significantly lower Cepstral Peak Prominence (CPP) and Smoothed Cepstral Peak Prominence (CPPS) values than healthy controls, indicating that... Read more
Key finding: This study empirically established that CPPS had higher diagnostic precision (AUC = 0.85) than Sample Entropy (AUC = 0.72) in differentiating dysphonic from normal voices using vowel excerpts from continuous speech. The... Read more
Key finding: Findings revealed that cepstral analysis (CPP and CPP standard deviation) was sensitive in detecting subtle voice quality changes pre- and post-resonant voice training in healthy subjects, particularly with voiced-weighted... Read more

2. How is Mel Frequency Cepstral Coefficient (MFCC) feature extraction utilized and adapted across diverse signal processing applications beyond traditional acoustic speech recognition?

This research theme investigates the computation, adaptation, and application of MFCC features in various domains—not limited to speech and speaker recognition but including biomedical signal classification (e.g., EEG, ECG), fault detection, and even non-acoustic signals. Understanding MFCC's applicability, parameter tuning, and its integration with machine and deep learning models informs its generalized utility and guides improvements for specific tasks.

Key finding: This comprehensive review synthesizes MFCC computation steps, challenges, and adaptations across diverse application fields, highlighting key considerations such as parameter tuning, combination with other features, and... Read more
Key finding: Although primarily focused on EEG analysis for epilepsy diagnosis, this work underscores the emerging role of quantitative EEG feature extraction—including cepstral-based metrics—as potential biomarkers. It situates cepstral... Read more
Key finding: The research advances the classification of non-acoustic geophysical signals using cepstral features extracted from empirical mode decomposition (EMD) components of multichannel seismic data. Integrating cepstral attributes... Read more
Key finding: Utilizing Cepstral Coefficient features from significant short-time frames combined with k-means clustering, this paper shows enhanced emotion detection accuracy (happy, angry, sad) compared to prior methods. It also... Read more

3. What advancements and evaluation exist in automated cephalometric measurement accuracy using AI-driven digital tools versus conventional manual methods?

This theme centers on assessing the precision, reliability, and clinical feasibility of automated cephalometric analysis systems powered by artificial intelligence (AI) compared to traditional manual cephalometric tracing. Given cephalometry’s critical role in orthodontic diagnosis and treatment planning, improving automation accuracy reduces errors, time, and costs while ensuring reproducibility and standardization.

Key finding: The study validated the accuracy and reliability of cephalometric measurements from the fully automated AI platform "WebCeph"™ by comparing its linear and angular measurements against manual tracings performed by an... Read more
Key finding: By analyzing 54 patients' cephalograms using manual, semi-automatic, and fully automatic AI-driven software (WebCeph and CephX), this study found no significant overall difference in the accuracy of cephalometric parameters... Read more
Key finding: This research discussed the advantages and limitations of 2D versus 3D cephalometric measurements using cone-beam computed tomography (CBCT), noting errors in 2D modalities like magnification and distortion, which automated... Read more
Key finding: This experimental study compared conventional manual Steiner cephalometric analysis to digital analysis with CephNinja® application on 32 cephalograms, finding no significant discrepancies. This result supports the clinical... Read more

All papers in Cepstral Analysis

This paper introduces the use of genetics-based algorithm in the reduction of 24 parameter set (i.e the base set) to a 5,6,7,8 or 10 parameter set, for each speaker in text-independent speaker identification. The feature selection is done... more
In this work, accurate spectral envelope estimation is applied to Voice Conversion in order to achieve High-Quality timbre conversion. True-Envelope based estimators allow model order selection leading to an adaptation of the spectral... more
Moving shadows constitute problems in various applications such as image segmentation and object tracking. The main cause of these problems is the misclassification of the shadow pixels as target pixels. Therefore, the use of an accurate... more
The article reports on the upgrading of the FPGA based isolated word recognition system for real-time tasks. All recognition system components (except some feature calculation steps) were implemented using VHDL. Some high precision... more
This article proposes the design of an automatic classifier using the empirical mode decomposition (EMD) along with machine learning techniques for identifying the five most important types of events of the Ubinas volcano, the most active... more
Emotion detection is a new research era in health informatics and forensic technology. Besides having some challenges, voice based emotion recognition is getting popular, as the situation where the facial image is not available, the voice... more
Usually the mel-frequency cepstral coefficients are estimated either from a periodogram or from a windowed periodogram. We state a general estimator which also includes multitaper estimators. We propose approximations of the variance and... more
This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly... more
W artykule przedstawiono sposób wyznaczania wskaźnika ekspozycji na niestacjonarne pola magnetyczne na podstawie adaptacyjnej analizy czasowo-częstotliwościowej, zarejestrowanych przebiegów czasowych indukcji pola magnetycznego B. Metodę... more
This article proposes the design of an automatic classifier using the empirical mode decomposition (EMD) along with machine learning techniques for identifying the five most important types of events of the Ubinas volcano, the most active... more
A new cepstrum-based channel compensation technique is proposed for speaker veri cation. Under this approach, channel cepstra are derived from the direct measurements of the frequency responses of telephone handsets. Speci cally, they are... more
Preprint of the article: <strong><em>"Standardization of noisy volcano-seismic waveforms as a key step towards station-independent, robust automatic recognition" </em></strong> publshed in Seismological... more
There is a paucity of quantitative data on the vestibular folds (VsF). Renewed interest in the VsF relates to their contribution to dysphonia. This study provides quantitative data to characterize developmentalinvolutional VsF changes and... more
Based on the Back Propagation Algorithm, this paper portrait a method for speaker identification in multiple foreign languages. In order to identify speaker, the complete process goes through recording of the speech utterances of... more