Age Recognition Based on GMM and DBN with Using Wavelet Packet Mel-Frequency Cepstrum Coefficient

Xian-Na Du, Yi-Biao YU

Abstract


A new multi-resolution feature extraction algorithm is proposed for the speaker and text independent age recognition. The new feature called Wavelet Packet Mel-Frequency Cepstrum Coefficient (WPMFC) takes more speakers personality characteristics like age compared with common feature such as MFCC. The speaker age is divided into four age groups such as children, youths, adult and older, and totally eight Gaussian Mixture Models (GMM) and Deep Belief Network (DBN) models are trained for each age group and gender. The performance of the age recognition systems with using WPMFC as the feature parameter are analyzed based on GMM and DBN. The experimental results show that the performance of the age recognition system based on GMM is more effective, the average recognition rate of outset speaker age reaches 65.17% which is superior to DBN approach for the same speech corpus.

Keywords


Speaker age recognition, MFCC, Wavelet packet transform, GMM, DBN

Publication Date


2016-11-30 00:00:00


DOI
10.12783/dtetr/ssme-ist2016/3974

Refbacks

  • There are currently no refbacks.