Static and Dynamic Lip Feature Analysis for Speaker Verification

File Size Format
53757_1.pdf 242Kb Adobe PDF View
Title Static and Dynamic Lip Feature Analysis for Speaker Verification
Author Wang, S.L.; Liew, Alan Wee-Chung
Publication Title Proceedings of the International Conference on Auditory-Visual Speech Processing 2008
Editor Simon Lucey, Roland Gocke, Patrick Lucey
Year Published 2008
Place of publication Australia
Publisher International Speech Communication Association, ISCA
Abstract As we all known, various speakers have their own talking styles. Hence, lip shape and its movement can be used as a new biometrics and infer the speaker’s identity. Compared with the traditional biometrics such as human face and fingerprint, person verification based on the lip feature has the advantage of containing both static and dynamic information. Many researchers have demonstrated that incorporating dynamic information such as lip movement help improve the verification performance. However, which is more discriminative, the static features or the dynamic features remained unsolved. In this paper, the discriminative power analysis of the static and dynamic lip features is performed. For the static lip features, a new kind of feature representation including the geometric features, contour descriptors and texture features is proposed and the Gaussian Mixture Model (GMM) is employed as the classifier. For the dynamic features, Hidden Markov Model (HMM) is employed as the classifier for its superiority in dealing with time-series data. Experiments are carried out on a database containing 40 speakers in our lab. Detailed evaluation for various static/dynamic lip feature representation is made along with a corresponding discussion on the discriminative ability. The experimental results disclose that the dynamic lip shape information and the static lip texture information contain much identity-relevant information.
Peer Reviewed Yes
Published Yes
Publisher URI
Copyright Statement Copyright 2008 ISCA and the Authors. The attached file is reproduced here in accordance with the copyright policy of the publisher. For information about this conference please refer to the conference’s website or contact the authors.
ISBN 978-0-646-49504-0
Conference name International Conference on Auditory-Visual Speech Processing, AVSP2008
Location Moreton Island, Australia
Date From 2008-09-26
Date To 2008-09-29
Date Accessioned 2009-03-02
Language en_US
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Science, Environment, Engineering and Technology
Subject Pattern Recognition and Data Mining
Publication Type Conference Publications (Full Written Paper - Refereed)
Publication Type Code e1

Show simple item record

Griffith University copyright notice