Show simple item record

dc.contributor.authorWang, SL
dc.contributor.authorLau, WH
dc.contributor.authorLiew, AWC
dc.contributor.authorLeung, SH
dc.contributor.editorTang, YY
dc.contributor.editorWang, SP
dc.contributor.editorLorette, G
dc.contributor.editorYeung, DS
dc.contributor.editorYan, H
dc.date.accessioned2017-05-03T15:20:38Z
dc.date.available2017-05-03T15:20:38Z
dc.date.issued2006
dc.date.modified2010-10-27T08:26:51Z
dc.identifier.isbn9780769525211
dc.identifier.issn1051-4651
dc.identifier.doi10.1109/ICPR.2006.301
dc.identifier.urihttp://hdl.handle.net/10072/24387
dc.description.abstractSpeech recognition solely based on visual information such as the lip shape and its movement is referred to as lipreading. This paper presents an automatic lipreading technique for speaker dependent (SD) and speaker independent (SI) speech recognition tasks. Since the visual features are derived according to the frame rate of the video sequence, spline representation is then employed to translate the discrete-time sampled visual features into continuous domain. The spline coefficients in the same word class are constrained to have similar expression and can be estimated from the training data by the EM algorithm. In addition, an adaptive multi-model approach is proposed to overcome the variation caused by different speaking style in speaker-independent recognition task. The experiments are carried out to recognize the ten English digits and an accuracy of 96% for speaker dependent recognition and 88% for speaker independent recognition have been achieved, which shows the superiority of our approach compared with other classifiers investigated.
dc.description.peerreviewedYes
dc.description.publicationstatusYes
dc.format.extent120510 bytes
dc.format.extent20497 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.languageEnglish
dc.language.isoeng
dc.publisherIEEE Computer Society
dc.publisher.placeUSA
dc.relation.ispartofstudentpublicationN
dc.relation.ispartofconferencename18th International Conference on Pattern Recognition (ICPR 2006)
dc.relation.ispartofconferencetitle18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS
dc.relation.ispartofdatefrom2006-08-20
dc.relation.ispartofdateto2006-08-24
dc.relation.ispartoflocationHong Kong, PEOPLES R CHINA
dc.relation.ispartofpagefrom881
dc.relation.ispartofpageto+
dc.relation.ispartofvolume3
dc.rights.retentionY
dc.subject.fieldofresearchcode280203
dc.subject.fieldofresearchcode280208
dc.titleAutomatic Lipreading with Limited Training Data
dc.typeConference output
dc.type.descriptionE1 - Conferences
dc.type.codeE - Conference Publications
gro.rights.copyright© 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
gro.date.issued2006
gro.hasfulltextFull Text
gro.griffith.authorLiew, Alan Wee-Chung


Files in this item

This item appears in the following Collection(s)

  • Conference outputs
    Contains papers delivered by Griffith authors at national and international conferences.

Show simple item record