Dugelay Temporal normalization of videos using visual speech MiFor'09 : 1st ACM Workshop on Multimedia in Forensics, 2009. ,
Jean-Luc Dugelay and Mohamed Jedra Impostor detection using facial stereoscopic images Eusipco, 17th European Signal Processing Conference, 2009. ,
Caroline Mallauran and Jean-Luc Dugelay Facial gender recognition using multiple sources of visual information MMSP, 10th IEEE International Workshop on MultiMedia Signal Processing, 2008. ,
Dugelay Facial video based response registration system Eusipco, 16th European Signal Processing Conference, 2008. ,
Serafeim Perdikis Albert Ali Salah, Dimitrios Tzovaras and Athanasios Vogiannou Activity-related biometric authentication eNTERFACE, 2008. ,
Ionut Petre, Usman Saeed and Jerome Urbain Multimodal services for remote communications eNTERFACE, 2007. ,
Dugelay Person recognition from video using facial mimics ICASSP, 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007. ,
Dugelay Person recognition based on head and mouth dynamics MMSP, IEEE International Workshop on Multimedia Signal Processing, 2006. ,
Video person recognition strategies using head motion and facial appearance, 2008. ,
Facial recognition vendor test 2002: evaluation report, 2003. ,
An Improved Word-Detection Algorithm for Telephone-Quality Speech Incorporating Both Syntactic and Semantic Constraints, AT&T Bell Laboratories Technical Journal, vol.63, issue.3, pp.479-498, 1984. ,
DOI : 10.1002/j.1538-7305.1984.tb00016.x
An Algorithm for Determining the Endpoints of Isolated Utterances, Bell System Technical Journal, vol.54, issue.2, pp.297-315, 1975. ,
DOI : 10.1002/j.1538-7305.1975.tb02840.x
Robust speech detection and segmentation for real-time ASR applications, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.432-435, 2003. ,
DOI : 10.1109/ICASSP.2003.1198810
Signal modeling techniques in speech recognition, Proceedings of the IEEE, vol.81, issue.9, pp.1215-1247, 1993. ,
DOI : 10.1109/5.237532
Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990. ,
DOI : 10.1121/1.399423
Robust audio segmentation, 2004. ,
Voice activity detection for conversational analysis, 1994. ,
A hybrid HMM-MLP speaker verification algorithm for telephone speech, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing, pp.153-156, 1994. ,
DOI : 10.1109/ICASSP.1994.389332
Digital representations of speech signals, Proceedings of the IEEE, vol.63, issue.4, pp.662-677, 1975. ,
DOI : 10.1109/PROC.1975.9799
Line spectrum representation of linear predictive coefficients, Trans. Committee Speech Research Acoustical Soc, vol.75, p.34, 1975. ,
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980. ,
DOI : 10.1109/TASSP.1980.1163420
On the use of instantaneous and transitional spectral information in speaker recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.36, issue.6, pp.871-879, 1988. ,
DOI : 10.1109/29.1598
An Overview of Speaker Recognition Technology, Proc. ESCA Workshop on Automatic Speaker Recognition, Identification, and Verification, pp.1-9, 1994. ,
DOI : 10.1007/978-1-4613-1367-0_2
A vector quantization approach to speaker recognition, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.14-26, 1987. ,
DOI : 10.1109/ICASSP.1985.1168412
Dynamic Programming Algorithm Optimization for Spoken Word Recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, issue.1, pp.43-49, 1978. ,
YOHO Speaker Verification, Speech Research Symposium, 1990. ,
Voice identification using nearest-neighbor distance measure, IEEE International Conference on Acoustics Speech and Signal Processing, pp.75-378, 1993. ,
DOI : 10.1109/ICASSP.1993.319317
Neural models for speaker recognition, 1991. ,
Speaker recognition using neural networks and conventional classifiers, IEEE Transactions on Speech and Audio Processing, vol.2, issue.1, pp.194-205, 1994. ,
DOI : 10.1109/89.260362
Neural net approaches to speaker verification: comparison with second order statistic measures, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.353-356, 1995. ,
DOI : 10.1109/ICASSP.1995.479594
A connectionist approach for automatic speaker identification, International Conference on Acoustics, Speech, and Signal Processing, pp.265-268, 1990. ,
DOI : 10.1109/ICASSP.1990.115619
Probabilistic cooperation of connectionist expert modules: Validation on a speaker identification task, Proc. IEEE ICASSP, pp.541-544, 1993. ,
Text???independent talker identification using recurrent neural networks, The Journal of the Acoustical Society of America, vol.87, issue.S1, 1990. ,
DOI : 10.1121/1.2027796
Text-independent speaker identification, IEEE Signal Processing Magazine, vol.11, issue.4, pp.18-32, 1994. ,
DOI : 10.1109/79.317924
Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Transactions on Speech and Audio Processing, vol.3, issue.1, pp.72-83, 1995. ,
DOI : 10.1109/89.365379
Audio-visual speech modeling for continuous speech recognition, IEEE Transactions on Multimedia, vol.2, issue.3, pp.141-151, 2000. ,
DOI : 10.1109/6046.865479
A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, pp.257-286, 1989. ,
Fundamentals of Speech Recognition, Signal Processing A. Oppenheim. Englewood, vol.Cliffs, 1993. ,
Mathematical Techniques in Multisensor Data Fusion, 1992. ,
Multisensor data fusion Handbook of Multisensor Data Fusion, pp.1-10, 2001. ,
Advances in Distributed Sensor Technology, 1995. ,
Adaptive bimodal sensor fusion for automatic speechreading, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.833-836, 1996. ,
DOI : 10.1109/ICASSP.1996.543250
Feature-level data fusion for bimodal person recognition, 6th International Conference on Image Processing and its Applications, pp.399-403, 1997. ,
DOI : 10.1049/cp:19970924
Multimodal Authentication Using Asynchronous HMMs, Proc. 4th International Conf. Audio-and Video-based Biometric Person Authentication, pp.770-777, 2003. ,
DOI : 10.1007/3-540-44887-X_89
Mixed memory Markov models: Decomposing complex stochastic processes as mixtures of simpler ones, Mach. Learn, vol.37, issue.1, pp.75-87, 1999. ,
Coupled hidden Markov models for complex action recognition, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.994-999, 1997. ,
DOI : 10.1109/CVPR.1997.609450
An approach to speaker identification using multiple classifiers, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1135-1138, 1997. ,
DOI : 10.1109/ICASSP.1997.596142
Introduction Multisensor Integration and Fusion for Intelligent Machines and Systems, pp.1-26, 1995. ,
On combining classifiers using sum and product rules, Pattern Recognition Letters, vol.22, issue.12, pp.1283-1289, 2001. ,
DOI : 10.1016/S0167-8655(01)00073-3
Multimodal decision-level fusion for person authentication, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol.29, issue.6, pp.674-680, 1999. ,
DOI : 10.1109/3468.798073
Fusion of face and speech data for person identity verification, IEEE Transactions on Neural Networks, vol.10, issue.5, pp.1065-1074, 1999. ,
DOI : 10.1109/72.788647
Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004. ,
DOI : 10.1109/ICIP.2004.1418816
An approach to statistical lip modelling for speaker identification via chromatic feature extraction, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), pp.123-125, 1998. ,
DOI : 10.1109/ICPR.1998.711095
Biometric Identification Using Motion History Images of a Speaker's Lip Movements, Machine Vision and Image Processing Conference, pp.83-88, 2008. ,
Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-65, 1996. ,
DOI : 10.1109/ICSLP.1996.607030
An Evaluation of Visual Speech Features for the Tasks of Speech and Speaker Recognition, International Conference of Audio-and Video-Based Person Authentication, pp.260-267, 2003. ,
DOI : 10.1007/3-540-44887-X_31
Motion Features from Lip Movement for Person Authentication, 18th International Conference on Pattern Recognition (ICPR'06), pp.1059-1062, 2006. ,
DOI : 10.1109/ICPR.2006.814
Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006. ,
DOI : 10.1109/TIP.2006.877528
Multi-sensorial inputs for the identification of persons with synergetic computers, Proceedings of 1st International Conference on Image Processing, pp.287-291, 1994. ,
DOI : 10.1109/ICIP.1994.413577
A new approach to integrate audio and visual features of speech, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532), pp.1093-1096, 2000. ,
DOI : 10.1109/ICME.2000.871551
Automatic speechreading with application to speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002. ,
Acoustic-labial speaker verification, Proceedings of the First international Conference on Audio-and Video-Based Biometric Person Authentication, 1997. ,
DOI : 10.1016/S0167-8655(97)00070-6
Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features, Proc. 4th International Conference on Audio and Video Based Biometric Person Authentication, 2003. ,
DOI : 10.1007/3-540-44887-X_86
The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), pp.2389-2392, 2000. ,
DOI : 10.1109/ICASSP.2000.859322
Joint audio-video processing for biometric speaker identification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.377-80, 2003. ,
Multimodal Biometrics of Lip Movements and Voice using Kernel Fisher Discriminant Analysis, 2006 9th International Conference on Control, Automation, Robotics and Vision, pp.1-6, 2006. ,
DOI : 10.1109/ICARCV.2006.345473
Synergy of Lip-Motion and Acoustic Features in Biometric Speech and Speaker Recognition, IEEE Transactions on Computers, vol.56, issue.9, pp.1169-1175, 2007. ,
DOI : 10.1109/TC.2007.1074
Audiovisual speech processing, IEEE Signal Processing Mag, 2001. ,
AVICAR: Audio-visual speech corpus in a car environment, Conf. Spoken Language, 2004. ,
The M2VTS multimodal face database 1 st Int. Conf. Audio-and Video-Based Biometric Person Authentication, 1997. ,
XM2VTSDB: The extended M2VTS database, nd Int. Conf. Audio-and Video-Based Biometric Person Authentication, 1999. ,
The realistic multi-modal VALID database and visual speaker identification comparison experiments, th International Conference on Audio-and Video-Based Biometric Person Authentication, 2005. ,
A video database of moving faces and people, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.5, pp.812-816, 2005. ,
DOI : 10.1109/TPAMI.2005.90
MyIdea -Multimodal Biometrics Database, Description of Acquisition Protocols, proc. of Third COST 275 Workshop, pp.59-62, 2005. ,
BIOMET: A Multimodal Person Authentication Database Including Face, Voice, Fingerprint, Hand and Signature Modalities, Audio-and Video-Based Biometric Person Authentication, p.1056, 2003. ,
DOI : 10.1007/3-540-44887-X_98
A segment-based audio-visual speech recognizer, Proceedings of the 6th international conference on Multimodal interfaces , ICMI '04, 2004. ,
DOI : 10.1145/1027933.1027972
Noise compensation in a person verification system using face and multiple speech features, Pattern Recognition, vol.36, issue.2, pp.293-302, 2003. ,
DOI : 10.1016/S0031-3203(02)00031-6
Computer analysis and classification of photographs of human faces, Proc. First USA?Japan Computer Conference, pp.2-7, 1972. ,
Facial components segmentation for extracting facial feature, Proceedings Second International Conference on Audio-and Videobased Biometric Person Authentication, 1999. ,
Human Face Segmentation and Identification, 1993. ,
Detection of human faces using decision trees, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, 1996. ,
DOI : 10.1109/AFGR.1996.557272
Edge and keypoint detection in facial regions, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp.212-217, 1996. ,
DOI : 10.1109/AFGR.1996.557266
Face Tracking and Pose Representation, Procedings of the British Machine Vision Conference 1996, 1996. ,
DOI : 10.5244/C.10.31
A real-time face tracker, IEEE Proc. of the 3 rd Workshop on Applications of Computer Vision, 1996. ,
Multi-model tracking of faces for video communications, IEEE Proc. of Int, Conf. on Computer Vision and Pattern Recognition, 1997. ,
Multi-modal system for locating heads and faces, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp.277-282, 1996. ,
DOI : 10.1109/AFGR.1996.557248
Automatic human face location in a complex background using motion and color information, Pattern Recognition, vol.29, issue.11, pp.1877-1889, 1996. ,
DOI : 10.1016/0031-3203(96)00036-2
Face-texture model based on sgld and its application Pattern Recog, pp.1007-1017, 1996. ,
Lip motion automatic detection, Scandinavian Conference on Image Analysis, 1997. ,
Visual pattern recognition by moment invariants, IRE Transactions on Information Theory, vol.8, pp.179-187, 1962. ,
Tracking Colour Objects Using Adaptive Mixture Models, Image and Vision Computing, vol.174, issue.3, pp.223-229, 1998. ,
Semantic segmentation of videophone image sequences, Proc. of SPIE Int. Conf. on Visual Communications and Image Processing, pp.1182-1193, 1992. ,
Eigenfaces for Recognition, Journal of Cognitive Neuroscience, vol.10, issue.9, pp.71-86, 1991. ,
DOI : 10.1007/BF00239352
Real-time tracking for an integrated face recognition system, 2nd Workshop on Parallel Modelling of Neural Operators, 1995. ,
Statistical chromaticity models for lip tracking with b-splines, Int. Conf. on Audio-and Video-Based Biometric Person Authentication, 1997. ,
Detection and tracking of facial features by using a facial feature model and deformable circular template, IEICE Trans. Inform. Systems, pp.1195-1207, 1995. ,
Facial feature detection using geometrical face model: An efficient approach, Pattern Recognition, vol.31, issue.3, 1998. ,
DOI : 10.1016/S0031-3203(97)00048-4
Saccadic search with Gabor features applied to eye detection and real-time head tracking, Image and Vision Computing, vol.18, issue.4, pp.323-329 ,
DOI : 10.1016/S0262-8856(99)00080-3
Face localization via shape statistics, Int. Workshop on Automatic Face and Gesture Recognition, 1995. ,
A robust approach to face and eyes detection from images with cluttered background, Proc. of International Conference on Pattern Recognition, 1998. ,
Real-time face location on gray-scale static images Pattern Recog, pp.1525-1539, 2000. ,
Human Face Image Recognition: An Evidence Aggregation Approach, Computer Vision and Image Understanding, vol.71, issue.2, 1998. ,
DOI : 10.1006/cviu.1998.0710
Snakes: Active contour models, Proc. of 1 st Int Conf. on Computer Vision, 1987. ,
DOI : 10.1007/BF00133570
A dual active contour for head and boundary extraction, IEE Colloquium on Image Processing for Biometric Measurement, 1994. ,
Human facial feature extraction for face interpretation and recognition, Pattern Recognition, vol.25, issue.12, pp.1435-1444, 1992. ,
DOI : 10.1016/0031-3203(92)90118-3
Facial contour extraction model, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998. ,
DOI : 10.1109/AFGR.1998.670957
Feature extraction from faces using deformable templates, International Journal of Computer Vision, vol.26, issue.6, pp.99-111, 1992. ,
DOI : 10.1007/BF00127169
Towards a system for automatic facial feature detection, Pattern Recognition, vol.26, issue.12, pp.1739-1755, 1993. ,
DOI : 10.1016/0031-3203(93)90173-T
EYE DETECTION USING OPTIMAL WAVELET PACKETS AND RADIAL BASIS FUNCTIONS (RBFs), International Journal of Pattern Recognition and Artificial Intelligence, vol.13, issue.07, 1999. ,
DOI : 10.1142/S0218001499000562
Classification of facial features for recognition, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.573-579, 1991. ,
DOI : 10.1109/CVPR.1991.139756
Automatic tracking, coding and reconstruction of human faces, using flexible appearance models, Electronics Letters, vol.30, issue.19, pp.578-1579, 1994. ,
DOI : 10.1049/el:19941110
Locating facial features using genetics algorithms, Proc. of Int. Conf. on Digital Signal Processing, pp.520-525, 1995. ,
Low-dimensional procedure for the characterization of human faces, Journal of the Optical Society of America A, vol.4, issue.3, pp.519-524, 1987. ,
DOI : 10.1364/JOSAA.4.000519
Two subspace methods to discriminate faces and clutters, Proceedings of the 2000 International Conference on Image Processing, 2000. ,
Face detection using mixtures of linear subspaces, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition, 2000. ,
A feature space for face image processing, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, 2000. ,
DOI : 10.1109/ICPR.2000.906025
Dynamic attention map by Ising model for human face detection, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), 1998. ,
DOI : 10.1109/ICPR.1998.711870
Probabilistic modeling of local appearance and spatial relationships for object recognition, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.45-51, 1998. ,
DOI : 10.1109/CVPR.1998.698586
A statistical method for 3D object detection applied to faces and cars, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.746-751, 2000. ,
DOI : 10.1109/CVPR.2000.855895
A cluster-based statistical model for object detection, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1046-1053, 1999. ,
DOI : 10.1109/ICCV.1999.790386
Object detection using hierarchical MRF and MAP estimation, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.186-192, 1997. ,
DOI : 10.1109/CVPR.1997.609318
Artificial neural network architecture for human face detection, Intell. Eng. Systems Artificial Neural Networks, vol.2, pp.535-540, 1992. ,
Neural network-based face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.1, pp.23-38, 1998. ,
DOI : 10.1109/34.655647
Rotation invariant neural network-based face detection, Proc. IEEE Intl. Conf. on Computer Vision and Pattern Recognition, pp.38-44, 1998. ,
Original approach for the localisation of objects in images, IEE Proc. Vision, Image and Signal Processing, pp.245-250, 1994. ,
DOI : 10.1049/ip-vis:19941301
A constrained generative model applied to face detection, Neural Process. Lett, vol.5, pp.73-81, 1997. ,
Face recognition/detection by probabilistic decision-based neural network, IEEE Trans. Neural Networks, vol.8, pp.114-132, 1997. ,
The SNoW Learning Architecture, 1999. ,
Training support vector machines: an application to face detection, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.130-136, 1997. ,
DOI : 10.1109/CVPR.1997.609310
A Trainable System for Object Recognition, International Journal of Computer Vision, vol.38, issue.1, pp.15-33, 2000. ,
DOI : 10.1023/A:1008162616689
HMM-based architecture for face identification, Image and Vision Computing, vol.12, issue.8, pp.537-583, 1994. ,
DOI : 10.1016/0262-8856(94)90007-8
Face Recognition Using Hidden Markov Models, 1994. ,
Face detection and recognition using hidden Markov models, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.141-145, 1998. ,
DOI : 10.1109/ICIP.1998.723445
Synthesizing a color algorithm from examples, Science, vol.239, issue.4839, pp.482-485, 1998. ,
DOI : 10.1126/science.3340834
Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of IAPR Workshop, pp.318-321, 2002. ,
Lip Image Segmentation Using Fuzzy Clustering Incorporating an Elliptic Shape Function, IEEE Transactions on Image Processing, vol.13, issue.1, pp.51-62, 2004. ,
DOI : 10.1109/TIP.2003.818116
Adaptive mouth segmentation using chromatic features, Pattern Recognition Letters, vol.23, issue.11, pp.1293-1302, 2002. ,
DOI : 10.1016/S0167-8655(02)00078-8
Lip feature extraction towards an automatic speechreading system, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101), pp.226-229, 2000. ,
DOI : 10.1109/ICIP.2000.899336
Initialised eigenlip estimator for fast lip tracking using linear regression, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, pp.178-181, 2000. ,
DOI : 10.1109/ICPR.2000.903514
A couple HMM for audio-visual speech recognition, Proc. ICASSP, pp.2013-2016, 2002. ,
Nonlinear manifold learning for visual speech recognition, Proceedings of IEEE International Conference on Computer Vision, pp.494-499, 1995. ,
DOI : 10.1109/ICCV.1995.466899
Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008. ,
DOI : 10.1049/iet-cvi:20070061
Modelling and segmentation of lip area in face images, IEE Proceedings Vision, Image and Signal Processing, pp.179-184, 2002. ,
DOI : 10.1049/ip-vis:20020378
Unsupervised lip segmentation under natural conditions, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3065-3068, 1999. ,
DOI : 10.1109/ICASSP.1999.757488
URL : https://hal.archives-ouvertes.fr/hal-00961012
Robust lip tracking by combining shape, color and motion, Proc. ACCV, pp.1040-1045, 2000. ,
Real-time lip tracking for audio-visual speech recognition applications, Proceedings of the 4th European Conference on Computer Vision, 1996. ,
DOI : 10.1007/3-540-61123-1_154
2D deformable models for visual speech analysis, " NATO Advanced Study Institute: Speech reading by Man and Machine, pp.391-398, 1995. ,
Using deformable templates to infer visual speech dynamics, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, pp.578-582, 1994. ,
DOI : 10.1109/ACSSC.1994.471518
Audiovisual speech recognition using MPEG-4 compliant visual features, EURASIP J. Appl. Signal Processing, pp.1213-1227, 2002. ,
Accurate and Quasi-Automatic Lip Tracking, IEEE Transactions on Circuits and Systems for Video Technology, pp.706-715, 2004. ,
DOI : 10.1109/TCSVT.2004.826754
Statistical Models of Appearance for Computer Vision, 2004. ,
Texture-Constrained Shape Prediction for Mouth Contour Extraction and its State Estimation, 18th International Conference on Pattern Recognition (ICPR'06), pp.88-91, 2006. ,
DOI : 10.1109/ICPR.2006.1114
Facial Expression Recognition Using Model-Based Feature Extraction and Action Parameters Classification, Journal of Visual Communication and Image Representation, pp.278-290, 1997. ,
DOI : 10.1006/jvci.1997.0359
Colour and Geometric based Model for Lip Localisation: Application for Lip-reading System, 14th International Conference on Image Analysis and Processing (ICIAP 2007), pp.9-14, 2007. ,
DOI : 10.1109/ICIAP.2007.4362750
Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004. ,
DOI : 10.1109/ICIP.2004.1418816
Automatic lip model extraction for constrained contour-based tracking, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348), pp.848-851, 1999. ,
DOI : 10.1109/ICIP.1999.823017
Unsupervised Lips Segmentation Based on ROI Optimisation and Parametric Model, 2007 IEEE International Conference on Image Processing, pp.301-304, 2007. ,
DOI : 10.1109/ICIP.2007.4380014
URL : https://hal.archives-ouvertes.fr/hal-00372142
SNAKES, Proc. International Journal of Computer Vision, pp.259-268, 1987. ,
DOI : 10.1016/B978-141600119-5.50010-X
Lip Localization and Viseme Recognition from Video Sequences, Fourteenth National Conference on Communications, 2008. ,
Robust Facial Feature Tracking, Procedings of the British Machine Vision Conference 2000, pp.232-241, 2000. ,
DOI : 10.5244/C.14.24
Practical feature subset selection for machine learning, Proceedings of the 21 st Australian Computer Science Conference, pp.181-191, 1998. ,
Wrappers for feature subset selection, Artificial Intelligence, vol.97, issue.1-2, pp.273-324, 1997. ,
DOI : 10.1016/S0004-3702(97)00043-X
An image transform approach for HMM based automatic lipreading, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.173-177, 1998. ,
DOI : 10.1109/ICIP.1998.999008
Visual Speech Recognition Using Motion Features and Hidden Markov Models, Proc. of International Conference on Computer Analysis of Images and Patterns, pp.832-839, 2007. ,
DOI : 10.1007/978-3-540-74272-2_103
Minimum redundancy feature selection from microarray gene expression data, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003, pp.523-528, 2003. ,
DOI : 10.1109/CSB.2003.1227396
Head Gesture Recognition Based on Bayesian Network, Proceedings of Iberian Conference on Pattern Recognition and Image Analysis, p.492, 2005. ,
DOI : 10.1007/11492429_60
Head gestures recognition, Proceedings of International Conference on Image Processing, pp.266-269, 2001. ,
Head nods analysis: interpretation of non verbal communication gestures, IEEE International Conference on Image Processing 2005, pp.425-433, 2005. ,
DOI : 10.1109/ICIP.2005.1530419
Look, Ma--No Hands! Hands free cursor control with real-time 3D face tracking, Proceedings of Workshop on Perceptual User Interface, 1998. ,
Real-time detection of nodding and head-shaking by directly detecting and tracking the "between-eyes", Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), pp.40-45, 2000. ,
DOI : 10.1109/AFGR.2000.840610
A real-time head nod and shake detector, Proceedings of the 2001 workshop on Percetive user interfaces , PUI '01, 2001. ,
DOI : 10.1145/971478.971509
Face and feature tracking for cursor control, Proceedings of 12th Scandinavian Conference on Image Analysis, 2001. ,
Natural Mouse-a novel human computer interface, Proceedings of International Conference on Image Processing, pp.653-656, 1999. ,
Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pp.511-518, 2001. ,
DOI : 10.1109/CVPR.2001.990517
A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-151, 1988. ,
DOI : 10.5244/C.2.23
An iterative image registration technique with an application to stereo vision, Proceedings of DARPA Image Understanding Workshop, pp.121-130, 1981. ,
A practical guide to support vector classification, 2003. ,
Tomofaces: Eigenfaces extended to videos of speakers, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1793-1796, 2008. ,
DOI : 10.1109/ICASSP.2008.4517979
Face recognition based on fitting a 3D morphable model, PAMI, pp.1063-1074, 2003. ,
DOI : 10.1109/TPAMI.2003.1227983
Online learning of probabilistic appearance manifolds for video-based recognition and tracking, Proc of CVPR, pp.852-859, 2005. ,
Illumination cones for recognition under variable lighting: faces, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.52-59, 1998. ,
DOI : 10.1109/CVPR.1998.698587
Kernel-based Subspace Analysis for Face Recognition, 2007 International Joint Conference on Neural Networks, pp.1127-1132, 2007. ,
DOI : 10.1109/IJCNN.2007.4371116
A Method for Converting a Smiling Face to a Neutral Face with Applications to Face Recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.977-980, 2005. ,
DOI : 10.1109/ICASSP.2005.1415570
Motion compensated frame rate conversion using normalized motion estimation, Proc. IEEE Workshop on Signal Processing Systems Design and Implementation, pp.663-668, 2005. ,
Recent advances in image morphing, Proceedings of CG International '96, 1996. ,
DOI : 10.1109/CGI.1996.511788
Video tomography, Proceedings of the second ACM international conference on Multimedia , MULTIMEDIA '94, pp.349-356, 1994. ,
DOI : 10.1145/192593.192697
A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intell, vol.8, pp.679-698, 1986. ,
Sex-Net: a neural network identifies sex from human faces, Proceedings of Advances in neural information processing systems, pp.572-577, 1990. ,
Genetic feature subset selection for gender classification: a comparison study, IEEE Proceedings on Applications of Computer Vision, pp.165-170, 2002. ,
Mixture of experts for classification of gender, ethnic origin, and pose of human faces, IEEE Transactions on Neural Networks, pp.948-960, 2000. ,
DOI : 10.1109/72.857774
Age and gender classification from face images using neural networks, Signal and Image Processing, 2004. ,
Multimodal Facial Gender and Ethnicity Identification, Advances in Biometrics, pp.554-561, 2005. ,
DOI : 10.1007/11608288_74
Gender classification with support vector machines, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), pp.306-311, 2000. ,
DOI : 10.1109/AFGR.2000.840651
Cascaded Classification of Gender and Facial Expression using Active Appearance Models, 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pp.393-398, 2006. ,
DOI : 10.1109/FGR.2006.29
Appearance-based gender classification with Gaussian processes, Pattern Recognition Letters, vol.27, issue.6, pp.618-626, 2006. ,
DOI : 10.1016/j.patrec.2005.09.027
An Experimental Study on Automatic Face Gender Classification, 18th International Conference on Pattern Recognition (ICPR'06), pp.1099-1102, 2006. ,
DOI : 10.1109/ICPR.2006.247
Boosting Sex Identification Performance, International Journal of Computer Vision, vol.20, issue.1, pp.111-119, 2007. ,
DOI : 10.1007/s11263-006-8910-9
Learning gender from human gaits and faces, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, pp.505-510, 2007. ,
DOI : 10.1109/AVSS.2007.4425362
Improved connected letter recognition by lipreading, Proc. IEEE ICASSP, pp.557-560, 1993. ,
See me, hear me: Integrating automatic speech recognition and lip-reading, Proc. ICSLP, pp.547-550, 1994. ,
Audio-visual speaker recognition for video broadcast news: some fusion techniques, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451), pp.161-167, 1999. ,
DOI : 10.1109/MMSP.1999.793814
Robust speaker verification via fusion of speech and lip modalities, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3061-3064, 1999. ,
DOI : 10.1109/ICASSP.1999.757487
Exploiting Visual Information in Automatic Speech Processing, Handbook of Image and Video Processing, pp.1263-1289, 2005. ,
DOI : 10.1016/B978-012119792-6/50134-0
Experiments in automatic visual speech recognition, Proc. 7th FASE Symp, pp.1163-1170, 1988. ,
Speech recognition enhancement by lip information, Proc. CHI, pp.198-204, 1986. ,
Audiovisual speech recognition using multiscale nonlinear image decomposition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.38-41, 1996. ,
DOI : 10.1109/ICSLP.1996.607019
A comparison of model and transform-based visual features for audio-visual LVCSR, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001., pp.22-25, 2001. ,
DOI : 10.1109/ICME.2001.1237849
Automatic Speechreading with Applications to Human-Computer Interfaces, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11, pp.1228-1247, 2002. ,
DOI : 10.1155/S1110865702206137
On the Production and the Perception of Audio-Visual Speech by Man and Machine, Proc. Symp. Multimedia Communications and Video Coding, pp.277-284, 1995. ,
DOI : 10.1007/978-1-4613-0403-6_34
Lipreading by neural networks: Visual preprocessing, learning, and sensory integration, Advances in Neural Information Processing Systems, pp.1027-1034, 1994. ,
Active appearance models, Proc. Eur. Conf. Computer Vision, pp.484-498, 1998. ,
Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models, IEEE Transactions on Circuits and Systems for Video Technology, vol.14, issue.5, pp.693-705, 2004. ,
DOI : 10.1109/TCSVT.2004.826773
Lip reading for robust speech recognition on embedded devices, Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp.473-476, 2005. ,
Extraction of visual features for lipreading, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.2, pp.198-213, 2002. ,
DOI : 10.1109/34.982900
Audiovisual speech recognition using MPEG-4 compliant visual features, EURASIP J. Appl. Signal Process, pp.1213-1227, 2002. ,
Biometric Identification Using Motion History Images of a Speaker's Lip Movements, International Machine Vision and Image Processing Conference, pp.83-88, 2008. ,
Lip signatures for automatic person recognition, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451), pp.457-462, 1999. ,
DOI : 10.1109/MMSP.1999.793890
Feature representation and discrimination based on Gaussian mixture model probability densities???Practices and algorithms, Pattern Recognition, vol.39, issue.7, 2005. ,
DOI : 10.1016/j.patcog.2006.01.005
Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006. ,
DOI : 10.1109/TIP.2006.877528
Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of the IAPR Workshop on Machine Vision Application, pp.318-321, 2002. ,
Audio-visual automatic speech recognition: An overview, Issues in Visual and Audio-Visual Speech Processing, 2004. ,
Pattern recognition with fuzzy objective function algorithms, 1981. ,
DOI : 10.1007/978-1-4757-0450-1
The use of cohort normalized scores for speaker verification, Proceedings of Spoken Language Processing, pp.599-602, 1992. ,
Segmentation of color lip images by spatial fuzzy clustering, IEEE Transactions on Fuzzy Systems, vol.11, issue.4, pp.542-549, 2003. ,
DOI : 10.1109/TFUZZ.2003.814843
Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008. ,
DOI : 10.1049/iet-cvi:20070061
Active contours without edges, IEEE Transactions on Image Processing, vol.10, issue.2, pp.266-277, 2001. ,
DOI : 10.1109/83.902291
Convolutional face finder: a neural architecture for fast and robust face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.11, pp.1408-1423, 2004. ,
DOI : 10.1109/TPAMI.2004.97
existe aucune base de données qui répond à nos exigences, plus particulièrement, la première exigence est la plus difficile à réaliser. Donc nous avons décidé d'utiliser deux bases de données pour nos expériences ,
se décompose en cinq sessions d'enregistrement de 106 personnes (77 hommes, 29 femmes) sur une période d'un mois. Le contenu de la base de données se compose de trois phrases par session (en anglais), 1: " <Nom complet du person> Joe took father's green shoe bench out ,
Italian TV Database " compilées par [1], a été enregistrée par la chaîne italienne RAI 1, sur une période de 21 mois ,
Video person recognition strategies using head motion and facial appearance, 2008. ,
Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004. ,
DOI : 10.1109/ICIP.2004.1418816
An approach to statistical lip modelling for speaker identification via chromatic feature extraction, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), pp.123-125, 1998. ,
DOI : 10.1109/ICPR.1998.711095
Biometric Identification Using Motion History Images of a Speaker's Lip Movements, Machine Vision and Image Processing Conference, pp.83-88, 2008. ,
Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-65, 1996. ,
DOI : 10.1109/ICSLP.1996.607030
An Evaluation of Visual Speech Features for the Tasks of Speech and Speaker Recognition, International Conference of Audio-and Video-Based Person Authentication, pp.260-267, 2003. ,
DOI : 10.1007/3-540-44887-X_31
Motion Features from Lip Movement for Person Authentication, 18th International Conference on Pattern Recognition (ICPR'06), pp.1059-1062, 2006. ,
DOI : 10.1109/ICPR.2006.814
Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006. ,
DOI : 10.1109/TIP.2006.877528
The realistic multi-modal VALID database and visual speaker identification comparison experiments, th International Conference on Audio-and Video-Based Biometric Person Authentication, 2005. ,
SNAKES, Proc. International Journal of Computer Vision, pp.259-268, 1987. ,
DOI : 10.1016/B978-141600119-5.50010-X
Lip Localization and Viseme Recognition from Video Sequences, Fourteenth National Conference on Communications, 2008. ,
An image transform approach for HMM based automatic lipreading, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.173-177, 1998. ,
DOI : 10.1109/ICIP.1998.999008
Visual Speech Recognition Using Motion Features and Hidden Markov Models, Proc. of International Conference on Computer Analysis of Images and Patterns, pp.832-839, 2007. ,
DOI : 10.1007/978-3-540-74272-2_103
Minimum redundancy feature selection from microarray gene expression data, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003, pp.523-528, 2003. ,
DOI : 10.1109/CSB.2003.1227396
Tomofaces: Eigenfaces extended to videos of speakers, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1793-1796, 2008. ,
DOI : 10.1109/ICASSP.2008.4517979
Recent advances in image morphing, Proceedings of CG International '96, 1996. ,
DOI : 10.1109/CGI.1996.511788
A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intell, vol.8, pp.679-698, 1986. ,
Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of the IAPR Workshop on Machine Vision Application, pp.318-321, 2002. ,
Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008. ,
DOI : 10.1049/iet-cvi:20070061
Active contours without edges, IEEE Transactions on Image Processing, vol.10, issue.2, pp.266-277, 2001. ,
DOI : 10.1109/83.902291