U. Saeed and J. , Dugelay Temporal normalization of videos using visual speech MiFor'09 : 1st ACM Workshop on Multimedia in Forensics, 2009.

A. Benaiss and U. Saeed, Jean-Luc Dugelay and Mohamed Jedra Impostor detection using facial stereoscopic images Eusipco, 17th European Signal Processing Conference, 2009.

F. Matta and U. Saeed, Caroline Mallauran and Jean-Luc Dugelay Facial gender recognition using multiple sources of visual information MMSP, 10th IEEE International Workshop on MultiMedia Signal Processing, 2008.

U. Saeed and J. , Dugelay Facial video based response registration system Eusipco, 16th European Signal Processing Conference, 2008.

G. Ananthakrishnan, H. Dibeklioglu, M. Lojka, and A. Lopez, Serafeim Perdikis Albert Ali Salah, Dimitrios Tzovaras and Athanasios Vogiannou Activity-related biometric authentication eNTERFACE, 2008.

J. Allasia, A. C. Andres-del-valle, and D. C. Barbu, Ionut Petre, Usman Saeed and Jerome Urbain Multimodal services for remote communications eNTERFACE, 2007.

U. Saeed and J. , Dugelay Person recognition from video using facial mimics ICASSP, 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007.

U. Saeed, F. Matta, and J. , Dugelay Person recognition based on head and mouth dynamics MMSP, IEEE International Workshop on Multimedia Signal Processing, 2006.

F. Matta, Video person recognition strategies using head motion and facial appearance, 2008.

P. J. Phillips, P. Grother, R. J. Micheals, D. M. Blackburn, E. Tabassi et al., Facial recognition vendor test 2002: evaluation report, 2003.

J. G. Wilpon, L. R. Rabiner, and T. Martin, An Improved Word-Detection Algorithm for Telephone-Quality Speech Incorporating Both Syntactic and Semantic Constraints, AT&T Bell Laboratories Technical Journal, vol.63, issue.3, pp.479-498, 1984.
DOI : 10.1002/j.1538-7305.1984.tb00016.x

L. R. Rabiner and M. R. Sambur, An Algorithm for Determining the Endpoints of Isolated Utterances, Bell System Technical Journal, vol.54, issue.2, pp.297-315, 1975.
DOI : 10.1002/j.1538-7305.1975.tb02840.x

I. Shafran and R. Rose, Robust speech detection and segmentation for real-time ASR applications, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.432-435, 2003.
DOI : 10.1109/ICASSP.2003.1198810

J. W. Picone, Signal modeling techniques in speech recognition, Proceedings of the IEEE, vol.81, issue.9, pp.1215-1247, 1993.
DOI : 10.1109/5.237532

H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990.
DOI : 10.1121/1.399423

J. Ajmera, Robust audio segmentation, 2004.

J. A. Haigh, Voice activity detection for conversational analysis, 1994.

J. M. Naik and D. M. Lubensky, A hybrid HMM-MLP speaker verification algorithm for telephone speech, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing, pp.153-156, 1994.
DOI : 10.1109/ICASSP.1994.389332

R. W. Schafer and L. R. Rabiner, Digital representations of speech signals, Proceedings of the IEEE, vol.63, issue.4, pp.662-677, 1975.
DOI : 10.1109/PROC.1975.9799

F. Itakura, Line spectrum representation of linear predictive coefficients, Trans. Committee Speech Research Acoustical Soc, vol.75, p.34, 1975.

S. B. Davis and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980.
DOI : 10.1109/TASSP.1980.1163420

F. K. Soong and A. E. Rosenberg, On the use of instantaneous and transitional spectral information in speaker recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.36, issue.6, pp.871-879, 1988.
DOI : 10.1109/29.1598

S. Furui, An Overview of Speaker Recognition Technology, Proc. ESCA Workshop on Automatic Speaker Recognition, Identification, and Verification, pp.1-9, 1994.
DOI : 10.1007/978-1-4613-1367-0_2

F. K. Soong, A. E. Rosenberg, L. R. Rabiner, and B. H. Juang, A vector quantization approach to speaker recognition, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.14-26, 1987.
DOI : 10.1109/ICASSP.1985.1168412

H. Sakoe and S. Chiba, Dynamic Programming Algorithm Optimization for Spoken Word Recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, issue.1, pp.43-49, 1978.

A. Higgins, YOHO Speaker Verification, Speech Research Symposium, 1990.

A. Higgins, L. Bahler, and J. Porter, Voice identification using nearest-neighbor distance measure, IEEE International Conference on Acoustics Speech and Signal Processing, pp.75-378, 1993.
DOI : 10.1109/ICASSP.1993.319317

J. Oglesby, Neural models for speaker recognition, 1991.

K. R. Farrell, R. J. Mammone, and K. T. Assaleh, Speaker recognition using neural networks and conventional classifiers, IEEE Transactions on Speech and Audio Processing, vol.2, issue.1, pp.194-205, 1994.
DOI : 10.1109/89.260362

M. M. Homayounpour and G. Chollet, Neural net approaches to speaker verification: comparison with second order statistic measures, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.353-356, 1995.
DOI : 10.1109/ICASSP.1995.479594

Y. Bennani, F. F. Soulie, and P. Gallinari, A connectionist approach for automatic speaker identification, International Conference on Acoustics, Speech, and Signal Processing, pp.265-268, 1990.
DOI : 10.1109/ICASSP.1990.115619

Y. Bennani, Probabilistic cooperation of connectionist expert modules: Validation on a speaker identification task, Proc. IEEE ICASSP, pp.541-544, 1993.

L. Rudasi and S. A. Zahorian, Text???independent talker identification using recurrent neural networks, The Journal of the Acoustical Society of America, vol.87, issue.S1, 1990.
DOI : 10.1121/1.2027796

H. Gish and M. Schmidt, Text-independent speaker identification, IEEE Signal Processing Magazine, vol.11, issue.4, pp.18-32, 1994.
DOI : 10.1109/79.317924

D. A. Reynolds and R. C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Transactions on Speech and Audio Processing, vol.3, issue.1, pp.72-83, 1995.
DOI : 10.1109/89.365379

S. Dupont and J. Luettin, Audio-visual speech modeling for continuous speech recognition, IEEE Transactions on Multimedia, vol.2, issue.3, pp.141-151, 2000.
DOI : 10.1109/6046.865479

L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, pp.257-286, 1989.

L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Signal Processing A. Oppenheim. Englewood, vol.Cliffs, 1993.

D. L. Hall, Mathematical Techniques in Multisensor Data Fusion, 1992.

D. L. Hall and J. Llinas, Multisensor data fusion Handbook of Multisensor Data Fusion, pp.1-10, 2001.

S. S. Iyengar, L. Prasad, and H. Min, Advances in Distributed Sensor Technology, 1995.

U. Meier, W. Hurst, and P. Duchnowski, Adaptive bimodal sensor fusion for automatic speechreading, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp.833-836, 1996.
DOI : 10.1109/ICASSP.1996.543250

C. C. Chibelushi, J. S. Mason, and F. Deravi, Feature-level data fusion for bimodal person recognition, 6th International Conference on Image Processing and its Applications, pp.399-403, 1997.
DOI : 10.1049/cp:19970924

S. Bengio, Multimodal Authentication Using Asynchronous HMMs, Proc. 4th International Conf. Audio-and Video-based Biometric Person Authentication, pp.770-777, 2003.
DOI : 10.1007/3-540-44887-X_89

K. S. Lawrence and I. J. Michael, Mixed memory Markov models: Decomposing complex stochastic processes as mixtures of simpler ones, Mach. Learn, vol.37, issue.1, pp.75-87, 1999.

M. Brand, N. Oliver, and A. Pentland, Coupled hidden Markov models for complex action recognition, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.994-999, 1997.
DOI : 10.1109/CVPR.1997.609450

V. Radova and J. Psutka, An approach to speaker identification using multiple classifiers, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1135-1138, 1997.
DOI : 10.1109/ICASSP.1997.596142

R. C. Luo and M. G. Kay, Introduction Multisensor Integration and Fusion for Intelligent Machines and Systems, pp.1-26, 1995.

L. A. Alexandre, A. C. Campilho, and M. Kamel, On combining classifiers using sum and product rules, Pattern Recognition Letters, vol.22, issue.12, pp.1283-1289, 2001.
DOI : 10.1016/S0167-8655(01)00073-3

V. Chatzis, A. G. Bors, and I. Pitas, Multimodal decision-level fusion for person authentication, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol.29, issue.6, pp.674-680, 1999.
DOI : 10.1109/3468.798073

S. Ben-yacoub, Y. Abdeljaoued, and E. Mayoraz, Fusion of face and speech data for person identity verification, IEEE Transactions on Neural Networks, vol.10, issue.5, pp.1065-1074, 1999.
DOI : 10.1109/72.788647

L. L. Mok, W. H. Lau, S. H. Leung, S. L. Wang, and H. Yan, Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004.
DOI : 10.1109/ICIP.2004.1418816

T. Wark, S. Sridharan, and V. Chandran, An approach to statistical lip modelling for speaker identification via chromatic feature extraction, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), pp.123-125, 1998.
DOI : 10.1109/ICPR.1998.711095

A. G. De-la-cuesta, Z. Jianguo, and P. Miller, Biometric Identification Using Motion History Images of a Speaker's Lip Movements, Machine Vision and Image Processing Conference, pp.83-88, 2008.

J. Luettin, N. A. Thacker, and S. W. Beet, Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-65, 1996.
DOI : 10.1109/ICSLP.1996.607030

S. Lucey, An Evaluation of Visual Speech Features for the Tasks of Speech and Speaker Recognition, International Conference of Audio-and Video-Based Person Authentication, pp.260-267, 2003.
DOI : 10.1007/3-540-44887-X_31

M. I. Faraj and J. Bigun, Motion Features from Lip Movement for Person Authentication, 18th International Conference on Pattern Recognition (ICPR'06), pp.1059-1062, 2006.
DOI : 10.1109/ICPR.2006.814

H. E. Cetingul, Y. Yemez, E. Engin, and A. M. Tekalp, Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006.
DOI : 10.1109/TIP.2006.877528

T. Wagner and U. Dieckmann, Multi-sensorial inputs for the identification of persons with synergetic computers, Proceedings of 1st International Conference on Image Processing, pp.287-291, 1994.
DOI : 10.1109/ICIP.1994.413577

H. Pan, L. Zhi-pei, and T. S. Huang, A new approach to integrate audio and visual features of speech, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532), pp.1093-1096, 2000.
DOI : 10.1109/ICME.2000.871551

C. C. Broun, X. Zhang, R. M. Mersereau, and M. Clements, Automatic speechreading with application to speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.

P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, Acoustic-labial speaker verification, Proceedings of the First international Conference on Audio-and Video-Based Biometric Person Authentication, 1997.
DOI : 10.1016/S0167-8655(97)00070-6

N. Fox and R. B. Reilly, Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features, Proc. 4th International Conference on Audio and Video Based Biometric Person Authentication, 2003.
DOI : 10.1007/3-540-44887-X_86

T. Wark, S. Sridharan, and V. Chandran, The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), pp.2389-2392, 2000.
DOI : 10.1109/ICASSP.2000.859322

A. Kanak, E. Erzin, Y. Yemez, and A. M. Tekalp, Joint audio-video processing for biometric speaker identification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.377-80, 2003.

M. Ichino, H. Sakano, and N. Komatsu, Multimodal Biometrics of Lip Movements and Voice using Kernel Fisher Discriminant Analysis, 2006 9th International Conference on Control, Automation, Robotics and Vision, pp.1-6, 2006.
DOI : 10.1109/ICARCV.2006.345473

M. L. Faraj and J. Bigun, Synergy of Lip-Motion and Acoustic Features in Biometric Speech and Speaker Recognition, IEEE Transactions on Computers, vol.56, issue.9, pp.1169-1175, 2007.
DOI : 10.1109/TC.2007.1074

T. Chen, Audiovisual speech processing, IEEE Signal Processing Mag, 2001.

B. Lee, M. Hasegawa-johnson, C. Goudeseune, S. Kamdar, S. Borys et al., AVICAR: Audio-visual speech corpus in a car environment, Conf. Spoken Language, 2004.

S. Pigeon and L. Vandendorpe, The M2VTS multimodal face database 1 st Int. Conf. Audio-and Video-Based Biometric Person Authentication, 1997.

K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, XM2VTSDB: The extended M2VTS database, nd Int. Conf. Audio-and Video-Based Biometric Person Authentication, 1999.

N. A. Fox, B. O-'mullane, and R. B. Reilly, The realistic multi-modal VALID database and visual speaker identification comparison experiments, th International Conference on Audio-and Video-Based Biometric Person Authentication, 2005.

A. J. O-'toole, J. Harms, S. L. Snow, D. R. Hurst, M. R. Pappas et al., A video database of moving faces and people, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.5, pp.812-816, 2005.
DOI : 10.1109/TPAMI.2005.90

B. Dumas, C. Pugin, J. Hennebert, D. Petrovska-delacrétaz, A. Humm et al., MyIdea -Multimodal Biometrics Database, Description of Acquisition Protocols, proc. of Third COST 275 Workshop, pp.59-62, 2005.

S. Garcia-salicetti, C. Beumier, G. Chollet, B. Dorizzi, J. L. Les-jardins et al., BIOMET: A Multimodal Person Authentication Database Including Face, Voice, Fingerprint, Hand and Signature Modalities, Audio-and Video-Based Biometric Person Authentication, p.1056, 2003.
DOI : 10.1007/3-540-44887-X_98

T. J. Hazen, K. Saenko, C. La, and J. Glass, A segment-based audio-visual speech recognizer, Proceedings of the 6th international conference on Multimodal interfaces , ICMI '04, 2004.
DOI : 10.1145/1027933.1027972

C. Sanderson and K. K. Paliwal, Noise compensation in a person verification system using face and multiple speech features, Pattern Recognition, vol.36, issue.2, pp.293-302, 2003.
DOI : 10.1016/S0031-3203(02)00031-6

T. Sakai, M. Nagao, and T. Kanade, Computer analysis and classification of photographs of human faces, Proc. First USA?Japan Computer Conference, pp.2-7, 1972.

J. Choi, S. Kim, and P. Rhee, Facial components segmentation for extracting facial feature, Proceedings Second International Conference on Audio-and Videobased Biometric Person Authentication, 1999.

S. A. Sirohey, Human Face Segmentation and Identification, 1993.

J. Huang, S. Gutta, and H. Wechsler, Detection of human faces using decision trees, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, 1996.
DOI : 10.1109/AFGR.1996.557272

R. Herpers, M. Michaelis, K. Lichtenauer, and G. Sommer, Edge and keypoint detection in facial regions, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp.212-217, 1996.
DOI : 10.1109/AFGR.1996.557266

S. Mckenna, S. Gong, and J. J. Collins, Face Tracking and Pose Representation, Procedings of the British Machine Vision Conference 1996, 1996.
DOI : 10.5244/C.10.31

J. Yang and A. Waibel, A real-time face tracker, IEEE Proc. of the 3 rd Workshop on Applications of Computer Vision, 1996.

J. L. Crowley and F. Berard, Multi-model tracking of faces for video communications, IEEE Proc. of Int, Conf. on Computer Vision and Pattern Recognition, 1997.

H. P. Graf, E. Cosatto, D. Gibson, E. Petajan, and M. Kocheisen, Multi-modal system for locating heads and faces, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp.277-282, 1996.
DOI : 10.1109/AFGR.1996.557248

C. H. Lee, J. S. Kim, and K. H. Park, Automatic human face location in a complex background using motion and color information, Pattern Recognition, vol.29, issue.11, pp.1877-1889, 1996.
DOI : 10.1016/0031-3203(96)00036-2

Y. Dai and Y. Nakano, Face-texture model based on sgld and its application Pattern Recog, pp.1007-1017, 1996.

F. Luthon and M. Lievin, Lip motion automatic detection, Scandinavian Conference on Image Analysis, 1997.

M. K. Hu, Visual pattern recognition by moment invariants, IRE Transactions on Information Theory, vol.8, pp.179-187, 1962.

S. Mckenna, Y. Raja, and S. Gong, Tracking Colour Objects Using Adaptive Mixture Models, Image and Vision Computing, vol.174, issue.3, pp.223-229, 1998.

P. J. Van-beek, M. J. Reinders, B. Sankur, and J. C. Van-der-lubbe, Semantic segmentation of videophone image sequences, Proc. of SPIE Int. Conf. on Visual Communications and Image Processing, pp.1182-1193, 1992.

M. Turk and A. Pentland, Eigenfaces for Recognition, Journal of Cognitive Neuroscience, vol.10, issue.9, pp.71-86, 1991.
DOI : 10.1007/BF00239352

S. Mckenna, S. Gong, and H. Liddell, Real-time tracking for an integrated face recognition system, 2nd Workshop on Parallel Modelling of Neural Operators, 1995.

M. U. Ramos-sanchez, J. Matas, and J. Kittler, Statistical chromaticity models for lip tracking with b-splines, Int. Conf. on Audio-and Video-Based Biometric Person Authentication, 1997.

L. C. De-silva, K. Aizawa, and M. Hatori, Detection and tracking of facial features by using a facial feature model and deformable circular template, IEICE Trans. Inform. Systems, pp.1195-1207, 1995.

S. H. Jeng, H. Y. Liao, C. C. Han, M. Y. Chern, and Y. T. Liu, Facial feature detection using geometrical face model: An efficient approach, Pattern Recognition, vol.31, issue.3, 1998.
DOI : 10.1016/S0031-3203(97)00048-4

F. Smeraldi, O. Carmona, and J. Big¨un, Saccadic search with Gabor features applied to eye detection and real-time head tracking, Image and Vision Computing, vol.18, issue.4, pp.323-329
DOI : 10.1016/S0262-8856(99)00080-3

M. C. Burl, T. K. Leung, and P. Perona, Face localization via shape statistics, Int. Workshop on Automatic Face and Gesture Recognition, 1995.

W. Huang, Q. Sun, C. P. Lam, and J. K. Wu, A robust approach to face and eyes detection from images with cluttered background, Proc. of International Conference on Pattern Recognition, 1998.

D. Maio and D. Maltoni, Real-time face location on gray-scale static images Pattern Recog, pp.1525-1539, 2000.

A. R. Mirhosseini, H. Yan, K. Lam, and T. Pham, Human Face Image Recognition: An Evidence Aggregation Approach, Computer Vision and Image Understanding, vol.71, issue.2, 1998.
DOI : 10.1006/cviu.1998.0710

M. Kass, A. Witkin, and D. Terzopoulos, Snakes: Active contour models, Proc. of 1 st Int Conf. on Computer Vision, 1987.
DOI : 10.1007/BF00133570

S. R. Gunn and M. S. Nixon, A dual active contour for head and boundary extraction, IEE Colloquium on Image Processing for Biometric Measurement, 1994.

C. L. Huang and C. W. Chen, Human facial feature extraction for face interpretation and recognition, Pattern Recognition, vol.25, issue.12, pp.1435-1444, 1992.
DOI : 10.1016/0031-3203(92)90118-3

T. Yokoyama, Y. Yagi, and M. Yachida, Facial contour extraction model, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998.
DOI : 10.1109/AFGR.1998.670957

A. L. Yuille, P. W. Hallinan, and D. S. Cohen, Feature extraction from faces using deformable templates, International Journal of Computer Vision, vol.26, issue.6, pp.99-111, 1992.
DOI : 10.1007/BF00127169

G. Chow and X. Li, Towards a system for automatic facial feature detection, Pattern Recognition, vol.26, issue.12, pp.1739-1755, 1993.
DOI : 10.1016/0031-3203(93)90173-T

J. Huang and H. Wechsler, EYE DETECTION USING OPTIMAL WAVELET PACKETS AND RADIAL BASIS FUNCTIONS (RBFs), International Journal of Pattern Recognition and Artificial Intelligence, vol.13, issue.07, 1999.
DOI : 10.1142/S0218001499000562

A. Shackleton and W. J. Welsh, Classification of facial features for recognition, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.573-579, 1991.
DOI : 10.1109/CVPR.1991.139756

A. Lanitis, C. J. Taylor, and T. F. Cootes, Automatic tracking, coding and reconstruction of human faces, using flexible appearance models, Electronics Letters, vol.30, issue.19, pp.578-1579, 1994.
DOI : 10.1049/el:19941110

A. Lanitis, A. Hill, T. Cootes, and C. Taylor, Locating facial features using genetics algorithms, Proc. of Int. Conf. on Digital Signal Processing, pp.520-525, 1995.

L. Sirovich and M. Kirby, Low-dimensional procedure for the characterization of human faces, Journal of the Optical Society of America A, vol.4, issue.3, pp.519-524, 1987.
DOI : 10.1364/JOSAA.4.000519

L. Meng and T. Nguyen, Two subspace methods to discriminate faces and clutters, Proceedings of the 2000 International Conference on Image Processing, 2000.

M. Yang, N. Ahuja, and D. Kriegman, Face detection using mixtures of linear subspaces, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition, 2000.

Q. Song and J. Robinson, A feature space for face image processing, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, 2000.
DOI : 10.1109/ICPR.2000.906025

M. Tanaka, K. Hotta, T. Kurita, and T. Mishima, Dynamic attention map by Ising model for human face detection, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), 1998.
DOI : 10.1109/ICPR.1998.711870

H. Schneiderman and T. Kanade, Probabilistic modeling of local appearance and spatial relationships for object recognition, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.45-51, 1998.
DOI : 10.1109/CVPR.1998.698586

H. Schneiderman and T. Kanade, A statistical method for 3D object detection applied to faces and cars, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662), pp.746-751, 2000.
DOI : 10.1109/CVPR.2000.855895

T. Rikert, M. Jones, and P. Viola, A cluster-based statistical model for object detection, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1046-1053, 1999.
DOI : 10.1109/ICCV.1999.790386

R. J. Qian and T. S. Huang, Object detection using hierarchical MRF and MAP estimation, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.186-192, 1997.
DOI : 10.1109/CVPR.1997.609318

M. Propp and A. Samal, Artificial neural network architecture for human face detection, Intell. Eng. Systems Artificial Neural Networks, vol.2, pp.535-540, 1992.

H. A. Rowley, S. Baluja, and T. Kanade, Neural network-based face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.1, pp.23-38, 1998.
DOI : 10.1109/34.655647

H. A. Rowley, S. Baluja, and T. Kanade, Rotation invariant neural network-based face detection, Proc. IEEE Intl. Conf. on Computer Vision and Pattern Recognition, pp.38-44, 1998.

R. Vaillant, C. Monrocq, and Y. Le-cun, Original approach for the localisation of objects in images, IEE Proc. Vision, Image and Signal Processing, pp.245-250, 1994.
DOI : 10.1049/ip-vis:19941301

R. Feraud, O. Bernier, and D. Collobert, A constrained generative model applied to face detection, Neural Process. Lett, vol.5, pp.73-81, 1997.

S. Lin, S. Kung, and L. Lin, Face recognition/detection by probabilistic decision-based neural network, IEEE Trans. Neural Networks, vol.8, pp.114-132, 1997.

D. Roth, The SNoW Learning Architecture, 1999.

E. Osuna, R. Freund, and F. Girosi, Training support vector machines: an application to face detection, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.130-136, 1997.
DOI : 10.1109/CVPR.1997.609310

C. Papageorgiou and T. Poggio, A Trainable System for Object Recognition, International Journal of Computer Vision, vol.38, issue.1, pp.15-33, 2000.
DOI : 10.1023/A:1008162616689

F. Samaria and S. Young, HMM-based architecture for face identification, Image and Vision Computing, vol.12, issue.8, pp.537-583, 1994.
DOI : 10.1016/0262-8856(94)90007-8

F. S. Samaria, Face Recognition Using Hidden Markov Models, 1994.

A. V. Nefian and M. H. Iii, Face detection and recognition using hidden Markov models, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.141-145, 1998.
DOI : 10.1109/ICIP.1998.723445

A. Hulbert and T. Poggio, Synthesizing a color algorithm from examples, Science, vol.239, issue.4839, pp.482-485, 1998.
DOI : 10.1126/science.3340834

U. Canzler and T. Dziurzyk, Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of IAPR Workshop, pp.318-321, 2002.

S. Leung, S. Wang, and W. Lau, Lip Image Segmentation Using Fuzzy Clustering Incorporating an Elliptic Shape Function, IEEE Transactions on Image Processing, vol.13, issue.1, pp.51-62, 2004.
DOI : 10.1109/TIP.2003.818116

S. Lucey, S. Sridharan, and V. Chandran, Adaptive mouth segmentation using chromatic features, Pattern Recognition Letters, vol.23, issue.11, pp.1293-1302, 2002.
DOI : 10.1016/S0167-8655(02)00078-8

X. Zhang and R. M. Mersereau, Lip feature extraction towards an automatic speechreading system, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101), pp.226-229, 2000.
DOI : 10.1109/ICIP.2000.899336

S. Lucey, S. Sridharan, and V. Chandran, Initialised eigenlip estimator for fast lip tracking using linear regression, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000, pp.178-181, 2000.
DOI : 10.1109/ICPR.2000.903514

A. Nefian, L. Liang, X. Pi, L. Xiaoxiang, C. Mao et al., A couple HMM for audio-visual speech recognition, Proc. ICASSP, pp.2013-2016, 2002.

C. Bregler and S. M. Omohundro, Nonlinear manifold learning for visual speech recognition, Proceedings of IEEE International Conference on Computer Vision, pp.494-499, 1995.
DOI : 10.1109/ICCV.1995.466899

Y. Guan, Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008.
DOI : 10.1049/iet-cvi:20070061

M. Sadeghi, J. Kittler, and K. Messer, Modelling and segmentation of lip area in face images, IEE Proceedings Vision, Image and Signal Processing, pp.179-184, 2002.
DOI : 10.1049/ip-vis:20020378

M. Lievin and F. Luthon, Unsupervised lip segmentation under natural conditions, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3065-3068, 1999.
DOI : 10.1109/ICASSP.1999.757488

URL : https://hal.archives-ouvertes.fr/hal-00961012

Y. Tian, T. Kanade, and J. Cohn, Robust lip tracking by combining shape, color and motion, Proc. ACCV, pp.1040-1045, 2000.

R. Kaucic, B. Dalton, and A. Blake, Real-time lip tracking for audio-visual speech recognition applications, Proceedings of the 4th European Conference on Computer Vision, 1996.
DOI : 10.1007/3-540-61123-1_154

T. Coianiz, L. Torresani, and B. Caprile, 2D deformable models for visual speech analysis, " NATO Advanced Study Institute: Speech reading by Man and Machine, pp.391-398, 1995.

M. E. Hennecke, K. V. Prasad, and D. G. Stork, Using deformable templates to infer visual speech dynamics, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, pp.578-582, 1994.
DOI : 10.1109/ACSSC.1994.471518

P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, Audiovisual speech recognition using MPEG-4 compliant visual features, EURASIP J. Appl. Signal Processing, pp.1213-1227, 2002.

N. Eveno, A. Caplier, and P. Coulon, Accurate and Quasi-Automatic Lip Tracking, IEEE Transactions on Circuits and Systems for Video Technology, pp.706-715, 2004.
DOI : 10.1109/TCSVT.2004.826754

T. F. Cootes, Statistical Models of Appearance for Computer Vision, 2004.

L. Zhaorong and A. Haizhou, Texture-Constrained Shape Prediction for Mouth Contour Extraction and its State Estimation, 18th International Conference on Pattern Recognition (ICPR'06), pp.88-91, 2006.
DOI : 10.1109/ICPR.2006.1114

C. L. Huang and Y. M. Huang, Facial Expression Recognition Using Model-Based Feature Extraction and Action Parameters Classification, Journal of Visual Communication and Image Representation, pp.278-290, 1997.
DOI : 10.1006/jvci.1997.0359

S. Werda, W. Mahdi, and A. Ben-hamadou, Colour and Geometric based Model for Lip Localisation: Application for Lip-reading System, 14th International Conference on Image Analysis and Processing (ICIAP 2007), pp.9-14, 2007.
DOI : 10.1109/ICIAP.2007.4362750

L. L. Mok, W. H. Lau, S. H. Leung, S. L. Wang, and H. Yan, Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004.
DOI : 10.1109/ICIP.2004.1418816

M. Chan, Automatic lip model extraction for constrained contour-based tracking, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348), pp.848-851, 1999.
DOI : 10.1109/ICIP.1999.823017

C. Bouvier, P. Coulon, and X. Maldague, Unsupervised Lips Segmentation Based on ROI Optimisation and Parametric Model, 2007 IEEE International Conference on Image Processing, pp.301-304, 2007.
DOI : 10.1109/ICIP.2007.4380014

URL : https://hal.archives-ouvertes.fr/hal-00372142

K. Michael, W. Andrew, and T. Demetri, SNAKES, Proc. International Journal of Computer Vision, pp.259-268, 1987.
DOI : 10.1016/B978-141600119-5.50010-X

N. Thejaswi and S. Sengupta, Lip Localization and Viseme Recognition from Video Sequences, Fourteenth National Conference on Communications, 2008.

F. Bourel, C. C. Chibelushi, and A. A. Low, Robust Facial Feature Tracking, Procedings of the British Machine Vision Conference 2000, pp.232-241, 2000.
DOI : 10.5244/C.14.24

M. A. Hall and L. A. Smith, Practical feature subset selection for machine learning, Proceedings of the 21 st Australian Computer Science Conference, pp.181-191, 1998.

R. Kohavi and G. John, Wrappers for feature subset selection, Artificial Intelligence, vol.97, issue.1-2, pp.273-324, 1997.
DOI : 10.1016/S0004-3702(97)00043-X

G. Potamianos, H. P. Graf, and E. Cosatto, An image transform approach for HMM based automatic lipreading, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.173-177, 1998.
DOI : 10.1109/ICIP.1998.999008

W. C. Yau, D. K. Kumar, and H. Weghorn, Visual Speech Recognition Using Motion Features and Hidden Markov Models, Proc. of International Conference on Computer Analysis of Images and Patterns, pp.832-839, 2007.
DOI : 10.1007/978-3-540-74272-2_103

C. Ding and H. C. Peng, Minimum redundancy feature selection from microarray gene expression data, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003, pp.523-528, 2003.
DOI : 10.1109/CSB.2003.1227396

P. Lu, X. Huang, X. Zhu, and Y. Wang, Head Gesture Recognition Based on Bayesian Network, Proceedings of Iberian Conference on Pattern Recognition and Image Analysis, p.492, 2005.
DOI : 10.1007/11492429_60

P. C. Ng, L. C. De, and . Silva, Head gestures recognition, Proceedings of International Conference on Image Processing, pp.266-269, 2001.

A. Benoit and A. Caplier, Head nods analysis: interpretation of non verbal communication gestures, IEEE International Conference on Image Processing 2005, pp.425-433, 2005.
DOI : 10.1109/ICIP.2005.1530419

K. Toyama, Look, Ma--No Hands! Hands free cursor control with real-time 3D face tracking, Proceedings of Workshop on Perceptual User Interface, 1998.

S. Kawato and J. Ohya, Real-time detection of nodding and head-shaking by directly detecting and tracking the "between-eyes", Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), pp.40-45, 2000.
DOI : 10.1109/AFGR.2000.840610

A. Kapoor and R. Picard, A real-time head nod and shake detector, Proceedings of the 2001 workshop on Percetive user interfaces , PUI '01, 2001.
DOI : 10.1145/971478.971509

V. Chauhan and T. Morris, Face and feature tracking for cursor control, Proceedings of 12th Scandinavian Conference on Image Analysis, 2001.

P. Hong and T. Huang, Natural Mouse-a novel human computer interface, Proceedings of International Conference on Image Processing, pp.653-656, 1999.

P. Viola and M. Jones, Rapid object detection using a boosted cascade of simple features, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, pp.511-518, 2001.
DOI : 10.1109/CVPR.2001.990517

C. Harris and M. Stephens, A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-151, 1988.
DOI : 10.5244/C.2.23

B. Lucas and T. Kanade, An iterative image registration technique with an application to stereo vision, Proceedings of DARPA Image Understanding Workshop, pp.121-130, 1981.

C. W. Hsu, C. C. Chang, and C. J. Lin, A practical guide to support vector classification, 2003.

F. Matta and J. Dugelay, Tomofaces: Eigenfaces extended to videos of speakers, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1793-1796, 2008.
DOI : 10.1109/ICASSP.2008.4517979

V. Blanz and T. Vetter, Face recognition based on fitting a 3D morphable model, PAMI, pp.1063-1074, 2003.
DOI : 10.1109/TPAMI.2003.1227983

K. Lee and D. Kriegman, Online learning of probabilistic appearance manifolds for video-based recognition and tracking, Proc of CVPR, pp.852-859, 2005.

A. S. Georghiades, D. J. Kriegman, and P. Belhumeur, Illumination cones for recognition under variable lighting: faces, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231), pp.52-59, 1998.
DOI : 10.1109/CVPR.1998.698587

P. Tsai, T. Jan, and T. Hintz, Kernel-based Subspace Analysis for Face Recognition, 2007 International Joint Conference on Neural Networks, pp.1127-1132, 2007.
DOI : 10.1109/IJCNN.2007.4371116

M. Ramachandran, S. K. Zhou, D. Jhalani, and R. Chellappa, A Method for Converting a Smiling Face to a Neutral Face with Applications to Face Recognition, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.977-980, 2005.
DOI : 10.1109/ICASSP.2005.1415570

K. Ugiyama, T. Aoki, and S. Hangai, Motion compensated frame rate conversion using normalized motion estimation, Proc. IEEE Workshop on Signal Processing Systems Design and Implementation, pp.663-668, 2005.

G. Wolberg, Recent advances in image morphing, Proceedings of CG International '96, 1996.
DOI : 10.1109/CGI.1996.511788

A. Akutsu and Y. Tonomura, Video tomography, Proceedings of the second ACM international conference on Multimedia , MULTIMEDIA '94, pp.349-356, 1994.
DOI : 10.1145/192593.192697

J. Canny, A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intell, vol.8, pp.679-698, 1986.

B. A. Golomb, D. T. Lawrence, and T. J. Sejnowski, Sex-Net: a neural network identifies sex from human faces, Proceedings of Advances in neural information processing systems, pp.572-577, 1990.

Z. Sun, G. Bebis, X. Yuan, and S. J. Louis, Genetic feature subset selection for gender classification: a comparison study, IEEE Proceedings on Applications of Computer Vision, pp.165-170, 2002.

S. Gutta, J. R. Huang, P. Jonathon, and H. Wechsler, Mixture of experts for classification of gender, ethnic origin, and pose of human faces, IEEE Transactions on Neural Networks, pp.948-960, 2000.
DOI : 10.1109/72.857774

M. Nakano, F. Yasukata, and M. Fukumi, Age and gender classification from face images using neural networks, Signal and Image Processing, 2004.

X. Lu, H. Chen, and A. K. Jain, Multimodal Facial Gender and Ethnicity Identification, Advances in Biometrics, pp.554-561, 2005.
DOI : 10.1007/11608288_74

B. Moghaddam and M. Yang, Gender classification with support vector machines, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), pp.306-311, 2000.
DOI : 10.1109/AFGR.2000.840651

Y. Saatci and C. Town, Cascaded Classification of Gender and Facial Expression using Active Appearance Models, 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pp.393-398, 2006.
DOI : 10.1109/FGR.2006.29

H. Kim, D. Kim, Z. Ghahramani, and S. Y. Bang, Appearance-based gender classification with Gaussian processes, Pattern Recognition Letters, vol.27, issue.6, pp.618-626, 2006.
DOI : 10.1016/j.patrec.2005.09.027

Y. Zhiguang, L. Ming, and A. Haizhou, An Experimental Study on Automatic Face Gender Classification, 18th International Conference on Pattern Recognition (ICPR'06), pp.1099-1102, 2006.
DOI : 10.1109/ICPR.2006.247

S. Baluja and H. A. Rowley, Boosting Sex Identification Performance, International Journal of Computer Vision, vol.20, issue.1, pp.111-119, 2007.
DOI : 10.1007/s11263-006-8910-9

S. Caifeng, G. Shaogang, and P. W. Mcowan, Learning gender from human gaits and faces, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, pp.505-510, 2007.
DOI : 10.1109/AVSS.2007.4425362

C. Bregler, H. Hild, S. Manke, and A. Waibel, Improved connected letter recognition by lipreading, Proc. IEEE ICASSP, pp.557-560, 1993.

P. Duchnowski, U. Meier, and A. Waibel, See me, hear me: Integrating automatic speech recognition and lip-reading, Proc. ICSLP, pp.547-550, 1994.

B. Maison, C. Neti, and A. Senior, Audio-visual speaker recognition for video broadcast news: some fusion techniques, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451), pp.161-167, 1999.
DOI : 10.1109/MMSP.1999.793814

T. Wark, S. Sridharan, and V. Chandran, Robust speaker verification via fusion of speech and lip modalities, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.3061-3064, 1999.
DOI : 10.1109/ICASSP.1999.757487

P. S. Aleksic, G. Potamianos, and A. K. Katsaggelos, Exploiting Visual Information in Automatic Speech Processing, Handbook of Image and Video Processing, pp.1263-1289, 2005.
DOI : 10.1016/B978-012119792-6/50134-0

E. D. Petajan, N. M. Brooke, B. J. Bischoff, and D. A. Boddoff, Experiments in automatic visual speech recognition, Proc. 7th FASE Symp, pp.1163-1170, 1988.

S. Nishida, Speech recognition enhancement by lip information, Proc. CHI, pp.198-204, 1986.

I. Matthews, J. A. Bangham, and S. Cox, Audiovisual speech recognition using multiscale nonlinear image decomposition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.38-41, 1996.
DOI : 10.1109/ICSLP.1996.607019

I. Matthews, G. Potamianos, C. Neti, and J. Luettin, A comparison of model and transform-based visual features for audio-visual LVCSR, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001., pp.22-25, 2001.
DOI : 10.1109/ICME.2001.1237849

X. Zhang, C. C. Broun, R. M. Mersereau, and M. Clements, Automatic Speechreading with Applications to Human-Computer Interfaces, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11, pp.1228-1247, 2002.
DOI : 10.1155/S1110865702206137

C. Benoît, On the Production and the Perception of Audio-Visual Speech by Man and Machine, Proc. Symp. Multimedia Communications and Video Coding, pp.277-284, 1995.
DOI : 10.1007/978-1-4613-0403-6_34

G. J. Wolff, K. V. Prasad, D. G. Stork, and M. Hennecke, Lipreading by neural networks: Visual preprocessing, learning, and sensory integration, Advances in Neural Information Processing Systems, pp.1027-1034, 1994.

T. F. Cootes, G. J. Edwards, and C. J. Taylor, Active appearance models, Proc. Eur. Conf. Computer Vision, pp.484-498, 1998.

S. W. Foo, Y. Lian, and L. Dong, Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models, IEEE Transactions on Circuits and Systems for Video Technology, vol.14, issue.5, pp.693-705, 2004.
DOI : 10.1109/TCSVT.2004.826773

J. F. Perez, A. F. Frangi, E. L. Solano, and K. Lukas, Lip reading for robust speech recognition on embedded devices, Proc. Int. Conf. Acoustics, Speech and Signal Processing, pp.473-476, 2005.

I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey, Extraction of visual features for lipreading, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.2, pp.198-213, 2002.
DOI : 10.1109/34.982900

P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, Audiovisual speech recognition using MPEG-4 compliant visual features, EURASIP J. Appl. Signal Process, pp.1213-1227, 2002.

A. G. De-la-cuesta, Z. Jianguo, and P. Miller, Biometric Identification Using Motion History Images of a Speaker's Lip Movements, International Machine Vision and Image Processing Conference, pp.83-88, 2008.

J. S. Mason, J. Brand, R. Auckenthaler, F. Deravi, and C. Chibelushi, Lip signatures for automatic person recognition, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451), pp.457-462, 1999.
DOI : 10.1109/MMSP.1999.793890

P. Paalanen, J. Kamarainen, J. Ilonen, and H. Kalviainen, Feature representation and discrimination based on Gaussian mixture model probability densities???Practices and algorithms, Pattern Recognition, vol.39, issue.7, 2005.
DOI : 10.1016/j.patcog.2006.01.005

H. E. Cetingul, Y. Yemez, E. Erzin, and A. M. Tekalp, Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006.
DOI : 10.1109/TIP.2006.877528

U. Canzler and T. Dziurzyk, Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of the IAPR Workshop on Machine Vision Application, pp.318-321, 2002.

G. Potamianos, C. Neti, J. Luettin, and I. Matthews, Audio-visual automatic speech recognition: An overview, Issues in Visual and Audio-Visual Speech Processing, 2004.

J. C. Bezdec, Pattern recognition with fuzzy objective function algorithms, 1981.
DOI : 10.1007/978-1-4757-0450-1

A. E. Rosenberg, J. Delong, C. Lee, B. Juang, and F. K. Soong, The use of cohort normalized scores for speaker verification, Proceedings of Spoken Language Processing, pp.599-602, 1992.

A. W. Liew, L. Shu-hung, and L. W. Hong, Segmentation of color lip images by spatial fuzzy clustering, IEEE Transactions on Fuzzy Systems, vol.11, issue.4, pp.542-549, 2003.
DOI : 10.1109/TFUZZ.2003.814843

Y. Guan, Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008.
DOI : 10.1049/iet-cvi:20070061

T. F. Chan and L. A. Vese, Active contours without edges, IEEE Transactions on Image Processing, vol.10, issue.2, pp.266-277, 2001.
DOI : 10.1109/83.902291

C. Garcia and M. Delakis, Convolutional face finder: a neural architecture for fast and robust face detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.11, pp.1408-1423, 2004.
DOI : 10.1109/TPAMI.2004.97

. Malheureusement, existe aucune base de données qui répond à nos exigences, plus particulièrement, la première exigence est la plus difficile à réaliser. Donc nous avons décidé d'utiliser deux bases de données pour nos expériences

. La-base-de and . Valid, se décompose en cinq sessions d'enregistrement de 106 personnes (77 hommes, 29 femmes) sur une période d'un mois. Le contenu de la base de données se compose de trois phrases par session (en anglais), 1: " <Nom complet du person> Joe took father's green shoe bench out

. La-base-de-données, Italian TV Database " compilées par [1], a été enregistrée par la chaîne italienne RAI 1, sur une période de 21 mois

F. Matta, Video person recognition strategies using head motion and facial appearance, 2008.

L. L. Mok, W. H. Lau, S. H. Leung, S. L. Wang, and H. Yan, Person authentication using ASM based lip shape and intensity information, 2004 International Conference on Image Processing, 2004. ICIP '04., pp.561-564, 2004.
DOI : 10.1109/ICIP.2004.1418816

T. Wark, S. Sridharan, and V. Chandran, An approach to statistical lip modelling for speaker identification via chromatic feature extraction, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), pp.123-125, 1998.
DOI : 10.1109/ICPR.1998.711095

A. G. De-la-cuesta, Z. Jianguo, and P. Miller, Biometric Identification Using Motion History Images of a Speaker's Lip Movements, Machine Vision and Image Processing Conference, pp.83-88, 2008.

J. Luettin, N. A. Thacker, and S. W. Beet, Speaker identification by lipreading, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, pp.62-65, 1996.
DOI : 10.1109/ICSLP.1996.607030

S. Lucey, An Evaluation of Visual Speech Features for the Tasks of Speech and Speaker Recognition, International Conference of Audio-and Video-Based Person Authentication, pp.260-267, 2003.
DOI : 10.1007/3-540-44887-X_31

M. I. Faraj and J. Bigun, Motion Features from Lip Movement for Person Authentication, 18th International Conference on Pattern Recognition (ICPR'06), pp.1059-1062, 2006.
DOI : 10.1109/ICPR.2006.814

H. E. Cetingul, Y. Yemez, E. Engin, and A. M. Tekalp, Discriminative Analysis of Lip Motion Features for Speaker Identification and Speech-Reading, IEEE Transactions on Image Processing, vol.15, issue.10, pp.2879-2891, 2006.
DOI : 10.1109/TIP.2006.877528

N. A. Fox, B. O-'mullane, and R. B. Reilly, The realistic multi-modal VALID database and visual speaker identification comparison experiments, th International Conference on Audio-and Video-Based Biometric Person Authentication, 2005.

K. Michael, W. Andrew, and T. Demetri, SNAKES, Proc. International Journal of Computer Vision, pp.259-268, 1987.
DOI : 10.1016/B978-141600119-5.50010-X

N. Thejaswi and S. Sengupta, Lip Localization and Viseme Recognition from Video Sequences, Fourteenth National Conference on Communications, 2008.

G. Potamianos, H. P. Graf, and E. Cosatto, An image transform approach for HMM based automatic lipreading, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269), pp.173-177, 1998.
DOI : 10.1109/ICIP.1998.999008

W. C. Yau, D. K. Kumar, and H. Weghorn, Visual Speech Recognition Using Motion Features and Hidden Markov Models, Proc. of International Conference on Computer Analysis of Images and Patterns, pp.832-839, 2007.
DOI : 10.1007/978-3-540-74272-2_103

C. Ding and H. C. Peng, Minimum redundancy feature selection from microarray gene expression data, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003, pp.523-528, 2003.
DOI : 10.1109/CSB.2003.1227396

F. Matta and J. Dugelay, Tomofaces: Eigenfaces extended to videos of speakers, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1793-1796, 2008.
DOI : 10.1109/ICASSP.2008.4517979

G. Wolberg, Recent advances in image morphing, Proceedings of CG International '96, 1996.
DOI : 10.1109/CGI.1996.511788

J. Canny, A computational approach to edge detection, IEEE Trans. Pattern Anal. Mach. Intell, vol.8, pp.679-698, 1986.

U. Canzler and T. Dziurzyk, Extraction of Non Manual Features for Videobased Sign Language Recognition, Proceedings of the IAPR Workshop on Machine Vision Application, pp.318-321, 2002.

Y. Guan, Automatic extraction of lips based on multi-scale wavelet edge detection, IET Computer Vision, vol.2, issue.1, pp.23-33, 2008.
DOI : 10.1049/iet-cvi:20070061

T. F. Chan and L. A. Vese, Active contours without edges, IEEE Transactions on Image Processing, vol.10, issue.2, pp.266-277, 2001.
DOI : 10.1109/83.902291