Abstract
A new algorithm is proposed for face recognition by a Bayesian framework. Posterior distributions are computed by Markov chain Monte Carlo (MCMC). Face features used in the paper are those used in our previous work [1][2] based on the Elastic Graph Matching method. While our previous method attempts to optimize facial feature point positions so as to maximize a similarity function between each model and face region in the input sequence, the proposed approach evaluates posterior distributions of models conditioned on the input sequence. Experimental results show a rather dramatic improvement in robustness. The proposed algorithm eliminates almost all identification errors on sequences showing individuals talking, and reduces identification errors by more than 90% on sequences showing individuals smiling although such data was not used in training.
To appear in Proceedings of ICPR2004, Cambridge, 23-26 August 2004
http://www.ee.surrey.ac.uk/icpr2004/
![]() |
(1) |
![]() |
(2) |
![]() |
(3) |
![]() |
(4) |
![]() |
(5) |
![]() |
(6) |
is a normalizing factor: ![]() |
(7) |
![]() |
(8) |
![]() |
(9) |
![]() |
(10) (11) |
![]() |
(12) |
![]() |
(13) |
| (14) |
![]() |
(15) |
![]() |
(16) |
![]() |
(17) |
![]() |
(18) |
![]() |
![]() |
(19) |
| INPUT | ORIGINAL | Bayesian MCMC |
| Neutral |
4.0%(32 / 796) | 0.0%(0 / 796) |
| Talking | 8.2%(121 / 1483) | 0.1%(1 / 1483) |
| Smiling | 37.4%(461 / 1233) | 2.6%(32 / 1233) |
| TOTAL | 17.5%(614 / 3512) | 0.9%(33 / 3512) |
| proc.time* | 3 min. | 50 min. |
| To evaluate the sensitivity of the approach to errors at the preceding face detection stage, we performed a numerical simulation of a noisy face detector. Tables 2, 3 and 4 show the recognition results when face regions were dilated |
| -4 | -2 | -1 | 0 | +1 | +2 | +4 | |
| Neutral | 49.6 | 18.8 | 6.5 | 0.1 | 0.0 | 10.2 | 30.8 |
| Talking | 31.8 | 1.3 | 0.4 | 0.7 | 0.3 | 5.8 | 29.0 |
| Smiling | 49.4 | 17.6 | 9.2 | 4.4 | 7.1 | 13.4 | 27.9 |
| TOTAL | 42.1 | 11.0 | 4.4 | 1.9 | 2.6 | 9.5 | 29.0 |
| proc. time * | 5 min. | ||||||
| -4 | -2 | -1 | 0 | +1 | +2 | +4 | |
| Neutral | 71.6 | 9.8 | 0.1 | 0.1 | 17.2 | 35.2 | 45.2 |
| Talking | 66.3 | 17.3 | 0.0 | 0.7 | 3.3 | 11.9 | 33.5 |
| Smiling | 71.7 | 47.6 | 11.6 | 4.4 | 16.4 | 16.4 | 56.6 |
| TOTAL | 69.4 | 26.5 | 4.2 | 1.9 | 7.7 | 18.7 | 44.4 |
| proc. time* | 5 min. | ||||||
| -4 | -2 | -1 | 0 | +1 | +2 | +4 | |
| Neutral | 86.3 | 18.0 | 5.3 | 0.1 | 17.2 | 65.8 | 89.2 |
| Talking | 83.0 | 13.3 | 8.2 | 0.7 | 1.3 | 31.3 | 91.1 |
| Smiling | 86.2 | 27.5 | 14.4 | 4.4 | 8.7 | 34.7 | 93.5 |
| TOTAL | 84.9 | 19.4 | 9.8 | 1.9 | 7.5 | 40.2 | 84.8 |
| proc. time* | 5 min. | ||||||
| (20) |
![]() |
Mr. Atsushi Matsui Atsushi Matsui received the B.E. and M.E. degrees in Electrical Engineering from Waseda University, Tokyo, Japan in 1994 and 1996 respectively. He joined Japan Broadcasting Corporation (NHK) in 1996. Since 1998, he has been with NHK Science and Technical Research Laboratories, engaged in research on speech recognition and face recognition. He is a member of the Institute of Electronics, Information and Communication Engineers (IEICE) and the Institute of Image Information and Television Engineers of Japan (ITE). |
![]() |
Mr. Simon Clippingdale Simon Clippingdale received the B.Sc. Honours degree in Electronic and Electrical Engineering in 1982 from the University of Birmingham, U.K. and the Ph.D. degree in Computer Science in 1988 from the University of Warwick, U.K. He was a Japanese Government Science & Technology Agency (STA) Research Fellow at NHK Science and Technical Research Laboratories in 1990-91, and after lecturing at the University of Warwick, joined NHK in 1996. Since then he has been with NHK Science and Technical Research Laboratories, pursuing research on image recognition and vision. He is currently a Senior Research Engineer, and is a member of the Institute of Electronics, Information and Communication Engineers (IEICE) and the Institute of Image Information and Television Engineers of Japan (ITE). He currently serves on the IEICE Technical Committee on Pattern Recognition and Media Understanding. |
![]() |
Mr. umiki Uzawa Fumiki Uzawa received the B.E. degree in Electrical, Electronics and Computer Engineering from Waseda University, Tokyo, Japan, in 2004. His research interests include face recognition and image processing. He joined Japan Broadcasting Corporation (NHK) in 2004. Since then he has been with NHK Okayama Broadcasting Station. He is a member of the Institute of Electronics, Information and Communication Engineers (IEICE) . |
![]() |
Mr. Takashi Matusmoto Takashi Matsumoto received a B.S. degree in Electrical Engineering from Waseda University, Tokyo, Japan, a M. Sc. degree in Applied Mathematics from Harvard University, Cambridge, MA, and a Ph.D. degree in Electrical Engineering from Waseda University. Since 1980, he has been a Professor at Waseda University, Tokyo, Japan. Currently, Dr. Matsumoto is with the Signal Processing Group, Cambridge University, U.K., on leave from Waseda University. His research interests include Monte Carlo based hierarchical Bayesian algorithms, Particle Filters, MCMC, on-line signature verification, on-line face recognition, on-line handwriting recognition, non-linear time series prediction, and bioinformatics. He is on the editorial board of Circuits, Systems, and Signal Processing. He is a coauthor of the book Bifurcations (Springer-Verlag, 1993). Dr. Matsumoto is a past Editorial Board Member, as well as a guest coeditor of the Proceedings of the IEEE. He is a member of the IEICE Biometric Person Authentication Technical Committee as well as a member of the ITE Technical Committee on Next-Generation Image Input Devices. He serves as a member of the Steering Committee of SVC 2004, Signature Verification Competition. He is a fellow of the IEEE. |
