National Technical Symposium on Advancements in Computing Technologies |
Foundation of Computer Science USA |
NTSACT - Number 5 |
August 2011 |
Authors: A.V. Brahmane |
0592acf9-cb3a-4787-8f35-65eca1eb8c3f |
A.V. Brahmane . Human Face Recognition from Video Using Probabilistic Appearance Manifolds. National Technical Symposium on Advancements in Computing Technologies. NTSACT, 5 (August 2011), 17-25.
In surveillance, information security, and access control applications, face recognition and identification from a video sequence is an important problem. The problems facing video-based face recognition are: a) poor quality of video (especially outdoors) and the large changes in illumination and pose, (b) detecting and localizing a face, which is often very small, from the background clutter, which may vary from frame to frame. This paper presents a new technique to model and recognize human faces in video sequences. Each registered person is represented by a lowdimensional appearance manifold in the ambient image space. The complex nonlinear appearance manifold expressed as a collection of subsets (named pose manifolds), and the connectivity among them. Each pose manifold is approximated by an affine plane. To construct this representation, exemplars are sampled from videos, and these exemplars are clustered with a K-means algorithm; each cluster is represented as a plane computed through principal component analysis (PCA). The connectivity between the pose manifolds encodes the transition probability between images in each of the pose manifold and is learned from a training video sequences. A maximum a posteriori formulation is presented for face recognition in test video sequences by integrating the likelihood that the input image comes from a particular pose manifold and the transition probability to this pose manifold from the previous frame. To recognize faces with partial occlusion, we introduce a weight mask into the process. Extensive experiments demonstrate that the proposed algorithm outperforms existing frame-based face recognition methods with temporal voting schemes.