Preface |
|
xv | |
Part I Background |
|
1 | (56) |
|
|
3 | (12) |
|
|
3 | (1) |
|
|
4 | (2) |
|
|
6 | (5) |
|
|
11 | (1) |
|
|
12 | (3) |
|
Perception and Representation |
|
|
15 | (16) |
|
|
16 | (2) |
|
Representation by 3D Reconstruction |
|
|
18 | (1) |
|
Two-dimensional View-based Representation |
|
|
19 | (2) |
|
Image Template-based Representation |
|
|
21 | (1) |
|
The Correspondence Problem and Alignment |
|
|
22 | (4) |
|
|
26 | (3) |
|
|
29 | (2) |
|
Learning under Uncertainty |
|
|
31 | (26) |
|
|
32 | (1) |
|
Learning as Function Approximation |
|
|
33 | (3) |
|
Bayesian Inference and MAP Classification |
|
|
36 | (1) |
|
Learning as Density Estimation |
|
|
37 | (4) |
|
|
37 | (2) |
|
|
39 | (1) |
|
|
40 | (1) |
|
Unsupervised Learning without Density Estimation |
|
|
41 | (3) |
|
|
43 | (1) |
|
|
43 | (1) |
|
Linear Classification and Regression |
|
|
44 | (4) |
|
|
45 | (1) |
|
Linear Support Vector Machines |
|
|
45 | (3) |
|
Non-linear Classification and Regression |
|
|
48 | (3) |
|
|
48 | (2) |
|
|
50 | (1) |
|
|
51 | (2) |
|
|
53 | (1) |
|
|
54 | (3) |
Part II From Sensory to Meaningful Perception |
|
57 | (104) |
|
Selective Attention: Where to Look |
|
|
59 | (22) |
|
Pre-attentive Visual Cues from Motion |
|
|
60 | (5) |
|
Measuring Temporal Change |
|
|
61 | (1) |
|
|
62 | (3) |
|
Learning Object-based Colour Cues |
|
|
65 | (6) |
|
|
66 | (2) |
|
|
68 | (3) |
|
Perceptual Grouping for Selective Attention |
|
|
71 | (2) |
|
Data Fusion for Perceptual Grouping |
|
|
73 | (3) |
|
Temporal Matching and Tracking |
|
|
76 | (1) |
|
|
77 | (1) |
|
|
78 | (3) |
|
A Face Model: What to Look For |
|
|
81 | (22) |
|
Person-independent Face Models for Detection |
|
|
82 | (4) |
|
|
82 | (1) |
|
|
83 | (2) |
|
|
85 | (1) |
|
|
86 | (3) |
|
Principal Components Analysis for a Face Model |
|
|
87 | (1) |
|
Density Estimation in Local PCA Spaces |
|
|
88 | (1) |
|
Modelling a Near-face Class |
|
|
89 | (1) |
|
Learning a Decision Boundary |
|
|
90 | (9) |
|
Face Detection in Dynamic Scenes |
|
|
91 | (2) |
|
|
93 | (2) |
|
Face Detection using Multi-layer Perceptrons |
|
|
95 | (2) |
|
Face Detection using Support Vector Machines |
|
|
97 | (2) |
|
|
99 | (2) |
|
|
101 | (1) |
|
|
102 | (1) |
|
|
103 | (22) |
|
Feature and Template-based Correspondence |
|
|
105 | (1) |
|
The Face Space across Views: Pose Manifolds |
|
|
105 | (6) |
|
The Effect of Gabor Wavelet Filters on Pose Manifolds |
|
|
111 | (2) |
|
Template Matching as Affine Transformation |
|
|
113 | (5) |
|
Similarities to Prototypes across Views |
|
|
118 | (3) |
|
Learning View-based Support Vector Machines |
|
|
121 | (2) |
|
|
123 | (1) |
|
|
123 | (2) |
|
Prediction and Adaptation |
|
|
125 | (36) |
|
|
128 | (1) |
|
Propagating First-order Markov Processes |
|
|
129 | (2) |
|
|
131 | (1) |
|
Propagating Non-Gaussian Conditional Densities |
|
|
132 | (2) |
|
Learning Priors using HMMs and EM |
|
|
132 | (1) |
|
Observation Augmented Density Propagation |
|
|
133 | (1) |
|
Tracking Attended Regions |
|
|
134 | (2) |
|
|
136 | (4) |
|
|
140 | (5) |
|
|
145 | (2) |
|
|
147 | (9) |
|
Person-specific Pose Tracking |
|
|
149 | (1) |
|
Person-independent Pose Tracking |
|
|
150 | (6) |
|
|
156 | (1) |
|
|
157 | (4) |
Part III Models of Identity |
|
161 | (66) |
|
Single-View Identification |
|
|
163 | (24) |
|
|
163 | (2) |
|
Nearest-neighbour Template Matching |
|
|
165 | (1) |
|
Representing Knowledge of Facial Appearance |
|
|
166 | (2) |
|
Statistical Knowledge of Facial Appearance |
|
|
168 | (8) |
|
Low Dimensionality: Principal Components Analysis |
|
|
168 | (5) |
|
Separability: Linear Discriminant Analysis |
|
|
173 | (3) |
|
Sparseness and Topography: Local Feature Analysis |
|
|
176 | (1) |
|
Statistical Knowledge of Identity |
|
|
176 | (4) |
|
Identification Tasks Revisited |
|
|
177 | (2) |
|
Class-conditional Densities for Modelling Identity |
|
|
179 | (1) |
|
Structural Knowledge: The Role of Correspondence |
|
|
180 | (3) |
|
Beyond Alignment: Correspondence at a Single View |
|
|
181 | (1) |
|
Combining Statistical and Structural Models |
|
|
182 | (1) |
|
|
183 | (2) |
|
|
185 | (2) |
|
Multi-View Identification |
|
|
187 | (22) |
|
|
189 | (2) |
|
The Role of Prior Knowledge |
|
|
191 | (1) |
|
View Correspondence in Identification |
|
|
191 | (8) |
|
Learning Linear Shape Models |
|
|
192 | (3) |
|
|
195 | (4) |
|
Generalisation from a Single View |
|
|
199 | (4) |
|
Identity by Linear Combination |
|
|
199 | (2) |
|
Identity by Similarity to Prototype Views |
|
|
201 | (2) |
|
Generalisation from Multiple Views |
|
|
203 | (2) |
|
|
205 | (2) |
|
|
207 | (2) |
|
|
209 | (18) |
|
|
210 | (3) |
|
Learning and Identity Constancy |
|
|
210 | (1) |
|
The Effect of Temporal Order of Pose on Learning |
|
|
210 | (1) |
|
The Effect of Motion on Familiar Face Identification |
|
|
211 | (2) |
|
Computational Theories of Temporal Identification |
|
|
213 | (2) |
|
|
213 | (1) |
|
Spatio-temporal Signatures |
|
|
214 | (1) |
|
Identification using Holistic Temporal Trajectories |
|
|
215 | (3) |
|
Identification by Continuous View Transformation |
|
|
218 | (1) |
|
|
219 | (6) |
|
|
225 | (2) |
Part IV Perception in Context |
|
227 | (38) |
|
|
229 | (24) |
|
Sensory and Model-based Vision |
|
|
230 | (1) |
|
|
231 | (5) |
|
|
236 | (6) |
|
Inference using Hidden Markov Models |
|
|
240 | (1) |
|
Closed-loop Perceptual Control |
|
|
241 | (1) |
|
Vision as Cooperating Processes |
|
|
242 | (5) |
|
Visual Attention and Grouping |
|
|
242 | (3) |
|
Face Detection, Tracking and Identification |
|
|
245 | (2) |
|
|
247 | (3) |
|
|
250 | (3) |
|
|
253 | (12) |
|
Multi-model Identification |
|
|
254 | (1) |
|
Visually Mediated Interaction |
|
|
255 | (3) |
|
Visual Surveillance and Monitoring |
|
|
258 | (2) |
|
Immersive Virtual Reality |
|
|
260 | (1) |
|
Visual Database Screening |
|
|
261 | (4) |
Part V Appendices |
|
265 | (50) |
|
|
267 | (14) |
|
A.1 Database Acquisition and Design |
|
|
267 | (1) |
|
A.1.1 Intrinsic Experimental Variables |
|
|
268 | (1) |
|
A.1.2 Extrinsic Experimental Variables |
|
|
268 | (1) |
|
A.2 Acquisition of a Pose-labelled Database |
|
|
269 | (1) |
|
A.2.1 Using Artificial Markers |
|
|
269 | (2) |
|
A.2.2 Using Sensors and a Calibrated Camera |
|
|
271 | (2) |
|
|
273 | (2) |
|
|
275 | (2) |
|
A.5 Public Domain Face Databases |
|
|
277 | (2) |
|
|
279 | (2) |
|
|
281 | (16) |
|
B.1 System Characterisation |
|
|
282 | (1) |
|
B.2 A View on the Industry |
|
|
283 | (1) |
|
B.2.1 Visionics Corporation |
|
|
284 | (2) |
|
|
286 | (1) |
|
B.2.3 VisionSpheres Technologies |
|
|
287 | (2) |
|
B.2.4 Eigenface-based Systems: Viisage Technologies, Intelligent Verification Systems and Facia Reco |
|
|
289 | (2) |
|
B.2.5 Systems based on Facial Feature Matching: Plettac Electronic Security GmbH, ZN Bochum GmbH and Eyematic Interfaces Inc. |
|
|
291 | (4) |
|
|
295 | (2) |
|
|
297 | (18) |
|
C.1 Principal Components Analysis |
|
|
297 | (1) |
|
C.2 Linear Discriminant Analysis |
|
|
298 | (2) |
|
C.3 Gaussian Mixture Estimation |
|
|
300 | (1) |
|
C.3.1 Expectation-maximisation |
|
|
300 | (1) |
|
C.3.2 Automatic Model Order Selection |
|
|
301 | (1) |
|
C.3.3 Adaptive EM for Non-stationary Distributions |
|
|
302 | (2) |
|
|
304 | (1) |
|
C.4.1 Zero-order Prediction |
|
|
304 | (1) |
|
C.4.2 First-order Prediction |
|
|
305 | (1) |
|
C.5 Bayesian Belief Networks |
|
|
306 | (3) |
|
|
309 | (3) |
|
|
312 | (3) |
Bibliography |
|
315 | (24) |
Index |
|
339 | |