Abstract
Similarity measurements between 3D objects and 2D images are useful for the tasks of object recognition and classification. We distinguish between two types of similarity metrics: metrics computed in image-space (image metrics) and metrics computed in transformation-space (transformation metrics). Existing methods typically use image metrics; namely, metrics that measure the difference in the image between the observed image and the nearest view of the object. Example for such a measure is the Euclidean distance between feature points in the image and their corresponding points in the nearest view. (This measure can be computed by solving the exterior orientation calibration problem.) In this paper we introduce a different type of metrics: transformation metrics. These metrics penalize for the deformations applied to the object to produce the observed image. In particular, we define a transformation metric that optimally penalizes for "affine deformations" under weak-perspective. A closedform solution, together with the nearest view according to this metric, are derived. The metric is shown to be equivalent to the Euclidean image metric, in the sense that they bound each other from both above and below. It therefore provides an easy-to-use closed-form approximation for the commonly-used least-squares distance between models and images. We demonstrate an image understanding application, where the true dimensions of a photographed battery charger are estimated by minimizing the transformation metric.
Original language | English |
---|---|
Pages (from-to) | 465-470 |
Number of pages | 6 |
Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
Volume | 18 |
Issue number | 4 |
DOIs | |
State | Published - 1996 |
Externally published | Yes |
Keywords
- 3d-to-2d metric
- Affine deformations
- Exterior orientation calibration
- Object recognition