Computer Vision Metrics: Survey, Taxonomy, and Analysis

ISBN-10: 1430259299
ISBN-13: 9781430259299
Category: Computers
Pages: 508
Language: English
Published: 2014-05-22
Publisher: Apress
Author: Scott Krig

Description

Computer Vision Metrics provides an extensive survey and analysis of over 100 current and historical feature description and machine vision methods, with a detailed taxonomy for local, regional and global features. This book provides necessary background to develop intuition about why interest point detectors and feature descriptors actually work, how they are designed, with observations about tuning the methods for achieving robustness and invariance targets for specific applications. The survey is broader than it is deep, with over 540 references provided to dig deeper. The taxonomy includes search methods, spectra components, descriptor representation, shape, distance functions, accuracy, efficiency, robustness and invariance attributes, and more. Rather than providing ‘how-to’ source code examples and shortcuts, this book provides a counterpoint discussion to the many fine opencv community source code resources available for hands-on practitioners. What you’ll learn Interest point & descriptor concepts (interest points, corners, ridges, blobs, contours, edges, maxima), interest point tuning and culling, interest point methods (Laplacian, LOG, Moravic, Harris, Harris-Stephens, Shi-Tomasi, Hessian, difference of Gaussians, salient regions, MSER, SUSAN, FAST, FASTER, AGHAST, local curvature, morphological regions, and more), descriptor concepts (shape, sampling pattern, spectra, gradients, binary patterns, basis features), feature descriptor families. Local binary descriptors (LBP, LTP, FREAK, ORB, BRISK, BRIEF, CENSUS, and more). Gradient descriptors (SIFT, SIFT-PCA, SIFT-SIFER, SIFT-GLOH, Root SIFT, CensureE, STAR, HOG, PHOG, DAISY, O-DAISY, CARD, RFM, RIFF-CHOG, LGP, and more). Shape descriptors (Image moments, area, perimeter, centroid, D-NETS, chain codes, Fourier descriptors, wavelets, and more) texture descriptors, structural and statistical (Harallick, SDM, extended SDM, edge metrics, Laws metrics, RILBP, and more). 3D descriptors for depth-based, volumetric, and activity recognition spatio-temporal data sets (3D HOG, HON 4D, 3D SIFT, LBP-TOP, VLBP, and more). Basis space descriptors (Zernike moments, KL, SLANT, steerable filter basis sets, sparse coding, codebooks, descriptor vocabularies, and more), HAAR methods (SURF, USURF, MUSURF, GSURF, Viola Jones, and more), descriptor-based image reconstruction. Distance functions (Euclidean, SAD, SSD, correlation, Hellinger, Manhattan, Chebyshev, EMD, Wasserstein, Mahalanobis, Bray-Curtis, Canberra, L0, Hamming, Jaccard), coordinate spaces, robustness and invariance criteria. Image formation, includes CCD and CMOS sensors for 2D and 3D imaging, sensor processing topics, with a survey identifying over fourteen (14) 3D depth sensing methods, with emphasis on stereo, MVS, and structured light. Image pre-processing methods, examples are provided targeting specific feature descriptor families (point, line and area methods, basis space methods), colorimetry (CIE, HSV, RGB, CAM02, gamut mapping, and more). Ground truth data, some best-practices and examples are provided, with a survey of real and synthetic datasets. Vision pipeline optimizations, mapping algorithms to compute resources (CPU, GPU, DSP, and more), hypothetical high-level vision pipeline examples (face recognition, object recognition, image classification, augmented reality), optimization alternatives with consideration for performance and power to make effective use of SIMD, VLIW, kernels, threads, parallel languages, memory, and more. Synthetic interest point alphabet analysis against 10 common opencv detectors to develop intuition about how different classes of detectors actually work (SIFT, SURF, BRISK, FAST, HARRIS, GFFT, MSER, ORB, STAR, SIMPLEBLOB). Source code provided online. Visual learning concepts, although not the focus of this book, a light introduction is provided to machine learning and statistical learning topics, such as convolutional networks, neural networks, classification and training, clustering and error minimization methods (SVM,’s, kernel machines, KNN, RANSAC, HMM, GMM, LM, and more). Ample references are provided to dig deeper. Who this book is for Engineers, scientists, and academic researchers in areas including media processing, computational photography, video analytics, scene understanding, machine vision, face recognition, gesture recognition, pattern recognition and general object analysis. Table of Contents Chapter 1. Image Capture and Representation Chapter 2. Image Pre-Processing Chapter 3. Global and Regional Features Chapter 4. Local Feature Design Concepts, Classification, and Learning Chapter 5. Taxonomy Of Feature Description Attributes Chapter 6. Interest Point Detector and Feature Descriptor Survey Chapter 7. Ground Truth Data, Data, Metrics, and Analysis Chapter 8. Vision Pipelines and Optimizations Appendix A. Synthetic Feature Analysis Appendix B. Survey of Ground Truth Datasets Appendix C. Imaging and Computer Vision Resources Appendix D. Extended SDM Metrics

Get the book

Similar books

Family History Digital Libraries
By William Sims Bainbridge
One named Sara and Timberlake had 11 male workers, 1 female worker, and 4 children workers, so it might have employed the Minor family.
Foundation Dreamweaver MX
By Craig Grannell, Jerome Turner, Matt Stephens
So here's what we need to do to arrive at our layout: s Create the main table to hold all the page elements. s Deal with the navigation area which is ...
Cisco CCNA Certification, 2 Volume Set: Exam 200-301
By Todd Lammle
This inclusive, two-book set provides what you need to know to succeed on the new CCNA exam. The set includes Understanding Cisco Networking Technologies: Volume 1 and the CCNA Certification Study Guide: Volume 2.
CompTIA Network+ Study Guide: Exam N10-006
By Todd Lammle
... you can use: –a –A –c –n –r –R –S –s All nbtstat switches are case sensitive. Generally speaking, lowercase switches deal with NetBIOS names of hosts, ...
CompTIA Network+ Study Guide with Online Labs: N10-007 Exam
By Todd Lammle, Jon Buhagiar
... you can use: –a –A –c –n –r –R –S –s All nbtstat switches are case sensitive. Generally speaking, lowercase switches deal with NetBIOS names of hosts, ...
CCNA: Cisco Certified Network Associate FastPass
By Todd Lammle
S The S reference point defines the point between the customer router and an ... with the letter E deal with using ISDN on the existing telephone network.
Stranger in the Chat Room
By Todd Hafer, Jedd Hafer
A sequel to In the Chat Room With God finds a group of teens contacted by a mysterious and increasingly malevolent character who claims to know about their encounters with the Almighty and challenges their beliefs. Original.
Error Correction Coding: Mathematical Methods and Algorithms
By Todd K. Moon
M M−1∑ k=0 −∞ ∞ k=0 The average energy per signal E s ∫ can be related to the ... we will deal primarily with additive white Gaussian noise (AWGN), ...
Security+ Training Guide
By Todd King
... to deal with most , but unfortunately not all , of these potential threats . ... The S / MIME standard implements encryption for message content using ...
CCDA: Cisco Certified Design Associate Study Guide: Exam 640-861
By Todd Lammle, Andy Barkl
S reference point The S reference point defines the reference point between ... with the letter E deal with using ISDN on the existing telephone network.