Ear Recognition: More Than a Survey

Published 18 Nov 2016 in cs.CV | (1611.06203v2)

Abstract: Automatic identity recognition from ear images represents an active field of research within the biometric community. The ability to capture ear images from a distance and in a covert manner makes the technology an appealing choice for surveillance and security applications as well as other application domains. Significant contributions have been made in the field over recent years, but open research problems still remain and hinder a wider (commercial) deployment of the technology. This paper presents an overview of the field of automatic ear recognition (from 2D images) and focuses specifically on the most recent, descriptor-based methods proposed in this area. Open challenges are discussed and potential research directions are outlined with the goal of providing the reader with a point of reference for issues worth examining in the future. In addition to a comprehensive review on ear recognition technology, the paper also introduces a new, fully unconstrained dataset of ear images gathered from the web and a toolbox implementing several state-of-the-art techniques for ear recognition. The dataset and toolbox are meant to address some of the open issues in the field and are made publicly available to the research community.

Abstract PDF Upgrade to Chat

Citations (226)

View on Semantic Scholar

Summary

The paper presents a comprehensive survey and taxonomy of 2D ear recognition methods, categorizing techniques into geometric, holistic, local, and hybrid approaches.
The research introduces the Annotated Web Ears dataset and an open-source toolbox, enabling reproducible evaluation and benchmarking of ear recognition systems.
Experimental results show that descriptor-based methods perform well on controlled datasets but struggle with real-world variability, highlighting critical challenges for future research.

Overview of Ear Recognition: More Than a Survey

The paper "Ear Recognition: More Than a Survey" by Žiga Emeršič, Vitomir Štruc, and Peter Peer presents a comprehensive examination of automatic ear recognition technologies while providing significant contributions to the field through the introduction of new tools. The research encompasses a detailed survey of 2D ear recognition methodologies, outlines existing challenges, and suggests future directions for research in this area.

Automatic ear recognition offers advantages due to its non-intrusive nature and applicability in surveillance and security contexts. The ear serves as a valuable biometric due to its unique features that are distinct even among identical twins. Consequently, this research domain has gained significant traction, although commercial adoption remains limited. The authors propose that open challenges have hindered widespread implementation.

Contributions

The paper introduces several important contributions to the field:

Survey and Taxonomy: The authors conduct a comprehensive review of contemporary 2D ear recognition approaches, emphasizing descriptor-based methods. A taxonomy categorizes methods into geometric, holistic, local, and hybrid techniques, thereby providing clarity on the classification and characteristics of ear recognition methodologies.
Annotated Web Ears (AWE) Dataset: A novel dataset consisting of ear images captured in unconstrained settings is introduced. This dataset, considered the first of its kind in the "wild," presents increased variability and challenges for existing methods, pushing the boundaries of ear recognition technology.
AWE Toolbox: An open-source toolbox for ear recognition research is presented. It implements a suite of state-of-the-art techniques and allows for reproducible experimentation, enabling researchers to benchmark and enhance performance on the AWE dataset and other datasets.
Reproducible Evaluation Protocols: By employing consistent experimental protocols across multiple datasets using the AWE toolbox, this research enables direct comparison and independent evaluation of the state-of-the-art ear recognition methods.

Numerical Results and Experimentation

Experimental results demonstrate the performance of descriptor-based ear recognition techniques on both traditional and the newly introduced datasets. The comparison reveals that while descriptor-based methods perform well on well-aligned datasets, such as IITD II, there is a marked decrease in performance on the AWE dataset, with rank-1 recognition rates and equal error rates indicating significant room for improvement. This points to the increased complexity and the need for robust techniques that handle variability such as occlusions, pose changes, and environmental conditions.

Implications and Future Directions

The research identifies several pertinent issues and potential research directions:

Ear Detection and Alignment: Effective and efficient detection and alignment remain critical and unresolved challenges in developing fully automatic systems.
Variability Handling: Addressing occlusions, pose variability, and lighting differences is paramount for real-world applicability. Techniques that model contextual information could offer solutions.
Feature Learning: The future of ear recognition lies in the adoption of advanced machine learning approaches, such as CNNs, which have revolutionized related fields like face recognition.
Scaling and Dataset Size: Understanding the scalability of current technologies and creating larger, more diverse datasets are essential steps toward the technology's maturity.
Broader Modalities and Cross-Modality Recognition: Engagement with 3D data and heterogeneous recognition across modalities presents untapped opportunities that warrant exploration.

In conclusion, the paper by Emeršič et al. provides a crucial and well-rounded reference point in ear recognition, both addressing the current state of the technology and propelling future research into new dimensions. The AWE dataset and toolbox foster the potential for advancements and innovations, setting the stage for overcoming existing barriers to commercialization and broader applicability in biometric systems.

Markdown Report Issue