skip to main content
10.1145/3317640.3317646acmotherconferencesArticle/Chapter ViewAbstractPublication PagesivspConference Proceedingsconference-collections
research-article

Few-shot learning for ear recognition

Published: 25 February 2019 Publication History

Abstract

Ear recognition is a popular field of research within the biometric community. It plays an important part in automatic recognition systems. The ability to capture image of the ear from a distance and perform identity recognition makes ear recognition technology an attractive choice for security application as well as other related applications. However, datasets of ear images are still limited in size, while in other biometric modal communities, like face recognition, they possess large datasets and the most of them are collected in uncontrolled condition. As a result, deep learning technology still cannot yield satisfactory result in ear recognition area. In this paper, we tackle ear recognition problem by using few-shot learning based methods. We explore different methods towards model training with limited amounts of training data and show that by using them, with the help of data augmentation, the model can be flexible and can quickly adapt to new identity to perform fast recognition. The result of our work is the first few-shot learning based work to ear recognition. With our work we are able to significantly improve the accuracy of 23% on a regular dataset, and even 21% on a challenging dataset that is collected from the web, which is comparable with state-of-the-art in ear recognition area.

References

[1]
Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, and Timothy Lillicrap. Metalearning with memory-augmented neural networks. In ICML, 2016.
[2]
Andrychowicz, Marcin, Denil, Misha, Gomez, Sergio, Hoffman, Matthew W, Pfau, David, Schaul, Tom, and de Freitas, Nando. Learning to learn by gradient descent by gradient descent. In Neural Information Processing Systems (NIPS), 2016.
[3]
A. Jain, A. Ross, K. Nandakumar, Introduction to biometrics, Springer Science & Business Media, 2011.
[4]
A. Santoro, S. Bartunov, M. Botvinick, D. Wierstra, and T. Lillicrap, "Meta-learning with memory-augmented neural networks," in International conference on machine learning, 2016, pp. 1842--1850.
[5]
Bengio, Samy, Bengio, Yoshua, Cloutier, Jocelyn, and Gecsei, Jan. On the optimization of a synaptic learning rule. In Optimality in Artificial and Biological Neural Networks, pp. 6--8, 1992.
[6]
Bengio, Yoshua, Bengio, Samy, and Cloutier, Jocelyn. Learning a synaptic learning rule. Universite' de Montre al, Departement d'informatique et de recherche operationnelle, 1990.
[7]
B. M. Lake, R. Salakhutdinov, and J. B. Tenenbaum, "Human-level concept learning through probabilistic program induction," Science, vol. 350, no. 6266, pp. 1332--1338, 2015.
[8]
Brenden M Lake, Ruslan R Salakhutdinov, and Josh Tenenbaum. One-shot learning by inverting a compositional causal process. In NIPS, pp. 2526--2534, 2013.
[9]
C. Finn, P. Abbeel, and S. Levine, "Model-agnostic meta-learning for fast adaptation of deep networks," ArXiv preprint arXiv:1703.03400, 2017.
[10]
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," in Interna- tional Conference on Learning Representations (ICLR), 2015.
[11]
E. Gonzalez-Sanchez, Biometria de la oreja, Ph.D. thesis, Universidad de Las Palmas de Gran Canaria, Spain (2008).
[12]
Ha, David, Dai, Andrew, and Le, Quoc V. Hypernetworks. International Conference on Learning Representations (ICLR), 2017.
[13]
H.Nejati,L.Zhang,T.Sim,E.Martinez-Marroquin,G.Dong, Wonder Ears: Identification of Identical Twins from Ear Images, in: Proceedings of the International Conference on Pat- tern Recognition, IEEE, 2012, pp. 1201--1204.
[14]
Hochreiter, Sepp, Younger, A Steven, and Conwell, Pe- ter R. Learning to learn using gradient descent. In International Conference on Artificial Neural Networks. Springer, 2001.
[15]
Ioffe, Sergey and Szegedy, Christian. Batch normalization: Accelerating deep network training by reducing internal covariate shift. International Conference on Machine Learning (ICML), 2015.
[16]
J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell, "Decaf: A deep convolutional activation feature for generic visual recognition." in ICML, vol. 32, 2014, pp. 647--655.
[17]
Kemelmacher-Shlizerman I, Seitz S M, Miller D, et al. The megaface benchmark: 1 million faces for recognition at scale{C}//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 4873--4882.
[18]
L. A. Schmidt, "Meaning and compositionality as statistical induction of categories and constraints," PhD thesis, Massachusetts Institute of Technology, 2009.
[19]
Li, Ke and Malik, Jitendra. Learning to optimize. International Conference on Learning Representations (ICLR), 2017.
[20]
Munkhdalai, Tsendsuren and Yu, Hong. Meta net- works. International Conference on Machine Learn- ing (ICML), 2017.
[21]
Nech A, Kemelmacher-Shlizerman I. Level playing field for million scale face recognition{C}//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2017: 3406--3415.
[22]
Nichol A, Schulman J. Reptile: a Scalable Metalearning Algorithm{J}. arXiv preprint arXiv:1803.02999, 2018.
[23]
Oriol Vinyals, Charles Blundell, Tim Lillicrap, Daan Wierstra, et al. Matching networks for one shot learning. In NIPS, 2016.
[24]
Ravi, Sachin and Larochelle, Hugo. Optimization as a model for few-shot learning. In International Confer- ence on Learning Representations (ICLR), 2017.
[25]
R. Salakhutdinov, J. Tenenbaum, and A. Torralba, "One-shot learning with a hierarchical nonparametric bayesian model," in Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 2012, pp. 195--206.
[26]
Schmidhuber, Jürgen. Learning to control fast-weight memories: An alternative to dynamic recurrent net- works. Neural Computation, 1992.
[27]
Srivastava N, Hinton G, Krizhevsky A, et al. Dropout: a simple way to prevent neural networks from overfitting{J}. The Journal of Machine Learning Research, 2014, 15(1): 1929--1958.
[28]
Vinyals, Oriol, Blundell, Charles, Lillicrap, Tim, Wierstra, Daan, et al. Matching networks for one shot learning. In Neural Information Processing Systems (NIPS), 2016.
[29]
Ž. Emeršič, V. Štruc, and P. Peer: Ear Recognition: More than a Survey«, Neurocomputing, 2017.
[30]
Ž. Emeršič, L. L. Gabriel, V. Štruc, and P. Peer: Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks, arXiv preprint: arXiv:1702.00307, 2017.
[31]
Z. Wang, T. Schaul, M. Hessel, H. Van Hasselt, M. Lanctot, and N. De Freitas, "Dueling network architectures for deep reinforcement learning," ArXiv preprint arXiv:1511.06581, 2015.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
IVSP '19: Proceedings of the 2019 International Conference on Image, Video and Signal Processing
February 2019
140 pages
ISBN:9781450361750
DOI:10.1145/3317640
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • Wuhan Univ.: Wuhan University, China
  • City University of Hong Kong: City University of Hong Kong

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 February 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Deep Learning
  2. Ear Recognition
  3. Few-shot Learning
  4. Fomaml
  5. Maml
  6. Meta-Learning
  7. Pretrained Network
  8. Reptile
  9. Small Data

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

IVSP 2019

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)4
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media