Insights on representational similarity in neural networks with canonical correlation

Morcos, Ari S.; Raghu, Maithra; Bengio, Samy

Statistics > Machine Learning

arXiv:1806.05759 (stat)

[Submitted on 14 Jun 2018 (v1), last revised 23 Oct 2018 (this version, v3)]

Title:Insights on representational similarity in neural networks with canonical correlation

Authors:Ari S. Morcos, Maithra Raghu, Samy Bengio

View PDF

Abstract:Comparing different neural network representations and determining how representations evolve over time remain challenging open questions in our understanding of the function of neural networks. Comparing representations in neural networks is fundamentally difficult as the structure of representations varies greatly, even across groups of networks trained on identical tasks, and over the course of training. Here, we develop projection weighted CCA (Canonical Correlation Analysis) as a tool for understanding neural networks, building off of SVCCA, a recently proposed method (Raghu et al., 2017). We first improve the core method, showing how to differentiate between signal and noise, and then apply this technique to compare across a group of CNNs, demonstrating that networks which generalize converge to more similar representations than networks which memorize, that wider networks converge to more similar solutions than narrow networks, and that trained networks with identical topology but different learning rates converge to distinct clusters with diverse representations. We also investigate the representational dynamics of RNNs, across both training and sequential timesteps, finding that RNNs converge in a bottom-up pattern over the course of training and that the hidden state is highly variable over the course of a sequence, even when accounting for linear transforms. Together, these results provide new insights into the function of CNNs and RNNs, and demonstrate the utility of using CCA to understand representations.

Comments:	NIPS 2018
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1806.05759 [stat.ML]
	(or arXiv:1806.05759v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1806.05759

Submission history

From: Ari Morcos [view email]
[v1] Thu, 14 Jun 2018 22:34:11 UTC (3,432 KB)
[v2] Thu, 21 Jun 2018 23:09:23 UTC (3,432 KB)
[v3] Tue, 23 Oct 2018 18:59:02 UTC (2,311 KB)

Statistics > Machine Learning

Title:Insights on representational similarity in neural networks with canonical correlation

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Insights on representational similarity in neural networks with canonical correlation

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators