Visualizing data using t-SNE

Laurens Van Der Maaten; Geoffrey Hinton

Journal Article

Visualizing data using t-SNE

Journal of Machine Learning Research (2008) 9 2579-2625

ISSN: 15324435

38.2kCitations

11.5kReaders

Abstract

We present a new technique called "t-SNE" that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. t-SNE is better than existing techniques at creating a single map that reveals structure at many different scales. This is particularly important for high-dimensional data that lie on several different, but related, low-dimensional manifolds, such as images of objects from multiple classes seen from multiple viewpoints. For visualizing the structure of very large data sets, we show how t-SNE can use random walks on neighborhood graphs to allow the implicit structure of all of the data to influence the way in which a subset of the data is displayed. We illustrate the performance of t-SNE on a wide variety of data sets and compare it with many other non-parametric visualization techniques, including Sammon mapping, Isomap, and Locally Linear Embedding. The visualizations produced by t-SNE are significantly better than those produced by the other techniques on almost all of the data sets.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Van Der Maaten, L., & Hinton, G. (2008). Visualizing data using t-SNE. Journal of Machine Learning Research, 9, 2579–2625.

Readers' Seniority

PhD / Post grad / Masters / Doc 4935

70%

Researcher 1460

21%

Professor / Associate Prof. 441

Lecturer / Post doc 166

Readers' Discipline

Computer Science 2804

50%

Engineering 1228

22%

Biochemistry, Genetics and Molecular Bi... 782

14%

Agricultural and Biological Sciences 771

14%

Visualizing data using t-SNE

Abstract

Author supplied keywords

References Powered by Scopus

Reducing the dimensionality of data with neural networks

Nonlinear dimensionality reduction by locally linear embedding

A global geometric framework for nonlinear dimensionality reduction

Cited by Powered by Scopus

Deep learning

Human-level control through deep reinforcement learning

STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline