Visualizing data using t-SNE

ISSN: 15324435
38.2kCitations
Citations of this article
11.5kReaders
Mendeley users who have this article in their library.

Abstract

We present a new technique called "t-SNE" that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. t-SNE is better than existing techniques at creating a single map that reveals structure at many different scales. This is particularly important for high-dimensional data that lie on several different, but related, low-dimensional manifolds, such as images of objects from multiple classes seen from multiple viewpoints. For visualizing the structure of very large data sets, we show how t-SNE can use random walks on neighborhood graphs to allow the implicit structure of all of the data to influence the way in which a subset of the data is displayed. We illustrate the performance of t-SNE on a wide variety of data sets and compare it with many other non-parametric visualization techniques, including Sammon mapping, Isomap, and Locally Linear Embedding. The visualizations produced by t-SNE are significantly better than those produced by the other techniques on almost all of the data sets.

References Powered by Scopus

Reducing the dimensionality of data with neural networks

17442Citations
N/AReaders
Get full text

Nonlinear dimensionality reduction by locally linear embedding

13243Citations
N/AReaders
Get full text

A global geometric framework for nonlinear dimensionality reduction

11620Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Deep learning

64953Citations
N/AReaders
Get full text

Human-level control through deep reinforcement learning

23142Citations
N/AReaders
Get full text

STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

12192Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Van Der Maaten, L., & Hinton, G. (2008). Visualizing data using t-SNE. Journal of Machine Learning Research, 9, 2579–2625.

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4935

70%

Researcher 1460

21%

Professor / Associate Prof. 441

6%

Lecturer / Post doc 166

2%

Readers' Discipline

Tooltip

Computer Science 2804

50%

Engineering 1228

22%

Biochemistry, Genetics and Molecular Bi... 782

14%

Agricultural and Biological Sciences 771

14%

Save time finding and organizing research with Mendeley

Sign up for free