Nonlinear Dimensionality Reduction by Locally Linear Embedding

Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics.

Reference Nonlinear Dimensionality Reduction by Locally Linear Embedding Paper

Date Created:
Last Edited:
1 31

Introduction

Many areas of science depend on explanatory data analysis and visualization. The need to analyze large amounts of multivariate data raises the fundamental problem of dimensionality reduction: how to discover compact representations of high-dimensional data. This paper produces locally linear embedding (LLE). an unsupervised learning algorithm that computes low-dimensional, neighborhood preserving embeddings of high-dimensional inputs. Unlike clustering methods for local dimensionality reduction, LLE maps its inputs into a single global coordinate system of lower dimensionality, and its optimizations do not involve local minima. By exploiting the local symmetries of linear reconstructions, LLE is able to learn the global structure of nonlinear manifolds, such as those generated by images or faces or documents of text.

To compare and classify high dimensional observations depends on crucially modeling the nonlinear geometry of the low dimensional manifolds on which the data often lies. Scientistis interested in explanatory analysis or visualization of multivariate data face a similar problem in dimensionality reduction. The problem, shown below, involves mapping high-dimensional inputs into a low-dimensional ”description” space with as many coordinates as observed modes of variability.

PIC

Previous approaches to this problem, based on multidimensional scaling (MDS) have computed embeddings that attempt to preserve pairwise distances between data points. In this paper, a different approach is used called locally linear embeddings (LLE) that eliminates the need to estimate pairwise distances between widely separated data points. Unlike previous methods, LLE recovers global nonlinear structure from locally linear fits.

Notes

The LLE algorithm is based on simple geometric intuitions. Suppose the data consist of N real-values vectors Xi, each of dimensionality D, sampled from some underlying manifold. Provided there is sufficient data, we expect each data point and its neighbors to lie on or close to a locally linear patch of manifold. We characterize the local geometry of these patches by linear coefficients that reconstruct each data point from its neighbors. Reconstruction errors are measured by

ϵ(W) = i |Xi - jWijXi|

which adds up the squared distances between all the data points and their reocnstructiuons. The weights Wij summarize the contribution of the jth data point to the ith reconstruction. To compute the weights Wij, we minimize the cost function subject to two constraints: first, that each data point Xi is reconstructed only from its neighbors, enforcing Wij = 0 if Xj does not belong to the set of neighbors of Xi; second, that the rows of the weight matrix sum to one jWij = 1. The optimal weights Wij subject to these constraints are found by solving a least-squares problem.

PIC

The constrained weights that minimize these reconstruction errors obey an important symmetry: for any particular data point, they are invariant to rotations, rescalings, and translations of that data point and its neighbors. By symmetry, it follows that the reconstruction weights characterize intrinsic geometric properties of each neighborhood, as opposed to properties that depend on a particular frame of reference.

Suppose that the data lie on or near a smooth nonlinear manifold of lower dimensionality d << D. To a good approximation then, there exists a linear mapping - consisting of a translation, rotation, and rescaling - that maps the high-dimensional coordinates of each neighborhood to global internal coordinates on the manifold. By design, the construction weights Wij reflect the intrinsic geometric properties of the data that are invariant to exactly such transformations. We therefore expect their characterization of local geometry in the orginal data space to be equally valid for local patches on the manifold. In particular, the same weights Wij that reconstruct the ith data point in D dimensions should also reconstruct its embedded manifold coordinates in d dimensions.

LLE constructs a neighborhood-preserving mapping based on the above idea. In the final step of the algorithm, each high-dimensional observation Xi is mapped to a low-dimensional vector Y i represneting global internal coordinates on the manifold. This is done by choosing d-dimensional coordinates Y i to minimize cost function

Φ(Y ) = i |Y i - jWijY j| 2

The LLE algorithm only has one free parameter: the number of neighbors, K. Once neighbors are chosen the weights and coordinates are computed using standard methods in linear algebra.

PIC

PIC

You can read more about how comments are sorted in this blog post.

User Comments

Insert Math Markup

ESC
About Inserting Math Content
Display Style:

Embed News Content

ESC
About Embedding News Content

Embed Youtube Video

ESC
Embedding Youtube Videos

Embed TikTok Video

ESC
Embedding TikTok Videos

Embed X Post

ESC
Embedding X Posts

Embed Instagram Post

ESC
Embedding Instagram Posts

Insert Details Element

ESC

Example Output:

Summary Title
You will be able to insert content here after confirming the title of the <details> element.

Insert Table

ESC
Customization
Align:
Preview:

Insert Horizontal Rule

#000000

Preview:


View Content At Different Sizes

ESC

Edit Style of Block Nodes

ESC

Edit the background color, default text color, margin, padding, and border of block nodes. Editable block nodes include paragraphs, headers, and lists.

#ffffff
#000000

Edit Selected Cells

Change the background color, vertical align, and borders of the cells in the current selection.

#ffffff
Vertical Align:
Border
#000000
Border Style:

Edit Table

ESC
Customization:
Align:

Upload Files

ESC

Upload a .lexical file. If the file type matches the type of the current editor, then a preview will be shown below the file input.

Upload Jupyter Notebook

ESC

Upload a Jupyter notebook and embed the resulting HTML in the text editor.

Insert Custom HTML

ESC

Edit Image

ESC
#ffffff

Insert Columns Layout

ESC
Column Type:

Select Code Language

ESC
Select Coding Language

Upload Previous Version of Editor State

ESC