PDF

Description

Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples. Existing visualization methods which employ dimensionality reduction to two or three dimensions are often inefficient and/or ineffective for these datasets. For example, T-Distributed Neighbor Embedding (T-SNE) is a popular technique for dimensionality reduction, and visualization of high dimensional point structures, however T-SNE is an inherently slow algorithm, requiring pairwise computation between each of the points in high dimension. This thesis explores GP-GPU accelerated algorithms for approximate T-SNE, and demonstrates multiple algorithms achieving state of the art performance on, and novel visualizations of, common machine learning datasets.

Details

Files

Statistics

from
to
Export
Download Full History