67 What is dimensionality reduction? What is the difference between feature selection and extraction? 2014-05-18T06:26:15.673

33 How to do SVD and PCA with big data? 2014-09-25T08:40:59.467

27 Purpose of visualizing high dimensional data? 2015-11-26T04:28:17.827

26 Machine learning techniques for estimating users' age based on Facebook sites they like 2014-05-17T09:16:18.823

23 Improve the speed of t-sne implementation in python for huge data 2016-02-06T14:19:10.243

20 Nearest neighbors search for very high dimensional data 2014-08-14T00:50:51.103

19 Feature selection vs Feature extraction. Which to use when? 2018-03-13T05:32:15.480

18 Are t-sne dimensions meaningful? 2017-03-02T16:46:05.247

16 High-dimensional data: What are useful techniques to know? 2015-01-25T22:52:23.437

16 Dimensionality and Manifold 2015-05-05T17:48:35.433

15 Why are autoencoders for dimension reduction symmetrical? 2017-10-13T05:25:23.793

14 Can closer points be considered more similar in T-SNE visualization? 2016-03-20T16:11:45.457

14 One hot encoding alternatives for large categorical values 2017-11-14T17:20:58.253

12 Efficient dimensionality reduction for large dataset 2018-08-29T11:35:46.950

9 Reducing the dimensionality of word embeddings 2015-07-28T17:54:23.927

9 t-SNE: Why equal data values are visually not close? 2017-05-17T12:58:12.303

9 What is the difference between and Embedding Layer and an Autoencoder? 2019-06-21T15:52:36.537

8 Why does "Depth = Semantic representation" in convolutional neural networks? 2016-09-08T23:14:59.327

8 Applying dimensionality reduction on OneHotEncoded array 2018-02-19T12:51:27.787

8 What does the long curve-shape t-SNE mean? 2018-07-08T01:04:50.397

7 Projecting data from $S^n$ to $S^2$ 2015-06-05T21:52:05.920

7 Is t-SNE just for visualization? 2016-07-29T15:12:44.180

5 Completing MDS manually in R 2015-11-16T01:53:50.673

5 Is mutual information symmetric? 2016-06-25T06:53:00.070

5 Is there a particular order in which to do feature selection and sampling? 2016-08-05T09:10:52.093

5 Distributed PCA or an equivalent 2018-07-11T21:22:04.607

5 What does it mean by “t-SNE retains the structure of the data”? 2018-08-13T18:18:46.960

5 How to handle large number of features in machine learning? 2018-09-08T06:09:48.977

4 Can I use unsupervised learning followed by supervised learning? 2014-08-16T11:05:43.353

4 machine learning algorithms for 2d data? 2014-10-05T05:12:13.633

4 Reduction of multiple answers to single variable 2014-11-18T09:07:00.867

4 Scikit Learn: KMeans Clustering 3D data over a time period (dimentionality reduction?) 2014-12-30T10:33:08.240

4 Is it ok to interprete PCA plot this way? 2015-06-17T19:36:26.493

4 Illustrating the dimensionality reduction done by a classification or regression model 2015-08-27T22:06:09.470

4 How exactly dependent variable is expressed in terms of independent variables using Partial Least Square Regression Method? 2016-02-08T06:38:59.600

4 How to reduce dimensionality of audio data that comes in form of matrices and vectors? 2016-03-14T00:37:25.940

4 Can I apply Clustering algorithms to the result of Manifold Visualization Methods? 2016-03-31T12:42:21.497

4 Can I do incremental learning with the sklearn implementation of Linear Discriminant Analysis 2017-03-05T10:35:45.193

4 Which dissimilarity/similarity measure use after a dimension reduction ( PCA / AutoEncoder / ... )? 2018-02-03T15:18:33.053

4 How to create interactive plot of thousands of images as output of t-SNE? 2018-03-25T02:48:47.183

4 Are dimensionality reduction techniques useful in deep learning 2018-07-30T15:33:11.740

4 Many things behave differently in high dimensional space 2019-02-16T14:37:25.063

4 Scikit-learn pipeline with scaling, dimensionality reduction, average prediction of multiple regression models, and grid search cross validation 2019-04-29T17:48:04.467

4 What is major difference between different dimensionality reduction algorithms? 2020-08-16T10:49:58.790

4 Reducing the size of a dataset 2020-08-27T01:24:51.847

4 Elimination of features based on high covariance without affecting performance? 2021-01-07T19:01:15.947

3 Dimension reduction for logical arrays 2014-11-14T08:53:21.510

3 Various algorithms performance in a problem and what can be deduced about data and problem? 2015-05-15T13:51:50.087

3 Online/incremental unsupervised dimensionality reduction for use with classification for event prediction 2015-09-07T16:14:16.143

3 How to equalize the pairwise affinity perplexities when implementing t-SNE? 2016-11-22T16:09:21.413

3 Multi-class text classification with LSTM in Keras 2017-02-14T03:24:29.140

3 Dimensionality reduction with PCA limitations 2017-05-27T11:49:55.000

3 How to deal with disconnected components in isomap? 2017-10-03T06:35:58.003

3 Dimensionality reduction with prior knowledge of colinearity between features 2017-10-20T07:31:27.757

3 Bandwidth of the gaussian kernels in t-sne 2017-10-26T15:08:47.297

3 Non Deterministc Dimensionality reduction 2018-02-18T18:44:06.080

3 Discarding correlation among inputs in a neural network 2018-05-18T07:12:19.217

3 Do I need to fit on train data for truncated SVD and then transform the test data on fitted train data? 2018-09-14T18:25:06.083

3 When projecting data with UMAP, should I use only the samples I need projected or the entire dataset? 2019-01-01T13:24:28.533

3 What are the cases in which Isomap fails to do a good job? 2019-03-18T04:44:41.450

3 Multidimensional scaling producing different results for different seeds 2019-04-15T08:34:16.767

3 How to reduce position changes after dimensionality reduction? 2019-05-22T11:18:32.580

3 How to automate ANOVA in Python 2019-07-14T14:43:53.253

3 Why don't we use space filling curves for high-dimensional nearest neighbor search? 2019-09-26T11:34:10.500

3 High dimensional data stream summarization and processing 2019-12-27T10:38:02.837

3 Linear Discriminant Analysis (LDA) before or after k-fold cross-validation? 2020-01-21T08:26:03.360

3 find most dense neighborhood of points in high dimensional space 2020-01-23T16:33:48.357

3 Possible flaw in the MDS method for dimensionality reduction 2020-03-01T09:08:30.797

2 Deciding about dimensionality reduction, classification and clustering? 2016-01-10T14:11:47.697

2 What is a good explanation of Non Negative Matrix Factorization? 2016-02-18T04:25:38.627

2 Free/open interactive softwares/plugins for end-users' high-dimensional data visualization 2016-03-17T05:49:24.920

2 feature redundancy 2016-06-17T23:30:03.107

2 Dimension Reduction - After or Before Train-Test Split 2016-08-23T08:58:00.587

2 Multidimensional Scaling with Categorical Data 2016-09-30T08:56:03.210

2 How is dimensionality reduction achieved in Deep Belief Networks with Restricted Boltzmann Machines? 2016-11-16T16:53:01.463

2 Principal Component Analysis, Eigenvectors lying in the span of the observed data points? 2016-12-22T17:42:55.723

2 Preserving explained variance while reducing dimensionality 2017-01-02T09:15:31.383

2 Finding the relation between two dimensions in a multi-dimensional problem 2017-02-22T11:56:01.313

2 What are 2D dimensionality reduction algorithms good for? 2017-03-29T08:34:43.283

2 I have n dimensional data and I want to check integrity, can I downgrade to 2 dimensional feature space via PCA and do so? 2017-04-11T21:14:18.453

2 Tensor Decomposition in TensorFlow for multinomial time series dimensionality reduction 2017-05-23T19:11:58.487

2 How to test trained PCA used for compression? 2017-11-18T14:08:00.540

2 How to choose variables for regression 2018-02-14T02:13:20.363

2 Accuracy reduces drastically when using TruncatedSVD with hashingvector 2018-05-30T11:38:00.393

2 Dimension of the manifold on which my data sits 2018-11-27T03:19:50.693

2 How to compare Factor and Principal Component Analysis results? 2019-02-13T14:22:49.873

2 Is dimension reduction helpful to select features for a classification problem? 2019-02-20T23:02:27.873

2 Measuring distance preservation in dimensionality reduction 2019-04-21T02:23:37.980

2 Multiclass classification with high number of classes, high number of features and small sample size 2019-06-03T19:22:04.513

2 How to scale or standardize data that is mostly 0 (ranges from 0-1)? 2019-06-12T18:30:02.340

2 Feature selection or Dimension reduction in unsupervised learning 2019-06-19T15:30:15.997

2 When I should use PCA? 2019-07-08T22:59:07.600

2 Using random forest for selecting variables returns the entire dataframe 2019-08-11T02:38:59.623

2 Document embedding vs locality sensitive hashing for document clustering 2019-09-26T11:23:40.763

2 'PCA' object has no attribute 'explained_variance_' 2019-10-06T19:19:33.273

2 Keras - Autoencoder different from Encoder + Decoder 2019-10-13T21:27:30.687

2 How to leverage description data in multi-class classification (dimensionality reduction) 2019-10-31T19:08:23.767

2 What is the meaning of 2D vectors? 2020-01-08T03:59:59.453

2 Magnifying or reducing the size of input groups into a neural network 2020-01-22T20:30:15.880