Tag: feature-selection

67 What is dimensionality reduction? What is the difference between feature selection and extraction? 2014-05-18T06:26:15.673

57 Machine learning - features engineering from date/time data 2014-10-29T05:25:55.603

50 Does scikit-learn have forward selection/stepwise regression algorithm? 2014-08-07T15:33:43.793

38 Does XGBoost handle multicollinearity by itself? 2016-07-02T07:30:13.713

31 Are there any tools for feature engineering? 2015-10-03T04:09:56.397

23 Text categorization: combining different kind of features 2014-08-17T17:29:44.123

19 How to perform feature engineering on unknown features? 2016-03-10T19:39:16.190

19 Feature selection vs Feature extraction. Which to use when? 2018-03-13T05:32:15.480

18 How to combine categorical and continuous input features for neural network training 2018-03-28T08:49:04.513

17 How to choose the features for a neural network? 2014-07-10T10:07:13.523

17 Any "rules of thumb" on number of features versus number of instances? (small data sets) 2016-04-24T06:55:37.417

15 How to specify important attributes? 2014-05-19T15:55:24.983

15 What are the implications for training a Tree Ensemble with highly biased datasets? 2014-06-18T15:48:19.497

15 How to compare the performance of feature selection methods? 2016-12-06T13:31:05.340

14 What is difference between one hot encoding and leave one out encoding? 2016-03-23T03:25:53.170

14 List of feature engineering techniques 2016-07-25T18:55:53.813

13 What features are generally used from Parse trees in classification process in NLP? 2014-08-24T17:09:40.510

13 Feature selection using feature importances in random forests with scikit-learn 2015-08-04T17:44:35.277

13 Is feature selection necessary? 2017-01-04T08:46:42.270

12 What to do when testing data has less features than training data? 2015-11-18T09:12:27.920

12 Feature importance with high-cardinality categorical features for regression (numerical depdendent variable) 2017-04-05T18:23:12.657

12 When to remove correlated variables 2018-08-03T05:01:01.897

11 Feature Extraction Technique - Summarizing a Sequence of Data 2014-06-23T23:20:36.180

11 Which one first: algorithm benchmarking, feature selection, parameter tuning? 2016-03-06T17:57:54.723

11 Feature selection and classification accuracy relation 2016-10-24T13:13:44.127

11 Can GPS coordinates (latitude and longitude) be used as features in a linear model? 2017-10-09T20:19:27.787

11 How to do stepwise regression using sklearn? 2017-11-06T12:58:58.223

10 Data science projects explained step by step? 2015-07-02T14:36:59.107

10 Linear Regression and scaling of data 2018-04-14T07:54:10.320

9 Learning signal encoding 2014-06-18T03:19:07.557

9 Feature selection for Support Vector Machines 2015-07-26T12:17:09.947

9 feature importance via random forest and linear regression are different 2016-06-10T08:35:44.360

9 Improving accuracy of Text Classification 2017-05-28T12:56:36.267

9 LSTM Feature selection process 2018-02-16T07:20:51.617

9 Features reduction for the not correlated data set 2019-09-04T18:45:01.973

9 Why continuous features are more important than categorical features in decision tree models? 2020-01-15T14:55:09.140

9 Does "feature importance" depend on the model type? 2020-08-24T14:19:01.947

8 Feature selection for tracking user activity within an application 2014-06-10T15:08:54.073

8 Document classification: tf-idf prior to or after feature filtering? 2014-12-10T16:08:03.537

8 Dissmissing features based on correlation with target variable 2016-03-12T15:21:23.430

8 How to handle features which are not always available? 2019-02-12T09:09:01.343

8 Model for Differing Number of Rows per Observation 2019-04-17T16:47:56.343

7 Interpreting the results of randomized PCA in scikit-learn 2016-03-05T19:07:07.393

7 Named entity recognition (NER) features 2017-02-02T18:48:59.820

7 What is the meaning of hand crafted features in computer vision problems? 2017-09-02T00:18:25.147

7 When should I use StandardScaler and when MinMaxScaler? 2019-01-14T13:58:22.053

7 How to determine feature importance in a neural network? 2019-01-27T14:01:53.513

7 Dropping features after final evaluation on test data 2020-12-29T17:37:32.627

6 How to normalize results of Singular Value Decomposition (SVD) between 0 and 1? 2014-06-26T19:23:46.043

6 Improve a regression model and feature selection 2015-12-24T17:21:26.850

6 Which feature selection I should trust? 2017-01-10T20:35:37.160

6 What is the rationale for discretization of continuous features and when should it be done? 2017-06-17T04:23:17.953

6 Is there a model-agnostic way to determine feature importance? 2017-10-21T08:25:50.147

6 Image segmentation - handcrafted features vs DNN? 2018-02-24T03:22:37.157

6 Always drop the first column after performing One Hot Encoding? 2018-02-27T12:28:35.403

6 Instead of one-hot encoding a categorical variable, could I profile the data and use the percentile value from it's cumulative density distribution? 2018-04-04T00:31:09.753

6 Using python and machine learning to extract information from an invoice? Inital dataset? 2018-06-15T18:46:20.477

6 Number of features of the model must match the input. Model n_features is `N` and input n_features is `X`. 2018-07-02T21:51:21.983

6 Testing independence of random variables in Python 2018-07-20T12:30:15.843

6 Why would a fake feature with random numbers get selected in feature importance? 2018-11-14T11:49:16.150

6 Will unnecessary features harm the tree based model? 2019-02-06T17:37:49.263

6 Regression vs Random Forest - Combination of features 2019-03-31T14:28:26.237

6 Does feature selections matter to Decision Tree algorithms? 2019-05-08T13:17:04.510

6 Why ML model produces different results despite random_state defined? And how to set global random seed for sklearn 2020-01-12T11:34:54.883

5 Time series prediction 2014-12-28T14:58:45.320

5 Is automatic feature detection feasible? 2015-05-27T10:02:03.657

5 Predictive models with class value belonging to a set of observations 2015-09-25T23:04:36.723

5 General way to reduce features 2016-02-24T06:07:13.507

5 Importance of feature selection for boosting methods 2016-04-07T09:03:49.453

5 feature selection techniques 2016-06-24T05:34:53.880

5 Is there a particular order in which to do feature selection and sampling? 2016-08-05T09:10:52.093

5 Why is duplicating inputs bad? 2017-07-21T21:15:44.587

5 What feature engineering is necessary with tree based algorithms? 2017-08-08T15:00:47.583

5 Should boolean features be normalized and should false be -1 or 0 2017-09-06T13:44:30.387

5 Difference between RFE and SelectFromModel in Scikit-Learn 2017-10-04T15:13:46.197

5 Feature selection by overfitting a small sample size 2017-11-14T07:28:17.127

5 Unsupervised feature reduction for anomaly detection with autoencoders 2017-11-28T12:27:51.053

5 Feature Selection in Linear Regression 2018-04-30T10:19:41.153

5 LightGBM - Why Exclusive Feature Bundling (EFB)? 2018-11-30T14:36:56.090

5 Categorical vs continuous feature selection/engineering 2019-04-12T10:17:40.903

5 Feature Selection with one-hot-encoded categorical data 2019-06-01T18:05:36.153

5 What does embedding mean in machine learning? 2019-06-18T08:09:18.420

5 How can we convert time series data to supervised learning problem? 2019-12-02T19:16:35.017

5 RandomForest and tree feature importance in scikit-learn 2020-01-21T07:50:29.743

5 What can be done with highly correlated variables (>.95 and <-.95) 2020-02-07T12:56:03.360

5 Can one perform Feature Selection on a subset of training data? 2020-11-04T04:01:18.317

4 Understanding output stepAIC 2014-08-12T17:11:45.447

4 How to find the input variables for a classification problem? 2015-05-23T11:45:23.043

4 What features from sound waves to use for an AI song composer? 2015-07-14T22:39:37.943

4 Measure of correlation for term frequency 2015-10-06T21:55:44.410

4 Is there any difference between feature extraction and feature learning? 2015-11-09T22:55:49.043

4 Use forecast weather data or actual weather data for prediction? 2016-04-15T21:43:51.117

4 Which features do I select from text? 2016-05-14T19:12:02.683

4 Categorizing Customer Emails 2016-06-18T07:44:12.953

4 How does SelectKBest() perform feature selection? 2016-07-07T16:35:20.943

4 Is there a problem of over fitting in my dataset? 2016-08-08T04:32:35.520

4 Use TSFRESH-library to forecast values 2016-11-12T14:35:25.700

4 Number of features vs. number of samples : if small sample size is sufficient, why take large number of samples? 2017-05-05T18:44:39.317

4 What is representation in optical character recognition? 2017-06-06T18:12:53.580