32 strings as features in decision tree/random forest 2015-02-25T01:07:14.717

28 Gini Impurity vs Entropy 2016-02-12T22:05:41.193

10 Decision tree or logistic regression? 2015-06-09T09:37:11.537

10 Why do we need XGBoost and Random Forest? 2017-10-14T12:33:00.527

7 Can regression trees predict continuously? 2015-12-16T11:39:09.137

7 Minimum number of trees for Random Forest classifier 2016-08-09T09:28:26.697

7 Information Gain in R 2017-01-10T02:45:38.720

7 XGBoost for binary classification: choosing the right threshold 2017-03-24T22:23:56.137

6 Is decision tree algorithm a linear or nonlinear algorithm? 2015-08-13T13:59:52.603

6 How to predict probabilities in xgboost? 2015-09-08T03:14:09.230

6 Why we use information gain over accuracy as splitting criterion in decision tree? 2016-10-10T00:06:31.593

6 XGBRegressor vs. xgboost.train huge speed difference? 2017-03-01T19:15:54.660

5 Unbalanced classes -- How to minimize false negatives? 2015-11-12T16:09:57.543

5 what is the difference between "fully developed decision trees" and "shallow decision trees"? 2016-01-11T07:07:23.557

5 Decision trees: leaf-wise (best-first) and level-wise tree traverse 2018-01-16T17:04:37.950

4 Distributed Scalable Decision Trees 2014-10-20T22:22:09.660

4 How to preprocess different kinds of data (continuous, discrete, categorical) before Decision Tree learning 2015-08-07T10:43:50.747

4 How is cross validation used to prune a decision tree 2015-10-25T22:05:51.123

4 Decision tree vs. KNN 2015-12-05T22:24:29.063

4 decision trees on mix of categorical and real value parameters 2016-04-19T12:37:05.593

4 Why is the number of samples smaller than the number of values in my decision tree? 2016-06-21T12:23:37.307

4 How to normalize data for Neural Network and Decision Forest 2016-08-03T20:28:03.443

4 Why `max_features=n_features` does not make the Random Forest independent of number of trees? 2017-02-07T10:12:01.437

4 How to (better) discretize continuous data in decision trees? 2017-04-06T09:06:06.453

4 Does increasing the n_estimators parameter in decision trees always increase accuracy 2017-06-22T01:43:13.350

4 Bayesian combination of multi-dimensional experts? 2017-07-23T05:00:29.247

4 Understanding decision tree concept 2017-10-26T17:56:35.197

4 What are limitations of decision tree approaches to data analysis? 2017-12-14T12:26:06.487

3 How to explain decision tree algortihm in layman's terms? 2015-08-11T02:15:24.130

3 Making fake result in data mining using weka j48 algorithm 2016-01-30T20:37:07.527

3 How to interpret a decision tree correctly? 2016-02-11T01:47:47.487

3 How would I map categories between two similar but different sets of categories 2016-04-11T16:25:23.597

3 clustering plus linear model versus non linear (tree) model 2016-04-14T13:25:16.030

3 Decision Tree generating leaves for only one case 2016-04-28T10:50:38.520

3 First steps with Python and scikit-learn 2016-08-16T15:38:29.150

3 How to force DecisionTreeRegressor to use polyfit equation instead of mse at leaf level in python SKlearn 2016-12-07T13:06:55.723

3 Performance difference between decision trees and logistic regression when one of the features is a string 2017-01-25T01:14:59.223

3 Is it acceptable to select a random child node when using a Decision Tree (trained via ID3) to predict if an unknown attribute value is encountered 2017-02-22T18:47:11.223

3 More features hurts when underfitting? 2017-02-27T07:00:12.147

3 Why don't tree ensembles require one-hot-encoding? 2017-04-02T03:37:47.290

3 Why Decision Tree boundary forms a square shape and SVM a circular/oval one? 2017-07-19T12:13:18.233

3 Is there any way to get samples in under each leaf of a decision tree in Sklearn ? 2017-07-29T08:37:45.703

3 Should we use discrete or continuous input for decision trees 2017-09-01T15:02:30.823

3 Is decision tree regression comparable to locally weighted regression 2017-10-27T16:58:55.980

3 Making Use of the Target Values for Regression 2017-11-08T09:16:58.020

3 Bootstrapping or Randomly Dividing Dataset to reduce variance? 2018-01-11T12:21:11.653

2 Over-fitting issue in a classification problem (unbalanced data) 2015-06-17T09:45:49.593

2 Pick a model from multiple models using a decision tree 2015-10-19T11:31:55.477

2 Do I need to include a squared and linear variable in a random forest to achieve a parabolic effect? 2015-12-14T21:58:59.800

2 Pruning tree using REP 2016-02-07T04:36:41.013

2 Parameters for CART tree 2016-04-20T20:55:10.390

2 Fit Decision Tree to Gradient Boosted Trees for Interpretability 2016-08-08T20:51:42.173

2 Ordinal feature in decision tree 2016-09-15T21:45:21.687

2 Tuning Gradient Boosted Classifier's hyperparametrs and balancing it 2016-10-05T13:10:32.023

2 fix first two levels of decision tree? 2016-11-01T12:03:03.020

2 Feature Selection for K Nearest Neighbour and Decision Trees 2016-11-06T19:54:30.057

2 What is 'parameter convergence'? 2016-11-09T09:44:31.527

2 Sales Dataset to determine best model for predicting future sales 2016-12-08T15:13:26.143

2 Decision Tree Ensembling 2017-01-25T12:57:36.497

2 Interpreting Decision Tree in context of feature importances 2017-02-02T00:29:32.877

2 Preparing data, choosing algorithm 2017-03-30T16:24:43.480

2 What scale does LightGBM use for output? 2017-08-12T21:04:27.917

2 Multiclass Classification with Decision Trees: Why do we calculate a score and apply softmax? 2017-09-27T01:05:07.040

2 Can we implement random forest using fitctree in matlab? 2017-10-27T06:12:27.310

2 Decision tree classifier: possible overfitting 2017-11-02T18:30:08.770

2 How is a splitting point chosen for continuous variables in decision trees? 2017-11-03T21:45:09.203

2 How Can I Compute Information-Gain for Continuous- Valued Attributes 2017-11-18T01:19:20.370

2 Machine Learning - Same impurity values 2017-12-17T13:18:04.917

2 python sklearn decision tree classifier feature_importances_ with feature names when using continuous values 2017-12-26T09:53:26.060

2 How to get a confidence score for predictions? 2018-01-06T18:36:27.480

2 How to prevent/tell if Decision Tree is overfitting? 2018-01-18T10:02:49.040

2 Feature importance parameter in machine learning models like Naive Bayes 2018-02-06T18:20:35.000

1 Decision Tree Bayes rules / Maximax / Maximin 2015-02-19T16:23:15.093

1 Does pruning a decision tree always make it more general? 2015-03-17T04:25:34.463

1 Question on decision tree in the book Programming Collective Intelligence 2015-04-25T01:48:19.270

1 Weka class attribute suggestion 2015-07-02T13:16:43.333

1 Entropy calculation for MNIST dataset to form classification decision tree 2015-08-22T18:06:21.510

1 How to build a unique decision tree for each subset of data based on a grouping variable? 2015-09-01T13:12:35.833

1 How to choose the order in which to split a decision tree? 2015-10-01T20:22:36.540

1 Extract the "path" of a data point through a decision tree in sklearn 2015-10-15T02:06:37.107

1 What is a benchmark model? 2015-11-10T18:16:17.677

1 When to choose linear regression or Decision Tree or Random Forest regression? 2015-12-02T01:06:28.243

1 What's the best way to use binned data in a tree-based model? 2016-02-09T19:10:00.017

1 What can I do with a Decision Tree with poor ROC 2016-03-10T03:42:07.697

1 Aggregating Decision Trees 2016-04-13T22:20:00.967

1 How can decision trees be tuned for non-symmetrical loss? 2016-04-23T09:23:50.473

1 Visualizing N-way frequency table as a Decision Tree in R 2016-04-27T22:36:32.490

1 Binning of Continous Predictor and Predicted Variables 2016-05-11T22:26:32.840

1 How important is lookahead search in decision trees? 2016-05-30T14:02:14.090

1 Pruning and parameter reduction for decision trees 2016-06-24T08:43:11.603

1 Should we convert independent continous variables (features) to categorical variable before using decision tree like classifier? 2016-07-10T22:01:32.933

1 Multiple categories within a variable in decision tree 2016-07-19T09:36:23.997

1 Using Decision Tree methodology to identify Independent Variables for Multiple Regression 2016-08-07T22:05:11.457

1 Predictive modeling on big data set that can't fit into memory 2016-09-19T16:20:31.417

1 How to combine two CART decision trees learned in same type of data? 2016-10-20T07:56:19.977

1 Altered priors for classification trees 2017-01-03T09:07:19.657

1 Is the probabilistic cutoff in random forest flexible? 2017-01-05T09:44:22.833

1 Measures for choosing the best Decision Tree split? 2017-02-23T12:43:28.863

1 How does XGBoost compute the probabilities in predict_proba()? 2017-03-19T15:34:46.507