17 Any "rules of thumb" on number of features versus number of instances? (small data sets) 2016-04-24T06:55:37.417

15 How to compare the performance of feature selection methods? 2016-12-06T13:31:05.340

10 How do scientists come up with the correct Hidden Markov Model parameters and topology to use? 2015-10-09T00:02:34.463

10 Why my network needs so many epochs to learn? 2019-06-01T09:58:54.403

9 Nested cross-validation and selecting the best regression model - is this the right SKLearn process? 2016-08-04T01:28:45.307

9 What are some of the best practices for sharing data and models with colleagues? 2017-03-17T18:45:16.867

9 Adding feature leads to worse results 2017-12-07T06:46:01.720

8 Machine Learning models in production environment 2016-08-11T17:48:38.837

7 How would you describe the trade-off between model interpretability and model prediction power in layman's terms? 2018-01-11T08:56:20.023

7 TypeError: Expected binary or unicode string, got [ 2018-02-19T15:25:18.180

7 Which is first ? Tuning the parameters or selecting the model 2018-11-27T21:00:11.183

6 Why are RNN/LSTM preferred in time series analysis and not other NN? 2017-09-14T14:15:57.707

5 On coursera what exactly does Andrew Ng say in videos Lectures 60 & 61 of machine learning? 2016-02-01T12:58:14.967

5 Is the early stopping of xgboost using correct 2018-04-17T21:19:04.727

5 Is there any way to explicitly measure the complexity of a Machine Learning Model in Python 2020-08-19T19:22:23.730

4 When to use linear or logistic regression? 2015-12-13T09:26:04.977

4 Neural Network - Sparsity of collaborative based filtering and modelling the prediction problem 2017-03-15T17:20:28.607

4 ROC curve for different hyperparameters of `RandomForestClassifier`? 2017-10-09T13:42:54.810

4 How to tune weights in Voting Classifier (Sklearn) 2017-10-16T16:10:00.417

4 Choosing a model for dataset with categorical variables 2018-02-06T21:43:44.093

4 Folds in Cross validation 2018-10-29T10:29:39.493

4 Model selection: large mean and variance vs small mean and variance 2019-03-06T08:04:13.257

4 help understanding nested cross validation 2019-08-08T23:09:44.147

4 Is autocorrelation of residuals a problem in machine learning? 2020-08-28T08:28:19.690

3 From development environment to production 2016-05-14T14:20:16.333

3 Multiple models vs. Single model for prediction 2017-01-12T22:14:32.763

3 Model selection and assessment using leave-one-out cross validation 2017-05-22T04:35:15.383

3 Is Gini coefficient a good metric for measuring predictive model performance on highly imbalanced data 2017-06-15T20:15:12.750

3 Intuitive interpretation of ratios between training set scores and validation set scores 2017-06-25T20:30:43.327

3 What are alternatives to MLP, when you have rectangular, structured data? 2017-08-10T16:32:23.513

3 Adding new variable to model 2018-05-18T12:38:51.923

3 How do you pronounce ROC? 2018-12-11T22:01:57.710

3 LSTM vs ARIMA for demand prediction 2018-12-30T07:04:22.223

3 Where does the "deep learning needs big data" rule come from 2019-02-18T20:47:09.420

3 How to handle associated features in machine learning 2019-07-10T15:54:23.353

3 How to measure the stability of hyperparameter selection in a model-building procedure? 2019-07-29T09:18:46.730

3 svm.LinearSVC: larger max_iter number doesn't always increase the accuracy/precision/recall 2019-08-22T08:29:09.923

3 How best to show the best model over multiple labels? 2019-12-14T17:17:29.660

3 Improving a simple trig model 2020-01-13T22:22:48.897

3 difference between empirical risk minimization and structural risk minimization? 2020-01-20T00:31:03.403

3 Is k-means with Mahalanobis a valid option for clustering? 2020-01-20T11:04:27.037

3 How can I approach this problem? 2020-01-22T13:57:45.083

3 Transfer Learning Question: Extending the Functionality of a Multipose-Estimation Machine Learning Model? 2020-02-07T06:57:59.373

3 ML: Classification Model Comparison 2020-06-04T15:48:06.670

3 How do you, analytically, show you are not using too many features? 2020-10-26T20:51:52.200

3 At what stage are ROC curves used when building machine learning model? 2021-01-18T21:25:36.813

2 What models are used to get the Next Best Action to convert a physician from a non writer to a write? 2016-03-04T10:03:04.807

2 Difference between coefficient of determination and least squared error? 2016-03-08T21:27:18.810

2 How to perform model selection for One-Class Classification? 2016-05-19T07:29:00.693

2 Good Libraries or Software for Temporal Bayesian Network Structure Learning? 2016-07-06T08:59:59.330

2 Comparing SMOTE to down sampling the majority class in imbalanced binary classification 2016-11-11T14:42:33.790

2 How to choose inputs to maximize reward for an ordered dataset? 2017-02-17T21:10:32.637

2 Why exactly using a test set for model evaluation is a bad idea? 2017-09-25T21:43:15.577

2 How to evaluate the "betterness" of competitive good models? 2018-05-11T17:51:04.967

2 How to select the learned model using $k$-fold cross validation? 2018-06-01T09:39:33.917

2 optimal combination of hyper parameters and model selection 2018-06-10T10:26:51.343

2 Please help to identify if there is meaning of linear variables separation by features in phase space? 2018-06-26T08:01:55.933

2 can accuracy rise while precision and recall drop? 2018-07-17T12:45:37.553

2 Optimizing decision threshold on model with oversampled/imbalanced data 2018-09-21T20:36:28.127

2 Is the ultimate challenge in ML simply computational power? 2018-11-07T19:49:10.783

2 P-value mining on large number of combinations of variables 2018-11-20T18:35:02.013

2 Ranking ATM based on Utilization and Economic Data (Scoring/Rank Model) 2019-02-07T09:46:19.737

2 What metrics determine the quality of the model? 2019-02-22T21:56:02.117

2 How to Work with Imbalanced Data 2019-03-04T23:28:53.910

2 How to do vocabulary estimation based on observed writings? 2019-03-08T15:06:12.673

2 Test RMSE of polynomial regression drops when using more variables? 2019-04-03T05:19:26.360

2 Interpretation of ROC AUC score 2019-05-21T14:01:05.007

2 What regression model can handle tiny amounts of data? 2019-08-19T01:51:34.297

2 Can someone explain what batch size is doing in convolutional NNs? 2019-12-06T17:13:23.963

2 What is my training score the mean_train_score or mean_test_score? 2019-12-09T05:15:05.640

2 Machine learning solution approach to match loan repayments 2020-01-14T10:24:50.743

2 Does it make sense to use train_test_split and cross-validation when using GridSearchCV to play with hyperparameters? 2020-02-08T12:04:19.847

2 How to select the best model from validation/training/holdout accuracy score 2020-02-22T06:42:42.987

2 What supervised machine learning model can be used to generate a scorecard-like result? 2020-06-08T07:44:38.033

2 RFE vs Univariate feature selection 2020-12-07T11:56:28.170

2 Does RandomForest convergence imply I can solve a problem with a NN too? 2021-02-01T19:48:17.433

1 Variance in cross validation score / model selection 2016-11-28T14:19:30.313

1 Modeling maps using trip data 2016-12-15T08:03:51.997

1 TF-IDF Regression & Machine Learning 2017-03-02T23:50:13.623

1 Cifar10 classified using VGG16 keep showing the same output? 2017-05-25T08:57:16.483

1 Choosing the right model for predicting demand 2017-08-11T22:02:48.103

1 Binomial model in scikit-learn 2017-08-18T13:30:33.520

1 Do models without parameters exist? 2017-09-28T08:37:10.113

1 Why is it good news that the expected gradient of the loss for a mini batch is equal to the gradient for the whole set? 2017-10-05T13:47:24.393

1 After choosing top models in classification? Can I apply it on the rest of my dataset 2017-12-11T23:19:13.800

1 Python 3: Please help.. Found input variables with inconsistent numbers of samples: [3292, 3326] 2017-12-22T19:40:10.270

1 How to identify ARIMA model parameters 2017-12-24T18:10:23.917

1 Re: Missing Value 2018-01-31T18:38:34.280

1 Retrieve user features in real time from UserId for prediction 2018-02-20T00:25:18.850

1 Performance Evaluation Metrics used in Training, Validation and Testing 2018-04-14T12:39:21.350

1 Model selection for a "set of linear models"? 2018-05-02T18:53:00.760

1 Regression model for continuous dependent variable and count independent variables 2018-07-04T15:57:28.733

1 Finding the equation for a multiple and nonlinear regression model? 2018-08-01T15:10:32.563

1 Which model may be best for outcome of a surgery? 2018-09-16T23:02:02.813

1 Decent ROC, but horrible Precision-Recall curve 2018-09-27T20:03:54.553

1 Why the VC dimension to this linear hypothesis equal to 3? 2018-10-02T11:12:04.463

1 Are there any Meta Knowledge bank available? 2018-10-09T09:53:29.967

1 Nested cross-validation generalization error for multiple models 2018-11-05T13:39:13.310

1 checking model stability - Performance for different class 2019-01-19T04:45:03.503