103 US Election results 2016: What went wrong with prediction models? 2016-11-09T18:08:37.417

86 Differences between cross validation and bootstrapping to estimate the prediction error 2011-11-14T14:57:47.103

64 Practical thoughts on explanatory vs. predictive modeling 2010-08-03T20:19:57.303

54 Difference between confidence intervals and prediction intervals 2011-10-04T18:35:49.743

53 Alternatives to logistic regression in R 2010-08-31T10:02:07.947

46 Variables are often adjusted (e.g. standardised) before making a model - when is this a good idea, and when is it a bad one? 2011-12-01T16:29:35.557

46 How can I help ensure testing data does not leak into training data? 2011-12-19T22:49:14.553

42 Is adjusting p-values in a multiple regression for multiple comparisons a good idea? 2010-09-30T14:07:56.490

33 When and how to use standardized explanatory variables in linear regression 2011-02-11T23:09:54.510

31 Manually calculated $R^2$ doesn't match up with randomForest() $R^2$ for testing new data 2011-02-18T02:32:48.823

31 When is unbalanced data really a problem in Machine Learning? 2017-06-02T12:08:34.323

28 Should parsimony really still be the gold standard? 2015-07-28T14:19:01.303

28 Is this the state of art regression methodology? 2015-12-10T15:21:02.460

26 When can correlation be useful without causation? 2015-07-24T21:07:09.327

26 Does $K$-fold CV with $K=N$ (LOO) provide the MOST or LEAST variable estimates, and what is the role of "stability"? 2017-05-20T01:11:46.233

22 Sites for predictive modeling competitions 2011-05-23T02:47:10.030

22 Visualizing the calibration of predicted probability of a model 2012-03-29T14:52:38.517

22 Why are p-values misleading after performing a stepwise selection? 2015-11-03T09:04:56.567

21 Cross-validation or bootstrapping to evaluate classification performance? 2013-09-26T19:54:34.270

21 How to predict outcome with only positive cases as training? 2015-09-27T13:01:26.530

21 Explanation of what Nate Silver said about loess 2016-07-28T13:57:40.307

21 What is the root cause of the class imbalance problem? 2016-11-25T19:02:49.697

20 "Interestingness" function for StackExchange questions 2011-05-03T21:53:26.910

20 Relative variable importance for Boosting 2015-07-19T13:29:17.233

19 Is there any algorithm combining classification and regression? 2016-11-14T18:42:08.790

18 whether to rescale indicator / binary / dummy predictors for LASSO 2013-09-09T14:46:46.747

18 Data augmentation techniques for general datasets? 2015-05-23T11:52:55.977

17 Obtaining a formula for prediction limits in a linear model 2011-04-03T18:24:49.593

17 How to predict when the next event occurs, based on times of previous events? 2011-09-30T20:26:34.607

16 How to interpret the output of predict.coxph? 2012-12-02T04:45:46.950

16 Are mixed models useful as predictive models? 2016-12-07T21:51:40.303

15 Survival Model for Predicting Churn - Time-varying predictors? 2011-03-16T19:58:45.120

15 Practical thoughts on explanatory vs predictive modeling 2011-11-24T17:36:50.653

15 Predicting with both continuous and categorical features 2012-04-19T14:56:45.380

15 Are robust methods really any better? 2012-08-02T01:24:20.753

14 Determining best fitting curve fitting function out of linear, exponential, and logarithmic functions 2011-04-08T02:46:48.813

14 Predictive Modeling - Should we care about mixed modeling? 2012-02-07T21:39:53.040

14 Fastest SVM implementation 2012-02-17T13:48:24.903

14 Is it cheating to drop the outliers based on the boxplot of Mean Absolute Error to improve a regression model 2017-02-21T18:25:03.820

13 How to do cross-validation with a Cox proportional hazards model? 2012-02-02T16:39:13.637

13 Predictive performance depends more on expertise of data analyst than on method? 2013-02-07T20:27:26.357

13 When building a regression model using separate modeling/validation sets, is it appropriate to "recirculate" the validation data? 2013-06-11T14:30:28.143

13 Can Random Forest Methodology be Applied to Linear Regressions? 2014-01-16T19:09:03.513

13 Boosting: why is the learning rate called a regularization parameter? 2015-08-25T10:39:16.160

13 Is prediction the 'golden criterion' to judge the ability of statisticans? 2015-12-08T08:40:05.500

13 Why is this prediction of time series "pretty poor"? 2017-10-04T16:34:17.990

12 Generative vs discriminative models (in Bayesian context) 2010-11-18T18:16:48.990

12 Bagging with oversampling for rare event predictive models 2011-08-31T18:13:26.723

12 How can I interpret Sklearn confusion matrix 2014-04-25T17:00:53.760

12 Goodness-of-fit test in Logistic regression; which 'fit' do we want to test? 2015-08-27T07:04:11.320

11 Confidence intervals for difference in time series 2011-10-27T17:51:06.383

11 Determine accuracy of model which estimates probability of event 2012-01-03T11:49:22.580

11 Quantile regression prediction 2012-02-10T18:23:37.420

11 Domain-agnostic feature engineering that retains semantic meaning? 2012-02-08T06:05:21.183

11 How do bookmakers select their opening odds? 2012-04-13T12:08:15.183

11 Is there a problem with multicollinearity and for splines regression? 2013-09-24T23:27:26.077

11 How to interpret the results when both ridge and lasso separately perform well but produce different coefficients 2017-03-14T09:46:30.640

11 Why would Netflix switch from its five-star rating system to a like/dislike system? 2017-04-11T18:20:45.213

10 Recommend some books/articles/guides to enter predictive analytics? 2010-08-22T19:49:20.753

10 Predicting long-memory processes 2011-01-31T14:03:05.300

10 Predicting multiple targets or classes? 2012-01-28T04:51:58.973

10 Model performance in quantile modelling 2013-03-17T00:46:40.517

10 Is there overfitting in this modellng approach 2013-04-22T15:07:38.837

10 Prediction evaluation metric for panel/longitudinal data 2013-06-04T20:49:12.507

10 Fitting distribution to spatial data 2014-02-04T02:23:18.327

10 What is shrinkage? 2015-03-06T22:22:56.520

10 How to choose optimal bin width while calibrating probability models? 2015-11-03T23:26:06.580

9 SVM, variable interaction and training data fit 2012-01-18T17:27:11.277

9 Best way to combine binary and continuous response 2012-07-23T17:55:29.850

9 Does preclustering help to build a better predictive model? 2012-10-12T12:09:25.227

9 Unique (?) idea for forecasting sales 2013-06-25T15:18:26.877

9 Forecasting hourly time series with daily, weekly & annual periodicity 2013-08-08T11:07:22.463

9 How fair is it to use the word "predict" for (logistic) regression? 2015-01-18T17:57:56.530

9 What problem does oversampling, undersampling, and SMOTE solve? 2017-06-14T01:33:25.767

8 General approaches to model car traffic in a parking garage 2011-02-13T21:04:03.320

8 How can I generate predictions from the randomSurvivalForest package in R? 2011-08-30T15:41:30.607

8 Statistics for online dating sites 2011-11-16T00:06:56.790

8 Best way to handle unbalanced multiclass dataset with SVM 2012-01-11T23:21:39.240

8 Why is a zero-intercept linear regression model predicts better than a model with an intercept? 2012-01-26T01:30:54.620

8 Support vector regression on skewed/high kurtosis data 2012-04-05T19:48:12.037

8 Analogues of sensitivity and specificity for continuous outcomes 2013-01-31T20:41:26.080

8 Classification vs. regression for prediction of the sign of a continuous response variable 2013-02-13T21:58:57.940

8 Mean absolute percentage error (MAPE) in Scikit-learn 2013-05-07T16:52:36.177

8 How to predict new data with spline/smooth regression 2013-06-23T17:40:10.420

8 Negative values in predictions for an always-positive response variable in linear regression 2013-10-13T09:41:29.343

8 Coupling time series information from sources with multiple spatial resolutions/scales 2013-12-20T17:59:45.153

8 Failing at linear regression / prediction on a real data set 2014-01-27T17:53:21.100

8 How do we predict rare events? 2014-04-12T05:13:42.790

8 Deciding between a linear regression model or non-linear regression model 2015-02-06T10:41:01.880

8 Clarifications regarding reading a nomogram 2015-06-04T05:04:22.083

8 Statistical model to predict the next move on network only using movement history 2015-08-27T17:25:06.233

8 What is the intuition behind the expected transaction value for a customer in the gamma-gamma model? 2015-11-02T17:07:37.513

8 The Myth of Long-Horizon Predictability 2018-01-31T04:17:21.887

7 Predicting daily electricity load - fitting time series 2011-01-25T08:17:54.077

7 Predicting from a simple linear model with lags in R 2011-01-25T19:56:10.017

7 Machine learning techniques for time series estimation - forecasting price 2011-07-13T06:58:12.073

7 Building background for machine learning for CS student 2011-12-04T03:12:11.190

7 Sample size and cross-validation methods for Cox regression predictive models 2012-01-31T15:06:05.793

7 What do Lift and Gain Charts state in the context of an employee turnover model 2012-07-27T02:21:52.767