17 When should one use L1, L2 regularization instead of dropout layer, given that both serve same purpose of reducing overfitting? 2018-08-23T15:46:54.923

15 Why does adding a dropout layer improve deep/machine learning performance, given that dropout suppresses some neurons from the model? 2018-08-16T12:18:54.423

12 Why using L1 regularization over L2? 2017-10-12T19:54:52.020

10 Choosing regularization method in neural networks 2016-05-25T05:08:13.180

10 L1 & L2 Regularization in Light GBM 2019-08-08T17:08:44.763

9 Are there studies which examine dropout vs other regularizations? 2015-12-03T21:30:57.907

7 Understanding regularization 2016-02-17T06:12:09.650

7 Regularization practice with ANNs 2016-11-17T10:58:19.760

7 Convolutional Neural Network overfitting 2016-12-05T22:08:51.903

6 Weight decay in neural network 2018-02-12T07:33:11.487

6 Dropout vs weight decay 2018-04-20T13:46:49.837

5 Good explanation for why regularisation works 2016-08-18T08:11:23.383

5 Which regularization in convolution layers (conv2D) 2018-11-19T18:24:54.777

5 Why use regularization instead of decreasing the model 2019-08-08T19:45:23.657

5 Understanding XG Boost Training (Multi class classification) 2020-04-04T10:45:18.163

5 Problem with basic understanding of polynomial regression 2020-05-09T09:17:39.230

5 difference in l1 and l2 regularization 2020-05-17T10:07:03.357

4 Recommendations and Missing Data in Deep Learning 2016-07-04T15:27:06.597

4 Why does dropout ruin my accuracy in CNN? 2017-02-26T15:05:30.823

4 SVM regularization - minimizing margin? 2017-04-08T10:08:35.800

4 Shouldn't L2 regularization be normalized for the number of nodes in a layer? 2017-07-13T16:13:53.097

4 Regularization in simple math explained 2018-10-12T23:10:21.520

4 Why does Lasso behave "erratically" when the number of features is greater than the number of training instances? 2019-07-17T12:03:36.480

3 Should I use regularization every time? 2016-07-27T15:42:23.953

3 What's the best way to tune the regularization parameter in neural nets 2016-11-15T15:49:28.457

3 Can the 'bin size' in a histogram be thought of as a regularity constraint? 2018-04-07T07:06:46.060

3 GANs and grayscale imagery colorization 2018-05-22T08:40:09.823

3 Is regularization only for regression? 2018-09-16T09:56:52.920

3 Implemented early stopping but came across the error SGDClassifier: Not fitted error in sklearn 2018-10-01T14:27:39.640

3 SVM behavior when regularization parameter equals 0 2019-06-22T10:32:55.637

3 Light GBM Regressor, L1 & L2 Regularization and Feature Importances 2019-08-08T09:35:27.620

3 Does ridge regression always reduce coefficients by equal proportions? 2020-03-07T10:14:17.507

3 On simple 1D dataset, LogisticRegressionCV selects terrible hyperparameters, resulting scores are nonsensical 2020-04-11T23:43:17.230

3 What's the difference between hessian regularisation (min_child_weight) and loss regularisation (gamma)? When to use one over another? 2020-09-14T18:46:25.240

3 When I add regularization like L1,L2 , do I need more epochs to properly train my model? 2020-09-15T11:09:27.527

3 Relatively high regularization parameters for XGBoost model only way to prevent overfitting 2020-09-25T11:27:44.663

2 Speed decay proof for L2 regularization and non-normalizied weight initiation 2016-11-09T07:21:55.643

2 Is LASSO regression implemented in Statsmodels? 2017-04-17T07:56:16.363

2 How to think about prediction error that is not convex in hyperparameter, or over the course of training 2017-12-15T14:27:11.183

2 Dropout in other machine learning models 2018-04-20T14:30:24.127

2 Why don't we want Autoencoders to perfectly represent their training data? 2018-07-27T17:15:25.163

2 Problems with Graphical Lasso 2018-08-06T10:54:42.287

2 Is regularization included in loss history Keras returns? 2018-08-12T14:58:49.387

2 Does Orange scale the data automatically for the linear regression with Ridge regularization 2018-10-01T08:39:35.323

2 Regularization term in Matrix Factorization 2019-01-04T20:39:46.577

2 Regularization in Embedding models? 2019-01-16T05:05:29.823

2 Loss and Regularization inference 2019-01-18T06:19:47.767

2 R package clogitL1 no longer available? 2019-01-23T21:19:38.263

2 Square Root Regularization and High Loss 2019-04-09T00:29:12.117

2 Multiclass classification with high number of classes, high number of features and small sample size 2019-06-03T19:22:04.513

2 Version of Perceptron 2019-07-01T15:02:25.877

2 Why do we divide the regularization term by the number of examples in regularized logistic regression? 2019-08-08T20:57:06.813

2 Why do we determine the values of λ in regularization as ln λ, such as ln λ=-18 instead of for example λ=0.3? 2019-08-10T13:40:05.513

2 Quadratic approximation of L1 regularized cost function 2019-08-13T12:53:29.363

2 Can ridge regression be used for feature selection? 2019-08-15T15:33:42.377

2 If my model is overfitting the training dataset, does adding noise to training dataset help regularizing the machine learning model 2019-11-09T20:36:06.107

2 How to build an overfitted network in order to increase performances 2019-12-05T19:16:41.623

2 Over fitting and association with regularization 2020-03-15T06:43:02.393

2 Regularization for intercept parameter 2020-05-04T23:31:30.150

2 Should you turn off label smoothing when validating? 2020-06-23T01:41:30.217

2 How to handle Overfitting 2020-06-28T18:58:05.273

2 Confusion with L2 Regularization in Back-propagation 2020-07-04T07:32:29.057

2 What is the intuition behind decreasing the slope when using regularization? 2020-07-26T16:08:05.493

2 Is it better to use separately regularization methods for Neural Networks (L2/L1 & Dropout) 2020-10-29T16:19:36.497

2 Entropy-regularized RL (G-learning) vs. IRL (Inverse Reinforcement Learning) 2020-11-08T10:16:23.067

2 Approximation of long sequence of layers by one layer 2021-02-06T15:28:02.777

1 L1 regularization in pybrain 2016-06-15T01:43:41.887

1 Feature selection with L1 regularization on sklearn's LogisticRegression 2016-09-24T17:42:04.227

1 How can I fix this "convex" problem ? Is it just a matter of overfitting? 2016-12-09T07:10:44.187

1 L2 regularization in caffe 2017-01-10T16:33:08.953

1 Selection of co related variables for ridge regression 2017-05-16T09:57:14.963

1 Lasso implementation in Python 2017-06-17T09:36:09.793

1 Should I set higher dropout prob if there are plenty of data? 2017-07-10T20:20:18.740

1 Support vector machine margin term, norm or norm squared? 2017-10-16T07:32:49.070

1 Dropout without the averaging 2017-12-01T13:48:34.893

1 Custom regularisation for logistics regression 2018-02-25T05:28:09.720

1 Concrete Dropout for Recurrent Neural Networks (Keras) 2018-03-13T10:58:25.813

1 Using L1 penalty in XGBoost 2018-04-02T17:38:24.867

1 trying to decrease overfitting with regularisation in CNN 2018-04-06T09:53:48.620

1 Point of dropping weights in mini batch for purpose of regularization 2018-04-12T14:00:40.450

1 Should I update my regularisation L1 and L2 regularisation parameters in online setting? 2018-04-24T12:41:16.470

1 Loss for CNN decreases and settles but training accuracy does not improve 2018-05-29T20:09:18.263

1 What is the intuition behind Ridge Regression and Adapting Gradient Descent algorithms? 2018-07-13T15:55:18.273

1 Should highly correlated features be omitted before applying Lasso? 2018-08-20T09:55:35.747

1 Make embedding more Gaussian-like 2019-01-21T11:53:15.800

1 Keras regularizers (kernel, bias and activity) vs tf.contrib.layers.apply_regularization 2019-03-14T21:10:29.853

1 Importing Excel format data into R/R Studio and using glmnet package? 2019-03-17T03:02:48.303

1 neural networks error function: is global minimum desirable? 2019-04-02T17:28:50.197

1 What are best activation and regularization method for LSTM? 2019-04-11T07:57:59.973

1 What is the point of getting rid of overfitting? 2019-05-17T23:11:13.677

1 correct ML approach 2019-05-29T22:00:19.187

1 Difference between LASSO penalty in neural network and just LASSO regression 2019-06-25T16:01:20.433

1 Why non-differentiable regularization lead to setting coefficients to 0? 2019-07-01T06:14:43.550

1 Improving Accuracy of the Deep Learning Model 2019-07-02T10:02:17.847

1 High Variance on CNN 2019-08-12T21:01:14.513

1 Do we need to divide our gradients by batch size our we will use the sum (Mini batch GSD plus L2 Regularization)? 2019-09-14T23:28:38.347

1 How can I regularize the output of a layer from scratch (without using Keras)? 2019-11-17T22:23:30.113

1 best way to regularize gradient boosting regressor? 2019-11-18T00:01:40.167

1 How regularization helps to get rid of outliers? 2019-11-28T06:35:49.553