Tag: class-imbalance

51 Should I go for a 'balanced' dataset or a 'representative' dataset? 2014-07-22T12:29:10.050

34 Quick guide into training highly imbalanced data sets 2014-09-12T15:20:51.767

27 Unbalanced multiclass data with XGBoost 2017-01-16T12:53:12.653

19 How do you apply SMOTE on text classification? 2018-02-10T11:18:25.340

15 What are the implications for training a Tree Ensemble with highly biased datasets? 2014-06-18T15:48:19.497

15 Macro- or micro-average for imbalanced class problems 2018-08-13T09:57:37.077

14 Train/Test Split after perform SMOTE 2016-12-09T00:19:45.343

14 why we need to handle data imbalance? 2017-11-06T06:15:29.570

12 When should we consider a dataset as imbalanced? 2016-05-16T11:36:14.850

11 Unbalanced classes -- How to minimize false negatives? 2015-11-12T16:09:57.543

11 When do we say that the dataset is not classifiable? 2017-12-05T12:09:52.173

10 macro average and weighted average meaning in classification_report 2020-01-04T10:38:34.497

9 How can I perform stratified sampling for multi-label multi-class classification? 2018-06-13T11:18:12.543

9 Cross validation for highly imbalanced data with undersampling 2019-02-04T16:32:21.823

8 Unbalanced class: class_weight for ML algorithms in Spark MLLib 2016-12-07T00:08:48.120

8 Categorization of approaches to deal with imbalanced classes 2018-06-08T05:10:45.573

8 What is the best performance metric used in balancing dataset using SMOTE technique 2018-07-31T23:23:50.440

8 CNN - imbalanced classes, class weights vs data augmentation 2019-03-16T15:50:39.950

8 Which classification algorithms are negatively affected by class imbalances? 2019-07-03T19:45:48.660

7 Training and testing AdaBoost for low probability classification 2015-06-12T00:35:25.937

7 How does class_weights work in RandomForestClassifier 2016-05-03T13:23:35.380

7 How to fix class imbalance in training sample? 2018-02-27T15:48:53.610

7 weighted cross entropy for imbalanced dataset - multiclass classification 2018-05-15T12:35:13.917

7 Using class weights in Keras with multiple binary outputs which are not simply one-hot-encoded 2018-08-03T19:55:38.100

7 Why doesn't class weight resolve the imbalanced classification problem? 2019-01-29T07:21:40.867

7 Deep network not able to learn imbalanced data beyond the dominant class 2019-02-01T00:02:15.120

6 Kappa near to 60% in unbalanced (1:10) data set 2014-09-12T16:26:15.827

6 Overfitting for minority class after SMOTE w/ random forests 2016-05-09T14:18:45.320

6 Balanced Train set to predict Imbalanced Prediction set 2016-09-01T07:36:40.657

6 Bad classification performance of logistic regression on imbalanced data in testing as compared to training 2017-03-27T18:48:43.817

6 Class weighting during validation in Keras 2017-09-04T12:09:46.973

6 Why will the accuracy of a highly unbalanced dataset reduce after oversampling? 2018-02-23T08:51:12.860

6 How does exactly class_weight in Keras work? 2018-10-24T02:39:25.507

6 using sklearn class weight to increase number of positive guesses in extremely unbalanced data set? 2018-11-19T02:39:50.403

6 How to compare two unsupervised anomaly detection algorithms on the same data-set? 2019-03-20T09:52:00.290

6 Why real-world output of my classifier has similar label ratio to training data? 2019-04-07T12:10:33.087

6 Imbalanced classes (balance of train, validation, and test) 2019-04-26T17:50:12.493

6 Can we specify the number of data generated(minority class) using SMOTE? 2019-08-20T06:44:54.047

6 Weighted Binary Cross Entropy Loss -- Keras Implementation 2019-09-05T14:27:44.380

6 What is the best metric to evaluate highly imbalanaced binary classifiction? (such as fraud detection in credit card) 2020-01-05T04:04:36.810

5 Is there a particular order in which to do feature selection and sampling? 2016-08-05T09:10:52.093

5 How to do imbalanced classification in deep learning (tensorflow, RNN)? 2017-02-27T11:21:46.193

5 Necessity of balancing positive/negative examples in binary classification machine learning? 2017-04-22T00:37:19.153

5 Setting class weights for categorical labels in Keras using generator 2017-08-11T08:27:57.187

5 When should you balance a time series dataset? 2018-02-22T18:10:43.190

5 How to avoid resampling part of pipeline on test data (imblearn package, SMOTE) 2018-05-18T21:37:12.507

5 Why does balancing the test dataset improve precision-recall curve? 2018-10-29T15:35:24.967

5 issue with early-stopping on f1 score with imbalanced data 2018-11-17T02:45:14.457

5 Why class weight is outperforming oversampling? 2019-05-26T01:09:55.883

5 Differences between class_weight and scale_pos weight in LightGBM 2019-06-18T20:36:31.660

5 convert predict_proba results using class_weight in training 2019-07-02T16:43:04.367

5 Difference between sklearn make_pipeline and imblearn make_pipeline 2019-08-21T06:45:04.380

5 Is There a Way to Re-Calibrate Predicted Probabilities After Using Class Weights? 2019-09-03T20:36:30.333

5 Why you shouldn't upsample before cross validation 2020-09-22T11:40:24.587

5 Is an $F_1$ score of 0.1 always bad? 2020-11-02T02:52:13.353

5 Metric for label imbalance 2021-02-25T03:27:35.167

4 oversampling plus down sampling using smote not working on random forests 2015-11-14T05:58:56.993

4 How to choose best classifier for Low positive to negative class ratio in data (training, validation and real time)? 2016-02-26T13:54:54.900

4 Imbalanced dataset in MLP classifier in python 2017-06-18T08:14:53.820

4 SMOTE and multi class oversampling 2017-11-11T23:20:19.680

4 Logic behind SMOTE-NC? 2018-01-07T09:54:03.007

4 What is the best way to deal with imbalanced data for XGBoost? 2018-02-25T14:02:12.957

4 How to handle "unknown" category in machine learning classification problems? 2018-09-02T09:08:00.073

4 Oversampling before Cross-Validation, is it a problem? 2019-01-21T12:02:00.250

4 Overfitting - how to detect it and reduce it? 2019-03-04T15:09:30.270

4 Train classifier on balanced dataset and apply on imbalanced dataset? 2019-03-05T16:10:02.510

4 How does class_weight work in Decision Tree 2019-07-23T14:29:08.027

4 Combining 'class_weight' with SMOTE 2019-08-30T17:55:55.513

4 Why did sampling boost the performance of my model? 2019-09-25T17:00:21.353

4 How to balance class weights correct for a CNN in Keras, given an unbalanced data set? 2020-01-10T15:40:44.420

4 AUC ROC metric on a Kaggle competition 2020-02-28T00:37:20.400

4 Why removing rows with NA values from the majority class improves model performance 2021-01-22T16:08:43.497

3 Modeling when the response variable has too many 0's and few continuous values? 2014-11-12T09:21:19.490

3 How to learn a classifier from a dataset with high imbalance 2015-01-25T23:09:22.863

3 What are the possible ways to handle class unbalance in a large scale image recognition problem with Deep Neural Nets? 2015-02-17T22:55:26.300

3 Ratio of positive to negative sample in data set for best classification 2015-08-29T10:27:02.597

3 Which accuracy metric of a ML classifier can maximize map@K of a recommender system for an unbalanced dataset? 2015-09-02T11:40:08.637

3 Balanced Linear SVM wins every class except One vs All 2016-03-14T17:18:16.080

3 Outlier detection for unbalanced classes 2016-05-05T12:54:28.863

3 How to deal with classification problem where labels are non uniformly distributed? 2016-09-11T17:38:19.650

3 unbalanced data classification 2016-11-10T14:59:31.553

3 Imbalanced dataset: how to deal with test data? 2017-03-26T06:32:14.227

3 Improving classifier performances in R for imbalanced dataset 2017-04-12T09:58:27.700

3 Is Gini coefficient a good metric for measuring predictive model performance on highly imbalanced data 2017-06-15T20:15:12.750

3 Restrictions on my skewed validation data 2017-11-03T15:05:54.343

3 What is the allowable limit of oversampling? 2018-02-08T10:49:20.787

3 Changing multiple models into 1 model 2018-06-07T11:45:19.017

3 Overfitted model produces similar AUC on test set, so which model do I go with? 2018-06-27T22:20:08.607

3 Deep Learning: Does starting the training on a smaller subset of the data make sense? 2018-08-17T05:26:26.403

3 In a binary classification, should the test dataset be balanced? 2018-11-29T11:30:46.227

3 imbalanced dataset in text classififaction 2019-02-06T12:31:40.717

3 Dealing with biased binary classifier 2019-04-03T15:47:41.140

3 Adjust class weights due to class imbalance and class importance Multi class classification XGBoost 2019-04-17T08:39:49.597

3 How to explain a Calibration Plot for many models? 2019-04-25T10:11:06.363

3 Large no of categorical variables with large no of categories 2019-06-04T11:23:27.523

3 How to apply oversampling when doing Leave-One-Group-Out cross validation? 2019-07-10T06:59:30.590

3 SMOTE on training data 2019-07-12T08:36:22.710

3 Machine Learning: Balanced training set but highly unbalanced prediction set? How to adjust? 2019-07-21T15:52:33.183

3 Poor performance of regression model for imbalanced data 2019-07-26T14:01:14.483