Tag: kaggle

24 Why do we convert skewed data into a normal distribution 2017-07-07T11:35:05.640

19 How to perform feature engineering on unknown features? 2016-03-10T19:39:16.190

12 Hashing Trick - what actually happens 2014-10-10T03:48:54.660

10 Why does Gradient Boosting regression predict negative values when there are no negative y-values in my training set? 2014-06-24T19:43:24.643

6 How can I fill NaN values in a Pandas DataFrame in Python? 2016-12-25T22:29:59.157

4 Can you recommend a machine learning challenge that is suitable for novices? 2017-01-17T09:50:27.347

4 Import data from google drive to Kaggle Kernel 2019-06-01T00:26:32.460

4 How to automate the encoding process? 2019-06-12T08:57:30.523

4 Should I perform cross validation only on the training set? 2019-08-17T05:16:24.687

4 AUC ROC metric on a Kaggle competition 2020-02-28T00:37:20.400

3 Sklearn StratifiedKFold code explanation 2016-08-01T14:29:32.783

3 Owen Zhang's slides: what does the "time" mean? 2017-03-12T02:16:29.307

3 How will a rotation matrix affect contestants in machine learning contests? 2017-05-08T18:56:04.020

3 Avoid hardware limitation while competing in Kaggle? 2017-10-12T17:03:35.873

3 Hyperparameter tuning for stacked models 2018-11-16T22:37:25.000

3 Pandas throwing "Error tokenizing data. C error" while loading data sets from URL 2019-05-09T09:30:21.670

3 one-hot-encoding categorical data gives error 2019-06-10T09:36:50.037

3 How to automate ANOVA in Python 2019-07-14T14:43:53.253

3 How to load numerous files from google drive into colab 2019-11-27T12:54:54.477

3 Kaggle Titanic submission score is higher than local accuracy score 2019-12-20T17:08:38.197

2 Finding parameters with extreme values (classification with scikit-learn) 2015-04-21T09:47:05.917

2 What's cooking Kaggle - Improve model 2015-11-23T09:40:01.580

2 "concat" mode can only merge layers with matching output shapes except for the concat axis 2018-01-23T03:40:58.220

2 When should ordinal data be represented catigorically and when as integer? 2018-08-18T16:35:07.987

2 Dummy variable for Categorical values 2018-08-28T16:01:39.673

2 What is the difference between public test and private test on Kaggle 2018-12-21T09:59:43.823

2 kaggle Titanic what is GP? 2019-02-10T23:09:41.753

2 merge 2 dataframe with Memory Error 2019-02-14T02:26:19.030

2 How to use Kaggle Api in Google Colab for directly using dataset? 2019-04-17T20:49:35.160

2 Logistic Regression doesn't predict for the entire test set 2019-06-06T10:52:01.857

2 Importing .ipnyb file from Kaggle into local Jupyter 2019-12-21T17:04:52.853

2 Extract features from Decision tree leaf nodes 2020-03-13T08:48:56.677

2 Sklearn Pipeline for mixed features: numerical and (skewed) categorical 2020-03-18T23:31:57.643

2 How to divide a dataset for training and testing when the features and targets are in two different files? 2020-06-12T13:39:42.747

2 How do I read the cord_19_embeddings_2020-07-16.csv from the COVID-19 Open Research Dataset Challenge (CORD-19) on Kaggle? 2020-07-19T19:13:57.183

1 non-linear optimization for a linear classifier? (scikit-learn) 2015-04-23T12:50:52.173

1 Multivariate linear regression accounting for threshold / data cleaning 2016-10-18T18:49:57.537

1 How to use ensemble of models in FM or FFM? 2017-01-13T13:01:01.133

1 HIgher Order Interaction Variables. How to use them in model? 2017-03-05T16:35:44.127

1 Is it possible to detect which field does a rotated "kaggle" contest data come from? 2017-07-03T12:12:43.060

1 xgboost with tree_method = 'hist' in R 2017-10-11T08:36:18.057

1 How decision trees work in Python 2018-09-11T11:26:34.320

1 Comparing XGBR with CatBoost performance 2018-09-25T20:09:35.843

1 Can I make kaggle kernels read directly from my computer? 2018-10-21T05:25:09.480

1 Issues with pandas chunk merge 2018-12-01T19:46:00.673

1 How to do feature analyzing : pandas groupby(). mean 2018-12-30T14:51:00.253

1 requesting password while git push in jupyter notebook 2019-01-29T15:23:35.417

1 Kaggle - Kernel dies continuously 2019-01-31T15:45:41.403

1 xgboost GridSearchCV take too long or does not goes to the next step 2019-04-15T02:36:29.347

1 Modelling regressor of historic data on basic features test set 2019-04-27T10:01:57.560

1 Is numpy.corrcoef() enough to find correlation? 2019-05-14T09:45:32.480

1 Printing all files' names from a folder (Kaggle kernel) 2019-05-18T18:58:18.843

1 Binary encoding and its interpretation in Python 2019-06-13T05:27:02.577

1 Face Landmark Detection Solution 2019-06-15T15:52:19.427

1 How to use Random Forest to reduce dimensions 2019-07-20T22:03:38.200

1 What is wrong with the below code? 2019-08-01T06:05:12.567

1 How do I handle division by 0 in Kaggle? 2019-08-07T14:49:00.610

1 Scalable Data Science Pipeline 2019-08-25T23:56:20.517

1 What to do when feature engineering and parameter tuning don't add to the base model performance 2020-04-05T23:22:42.943

1 How to train a simple Machine learning model in batches? 2020-05-03T18:55:21.600

1 Is this over-fitting or something else? 2020-05-13T00:17:37.477

1 How to find correlation between categorical data and continuous data 2020-07-14T13:34:53.890

1 Kaggle notebook Vs Google Colab 2020-07-16T12:19:59.767

1 Dropping missing rows in two dataframes 2020-09-18T06:44:31.197

1 How to find appropliate algorithm to bulid a model for natural language based two data 2020-11-19T05:53:16.270

0 Titanic Disaster 2017-01-05T19:15:16.787

0 Why there is two output in Titanic case in tflearn quickstart? 2017-12-24T09:41:33.190

0 Changing categorical data to binary data is not reflected on the dataset 2019-06-05T04:32:47.217

0 Python Script using pandas to plot histograms between the features 2019-06-15T10:26:36.650

0 Why n-split is not possible for a dataframe with KFold? 2019-06-19T15:43:06.113

0 Dataset columns throwing KeyError 2019-07-02T07:18:18.210

0 How to use the fillna method in a for loop 2019-07-03T06:18:58.273

0 How to handle missing date data? 2019-07-05T08:21:24.277

0 How to find the mean of a column relative to another column? 2019-07-06T02:29:42.350

0 What should be the criteria to select features using correlation factors between features? 2019-08-17T08:11:00.553

0 Is there any kaggle competition for finding the feature for affecting revenue? 2019-08-24T05:00:00.030

0 Python can't take input while using functions 2019-09-05T07:54:36.370

0 KNN scoring low compared to Logistic regression in MNIST challenge 2019-10-13T11:18:37.387

0 How to define the adequate cash prize sizing for hosting a Kaggle or similar compeition? 2019-12-12T09:25:07.303

0 Python and Titanic competition how to get the median of specific range of values where class is 3 2020-02-15T11:44:54.203

0 Multiple choice gap-fill question (with distractors) dataset for evaluating NLP algorithms 2020-03-23T13:40:03.480

0 Using Kendall's Tau for association between dichotomous nominal and ordinal features 2020-04-22T20:27:39.637

0 How do you determine cut-off values for correlation when choosing features to keep? 2020-04-22T21:07:29.667

0 How strong would you rate the association between this nominal predictor variable and continuous response variable? 2020-05-04T21:19:52.230

0 Using word embeddings for kaggle? 2020-05-27T15:46:07.680

0 Dropping attributes leads to better classifier accuracy? (Titanic Set) 2020-07-02T17:19:12.187

0 Is there a certain threshold over which to accept or reject predictors based on correlation values with the target variable? 2020-08-12T04:46:26.247

0 Kaggle API does not download the entire dataset 2020-11-24T10:42:46.357

0 Searchable list of Kaggle challenges 2020-12-10T15:04:50.697

0 Size drastically increased when images converted to hdf5 format 2021-01-07T07:50:07.410

0 Trying to run a kaggle notebook 2021-01-12T13:12:18.903

0 ValueError: array length 13996092 does not match index length 214200 for Kaggle submission 2021-01-19T07:43:26.530

0 Parse documents to obtain subjective sentiment 2021-02-02T01:13:24.493

-1 Meaning of 'hue" in seaborn 2020-02-09T12:29:57.870

-2 create a company to make money by winning Kaggle competition 2016-01-07T22:39:43.080

-2 Feature engineering using XGBoost 2016-12-11T11:43:55.263