Tag: python

200 What's the difference between fit and fit_transform in scikit-learn models? 2016-06-21T10:05:08.587

146 Best python library for neural networks 2014-07-07T19:17:04.973

146 Difference between isna() and isnull() in pandas 2018-09-06T10:14:01.593

119 Python vs R for machine learning 2014-06-12T06:04:48.243

114 Why do people prefer Pandas to SQL? 2018-07-12T09:25:51.067

101 SVM using scikit learn runs endlessly and never completes execution 2014-08-18T10:46:57.360

100 Training an RNN with examples of different lengths in Keras 2018-01-06T23:41:20.297

78 ValueError: Input contains NaN, infinity or a value too large for dtype('float32') 2016-05-26T04:13:04.033

76 strings as features in decision tree/random forest 2015-02-25T01:07:14.717

69 Open source Anomaly Detection in Python 2015-07-22T14:26:58.660

62 Clustering geo location coordinates (lat,long pairs) 2014-07-17T09:50:41.437

61 Tools and protocol for reproducible data science using Python 2014-07-16T20:09:08.640

58 Neural networks: which cost function to use? 2016-01-19T11:48:29.337

51 How to clone Python working environment on another machine? 2017-10-26T12:36:27.727

43 What is the use of torch.no_grad in pytorch? 2018-06-05T08:21:46.997

41 Opening a 20GB file for analysis with pandas 2018-02-13T14:03:39.623

40 How to force weights to be non-negative in Linear regression 2017-04-11T03:02:54.080

39 Multi GPU in Keras 2017-10-18T20:30:52.027

38 train_test_split() error: Found input variables with inconsistent numbers of samples 2017-07-06T05:17:55.947

37 Calculation and Visualization of Correlation Matrix with Pandas 2016-03-01T05:56:37.497

37 Merging two different models in Keras 2017-12-29T08:12:48.523

33 Calculating KL Divergence in Python 2015-12-08T10:37:44.050

32 What is the best Keras model for multi-class classification? 2016-02-01T15:18:33.907

31 Best practices to store Python machine learning models 2017-06-18T09:03:57.243

30 Hypertuning XGBoost parameters 2015-12-13T14:19:54.510

29 Is it necessary to standardize your data before clustering? 2015-08-06T20:58:57.380

28 Difference between OrdinalEncoder and LabelEncoder 2018-10-07T18:55:40.833

27 Scikit-learn: Getting SGDClassifier to predict as well as a Logistic Regression 2015-08-04T08:11:30.990

27 Merging multiple data frames row-wise in PySpark 2016-04-22T04:27:45.507

26 Machine learning techniques for estimating users' age based on Facebook sites they like 2014-05-17T09:16:18.823

26 Is there a straightforward way to run pandas.DataFrame.isin in parallel? 2014-05-19T23:59:58.070

26 Word2Vec for Named Entity Recognition 2014-06-19T19:29:57.797

26 VM image for data science projects 2015-01-22T21:34:57.443

26 How to count the number of missing values in each row in Pandas dataframe? 2016-07-07T10:26:23.330

26 Ways to deal with longitude/latitude feature 2016-08-20T06:51:26.563

26 What is the benefit of splitting tfrecord file into shards? 2017-01-14T08:59:36.870

26 PyTorch vs. Tensorflow Fold 2017-02-08T10:26:16.887

26 Is pandas now faster than data.table? 2017-10-25T02:43:49.793

26 Keras vs. tf.keras 2019-03-21T20:20:04.660

25 How to sum values grouped by two columns in pandas 2017-07-10T15:47:32.287

25 Is Python a viable language to do statistical analysis in? 2020-06-29T03:59:04.197

23 Improve the speed of t-sne implementation in python for huge data 2016-02-06T14:19:10.243

23 Sentence similarity prediction 2017-10-22T07:36:15.920

23 Keras Callback example for saving a model after every epoch? 2018-02-22T21:32:37.800

23 What is the reason behind taking log transformation of few continuous variables? 2018-10-23T13:08:02.707

22 Is there any data tidying tool for python/pandas similar to R tidyr tool? 2016-03-02T08:54:10.503

22 How to use LeakyRelu as activation function in sequence DNN in keras?When it perfoms better than Relu? 2018-10-02T04:06:47.510

21 Gradients for bias terms in backpropagation 2017-07-03T17:03:24.397

21 GraphViz not working when imported inside PydotPlus (`GraphViz's executables not found`) 2018-08-25T17:48:17.437

20 Python library for segmented regression (a.k.a. piecewise regression) 2015-10-16T04:07:42.020

20 Python implementation of cost function in logistic regression: why dot multiplication in one expression but element-wise multiplication in another 2017-08-22T09:31:03.423

20 What does the output of model.predict function from Keras mean? 2018-07-31T03:48:32.293

19 Feature extraction of images in Python 2015-11-15T00:45:03.647

19 Hyperparameter search for LSTM-RNN using Keras (Python) 2016-01-17T18:26:54.320

19 How to initialize a new word2vec model with pre-trained model weights? 2016-03-14T09:47:28.813

19 How to get predictions with predict_generator on streaming test data in Keras? 2016-09-07T15:14:56.833

19 Looking for a good package for anomaly detection in time series 2018-05-24T18:19:22.807

19 What are kernel initializers and what is their significance? 2018-08-24T04:30:57.397

18 How to calculate the fold number (k-fold) in cross validation? 2018-02-22T05:23:43.347

17 One-Class discriminatory classification with imbalanced, heterogenous Negative background? 2014-06-11T10:11:59.397

17 Binary classification model for unbalanced data 2014-06-23T07:03:15.643

17 Recommending movies with additional features using collaborative filtering 2014-07-25T00:58:12.253

17 Python library to implement Hidden Markov Models 2015-10-16T06:45:35.140

17 Multivariate linear regression in Python 2015-10-28T02:21:19.663

17 Visualization of multiple Markov models 2016-09-29T15:16:07.503

17 Good "frequent sequence mining" packages in Python? 2016-11-08T12:33:03.713

17 XGBRegressor vs. xgboost.train huge speed difference? 2017-03-01T19:15:54.660

16 Where in the workflow should we deal with missing data? 2014-05-27T21:07:48.973

16 How does SelectKBest work? 2016-03-18T10:34:45.107

15 Is stratified sampling necessary (random forest, Python)? 2017-01-12T00:58:27.320

15 Do modern R and/or Python libraries make SQL obsolete? 2017-02-24T19:33:34.840

15 Why are variables of train and test data defined using the capital letter (in Python)? 2017-03-15T07:36:40.437

15 Multi-dimentional and multivariate Time-Series forecast (RNN/LSTM) Keras 2018-02-07T14:49:20.597

15 Train Accuracy vs Test Accuracy vs Confusion matrix 2018-02-28T21:07:32.770

15 When to use Standard Scaler and when Normalizer? 2019-02-20T16:38:05.920

14 Is Python suitable for big data 2014-07-18T22:34:48.080

14 Feature importance with scikit-learn Random Forest shows very high Standard Deviation 2016-08-05T07:39:52.900

14 Convert a pandas column of int to timestamp datatype 2016-10-19T21:22:43.257

14 Heatmap on a map in Python 2016-10-27T00:18:08.207

14 How to train model to predict events 30 minutes prior, from multi-dimensionnal timeseries 2017-04-20T13:24:46.320

14 How to plot two columns of single DataFrame on Y axis 2017-12-12T13:04:59.383

14 Train, test split of unbalanced dataset classification 2018-06-08T09:49:13.647

14 scikit-learn n_jobs parameter on CPU usage & memory 2018-07-13T10:06:59.987

14 Are there any good out-of-the-box language models for python? 2018-09-20T13:34:22.520

13 How do I make an interactive PCA scatterplot in Python? 2016-05-28T00:55:13.630

13 Validation loss and accuracy remain constant 2016-08-23T06:19:59.023

13 Replace all numeric values in a pyspark dataframe by a constant value 2016-10-19T23:22:22.527

13 Extract information from sentence 2016-11-02T08:15:53.687

13 seaborn heatmap not displaying correctly 2019-08-08T15:33:39.847

13 What do Python's pandas/matplotlib/seaborn bring to the table that Tableau does not? 2020-03-29T12:00:41.113

12 Help regarding NER in NLTK 2015-01-29T12:13:01.677

12 How to scrape imdb webpage? 2015-04-15T23:53:13.957

12 How to convert categorical data to numerical data in Pyspark 2015-06-29T22:55:28.100

12 Issue with IPython/Jupyter on Spark (Unrecognized alias) 2015-07-23T03:45:36.867

12 Reshaping of data for deep learning using Keras 2016-05-12T13:41:11.543

12 Tensorflow neural network TypeError: Fetch argument has invalid type 2016-10-09T13:23:56.780

12 Train on batches in Tensorflow 2016-11-23T16:23:03.733

12 Using TF-IDF with other features in SKLearn 2017-09-04T11:30:19.893

12 Improve Pandas dataframe filtering speed 2017-09-24T10:50:17.553