106 Best python library for neural networks 2014-07-07T19:17:04.973

72 Python vs R for machine learning 2014-06-12T06:04:48.243

68 Choosing a learning rate 2014-06-16T18:08:38.623

66 The cross-entropy error function in neural networks 2015-12-10T06:22:48.927

59 What is the "dying ReLU" problem in neural networks? 2015-05-07T04:11:56.600

43 Why Is Overfitting Bad in Machine Learning? 2014-05-14T18:09:01.940

41 Open source Anomaly Detection in Python 2015-07-22T14:26:58.660

40 Why do cost functions use the square error? 2016-02-10T21:52:30.730

35 Is there any domain where Bayesian Networks outperform neural networks? 2016-01-17T13:04:57.100

34 Can machine learning algorithms predict sports scores or plays? 2014-06-10T10:58:58.447

32 When to use what - Machine Learning 2015-01-20T15:27:38.160

32 strings as features in decision tree/random forest 2015-02-25T01:07:14.717

32 Data Science in C (or C++) 2015-03-20T14:56:23.420

32 Neural networks: which cost function to use? 2016-01-19T11:48:29.337

32 Why are Machine Learning models called black boxes? 2017-08-17T11:53:15.637

31 Clustering geo location coordinates (lat,long pairs) 2014-07-17T09:50:41.437

31 Data scientist vs machine learning engineer 2018-02-20T06:15:04.687

30 Advantages of AUC vs standard accuracy 2014-07-22T03:43:20.327

30 Should I go for a 'balanced' dataset or a 'representative' dataset? 2014-07-22T12:29:10.050

30 RNN vs CNN at a high level 2016-05-06T14:36:20.190

28 Gini Impurity vs Entropy 2016-02-12T22:05:41.193

25 Machine learning techniques for estimating users' age based on Facebook sites they like 2014-05-17T09:16:18.823

25 Machine learning - features engineering from date/time data 2014-10-29T05:25:55.603

24 What are some standard ways of computing the distance between documents? 2014-07-05T16:10:21.580

23 The difference between `Dense` and `TimeDistributedDense` of `Keras` 2016-03-22T20:04:23.467

22 What algorithms should I use to perform job classification based on resume data? 2014-07-03T16:11:22.637

22 Data Science Project Ideas 2014-07-25T18:36:31.340

21 Why are NLP and Machine Learning communities interested in deep learning? 2014-10-11T10:24:01.393

21 Difference between AlphaGo's policy network and value network 2016-03-28T16:40:25.020

21 How do you visualize neural network architectures? 2016-07-18T17:08:17.237

20 Word2Vec for Named Entity Recognition 2014-06-19T19:29:57.797

20 Quick guide into training highly imbalanced data sets 2014-09-12T15:20:51.767

20 Deep Learning vs gradient boosting: When to use what? 2014-11-20T06:49:00.357

20 Deep learning basics 2014-12-08T22:37:32.777

20 Adding Features To Time Series Model LSTM 2017-02-21T22:17:40.000

18 GBM vs XGBOOST? Key differences? 2017-02-11T20:03:23.843

17 Use liblinear on big data for semantic analysis 2014-05-14T01:57:56.880

17 What statistical model should I use to analyze the likelihood that a single event influenced longitudinal data 2014-06-20T03:18:59.477

17 Purpose of visualizing high dimensional data? 2015-11-26T04:28:17.827

16 How can I predict traffic based on previous time series data? 2014-06-16T15:49:55.673

16 Detecting cats visually by means of anomaly detection 2014-06-24T12:28:10.990

16 How to perform feature engineering on unknown features? 2016-03-10T19:39:16.190

15 Where in the workflow should we deal with missing data? 2014-05-27T21:07:48.973

15 Meaning of latent features? 2014-07-16T09:24:51.780

15 How to increase accuracy of classifiers? 2014-07-16T09:49:15.933

15 Do Random Forest overfit? 2014-08-23T16:54:06.380

14 How to choose the features for a neural network? 2014-07-10T10:07:13.523

14 How to draw Deep learning network architecture diagrams? 2016-11-03T03:10:24.893

13 Looking for example infrastructure stacks/workflows/pipelines 2014-06-17T10:37:22.987

13 Text categorization: combining different kind of features 2014-08-17T17:29:44.123

13 Bagging vs Dropout in Deep Neural Networks 2015-11-16T14:41:08.553

13 What kinds of learning problems are suitable for Support Vector Machines? 2016-01-11T07:16:58.747

13 R: machine learning on GPU 2016-01-25T15:57:55.647

13 Merging sparse and dense data in machine learning to improve the performance 2016-04-06T05:14:11.457

13 What is Hellinger Distance and when to use it? 2017-08-31T02:11:38.127

13 In supervised learning, why is it bad to have correlated features? 2017-11-07T14:37:41.993

12 How to specify important attributes? 2014-05-19T15:55:24.983

12 Binary classification model for sparse / biased data 2014-06-23T07:03:15.643

12 Machine learning libraries for Ruby 2014-09-08T21:25:26.183

12 Item based and user based recommendation difference in Mahout 2014-12-04T05:18:03.720

12 General approach to extract key text from sentence (nlp) 2015-03-13T16:41:29.280

12 How to generate synthetic dataset using machine learning model learnt with original dataset? 2015-04-01T15:23:17.997

12 What is the difference between model hyperparameters and model parameters? 2016-09-24T11:24:50.800

11 What are some easy to learn machine-learning applications? 2014-06-10T11:05:47.273

11 One-Class discriminatory classification with imbalanced, heterogenous Negative background? 2014-06-11T10:11:59.397

11 What are the implications for training a Tree Ensemble with highly biased datasets? 2014-06-18T15:48:19.497

11 Predicting next medical condition from past conditions in claims data 2014-07-30T11:45:08.313

11 Sentiment data for Emoji 2014-08-12T07:57:03.283

11 Nearest neighbors search for very high dimensional data 2014-08-14T00:50:51.103

11 What features are generally used from Parse trees in classification process in NLP? 2014-08-24T17:09:40.510

11 Studying machine learning algorithms: depth of understanding vs. number of algorithms 2014-09-19T09:08:55.180

11 When to use Random Forest over SVM and vice versa? 2015-08-20T04:16:43.303

11 How to determine if character sequence is English word or noise 2016-04-28T17:20:13.760

11 Should a model be re-trained if new observations are available? 2016-07-13T11:03:54.740

11 Why mini batch size is better than one single "batch" with all training data? 2017-02-07T12:40:25.200

11 Why should the data be shuffled for machine learning tasks 2017-11-09T07:42:15.517

10 Is GLM a statistical or machine learning model? 2014-06-19T18:02:24.650

10 Fisher Scoring v/s Coordinate Descent for MLE in R 2014-07-03T17:11:01.770

10 Solutions for Continuous Online Cluster Identification? 2014-08-14T19:09:29.523

10 implementing temporal difference in chess 2014-08-23T13:56:43.813

10 Neural net for server monitoring 2014-09-10T14:50:13.720

10 Hashing Trick - what actually happens 2014-10-10T03:48:54.660

10 How to scale an array of signed integers to range from 0 to 1? 2015-05-24T02:11:36.640

10 (Why) do activation functions have to be monotonic? 2015-12-06T11:41:30.750

10 applying word2vec on small text files 2016-01-10T10:49:20.447

10 Doc2Vec - How to label the paragraphs (gensim) 2016-02-12T02:22:01.940

10 How do "intent recognisers" work? 2016-04-05T09:03:15.113

10 Why are ensembles so unreasonably effective 2016-05-25T13:08:06.693

10 Feature Transformation on Input data 2017-07-24T03:44:17.010

10 Why do we need XGBoost and Random Forest? 2017-10-14T12:33:00.527

10 When do we say that the dataset is not classifiable? 2017-12-05T12:09:52.173

10 Find optimal P(X|Y) given I have a model that has good performance when trained on P(Y|X) 2017-12-26T13:23:28.223

9 Algorithm for generating classification rules 2014-05-22T21:47:26.980

9 Debugging Neural Networks 2014-06-11T18:22:36.267

9 Statistics + Computer Science = Data Science? 2014-07-22T08:39:33.810

9 Solving a system of equations with sparse data 2014-08-05T20:45:01.383

9 Unstructured text classification 2014-09-05T12:08:11.347

9 Field Aware Factorization Machines 2014-10-21T00:09:40.597

9 Visualizing deep neural network training 2014-12-10T10:15:00.940