Tag: machine-learning

154 What is the "dying ReLU" problem in neural networks? 2015-05-07T04:11:56.600

146 Best python library for neural networks 2014-07-07T19:17:04.973

140 How to draw Deep learning network architecture diagrams? 2016-11-03T03:10:24.893

138 The cross-entropy error function in neural networks 2015-12-10T06:22:48.927

126 How do you visualize neural network architectures? 2016-07-18T17:08:17.237

119 Python vs R for machine learning 2014-06-12T06:04:48.243

111 Train/Test/Validation Set Splitting in Sklearn 2016-11-15T14:55:04.130

101 Choosing a learning rate 2014-06-16T18:08:38.623

98 Why do cost functions use the square error? 2016-02-10T21:52:30.730

89 When should I use Gini Impurity as opposed to Information Gain (Entropy)? 2016-02-12T22:05:41.193

86 Advantages of AUC vs standard accuracy 2014-07-22T03:43:20.327

77 Data scientist vs machine learning engineer 2018-02-20T06:15:04.687

76 strings as features in decision tree/random forest 2015-02-25T01:07:14.717

69 Open source Anomaly Detection in Python 2015-07-22T14:26:58.660

62 Clustering geo location coordinates (lat,long pairs) 2014-07-17T09:50:41.437

62 In supervised learning, why is it bad to have correlated features? 2017-11-07T14:37:41.993

61 Adding Features To Time Series Model LSTM 2017-02-21T22:17:40.000

59 RNN vs CNN at a high level 2016-05-06T14:36:20.190

58 Neural networks: which cost function to use? 2016-01-19T11:48:29.337

57 Machine learning - features engineering from date/time data 2014-10-29T05:25:55.603

56 Why mini batch size is better than one single "batch" with all training data? 2017-02-07T12:40:25.200

55 GBM vs XGBOOST? Key differences? 2017-02-11T20:03:23.843

55 Cross-entropy loss explanation 2017-07-10T10:26:39.450

53 Why Is Overfitting Bad in Machine Learning? 2014-05-14T18:09:01.940

53 Is there any domain where Bayesian Networks outperform neural networks? 2016-01-17T13:04:57.100

51 Should I go for a 'balanced' dataset or a 'representative' dataset? 2014-07-22T12:29:10.050

51 What is the difference between Gradient Descent and Stochastic Gradient Descent? 2018-08-04T06:36:04.657

50 How to set batch_size, steps_per epoch and validation steps 2018-03-30T06:53:30.373

46 Should a model be re-trained if new observations are available? 2016-07-13T11:03:54.740

45 Data Science in C (or C++) 2015-03-20T14:56:23.420

45 What is the Q function and what is the V function in reinforcement learning? 2016-01-18T13:51:25.520

45 Why should the data be shuffled for machine learning tasks 2017-11-09T07:42:15.517

44 How to interpret the output of XGBoost importance? 2016-06-21T06:02:19.990

44 In softmax classifier, why use exp function to do normalization? 2017-09-20T05:53:18.477

42 Deep Learning vs gradient boosting: When to use what? 2014-11-20T06:49:00.357

41 Can machine learning algorithms predict sports scores or plays? 2014-06-10T10:58:58.447

41 How to get accuracy, F1, precision and recall, for a keras model? 2019-02-06T13:29:24.533

40 What is Ground Truth 2017-03-24T12:09:14.510

40 Data science related funny quotes 2018-12-14T14:37:31.253

39 When to use what - Machine Learning 2015-01-20T15:27:38.160

39 Why are Machine Learning models called black boxes? 2017-08-17T11:53:15.637

38 The difference between `Dense` and `TimeDistributedDense` of `Keras` 2016-03-22T20:04:23.467

38 What is the difference between model hyperparameters and model parameters? 2016-09-24T11:24:50.800

38 Encoding features like month and hour as categorial or numeric? 2017-03-22T07:43:57.223

37 What are some standard ways of computing the distance between documents? 2014-07-05T16:10:21.580

37 Merging two different models in Keras 2017-12-29T08:12:48.523

35 Is it always better to use the whole dataset to train the final model? 2018-06-12T09:54:16.347

34 Quick guide into training highly imbalanced data sets 2014-09-12T15:20:51.767

34 Does gradient descent always converge to an optimum? 2017-11-09T16:41:20.940

34 How to set the number of neurons and layers in neural networks 2018-01-13T15:26:31.233

33 Why do we need XGBoost and Random Forest? 2017-10-14T12:33:00.527

33 Are there free cloud services to train machine learning models? 2017-11-03T12:41:54.203

33 Why is ReLU used as an activation function? 2018-01-10T13:07:47.997

32 Do Random Forest overfit? 2014-08-23T16:54:06.380

32 When to use Random Forest over SVM and vice versa? 2015-08-20T04:16:43.303

32 What is the advantage of keeping batch size a power of 2? 2017-07-05T05:43:20.287

32 StandardScaler before and after splitting data 2018-09-18T02:35:36.337

31 Meaning of latent features? 2014-07-16T09:24:51.780

30 RNN's with multiple features 2017-02-16T19:35:30.860

30 How to use the output of GridSearch? 2017-08-01T13:20:51.307

30 How to calculate mAP for detection task for the PASCAL VOC Challenge? 2017-11-26T19:32:57.543

30 When is precision more important over recall? 2018-04-26T14:31:01.753

29 What algorithms should I use to perform job classification based on resume data? 2014-07-03T16:11:22.637

29 General approach to extract key text from sentence (nlp) 2015-03-13T16:41:29.280

28 Are decision tree algorithms linear or nonlinear 2015-08-13T13:59:52.603

28 Why do convolutional neural networks work? 2016-12-23T12:43:47.203

28 Why use both validation set and test set? 2017-04-13T19:33:53.090

28 Difference between OrdinalEncoder and LabelEncoder 2018-10-07T18:55:40.833

28 Why is it wrong to train and test a model on the same dataset? 2020-12-13T14:11:58.530

27 Purpose of visualizing high dimensional data? 2015-11-26T04:28:17.827

27 Can machine learning learn a function like finding maximum from a list? 2019-07-31T11:06:16.047

26 Machine learning techniques for estimating users' age based on Facebook sites they like 2014-05-17T09:16:18.823

26 Word2Vec for Named Entity Recognition 2014-06-19T19:29:57.797

26 Why are NLP and Machine Learning communities interested in deep learning? 2014-10-11T10:24:01.393

26 Ways to deal with longitude/latitude feature 2016-08-20T06:51:26.563

26 Should we apply normalization to test data as well? 2018-02-08T16:53:24.653

25 Data Science Project Ideas 2014-07-25T18:36:31.340

25 Difference between AlphaGo's policy network and value network 2016-03-28T16:40:25.020

25 How to deal with string labels in multi-class classification with keras? 2017-03-11T13:42:10.793

25 What is weight and bias in deep learning? 2017-05-20T21:40:08.787

25 When would one use Manhattan distance as opposed to Euclidean distance? 2017-06-30T06:28:15.000

25 Role derivative of sigmoid function in neural networks 2018-04-23T09:38:48.060

25 Is Python a viable language to do statistical analysis in? 2020-06-29T03:59:04.197

24 Deep learning basics 2014-12-08T22:37:32.777

24 Word2Vec vs. Sentence2Vec vs. Doc2Vec 2017-06-30T07:05:33.707

24 How to decide neural network architecture? 2017-07-06T19:05:44.447

24 What is Hellinger Distance and when to use it? 2017-08-31T02:11:38.127

24 Keras difference beetween val_loss and loss during training 2017-11-30T19:33:23.220

24 What does Logits in machine learning mean? 2018-04-30T14:55:54.370

23 Text categorization: combining different kind of features 2014-08-17T17:29:44.123

23 Feature Transformation on Input data 2017-07-24T03:44:17.010

23 What is the reason behind taking log transformation of few continuous variables? 2018-10-23T13:08:02.707

22 How to predict probabilities in xgboost? 2015-09-08T03:14:09.230

22 Early stopping on validation loss or on accuracy? 2018-08-20T12:22:25.053

21 What statistical model should I use to analyze the likelihood that a single event influenced longitudinal data 2014-06-20T03:18:59.477

21 Doc2Vec - How to label the paragraphs (gensim) 2016-02-12T02:22:01.940

21 local minima vs saddle points in deep learning 2017-09-05T19:14:30.057

21 back propagation in CNN 2018-02-06T05:38:31.077

21 How can I check the correlation between features and target variable? 2018-10-03T18:43:27.863