Tag: neural-network

242 What are deconvolutional layers? 2015-06-13T09:56:45.397

154 What is the "dying ReLU" problem in neural networks? 2015-05-07T04:11:56.600

146 Best python library for neural networks 2014-07-07T19:17:04.973

143 When to use GRU over LSTM? 2016-10-17T11:47:45.340

140 How to draw Deep learning network architecture diagrams? 2016-11-03T03:10:24.893

126 How do you visualize neural network architectures? 2016-07-18T17:08:17.237

101 Choosing a learning rate 2014-06-16T18:08:38.623

89 Backprop Through Max-Pooling Layers? 2016-05-12T08:38:12.740

77 How are 1x1 convolutions the same as a fully connected layer? 2016-07-17T13:23:22.600

75 When to use (He or Glorot) normal initialization over uniform init? And what are its effects with Batch Normalization? 2016-07-28T17:12:29.933

61 Adding Features To Time Series Model LSTM 2017-02-21T22:17:40.000

59 RNN vs CNN at a high level 2016-05-06T14:36:20.190

58 Neural networks: which cost function to use? 2016-01-19T11:48:29.337

58 What is the difference between "equivariant to translation" and "invariant to translation" 2017-01-04T08:41:15.700

55 How to fight underfitting in a deep neural net 2014-07-13T09:04:39.703

55 Cross-entropy loss explanation 2017-07-10T10:26:39.450

52 What is the difference between LeakyReLU and PReLU? 2017-04-25T11:58:13.553

51 What is the difference between Gradient Descent and Stochastic Gradient Descent? 2018-08-04T06:36:04.657

45 Why should the data be shuffled for machine learning tasks 2017-11-09T07:42:15.517

42 How do subsequent convolution layers work? 2015-12-02T21:53:17.183

41 How to prepare/augment images for neural network? 2015-02-24T11:59:36.033

41 Sparse_categorical_crossentropy vs categorical_crossentropy (keras, accuracy) 2018-12-01T06:28:06.650

41 How to get accuracy, F1, precision and recall, for a keras model? 2019-02-06T13:29:24.533

40 What is Ground Truth 2017-03-24T12:09:14.510

40 Data science related funny quotes 2018-12-14T14:37:31.253

39 Choosing between CPU and GPU for training a neural network 2017-05-25T23:48:26.343

38 The difference between `Dense` and `TimeDistributedDense` of `Keras` 2016-03-22T20:04:23.467

35 Why not always use the ADAM optimization technique? 2018-04-15T16:55:34.020

34 How does Keras calculate accuracy? 2016-10-07T08:10:51.287

34 Does gradient descent always converge to an optimum? 2017-11-09T16:41:20.940

34 How to set the number of neurons and layers in neural networks 2018-01-13T15:26:31.233

33 Are there free cloud services to train machine learning models? 2017-11-03T12:41:54.203

33 Why is ReLU used as an activation function? 2018-01-10T13:07:47.997

32 What is the best Keras model for multi-class classification? 2016-02-01T15:18:33.907

31 Neural Network parse string data? 2014-07-30T16:27:45.177

30 RNN's with multiple features 2017-02-16T19:35:30.860

30 Are there any rules for choosing the size of a mini-batch? 2017-04-17T16:18:22.793

30 How to calculate mAP for detection task for the PASCAL VOC Challenge? 2017-11-26T19:32:57.543

29 What loss function to use for imbalanced classes (using PyTorch)? 2019-04-01T19:00:04.877

28 Guidelines for selecting an optimizer for training neural networks 2016-03-04T09:32:17.287

28 What is the meaning of "The number of units in the LSTM cell"? 2016-07-24T10:17:35.023

28 Why do convolutional neural networks work? 2016-12-23T12:43:47.203

28 Why use both validation set and test set? 2017-04-13T19:33:53.090

28 Why is it wrong to train and test a model on the same dataset? 2020-12-13T14:11:58.530

27 Neural Network for Multiple Output Regression 2017-02-10T23:17:41.920

26 Word2Vec for Named Entity Recognition 2014-06-19T19:29:57.797

26 Why are NLP and Machine Learning communities interested in deep learning? 2014-10-11T10:24:01.393

26 Should we apply normalization to test data as well? 2018-02-08T16:53:24.653

25 Role derivative of sigmoid function in neural networks 2018-04-23T09:38:48.060

24 How to decide neural network architecture? 2017-07-06T19:05:44.447

23 Extra output layer in a neural network (Decimal to binary) 2015-07-31T00:25:36.347

22 Choosing between TensorFlow or Theano as backend for Keras 2015-12-07T16:42:04.107

22 Keyword/phrase extraction from Text using Deep Learning libraries 2016-02-03T10:56:51.447

22 Early stopping on validation loss or on accuracy? 2018-08-20T12:22:25.053

21 Deep Neural Network - Backpropogation with ReLU 2017-05-28T09:06:20.633

21 Gradients for bias terms in backpropagation 2017-07-03T17:03:24.397

21 Convolutional neural network overfitting. Dropout not helping 2017-08-22T23:52:26.863

20 Uploading images folder from my system into Google Colab 2018-03-23T18:52:28.867

20 How to add non-image features along side images as the input of CNNs 2018-05-08T12:13:16.647

20 What does the output of model.predict function from Keras mean? 2018-07-31T03:48:32.293

19 Hyperparameter search for LSTM-RNN using Keras (Python) 2016-01-17T18:26:54.320

19 How to add a new category to a deep learning model? 2016-12-10T01:43:09.343

19 What are kernel initializers and what is their significance? 2018-08-24T04:30:57.397

18 Difference of Activation Functions in Neural Networks in general 2016-10-04T11:05:24.647

18 How to combine categorical and continuous input features for neural network training 2018-03-28T08:49:04.513

18 Validation loss is not decreasing 2018-12-27T08:23:06.633

17 How to choose the features for a neural network? 2014-07-10T10:07:13.523

17 Bagging vs Dropout in Deep Neural Networks 2015-11-16T14:41:08.553

17 What is the difference between word-based and char-based text generation RNNs? 2016-08-01T22:38:16.490

17 Why ReLU is better than the other activation functions 2017-10-03T14:17:09.163

17 Is there away to change the metric used by the Early Stopping callback in Keras? 2018-01-19T15:53:48.463

17 How Do I Learn Neural Networks? 2018-12-26T06:45:31.940

16 Why do activation functions have to be monotonic? 2015-12-06T11:41:30.750

16 How can I get prediction for only one instance in Keras? 2016-08-16T14:38:24.193

16 How should the bias be initialized and regularized? 2017-03-30T04:40:43.763

16 Advantages of stacking LSTMs? 2017-08-29T16:48:40.890

16 Parameterization regression of rotation angle 2017-11-21T15:33:00.287

16 Updating the weights of the filters in a CNN 2017-12-17T21:51:57.997

15 Modelling Unevenly Spaced Time Series 2014-11-03T16:51:47.467

15 How to scale an array of signed integers to range from 0 to 1? 2015-05-24T02:11:36.640

15 Why are autoencoders for dimension reduction symmetrical? 2017-10-13T05:25:23.793

15 Multi task learning in Keras 2018-02-05T19:56:47.897

15 Can the number of epochs influence overfitting? 2018-02-07T13:34:05.250

15 Can BERT do the next-word-predict task? 2019-02-28T08:37:42.190

14 Visualizing deep neural network training 2014-12-10T10:15:00.940

14 How many images per class are sufficient for training a CNN 2016-08-03T22:03:14.930

14 Back-propagation through max pooling layers 2016-08-21T09:44:54.333

14 Why convolute if Max Pooling is just going to downsample the image anyway? 2016-09-21T07:55:38.103

14 How are deep-learning NNs different now (2016) from the ones I studied just 4 years ago (2012)? 2016-10-04T13:13:15.930

14 Is there a thumb-rule for designing neural-networks? 2018-02-12T11:29:41.277

14 How to maximize recall? 2018-03-09T15:36:05.657

14 What is the relationship between the accuracy and the loss in deep learning? 2018-12-14T09:08:14.053

14 Can a neural network compute $y = x^2$? 2019-03-22T13:02:40.397

14 Gumbel-Softmax trick vs Softmax with temperature 2019-08-29T10:30:50.857

14 How can you include information not present in an image for neural networks? 2020-02-21T09:08:22.290

13 Best Julia library for neural networks 2015-11-19T06:04:53.053

13 Do neural networks have explainability like decision trees do? 2017-05-22T10:29:21.267

13 Forget Layer in a Recurrent Neural Network (RNN) - 2017-05-25T08:51:13.597

13 So what's the catch with LSTM? 2018-02-02T15:45:12.373