22 What is the time complexity for training a neural network using back-propagation? 2018-03-18T11:26:55.320

18 Are these two versions of back-propagation equivalent? 2016-09-04T09:45:02.357

9 How do evolutionary algorithms have advantages over the conventional backpropagation methods? 2016-08-11T09:39:32.893

9 Is the mean-squared error always convex in the context of neural networks? 2017-08-22T14:26:51.633

8 What is "backprop"? 2016-08-02T15:39:14.947

8 What are the learning limitations of neural networks trained with backpropagation? 2016-08-03T18:05:24.997

7 How do I know if my backpropagation is implemented correctly? 2017-09-03T12:22:57.440

6 How is the gradient calculated for the middle layer's weights? 2018-03-08T16:22:55.040

6 CNN backpropagation with stride>1 2018-03-22T10:48:09.347

6 Why not teach to a NN not only what is true, but also what is not true? 2018-07-13T09:47:49.763

6 Why do very deep non resnet architectures perform worse compared to shallower ones for the same iteration? Shouldn't they just train slower? 2019-09-28T18:25:39.067

5 How to avoid falling into the "local minima" trap? 2016-08-05T10:39:31.520

5 What kind of algorithm is the Levenbergâ€“Marquardt algorithm? 2016-12-21T09:53:06.757

5 What would an implementation of this Neural Network look like? 2017-09-04T20:05:21.753

5 What makes learned feature detectors specialize in CNN? 2017-10-01T20:18:43.517

5 How to design 4D Deep Recurrent Neural Networks using Tensorflow? 2017-10-31T00:44:22.317

5 How to train a CNN 2018-08-12T14:43:10.920

5 What is the actual learning algorithm: back-propagation or gradient descent? 2018-11-13T23:49:24.553

5 Are on-line backpropagation iterations perpendicular to the constraint? 2019-03-23T16:03:49.737

5 Is the gradient at a layer independent of the activations of the previous layers? 2019-10-11T05:27:08.410

4 How to test if my implementation of back propagation neural Network is correct 2017-01-26T10:51:55.857

4 Are Dreams a Form of Backpropagation? 2017-03-18T04:32:47.590

4 Can a neural network learn to avoid wrong decisions using backpropagation? 2017-08-21T16:53:00.563

4 Finding an optimum back propagation algorithm 2017-10-06T09:16:12.273

4 How to combine backpropagation in neural nets and reinforcement learning? 2017-12-04T23:12:58.733

4 What are some concrete steps to deal with the vanishing gradient problem? 2018-01-25T18:31:10.407

4 How to deal with back-propagation when dealing with invalid moves in Reinforced Learning? 2018-03-01T13:26:29.477

4 A few doubts on back propagation 2018-03-09T19:41:52.757

4 Do we know what the units of neural networks will do before we train them? 2018-07-05T05:25:27.123

4 Backpropagation With Medium-sized Neural Networks 2018-08-11T23:41:14.547

4 Am I able to visualize the differentiation in backprop as follows? 2018-11-26T19:41:12.317

4 How are filters weights updated for a CNN? 2019-05-20T01:40:49.383

4 How can I use one neural network for both players in Alpha Zero (Connect 4)? 2019-07-24T19:25:13.597

4 Backpropagation equation for a variant on the usual Linear Neuron architecture 2019-08-04T00:28:40.533

4 Which linear algebra book should I read to understand vectorized operations? 2019-10-28T15:52:31.180

4 How can a DQN backpropagate its loss? 2020-01-14T17:51:38.797

4 What is the purpose of the batch size in neural networks? 2020-03-01T21:42:15.223

4 How do weights changes handles during back-propagation when there are unknown labels 2020-03-27T21:56:43.117

4 How to perform back propagation with different sized layers? 2020-04-06T20:20:41.007

4 What should I do with the flatten layer during back-propagation? 2020-07-20T09:53:50.560

3 Does training happen during NEAT? 2018-03-08T23:48:42.083

3 How to calculate gradient of filter in convolution network 2018-04-13T05:50:47.720

3 Why coupling coefficients in capsule neural networks can't be learned by back-propagation? 2018-08-07T18:07:35.443

3 How do I calculate the gradient of the hinge loss function? 2018-10-06T11:49:18.957

3 How to perform neural network with output constraint? 2018-10-19T11:55:43.517

3 How does backpropagation with unbounded activation functions such as ReLU work? 2019-02-16T00:21:40.397

3 Could error surface shape be useful to detect which local minima is better for generalization? 2019-03-01T20:46:51.720

3 How does adding a small change to an neuron's weighted input affect the overall cost? 2019-06-04T13:42:58.250

3 What weights should I use while back-propagating? 2019-07-29T12:13:34.150

3 How is REINFORCE used instead of Backpropagation? 2019-08-04T22:33:07.360

3 Structure discrepancy of an LSTM? 2019-09-08T03:20:14.860

3 What kind of data structures are needed to efficiently do back-propagation in a feedforward neural network? 2019-11-10T02:54:14.940

3 How does the neural-network know how to tweak weights for a specific neuron? 2019-11-22T23:15:14.340

3 Regression using neural network 2019-12-11T19:12:22.267

3 What's the function that SGD takes to calculate the gradient? 2020-01-14T22:02:19.673

3 How can I formulate a nonogram problem as a constraint satisfaction problem? 2020-03-27T19:39:40.970

3 How can I use a Hidden Markov Model to recognize images? 2020-04-05T17:00:55.370

3 How do I calculate the partial derivative with respect to $x$? 2020-05-16T12:48:21.730

3 How does backpropagation work in LSTMs? 2020-05-23T06:00:48.420

2 Backpropagation in Decoupled Neural Interfaces 2016-12-30T01:42:39.703

2 Recommendations on which architecture to use to guess appointment 2017-10-03T16:03:04.327

2 Data prepared to linear regression. Can I use it with backpropagation? 2018-02-17T16:39:27.940

2 Hand computing feed forward and back propagation of neural network 2018-03-12T01:49:01.093

2 How to determine the size of biases? 2018-03-20T17:21:26.060

2 What is the best XOR neural network configuration out there in terms of low error? 2018-04-25T12:31:48.297

2 How do I implement softmax forward propagation and backpropagation to replace sigmoid in a neural network? 2018-05-10T18:43:12.950

2 How to change the backward pass for an LSTM layer that outputs to another LSTM layer? 2018-07-09T14:10:33.937

2 Should the weights of a neural network be updated after each example or at the end of the batch? 2018-10-19T19:53:17.767

2 Using features extracted from a CNN as convolutional filter 2018-10-29T15:02:46.970

2 Update of weights in Recurrent Neural Network through back propagation 2018-11-11T11:04:13.450

2 Use of backpropagation for weight updates in a combination of 2 neural networks 2018-11-12T05:21:00.323

2 What is the use of the $\epsilon$ term in this back-propagation equation? 2018-12-03T18:50:01.203

2 Which neuron represents which part of the input? 2019-02-05T14:23:05.253

2 What is the difference between backpropagation and predictive coding? 2019-03-11T16:02:04.947

2 Is back propagation applied for each data point or for a batch of data points? 2019-04-05T08:34:13.667

2 How is a neural network where the majority of inputs are 0 trained? 2019-07-22T03:23:57.923

2 How does a Bidirectional RNN work? 2019-10-10T07:34:57.290

2 When and how to use a mix of loss functions for back-propagation? 2019-10-15T11:34:25.603

2 How is gradient being calculated in Andrej Karpathy's pong code? 2019-11-07T14:48:22.037

2 Calculation of Neural network biases in backpropagation 2019-11-20T19:29:52.787

2 yolo output and how to define labels for backpropogation on it 2019-11-21T15:24:55.533

2 How can I train a neural network for another input set, without losing the learning of the previous input set? 2019-11-27T11:16:37.257

2 What would be the implications of mistakenly adding bias after the activation function? 2019-12-10T16:13:51.140

2 What is the neuron-level math behind backpropagation for a neural network? 2020-01-06T21:08:59.913

2 How can I implement derivative of softmax function for matrices in Python? 2020-01-25T09:30:22.750

2 How is the gradient with respect to weights derived in batch normalization? 2020-02-27T10:44:16.340

2 Why gradients are so small in deep learning? 2020-02-29T08:43:50.950

2 How to determine the target value when using ReLU as activation function? 2020-03-10T17:16:24.273

2 Why is my derivation of the back-propagation equations inconsistent with Andrew Ng's slides from Coursera? 2020-03-18T21:17:09.720

2 Different methods of calculating gradients of cost function(loss function) 2020-03-31T11:59:46.407

2 How can I train a neural network if I don't have enough data? 2020-04-03T15:01:23.627

2 Why is it called back-propagation? 2020-04-06T14:50:38.077

2 What is symbol-to-number differentiation? 2020-04-07T01:29:39.640

2 Why do we update all layers simultaneously while training a neural network? 2020-04-16T06:37:29.933

2 Does net with ReLU not learn when output < 0? 2020-04-27T15:17:48.300

2 Should I compute the gradients with respect to the flatten layer in a convolutional neural network? 2020-04-30T18:18:59.843

2 Would a different learning rate for every neuron and layer mitigate or solve the vanishing gradient problem? 2020-08-06T08:30:44.903

2 Doing backpropagation in an Tensorflow.js Neural Network 2020-08-14T06:59:09.167

1 What is the order of execution of steps in back propagation algorithm in a neural network? 2017-06-15T14:06:25.873