7 Understanding notation of Goodfellow's GAN objective function 2019-04-12T20:53:48.807

5 Understanding the equation of TD(0) in the paper "Learning to predict by the methods of temporal differences" 2019-06-01T14:41:50.517

4 How is the policy gradient calculated in REINFORCE? 2019-04-21T19:23:33.580

4 What is the meaning of the square brackets in ant colony optimization? 2019-11-01T12:59:24.073

4 What do the subscripts mean in $N_{t,n,\sigma,L}$? 2019-11-13T08:03:14.713

4 Why are the value functions sometimes written with capital letters and other times with lower-case letters? 2020-06-10T02:46:22.760

4 What does the term $|\mathcal{A}(s)|$ mean in the $\epsilon$-greedy policy? 2020-07-14T20:11:35.197

3 What does the argmax of the expectation of the log likelihood mean? 2018-01-28T11:15:09.723

3 Being confused of distribution notations in Deep Learning book 2019-05-25T12:02:33.703

3 Sutton & Barto's notation $V_{t+n}$ in Chapter 7: $n$-step Bootstrapping 2019-11-07T01:02:53.640

3 What does the notation sup dist mean in distributional RL? 2020-01-06T18:56:44.160

3 What is the difference between the notations $\|x\|_1, \|x\|_2$ and $|x|$? 2020-01-26T08:13:10.457

2 What does the notation $\nabla_\theta \mathcal{L}$ mean? 2018-07-06T21:26:06.797

2 Understanding the notation in the definition of the expected reward 2018-10-18T10:38:39.577

2 What is the use of the $\epsilon$ term in this back-propagation equation? 2018-12-03T18:50:01.203

2 What does the formula $1-\sum_i(e_i-a_i)^2$ mean in this NEAT Python API? 2019-04-19T04:37:06.437

2 Understanding the equation of the empirical error 2019-10-11T02:47:47.147

2 What does the notation ${s'\sim T(s,a,\cdot)}$ mean? 2020-03-29T15:33:37.707

2 What does the notation $\mathcal{N}(z; \mu, \sigma)$ stand for in statistics? 2020-08-23T17:49:31.347

1 What is $I$ in the noise described in the paper "Parameter Space Noise for Exploration"? 2017-10-19T10:47:53.943

1 Why is exp used in encoder of VAE instead of using the value of standard deviation alone? 2020-02-06T06:53:36.177

1 What does equation in the "related work" section of the GAN paper mean? 2020-04-04T10:19:30.697

1 What does the notation $\partial \theta_{\pi}$ mean in this actor-critic update rule? 2020-04-24T15:20:41.283

1 What do the notations $\sim$ and $\Delta (A) $ mean in the paper "Fairness Through Awareness"? 2020-06-19T22:19:03.553

0 How to understand the average l2 loss? 2020-01-27T11:37:17.533

0 What is the purpose of the arrow $\leftarrow$ in this formula? 2020-02-16T17:40:08.207

0 What does the notation "for t=T to 1,−1 do" in terms of time steps, in deep recurrent q network? 2020-07-03T12:17:31.510