52 Who created the first standard normal table? 2016-09-04T23:16:05.260

44 Best PCA algorithm for huge number of features (>10K)? 2010-09-18T02:08:24.590

43 What is a good algorithm for estimating the median of a huge read-once data set? 2010-07-20T19:21:16.220

43 Efficient online linear regression 2011-02-05T18:25:52.210

43 Measuring entropy/ information/ patterns of a 2d binary matrix 2011-10-17T12:39:47.917

42 Period detection of a generic time series 2010-08-04T00:32:13.360

42 Optimized implementations of the Random Forest algorithm 2011-04-26T18:39:04.007

39 What algorithm is used in linear regression? 2010-08-18T13:30:31.750

34 What are the differences between hidden Markov models and neural networks? 2011-12-31T21:03:35.660

32 Data mining: How should I go about finding the functional form? 2011-05-05T16:26:00.037

31 Approximate $e$ using Monte Carlo Simulation 2016-02-04T12:13:35.317

26 What is the difference between the forward-backward and Viterbi algorithms? 2012-07-06T03:46:48.220

22 Best bandit algorithm? 2012-02-16T23:06:00.480

21 Algorithm to dynamically monitor quantiles 2011-03-07T15:53:27.493

21 Examples of hidden Markov models problems? 2011-12-29T07:03:12.780

20 Difference between standard and spherical k-means algorithms 2013-07-07T12:57:39.273

19 How does random forest generate the random forest 2010-07-22T08:58:36.800

19 Is it possible to accumulate a set of statistics that describes a large number of samples such that I can then produce a boxplot? 2010-10-06T21:16:06.060

19 How to define the termination condition for gradient descent? 2012-07-26T21:13:16.833

18 Pairwise Mahalanobis distances 2013-07-26T22:51:34.437

18 Why PCA of data by means of SVD of the data? 2013-12-09T11:03:49.170

17 Algorithms to compute the running median? 2010-07-19T21:32:38.523

17 Simulating time-series given power and cross spectral densities 2012-07-13T15:17:37.717

16 Online algorithm for mean absolute deviation and large data set 2010-10-07T03:26:23.003

16 Compute approximate quantiles for a stream of integers using moments? 2011-05-09T05:22:38.153

16 Speed, computational expenses of PCA, LASSO, elastic net 2015-10-19T14:52:01.160

16 Which optimization algorithm is used in glm function in R? 2015-10-24T17:20:33.620

15 What are the pros and cons of learning about a distribution algorithmically (simulations) versus mathematically? 2011-05-17T18:24:27.733

15 In what kind of real-life situations can we use a multi-arm bandit algorithm? 2016-08-18T15:22:12.460

14 What is a 'message passing method'? 2011-06-23T16:23:30.200

14 How should decision tree splits be implemented when predicting continuous variables? 2011-12-31T11:27:10.903

14 Updating SVD decomposition after adding one new row to the matrix 2015-10-15T03:56:45.827

13 Testing random variate generation algorithms 2010-07-19T19:28:34.220

13 Generating values from a multivariate Gaussian distribution 2011-07-12T22:03:46.910

13 How does extreme random forest differ from random forest? 2013-01-10T13:19:37.363

13 What are efficient algorithms to compute singular value decomposition (SVD)? 2013-07-30T16:30:47.563

13 What are some important uses of random number generation in computational statistics? 2018-01-31T15:03:30.250

12 Can someone please explain the back-propagation algorithm? 2010-07-19T19:42:57.990

12 Why do we use k-means instead of other algorithms? 2013-05-13T12:49:21.223

12 Is automated machine learning a dream? 2015-07-06T14:59:43.520

11 How can I (numerically) approximate values for a beta distribution with large alpha & beta 2010-08-24T09:48:08.093

11 Run-time analysis of common machine learning algorithms 2011-12-06T04:54:04.620

11 Mathematics base for data mining and artificial intelligence algorithms 2012-08-17T07:27:49.003

11 Why doesn't runif generate the same result every time? 2014-10-16T21:54:15.587

10 Pseudo-random number generation algorithms 2010-07-19T19:32:47.750

10 How to apply LASSO to IRLS (logistic regression)? 2010-10-12T09:01:26.290

10 How do you test an implementation of k-means? 2010-11-26T08:54:39.863

10 From an email address to a quasi-random number 2011-02-16T08:54:55.157

10 What's the forward stagewise regression algorithm? 2012-07-08T20:14:01.580

10 What is the most efficient way of training data using least memory? 2012-07-09T16:35:19.170

10 Algebra of LDA. Fisher discrimination power of a variable and Linear Discriminant Analysis 2013-01-29T15:09:14.127

10 Anomaly detection: what algorithm to use? 2014-05-16T14:09:52.883

10 Algorithm: Binary search when values are uncertain 2015-12-29T02:57:12.580

9 Space-efficient clustering 2011-04-20T12:44:27.060

9 Apriori algorithm in plain English? 2011-10-04T07:01:46.337

9 Do Random Forests exhibit prediction bias? 2012-01-22T23:58:51.627

9 LogLikelihood Parameter Estimation for Linear Gaussian Kalman Filter 2014-02-06T15:45:35.457

9 What is it all about Machine Learning in real practice? 2014-05-21T00:38:04.463

9 Steps done in factor analysis compared to steps done in PCA 2014-06-10T19:15:07.613

9 How to sample when you don't know the distribution 2014-10-24T12:10:18.877

9 Is large scale PCA even possible? 2015-07-31T15:00:58.537

8 Forcing a set of numbers to a gaussian bell-curve 2010-12-31T05:49:57.543

8 Defining quantiles over a weighted sample 2011-07-18T00:08:18.250

8 Difference in using normalized gradient and gradient 2012-02-10T05:35:17.150

8 Why Adaboost with Decision Trees? 2014-11-19T04:00:01.853

8 Find close pairs in very high dimensional space with sparse vectors 2015-09-10T20:58:16.727

8 Policy and value iteration algorithm convergence conditions 2017-04-03T14:34:47.960

7 FA: Choosing Rotation matrix, based on "Simple Structure Criteria" 2010-08-18T07:36:59.733

7 Cycling in k-means algorithm 2011-04-27T11:13:57.890

7 Is there an R optimization package that can handle integer constraints and non-linear objective functions? 2011-07-14T05:18:38.310

7 How to calculate standard errors in OLS without inverting the X'X matrix? 2012-01-03T11:14:54.943

7 Calculating VC-dimension of a neural network 2012-04-06T03:15:26.353

7 How to calculate ratings/rankings from Paired comparison / Pairwise comparison of large data-sets? 2014-01-22T11:55:58.790

7 Why is VC dimension important? 2015-11-20T02:53:58.187

7 Interpolating binned data such that bin average is preserved 2016-04-26T10:04:36.057

7 How to sample a truncated multinomial distribution? 2016-06-27T21:12:34.090

7 Stopping criterion for Nelder Mead 2017-09-25T07:36:52.810

7 Classification algorithm based on average distances from a test point to the points in each class 2017-11-22T00:36:42.677

6 What are good references for dynamic pricing? 2010-09-03T00:34:44.053

6 How to diagonalize a large sparse symmetric matrix, to get the eigen values and eigenvectors? 2010-11-28T04:38:02.957

6 When is a deviation statistically significant? 2010-12-26T11:41:21.370

6 Centroid matching problem 2011-04-25T09:04:15.967

6 Big-O Scaling of R's cmdscale() 2011-05-17T14:16:14.383

6 Literature on the algorithm for optimal splitting in the growing of classification trees 2011-10-27T10:35:27.447

6 Algorithms for clustering documents by similar words and phrases 2012-03-16T22:43:20.997

6 Collaborative filtering and implicit ratings; normalization? 2012-10-15T11:59:06.277

6 Can sub-optimality of various hierarchical clustering methods be assessed or ranked? 2012-10-22T09:02:48.100

6 A simpler way to calculate Exponentially Weighted Moving Average? 2012-11-28T23:53:23.533

6 Machine learning algorithm for ranking 2012-12-31T10:22:45.607

6 Lesser-known but powerful probabilistic inference algorithms 2013-08-02T07:59:50.960

6 Outlier detection in very small sets 2013-12-05T02:58:37.710

6 Machine learning classifiers big-O or complexity 2014-05-09T00:08:40.473

6 What's the algorithm for finding sequences used by TraMineR? 2014-05-14T06:29:32.320

6 Benefits of CART over ID3 algorithm 2014-05-21T17:17:56.877

6 Gradient descent based minimization algorithm that doesn't require initial guess to be near the global optimum 2014-05-26T02:14:01.523

6 Growing number of Gaussians in a mixture 2014-10-10T13:12:19.737

6 What fast algorithms exist for computing truncated SVD? 2015-06-30T15:41:24.733

6 Books for statistical computing course? 2015-10-21T03:10:32.927

6 How can I quickly detect cheating variables in large data? 2017-05-05T16:24:27.283