9 how to check the distribution of the training set and testing set are similar 2019-04-18T11:22:01.990

7 Which outlier detection can detect these outliers? 2017-04-19T10:17:18.920

6 Plotting different values in pandas histogram with different colors 2016-11-10T12:09:42.713

6 xgboost: Is there a way to perform regression on rates/percentages data? 2019-08-06T01:25:13.453

6 Regression: How to deal with positive skewness in continuous target variable 2019-12-26T13:22:33.397

5 How to estimate the mutual information numerically? 2016-09-23T08:39:58.347

4 Transform a skewed distribution into a Gaussian distribution 2016-09-25T21:06:24.960

4 Working with Data which is not Normal/Gaussian 2017-11-04T13:49:05.097

4 Standard Deviation for Z-scores 2018-11-16T13:55:42.847

4 How to predict whether or not a customer will renew 2019-02-07T23:23:03.363

4 How can I plot/display a dataset or an image distribution? 2019-02-17T04:41:27.497

4 Is it possible to train probabilistic model to return several distributions? 2019-08-26T17:10:17.570

3 Shifted feature distribution across different datasets 2016-09-19T12:21:42.723

3 Plotting Weibull distribution on Wind Speed 2019-06-04T18:50:38.887

3 Why do seaborn.dist and pyplot.hist generate two different looking histograms on the same data? 2019-07-30T09:53:31.517

3 evaluation metrics for multiple values per session 2019-12-30T06:34:07.003

3 Confidence value for face recognition 2020-01-17T15:45:58.270

2 Methods / Algorithms for rank scales based on cumulative scoring 2016-11-16T03:25:47.093

2 Machine learning learn to work well on future data distribution? 2017-04-25T04:19:37.280

2 Stratified Sampling Variable Choice 2017-11-08T01:11:35.013

2 Boxplots or violinplots? 2018-02-20T16:57:10.453

2 can I say that my variable is "approximately" normally distributed? 2018-03-27T08:06:21.537

2 Does classification of a balanced data-set lead to any problem? 2018-03-27T12:46:00.947

2 Wasserstein distance between Gaussian and the empirical distribution 2018-05-01T19:10:04.023

2 How to visualize change in a distribution with a few outliers that account for a very large percent of the total? 2019-03-19T19:31:06.767

2 Normal distribution instead of Logistic distribution for classification 2019-03-27T07:40:48.387

2 How to create a prediction interval with the fact that the residuals follow a specific distribution (in python) 2019-04-03T12:08:47.030

2 How to scale or standardize data that is mostly 0 (ranges from 0-1)? 2019-06-12T18:30:02.340

2 How this visualisation was made? 2019-08-15T15:19:33.510

2 Fitting a distribution to data 2019-08-22T09:41:00.380

2 difference between normal skewed distribution and skewed distribution 2019-09-19T01:08:39.233

2 Co-joining multi-peak histograms 2019-10-13T22:39:41.370

2 Why do train, test, validation datasets need to have the same distribution? 2019-10-24T22:45:51.037

2 When to use t-distribution instead of normal distribution? 2019-11-10T17:12:30.670

2 Generating random numbers from best probability distributions? 2019-12-05T05:06:05.000

2 How to estimate the marginal distribution of a class with respect to one predictor in a classification task? 2019-12-05T17:25:30.500

2 Testing for gender composition between groups 2020-01-03T12:37:36.970

2 Synthetic time series generation according to some distribution 2020-01-16T16:52:06.557

2 Normal distribution and Random Forest 2020-10-29T13:51:23.950

2 Non-Gaussian like distributions - Classifier of source data fails on target data 2020-12-30T00:31:46.397

1 Generalization error for simple linear regression 2017-01-25T23:29:28.693

1 Does SQL Server support the Poisson distribution? 2017-03-10T17:16:15.423

1 Maximum likelihood Estimation of three-parameter log-logistic distribution in R 2017-03-17T14:00:19.510

1 Number of events estimation 2018-02-24T20:17:50.240

1 Algorithms that would benefit from variable transformations? 2018-03-07T18:12:53.917

1 On Noise Contrastive Estimation, replace noise distribution with difficult examples 2018-05-11T09:29:25.140

1 How to get maintenance interval from maintenance outcomes? 2018-08-06T06:13:49.130

1 How often do we see normally distributed data 2018-08-21T07:54:02.370

1 Calculate whether datapoints are part of a larger distribution 2018-09-01T16:30:33.167

1 Generating a set of different scenarios based on some initial observations 2018-10-12T13:54:32.587

1 How to check the similarity between two transition matrixes 2018-12-06T05:02:58.273

1 Probability Distribution of the Process Duration Time based on the Process Status Measurements 2019-02-14T11:36:39.907

1 Two-sample K–S test Kolmogorov–Smirnov test in Python 2019-02-15T22:11:00.063

1 How to elegantly caclulate probability distribution parameters for a particular random variable given some observed data? 2019-02-22T12:20:21.693

1 F-test for comparing the mean of two groups 2019-06-12T18:06:17.250

1 A2C Continuous for Pendulum-v0 working implementation, negation for loss and entropy calculation 2019-06-23T01:25:13.443

1 How to combine data having similar distribution? 2019-08-31T03:17:50.943

1 Use of PYMC distribution.dist() 2019-11-11T00:06:16.303

1 Univariate Outlier Detection 2019-12-13T18:53:55.530

1 Select the right distribution 2020-01-08T09:05:51.307

1 SQL Oracle - Excel's BETAINV function (Inverse cumulative function of a beta distribution) in SQL Oracle 2020-01-15T13:49:09.953

1 Calculate marginal probability distributions of a dataset 2020-01-19T14:13:45.583

1 How to resample one dataset to conform to the distribution of another dataset? 2020-02-06T13:24:53.370

1 Is there a cost associated with converting Koalas dataframe to Spark dataframe? 2020-02-17T10:11:15.727

1 How is the E(X) of a Poisson distribution lambda? 2020-03-08T12:49:11.260

1 Compare the variances of two categorical distributions in a repeated measure design 2020-03-27T19:12:39.660

1 Change distribution of a vector 2020-04-03T13:39:19.227

1 Professionals appear to interpret sample correlation (e.g. Karl Pearson) as if it represents linear correlation. Is it the correct interpretation? 2020-04-05T15:04:19.457

1 How to draw a sample from data set with respect to a given categorical or numerical variable based on given freely chosen distribution? (Python) 2020-05-22T08:56:15.307

1 Understand how to simulate a statistics 2020-05-25T05:03:46.730

1 A good machine learning approach for distribution of a whole? 2020-06-14T12:27:57.640

1 Can the extent of variability within a dataset be reflected through clustering? 2020-07-07T20:06:16.653

1 How to generate a random sample and distribute values based in an probability distribution? 2020-10-09T13:39:15.250

1 How to find the best fitting parametric distribution for an empirical dataset (stock returns)? 2020-11-03T09:50:52.390

1 What is a distribution-wise asymmetric measure? 2020-11-07T23:53:34.447

1 A/B Testing (Binomial Distribution vs Random Distribution) 2020-11-10T08:04:22.750

1 distribution difference between image and text 2020-11-23T19:39:48.023

1 How to bin a distribution data reported with different frequencies ( salaries ), showing mixed linearity and non-linearity? 2021-01-17T15:11:31.810

1 Normalizing variables with logarithmic shape 2021-01-25T12:16:50.303

0 Estimate the normal distribution of the mean of a normal distribution given a set of samples? 2017-03-25T18:31:07.617

0 Detecting Different Distributions in data 2017-07-26T16:43:05.053

0 Constructing graph of crypto financial instruments 2016-2017 2018-01-28T10:59:45.270

0 Feature has a pattern in relation to class but does not enhance classifier to predict class 2018-03-17T04:03:19.480

0 Supervised learning for variable length feature-less data 2018-03-30T07:39:23.170

0 Binomial family in logistic regression 2018-07-17T12:51:27.933

0 What Does the Normalization Factor Mean in the AdaBoost Algorithm? 2018-09-09T01:06:50.417

0 Maximum likelihood estimation vs calculating distribution parameters "manually" 2018-10-04T18:45:55.833

0 KL divergence in VAE 2018-12-10T12:21:10.047

0 Clustering by Distributions of Groups of observations 2019-03-07T17:56:57.477

0 Distribution of error values in linear regression vs logistic regression 2019-04-10T20:17:19.487

0 How do I fix mis-rendered matplotlib? 2019-07-01T09:34:28.410

0 Effect of skewness in data 2019-07-02T05:51:21.173

0 CUDA 8.0 is compatible with my GeForce GTX 670M Wikipedia says, but TensorFlow rises an error: GTX 670M's Compute Capability is < 3.0 2019-08-01T20:16:17.103

0 How to generate 12 independent random weights which all add up to one 2019-08-01T21:48:54.297

0 Predicted and true values distributions comparison 2019-09-25T11:38:24.400

0 Can continuous dataset have negative values? 2019-11-14T18:49:46.823

0 The distribution of dataset train and test are the differents, how to fix this? 2019-11-18T17:48:01.933

0 How does $\chi^2$ feature selection work? 2019-11-26T15:20:27.900

0 How to set and check normal distribution on a data set? 2019-12-02T08:48:46.633