Tag: categorical-data

108 Can principal component analysis be applied to datasets containing a mix of continuous and categorical variables? 2010-12-28T03:47:52.190

98 Correlations with unordered categorical variables 2014-07-15T12:18:27.247

49 Does it ever make sense to treat categorical data as continuous? 2010-07-23T06:17:10.517

39 Graph for relationship between two ordinal variables 2013-04-17T00:31:44.987

38 What is a contrast matrix (a term, pertaining to an analysis with categorical predictors)? 2013-12-02T21:19:40.847

33 Principled way of collapsing categorical variables with many categories 2015-04-17T13:31:28.447

30 Improve classification with many categorical variables 2014-04-25T17:14:28.573

29 Multinomial logistic regression vs one-vs-rest binary logistic regression 2013-03-13T14:31:41.380

28 Correlations between continuous and categorical (nominal) variables 2014-06-10T08:13:27.793

27 Warning in R - Chi-squared approximation may be incorrect 2014-01-07T12:00:20.197

23 How to visualize an enormous sparse contingency table? 2012-07-18T15:19:55.117

23 One-hot vs dummy encoding in Scikit-learn 2016-07-16T04:26:40.817

22 Doing principal component analysis or factor analysis on binary data 2011-10-01T18:39:16.607

21 Interpreting interaction terms in logit regression with categorical variables 2013-04-24T05:32:11.010

19 How can you visualize the relationship between 3 categorical variables? 2015-04-27T18:00:32.403

19 Is hour of day a categorical variable? 2016-11-14T16:54:44.177

18 What is the best way to visualize relationship between discrete and continuous variables? 2013-06-04T15:01:41.857

18 Regression with only categorical variables 2013-07-28T14:09:30.513

18 Non-transitivity of correlation: correlations between gender and brain size and between brain size and IQ, but no correlation between gender and IQ 2015-01-03T10:54:30.553

16 Is it possible to create "parallel sets" plot using R? 2011-06-17T11:14:03.430

16 Is building a multiclass classifier better than several binary ones? 2012-06-18T15:12:49.837

16 How to test the statistical significance for categorical variable in linear regression? 2012-07-05T15:15:31.523

16 What summary statistics to use with categorical or qualitative variables? 2012-07-23T07:51:37.487

16 Alternative to sieve / mosaic plots for contingency tables 2012-10-01T04:46:16.423

16 How to deal with an SVM with categorical attributes 2013-03-21T11:59:29.290

16 Significance of categorical predictor in logistic regression 2013-06-04T07:21:51.857

15 Predicting with both continuous and categorical features 2012-04-19T14:56:45.380

15 Why do we need to dummy code categorical variables 2014-09-10T23:33:22.820

13 How to summarize categorical data? 2010-08-19T00:31:44.013

13 Methods for merging / reducing categories in ordinal or nominal data? 2011-02-27T21:02:23.573

13 Qualitative variable coding in regression leads to "singularities" 2013-09-22T15:07:00.270

13 Why are mixed data a problem for euclidean-based clustering algorithms? 2014-10-29T13:02:18.520

13 "Dummy variable" versus "indicator variable" for nominal/categorical data 2014-11-26T18:09:39.437

13 How to transform categorical variable into numerical variable when using SVM or Neural Network 2015-02-25T02:37:00.897

12 Appropriate way to deal with a 3-level contingency table 2011-03-16T04:49:19.827

12 How to transform ordinal data from questionnaire into proper interval data? 2012-05-07T04:43:02.443

12 What are the different types of codings available for categorical variables (in R) and when would you use them? 2012-05-10T10:27:12.743

12 How do I study the "correlation" between a continuous variable and a categorical variable? 2012-05-30T15:22:59.033

12 Is it OK to mix categorical and continuous data for SVM (Support Vector Machines)? 2013-02-21T00:56:54.157

12 Anomaly Detection with Dummy Features (and other Discrete/Categorical Features) 2013-06-19T06:06:02.833

12 Berry inversion 2014-02-15T23:53:42.113

12 Maximum likelihood estimator of joint distribution given only marginal counts 2014-11-29T08:33:35.027

12 Train a Neural Network to distinguish between even and odd numbers 2015-07-13T12:52:16.340

12 Negative binomial distribution vs binomial distribution 2015-10-08T10:53:57.727

12 How to treat categorical predictors in LASSO 2016-04-24T05:56:54.663

12 With categorical data, can there be clusters without the variables being related? 2016-06-13T02:05:25.757

11 How to find summary statistics for all unique combinations of factors in a data.frame in R? 2010-08-16T13:23:52.747

11 Can I use multiple regression when I have mixed categorical and continuous predictors? 2011-01-18T20:04:35.030

11 Should I run separate regressions for every community, or can community simply be a controlling variable in an aggregated model? 2011-10-17T12:46:54.120

11 Regression based for example on days of week 2012-01-18T11:50:50.123

11 Random forest: how to handle new factor levels in test set? 2012-05-29T23:42:24.083

11 Is the Mundlak fixed effects procedure applicable for logistic regression with dummies? 2013-04-06T14:08:34.067

11 Why does it take R a long time to fit a model with a many-level factor? 2014-09-23T00:32:31.037

11 Interpretation of betas when there are multiple categorical variables 2014-10-14T19:04:10.240

10 Quickly evaluate (visually) correlations between ordered categorical data in R? 2010-08-25T03:16:23.030

10 How to do regression with effect coding instead of dummy coding in R? 2013-03-13T18:58:04.363

10 How to handle categorical predictors with too many levels? 2013-08-21T05:06:07.010

10 glmnet: How to make sense of multinomial parameterization? 2014-10-23T10:51:40.887

10 Can we use categorical independent variable in discriminant analysis? 2015-06-26T10:51:25.477

10 Ordinal logistic regression in Python 2015-08-21T19:39:14.830

10 Why is correlation not very useful when one of the variables is categorical? 2017-01-15T13:34:26.973

9 Multiple Chi-Squared Tests 2010-08-02T19:19:42.860

9 How to deal with non-binary categorical variables in logistic regression (SPSS) 2010-10-07T14:48:10.447

9 Dummy variable trap issues 2011-03-10T16:33:50.490

9 Best practices when treating range data as continuous 2011-12-08T11:50:52.137

9 How to implement dummy variable using n-1 variables? 2011-12-22T16:28:47.673

9 Is multicollinearity implicit in categorical variables? 2012-08-31T21:04:31.477

9 Interpreting coefficients of an interaction between categorical and continuous variable 2012-10-24T16:05:39.907

9 Multinomial-Dirichlet model with hyperprior distribution on the concentration parameters 2012-11-21T20:37:16.510

9 Penalized methods for categorical data: combining levels in a factor 2013-05-27T02:36:09.353

9 How do you plot an interaction between a factor and a continous covariate? 2014-01-14T13:07:27.697

9 Mixing continuous and binary data with linear SVM? 2014-01-21T16:42:07.537

9 How to interpret Cochran-Mantel-Haenszel test? 2014-02-28T20:07:10.633

9 Capturing seasonality in multiple regression for daily data 2014-07-22T15:56:30.547

9 Survey Method on Personal Isues 2014-09-18T00:42:01.140

9 Can glmnet logistic regression directly handle factor (categorical) variables without needing dummy variables? 2015-02-03T03:34:22.557

9 Develop a statistical test to distinguish two products 2015-03-22T19:09:41.890

9 What are dangers of calculating Pearson correlations (instead of tetrachoric ones) for binary variables in factor analysis? 2015-12-10T03:13:27.110

8 How to handle count data (categorical data), when it has been converted to a rate? 2010-08-02T04:40:22.673

8 Is it possible to directly read CSV columns as categorical data? 2010-08-09T22:25:11.207

8 Collinearity between categorical variables 2011-11-08T17:22:07.053

8 R linear regression categorical variable "hidden" value 2012-04-15T18:51:02.763

8 How to fit Bradley–Terry–Luce model in R, without complicated formula? 2012-04-24T01:29:41.913

8 How can I test the same categorical variable across two populations? 2013-03-07T16:18:13.510

8 Correlation among categories between categorical nominal variables 2013-11-06T02:00:54.243

8 Understanding dummy (manual or automated) variable creation in GLM 2014-04-16T15:27:32.753

8 Clustering data that has mixture of continuous and categorical variables 2014-05-25T11:52:46.003

8 What is this diagram called 2015-07-23T13:58:15.707

8 Ranking of categorical variables in logistic regression 2015-08-25T16:04:38.023

7 How can I use optimal scaling to scale an ordinal categorical variable? 2010-07-23T10:51:52.960

7 Yates continuity correction for 2 x 2 contingency tables 2010-11-16T01:27:37.710

7 Testing paired frequencies for independence 2010-12-05T23:43:10.043

7 Is there a version of multivariate multinomial logit? 2011-04-01T05:06:28.050

7 What to do with almost-continuous variable in regression? 2011-05-07T14:13:51.813

7 Measures of autocorrelation in categorical values of a Markov Chain? 2011-05-14T07:58:36.470

7 If a factor variable is to be dropped in model selection, should all levels be dropped simultaneously? If so, why? 2011-11-21T22:38:33.410

7 How to measure correlation between categorical variable? 2012-05-07T07:38:53.253

7 Is it acceptable to use Cronbach's alpha to assess reliability of questionnaire composed of categorical and conditional items? 2012-06-18T14:41:39.873

7 Alternatives to multinomial logistic regression 2012-08-16T12:33:05.650