Tag: r

119 Python vs R for machine learning 2014-06-12T06:04:48.243

98 How to get correlation between two categorical variable and a categorical variable and continuous variable? 2014-08-03T13:07:24.143

51 Is the R language suitable for Big Data 2014-05-14T11:15:40.907

51 IDE alternatives for R programming (RStudio, IntelliJ IDEA, Eclipse, Visual Studio) 2015-03-18T11:39:40.353

35 Organized processes to clean data 2014-05-14T15:25:21.700

30 Hypertuning XGBoost parameters 2015-12-13T14:19:54.510

27 Any Online R console? 2014-10-13T21:13:48.447

26 VM image for data science projects 2015-01-22T21:34:57.443

26 Is pandas now faster than data.table? 2017-10-25T02:43:49.793

25 Is Python a viable language to do statistical analysis in? 2020-06-29T03:59:04.197

22 How to predict probabilities in xgboost? 2015-09-08T03:14:09.230

22 Is there any data tidying tool for python/pandas similar to R tidyr tool? 2016-03-02T08:54:10.503

19 What do you use to generate a dashboard in R? 2014-08-04T19:21:45.067

19 removing strings after a certain character in a given text 2015-11-19T12:59:40.403

17 Recommending movies with additional features using collaborative filtering 2014-07-25T00:58:12.253

17 Visualization of multiple Markov models 2016-09-29T15:16:07.503

15 R: machine learning on GPU 2016-01-25T15:57:55.647

15 Do modern R and/or Python libraries make SQL obsolete? 2017-02-24T19:33:34.840

13 What is the difference in xgboost binary:logistic and reg:logistic 2016-01-15T11:00:41.637

12 Predicting next medical condition from past conditions in claims data 2014-07-30T11:45:08.313

12 Is a 100% model accuracy on out-of-sample data overfitting? 2018-02-08T09:13:24.217

11 Fisher Scoring v/s Coordinate Descent for MLE in R 2014-07-03T17:11:01.770

11 Data visualization for pattern analysis (language-independent, but R preferred) 2014-07-19T05:27:22.773

11 What regression to use to calculate the result of election in a multiparty system? 2014-11-29T16:05:08.810

11 Do you have to normalize data when building decision trees using R? 2015-03-04T08:05:45.003

11 How to avoid overfitting in random forest? 2015-07-07T18:05:23.903

11 Which of the 180 algorithms in R's caret package are feasible? 2016-03-16T18:47:46.380

11 GPU Accelerated Data Processing for R in Windows 2017-06-05T19:32:57.950

10 What are R's memory constraints? 2014-05-14T17:48:21.240

10 Learning ordinal regression in R? 2014-06-19T03:43:23.853

10 Libraries for (label propagation algorithms/frequent subgraph mining) for graphs in R 2014-08-27T13:01:14.643

10 LSTM or other RNN package for R 2015-08-31T20:58:15.070

10 Software Testing for Data Science in R 2016-01-01T00:30:14.717

10 Convergence in Hartigan-Wong k-means method and other algorithms 2016-01-19T20:59:28.040

10 ggvis vs. ggplot2+Shiny; which one to choose for interactive visualization? 2016-01-21T14:47:08.167

10 Visualizing items frequently purchased together 2016-10-06T21:27:28.460

10 R, keras: How to get output of a hidden layer? 2017-05-04T18:53:44.157

9 R random forest on Amazon ec2 Error: cannot allocate vector of size 5.4 Gb 2014-12-19T16:02:48.693

9 R - Interpreting neural networks plot 2015-07-08T12:05:49.663

9 Features reduction for the not correlated data set 2019-09-04T18:45:01.973

8 How does SQL Server Analysis Services compare to R? 2015-03-27T08:41:13.680

8 visualize a horizontal box plot in R 2015-06-11T15:40:45.990

8 Best way to store large data set using R from Twitter? 2015-06-18T18:23:07.763

8 How to find similarity between different factors in a dataset 2015-06-26T20:48:12.417

8 Are there any machine learning techniques to identify points on plots/ images? 2015-09-06T02:04:05.053

8 Convolutional Neural Networks in R 2016-05-25T13:30:36.233

8 Classifying Email in R 2016-05-26T18:06:06.083

8 Information Gain in R 2017-01-10T02:45:38.720

8 How far can one go with excel? 2018-09-10T22:48:12.997

7 Identifying “clusters” or “groups” in a matrix 2014-06-27T15:58:19.340

7 Linear Regression in R Mapreduce(RHadoop) 2014-07-03T10:49:50.993

7 R error using package tm (text-mining) 2014-07-30T18:45:13.790

7 Forecasting Foreign Exchange with Neural Network - Lag in Prediction 2014-08-29T06:00:53.420

7 Why does logistic regression in Spark and R return different models for the same data? 2015-05-07T13:23:47.440

7 Identifying templates with parameters in text fragments 2015-09-20T12:40:10.170

7 Image clustering by similarity measurement (CW-SSIM) 2016-01-10T19:44:59.887

7 Time series data: How I measure influence of new product sales on existing product sales (statistically)? 2016-01-22T05:10:26.040

7 Time Series Machine Learning Feature Selection Problem 2016-08-05T00:41:17.617

7 Reproducing randomForest Proximity Matrix from R package in Python 2017-03-16T09:31:07.317

7 Recommender system based on purchase history, not ratings 2017-06-07T07:39:26.060

7 R's mice imputation alternative in Python 2017-06-19T18:57:28.623

7 AdaBoost implementation and tuning for high dimensional feature space in R 2017-08-04T14:25:26.683

7 How do we make data Obfuscate or "De-identificate" to make it anonymous and share it publicly? 2017-08-05T14:40:09.643

7 Layman's Interpretation of XGBoost Importance 2018-01-07T19:59:02.353

7 Exploratory Data Analysis with Image Datset 2018-03-18T17:27:08.967

6 Kappa near to 60% in unbalanced (1:10) data set 2014-09-12T16:26:15.827

6 Is the GA R package the best Genetic Algorithm package? 2015-02-03T10:14:53.543

6 Python or R for implementing machine learning algorithms for fraud detection 2015-02-20T14:09:23.483

6 Gerrymandering - Geospatial optimization to maximize votes in R 2015-05-21T15:25:35.377

6 Is it advisable to rerun LASSO multiple (2) times? 2015-12-16T21:20:08.390

6 Classifying survey response text SVM 2016-01-21T16:08:59.477

6 Estimating the battery capacity using current power consumption and battery percentage 2016-01-27T14:02:57.430

6 How to find a confidence level given the z value 2016-02-05T13:52:08.660

6 Represent time-series data in much compact form 2016-02-08T06:03:16.453

6 Dummy coding a column in R with multiple levels 2016-05-02T11:27:55.597

6 Logistic regression on biased data 2016-06-16T12:54:34.747

6 Gibbs sampling in R 2016-06-20T13:51:41.460

6 options available for visualizing a matrix type data frame in R 2017-03-05T14:18:29.963

6 Is there a R implementation of isolation forest for anomaly detection? 2017-03-31T06:36:54.547

6 Naming conventions for dataframes 2017-05-15T23:33:44.637

6 Anomaly detection in time series 2017-08-30T10:03:49.240

6 A clear visualization of a two-way ANOVA 2017-10-02T17:34:08.457

6 Julia programming language 2018-09-15T21:54:03.183

6 Reticulate vs Python 2019-08-07T01:47:28.503

5 Running an R script programmatically 2014-05-14T14:26:54.313

5 Stochastic gradient descent in logistic regression 2014-07-07T11:43:48.430

5 Determine highly correlated segments 2015-01-14T15:42:48.013

5 Best or recommended R package for logit and probit regression 2015-05-13T01:27:32.653

5 How to subset rows from a data frame with comparison operators in R 2015-05-16T17:37:53.163

5 Multivariate - Time series data pattern changes 2015-06-10T04:57:02.477

5 Finding aggregated information of data 2015-06-23T13:54:03.990

5 Testing bimodality of data 2015-07-31T17:20:10.873

5 How to count observations per ID in R? 2015-08-12T14:01:23.257

5 Categorical Clustering of Users Reading Habits 2016-03-23T19:04:23.127

5 Error with negative bins while melting dataframe 2016-03-31T17:06:02.793

5 k-means in R, usage of nstart parameter? 2016-04-28T15:44:44.497

5 Classification technique for unsupervised data? 2016-06-14T07:46:25.090

5 Skills that school doesn't teach you 2016-08-17T19:08:17.143

5 ICD-10 codes in Machine Learning 2017-06-07T20:41:44.300