Tag: data

7 How to handle the CEO expectations from a company that's new to data science? 2016-07-21T06:41:18.743

6 How much data are sufficient to train my machine learning model? 2017-06-26T21:26:04.680

6 How to perform Logistic Regression with a large number of features? 2017-07-28T09:32:13.880

5 General way to reduce features 2016-02-24T06:07:13.507

5 IID violation in machine learning 2016-03-07T23:03:06.100

5 Do modern R and/or Python libraries make SQL obsolete? 2017-02-24T19:33:34.840

5 How to generate bulk graphics using R 2017-02-25T13:40:25.847

5 Missing Values in Data 2017-08-31T10:08:51.103

5 Decision Tree used for Calculating Precision, Accuracy, and Recall, class breakdown question 2018-01-28T05:12:10.113

4 Finding aggregated information of data 2015-06-23T13:54:03.990

4 How to preprocess different kinds of data (continuous, discrete, categorical) before Decision Tree learning 2015-08-07T10:43:50.747

4 Recommendations for storing time series data 2015-08-20T22:21:12.060

4 How to deal with analyzing optional survey data 2016-01-04T04:49:51.893

4 What are the methods to ensure that the population split for A/B test is random? 2016-02-26T13:27:48.553

4 Why aren't languages like C, C++ used for data analytics instead of R, Python? 2016-04-07T18:41:02.600

4 Algorithm or formula to measure happiness? 2016-04-27T19:21:19.720

4 What is the most used format to save data with type information 2016-08-25T10:57:42.487

4 How to generate training data for OCR 2016-11-28T15:29:51.543

3 What kind of research can be done with genomic data? 2015-06-22T02:13:40.850

3 python - Will this data mining approach work? Is it a good idea? 2015-07-01T15:52:15.467

3 Advise on making predictions given collection of dimensions and corresponding probabilities 2015-08-11T19:28:29.793

3 Domain-specific data science programs 2015-09-25T06:40:34.843

3 Establishing data science programs as an independent discipline 2015-10-01T18:57:42.820

3 Tool to Generate 2D Data via Mouse Clicking 2015-10-27T17:16:08.807

3 How to create US state heatmap 2016-01-04T18:35:57.540

3 Mathematics major for data science 2016-01-07T19:11:01.153

3 About data cleansing, to what extent should we do our work? 2016-04-29T04:19:50.950

3 Skills that school doesn't teach you 2016-08-17T19:08:17.143

3 The best way to calculate variations between 2 datasets? 2016-12-07T16:50:08.260

3 How is a single element of the training set called? 2017-01-19T16:28:21.157

3 Which is better for Data Science, a double major in Math & CS or Physics & CS? 2017-03-01T19:31:22.463

3 What are some of the best practices for sharing data and models with colleagues? 2017-03-17T18:45:16.867

3 What can i do after a PCA with the results? 2017-05-05T16:58:43.873

3 Can I scrape data from government websites if there is no mention about commercial usage? 2017-12-12T19:40:07.250

3 What is the appropriate name for this dataset? 2018-01-30T13:30:44.470

3 Neural Network for Multiple Float Output 2018-02-13T20:12:58.267

2 Spatial clustering based on response to inputs and building a reduced model 2015-07-05T01:07:42.470

2 How to store complex tables and structures? 2015-07-21T15:52:30.010

2 What proxies could be used to assess economic value of Stackoverflow for its users? 2015-08-02T09:22:41.890

2 Weighted k nearest neighbor search 2015-08-13T13:52:11.463

2 Do I have to standardize my new polynomial features? 2015-11-25T11:11:25.923

2 Central Probability Interval 2015-12-11T23:01:37.850

2 Data structure design for supporting arbitrary number of columns in table or database 2016-02-01T17:36:16.720

2 PCA on acceleration time series data 2016-02-07T15:34:18.730

2 Tag categorizer 2016-02-11T19:21:28.653

2 Data Scientists | Sectors , Departments and Profiles 2016-02-11T22:21:42.730

2 Free/open interactive softwares/plugins for end-users' high-dimensional data visualization 2016-03-17T05:49:24.920

2 Core components of data literacy for working professionals 2016-04-01T01:09:04.640

2 Retail Store Testing 2016-04-26T19:12:58.750

2 Finding outliers in multiple dimensions 2016-04-30T06:11:50.040

2 How can I find company descriptions for a long list of companies? 2016-05-09T07:50:09.217

2 Data sources problem 2016-06-22T13:46:43.247

2 Difficulties of getting raw data 2016-08-05T03:01:04.613

2 How to quantitatively compare two or more complex data sets 2016-09-06T15:24:00.467

2 How many people can use a single Hadoop cluster at one time? 2016-11-13T20:25:56.503

2 What is "noise" in observed data? 2016-12-25T06:28:38.973

2 Interpreting Decision Tree in context of feature importances 2017-02-02T00:29:32.877

2 Generating data that look alike my original data 2017-02-09T15:49:58.403

2 Dataset: language audio clips and country labels 2017-03-03T23:31:52.123

2 What is preferred upsampling vs. zero padding? 2017-03-16T22:37:36.680

2 How clustering is used in data management? 2017-05-02T13:48:48.113

2 What are the most suitable machine learning algorithms according to type of data? 2017-06-23T02:09:35.357

2 Histogram alternatives for two sets of data combined 2017-10-09T08:57:20.657

2 How is a splitting point chosen for continuous variables in decision trees? 2017-11-03T21:45:09.203

2 Count the number of false positives with respect to the first class 2017-11-07T22:40:29.373

2 How to deal with large data sets 2017-11-21T17:11:49.800

2 When visualizing data that has <1 or <5 ppm how do you display this? 2017-12-05T13:35:11.837

2 Columns with no (or nearly no) differences between rows worth keeping? 2017-12-17T12:56:58.843

2 Small data set in machine learning 2017-12-30T14:59:26.090

2 Purpose of weights in neural networks 2018-03-07T10:11:34.480

1 Data frame mutation in R 2015-08-20T09:49:51.230

1 Data Science education curriculum design and guidelines in Computer Science and other Disciplines 2015-09-07T15:41:26.427

1 How to generate bootstrapping samples in R? 2015-09-22T20:24:47.313

1 Generate a set of abstract search terms 2015-09-25T17:42:53.690

1 Deploying machine learning modules 2015-09-30T13:53:34.730

1 What kind of research I can do with my data set? 2015-11-23T05:47:01.023

1 How to select a bunch of optimized data from a larger data set? 2015-11-23T16:49:32.403

1 Creating queries on the fly and general manipulation for dataset of half a million data records 2016-03-24T18:26:46.147

1 First steps when analyzing a company's data 2016-06-16T02:06:04.567

1 Which one is better performer on wrangling big data, R or Python? 2016-07-26T21:09:57.130

1 Merging large CSV files in Pandas 2016-07-28T15:15:45.510

1 Multi-touch Attribution Model 2016-08-06T19:02:05.787

1 Target variable problem :- Classifier 2016-09-02T11:15:19.617

1 Dealing with outliers and z-scores 2016-09-23T23:27:29.433

1 Directly proportional Trend in training and cross validation curves 2016-09-26T10:10:34.397

1 Postitive event but unsure when it occurred in time 2016-09-30T04:07:20.720

1 Econometrics Thesis methodology Suggestions? 2016-10-06T13:26:57.993

1 What are recommended ways\tools for processing large data from Excel Files? 2016-12-31T01:29:25.177

1 Instead of one-hot encoding, can I store the same information in one column using a single value? 2017-01-04T23:12:54.290

1 What is the difference between data-driven methods and machine learning? 2017-01-27T17:45:17.133

1 Uploading huge dataset 2017-02-18T15:52:10.020

1 Cook's distance, altering diagnostic plot in R,? 2017-03-07T17:59:21.197

1 Which graph will be appropriate for the visualization task? 2017-03-08T18:53:45.300

1 R Code randomizing selection 2017-04-03T20:44:58.267

1 How to choose an appropriate Machine Learning algorithm? 2017-05-26T04:47:19.253

1 System to provide guide to students about getting admissions to universities of their choice or some specific courses 2017-06-17T21:36:01.840

1 Best Programming Language for Data Science 2017-06-21T21:26:09.167

1 E-Learning related terminology 2017-06-27T02:14:16.700

1 Find irregular bounding shape of 3D particle distribution 2017-07-08T09:49:53.300