Tag: preprocessing

27 How to prepare/augment images for neural network? 2015-02-24T11:59:36.033

6 Dealing with training set of questionable quality 2015-11-16T10:57:53.013

6 How to approach the numer.ai competition with anonymous scaled numerical predictors? 2016-06-29T16:11:34.107

6 Extracting individual emails from an email thread 2017-06-01T13:02:23.683

5 Do you apply outlier detection of numerical data in practical applications? 2016-07-04T15:19:21.973

5 Nested features with one to many relationships 2017-05-24T19:46:03.913

4 How to preprocess different kinds of data (continuous, discrete, categorical) before Decision Tree learning 2015-08-07T10:43:50.747

4 How to choose best classifier for Low positive to negative class ratio in data (training, validation and real time)? 2016-02-26T13:54:54.900

4 How would one separate digits for number recognition? 2016-06-15T15:44:45.850

4 Different Test Set and Training Set Distribution 2018-02-26T20:29:22.630

3 How to define a distance measure between two IP addresses? 2015-11-09T09:40:19.857

3 Is it common to preprocess image data before sending it through a deep net? 2015-12-17T01:42:57.980

3 Machine Learning or Survival Analysis? 2016-07-20T21:08:35.813

3 How to use a dataset where attribute names are changed? 2017-06-11T02:39:27.017

3 Should I standardize first or generate polynomials first? 2017-07-18T15:45:06.017

3 How to get spike values from a value sequence? 2018-01-25T11:09:14.883

2 Denoising Autoenoders with Variable Length Input 2015-08-21T14:26:10.297

2 How should clickstream data be prepared before user segmentation can be performed? 2015-10-04T14:57:05.073

2 Sampling for multi categorical variable 2015-10-11T20:58:19.907

2 How to visualize data of a multidimensional dataset (TIMIT) 2015-10-24T13:24:09.340

2 Does it makes sense to apply feature scaling on timestamp 2015-11-05T15:02:35.457

2 How can I preprocess multi-page image inputs in a theano/lasagne network? 2015-12-11T16:59:33.750

2 User activity representation for Prediction/ML 2016-02-07T22:27:42.207

2 What are some method for pre-processing data in OCR? 2016-09-28T05:57:38.303

2 How to implement global contrast normalization in python? 2016-11-14T17:35:28.093

2 Real time noise removal using Savitzky-Golay Method 2017-02-13T05:12:42.300

2 Transformation of Dependent and Independent Variables 2017-03-12T21:29:31.383

2 What pre processing should I use on data to feed into a CNN? 2017-08-06T07:52:00.267

2 How to preprocess Acoustic Data 2017-08-31T07:59:02.007

2 Keras loading images in incorrect format 2017-09-13T16:08:13.013

2 How Box cox and other transformations convert data into Normal Distributions? 2017-10-25T12:23:52.413

2 Dealing with a dataset where a subset of points live in a higher dimensional space 2017-12-04T06:30:36.457

2 One hot encoding of target space 2018-01-12T19:04:18.553

2 Preprocess list data 2018-02-06T19:58:15.913

2 Why is input preprocessing in VGG16 in Keras not 1/255.0 2018-02-24T04:14:29.187

1 Preprocessing in Data mining? 2015-08-26T17:37:49.187

1 What is a benchmark model? 2015-11-10T18:16:17.677

1 Data preprocessing : Aggregation, feature creation, or else? 2015-12-10T00:53:03.687

1 What metrics must i use in my data(unstructured) preprocessing research? 2016-02-20T10:11:17.397

1 Apply function on every four rows 2016-04-03T15:15:55.967

1 How to best represent rate or proportion as a feature? 2016-05-23T16:51:53.530

1 Redundancy - is it a big problem? 2016-06-05T10:02:27.790

1 Should we convert independent continous variables (features) to categorical variable before using decision tree like classifier? 2016-07-10T22:01:32.933

1 How do I convert a series of timestamps in seconds to milliseconds in order to distribute them smoothly 2016-08-10T08:31:02.127

1 Binary classification: best ways to pre-procees the data 2016-09-19T09:00:04.510

1 Preprocessing to-be-predicted data in ML with R - "learn" and "apply" features 2016-10-12T14:07:28.050

1 What are recommended ways\tools for processing large data from Excel Files? 2016-12-31T01:29:25.177

1 Convert exponential to normal distribution 2017-05-12T22:40:59.083

1 how is countvectorizer used in real production environment? 2017-07-12T01:37:41.657

1 How to use the same scale with new data? - scikit learn - scikit learn 2017-10-12T20:00:18.570

1 Chunker/shallow parser for spoken language 2017-10-12T20:57:37.467

1 Are annotated audio datasets augmented with mutated versions the way image datasets are? 2017-10-19T17:00:50.637

1 Is my normalization off? 2017-10-24T12:33:54.673

1 How to Replace an object in Pandas array using replace with dictionary from excel file? 2017-12-27T03:09:02.637

1 Should I apply PCA on the entire dataset or just the nominal values? 2018-01-06T20:08:22.303

1 Data preprocessing: Should we normalise images pixel-wise? 2018-01-21T11:56:56.290

1 Exploratory Data Analysis and selecting good predictor variables ? 2018-02-27T17:01:52.420

0 Problem in constructing co-occurence matrix 2016-02-28T07:05:01.517

0 What does normalizing and mean centering data do? 2016-07-15T17:36:20.600

0 Clustering bitcoin addresses with k-means - how would one prepare input 2016-08-18T20:35:24.683

0 Choice of replacing missing values based on the data distribution 2016-11-22T20:07:43.093

0 Outliers Approach 2016-12-09T13:53:54.977

0 Data preprocessing, relative scale problems in features of same type 2017-02-11T10:05:52.190

0 Is pre-processing always neccessary? 2017-06-08T11:15:58.780

0 Tool for analyzing a Python matrix and generating a report on the contents (column types, NaN counts, means, etc.) 2017-06-09T14:06:43.857

0 Is it advisable to remove similar attributes in data before clustering (without doing a PCA)? 2017-06-10T10:24:00.393

0 Document (scanned image) Classification pre-processing: binary or color? 2017-06-12T15:00:16.947

0 Preprocessing;Discretization;Data mining 2017-06-24T15:42:51.197

0 Phonetic Matching Algorithms 2017-09-18T16:49:10.757

0 How to perform efficient bining for continuous features 2017-09-29T17:02:20.040

0 Improving Accuracy in Neural Network Multi class classification 2017-10-14T09:40:41.747

0 issue with oneHotEncoding 2017-10-18T19:40:56.623

0 Normalizing time data 2017-10-24T13:37:18.193

0 Pre-process data images before training OneClassSVM and decrease number of features 2017-11-04T20:37:03.373

0 pre-process data for document classification when the words are short-cut in R? 2017-11-29T23:08:02.720

0 Need a Work-around for OneHotEncoder Issue in SKLearn Preprocessing 2017-11-30T18:28:09.573

0 Better input for Doc2Vec 2017-12-04T12:56:11.733

0 What is the best way to normalize histogram vectors to get distribution? 2017-12-04T18:06:33.343

0 Converting nominal data to numeric - is using dictionaries the right approach? 2018-01-05T21:02:46.697

0 Should I Impute target values? 2018-01-12T21:08:05.720

0 Preparing data for a live prediction engine 2018-01-17T13:00:03.880

0 Comparing coordinates likeliness 2018-02-21T16:47:39.687

0 Binning which variables? 2018-02-28T23:57:25.473

0 preprocessing data: mixing categorical, numerical and ordinal data? 2018-03-01T13:50:24.987

0 Data binning - Why we need to transform Categorical Variables? 2018-03-04T18:28:12.100

0 How to treat the datasets with unreliable labels 2018-03-06T01:26:21.337

0 Why is it necessary to clean/preprocess data if the unseen real data is bound to consist missing values, outliers etc.? 2018-03-10T17:14:12.530

-1 Image Classification, Convolution Network and Gamma Correction for images 2017-04-13T11:23:12.667

-1 Creating a dataset for benchmarking of timeseries preprocessing capabilities 2017-05-31T08:38:30.790

-1 How to train Matlab on a range of IP addresses? 2017-06-03T17:08:58.647

-1 Do I need equal number of bins for all attributes? 2017-06-23T19:54:18.457

-1 Best way to tokenize tweet 2017-08-16T09:18:06.850

-1 Tableau: How to change multiple field names 2017-09-18T14:35:13.653

-1 What are the best way to handle missing values 2017-10-08T10:15:40.980

-1 How to preprocess data? 2017-10-28T13:45:50.603

-1 Dataset processing question 2017-10-30T13:29:38.933