Tag: data-preprocessing

5 What is "conditioning" on a feature? 2019-11-14T11:57:21.740

4 Would this relatively small dataset be enough to train a CNN? 2017-08-31T05:32:04.277

4 How should I deal with variable input sizes for a neural network classifier? 2020-03-13T20:25:28.780

4 How should I deal with variable-length inputs for neural networks? 2020-03-28T07:59:44.080

3 Why do we normalize data in a deep neural network? 2019-07-17T20:22:22.167

3 How to fill missing values in a dataset where some properties can be inputs and outputs? 2020-05-14T20:56:39.477

2 Preprocessing of training dataset for machine learning 2017-06-06T16:38:48.323

2 How should I encode the input which are 5 cards from a deck of 52 cards? 2018-06-28T16:25:57.687

2 Which of these two numerical methods for z-score normalisation is preferable, in multivariate linear regression? 2018-11-21T05:00:25.957

2 How to define the "Pre-Processing" in machine learning? 2019-12-15T15:53:46.303

2 Suggestion for finding the stable regions in spiral galaxy data? 2020-01-13T19:53:54.220

2 Language Learning feedback with AI 2020-02-22T21:31:55.750

2 Can the addition of low-quality images to the training dataset increase the network performance? 2020-03-17T22:29:19.470

2 What is the difference between training a model with RGB images and using only the color channels separately? 2020-03-31T11:00:37.823

2 Do I have to downsample the input and upsample the output of the neural network when implementing the NICE algorithm? 2020-04-01T06:55:19.237

2 Do I need to denormalise results in linear regression? 2020-04-04T12:56:04.047

2 How should I deal with multi-dimensional tensors for nodes in a graph convolution network? 2020-04-07T08:24:26.160

2 How to feed key-value features (aggregated data) to LSTM? 2020-05-17T23:00:52.337

2 How does sampling works in case of imbalanced image datasets? 2020-05-18T21:18:18.600

2 How to take the optimal batch_size for training a model? 2020-08-19T06:38:39.940

2 When to convert data to word embeddings in NLP 2020-08-21T05:37:41.317

1 When should I use feature learning as opposed to feature engineering? 2018-11-17T06:03:54.540

1 What is the impact of scaling the features on the performance of the model? 2019-01-15T04:16:38.597

1 Is there an LSTM-based unsupervised learning algorithm to label a dataset of curves? 2019-04-24T14:53:35.293

1 Are there tools to help clean a large dataset so that it only contains faces? 2019-05-20T09:00:52.617

1 Do I need to use a pre-processed dataset to classify comments? 2019-06-12T12:04:21.097

1 How to rescale data to its original range after MinMaxScaler? 2019-07-18T17:10:25.233

1 How can I merge two datasets? 2019-11-15T11:47:27.703

1 Focal loss for imbalanced multi class classification in Pytorch 2019-11-17T12:04:59.223

1 learning object recognition of primitive shapes through transfer learning problem 2019-11-22T10:26:57.503

1 Are there free and easy-to-use annotation tools for 3D bounding boxes? 2019-12-04T13:51:13.570

1 Hashing images detects false duplicates 2020-01-15T12:53:27.420

1 EEG and Accelerometer Neural Network 2020-01-21T18:28:15.360

1 Acoustic Input Data: Decibel or Pascals 2020-02-04T22:20:03.473

1 Choosing Data Augmentation smartly for different application 2020-02-10T18:06:17.707

1 Using three image datasets with different image sizes to train a CNN 2020-03-01T07:14:41.703

1 Is it recommended to remove stop words before named entity recognition? 2020-03-04T14:59:59.170

1 Is text preprocessing really all that necessary for NLP? 2020-03-28T18:26:49.073

1 How can raw data from a motion sensor (like an IMU) reduced to the main points of the data 2020-05-06T13:31:27.317

1 Are there any general guidelines for dealing with imbalanced data through upsampling or downsampling? 2020-05-08T01:25:22.773

1 What pre-processing of the image is needed before feeding it into the convolutional neural network? 2020-05-12T10:11:16.553

1 Are my steps correct for a proper classification of a sick brain? 2020-05-14T07:55:24.707

1 Can I find a mapping that minimizes the maximum distance ratio of certain vectors? 2020-05-19T05:29:00.697

1 How to train a neural network with a data set that in which the target is a mix of 0-1 label and numeric real value label? 2020-05-23T00:28:46.417

1 How can I predict the label given a partial feature vector? 2020-06-04T17:05:41.123

1 How is the performance of a CNN trained with monochrome images on image recognition tasks degraded? 2020-06-24T08:00:51.617

1 Do I need to rotate the masks, if I also rotate the images and the masks are generated from the input? 2020-07-09T20:50:03.967

1 Can I resize my images after labeling them? 2020-08-06T08:09:14.450

0 Is there any reason to believe a ml pipeline that works on dataset A will work on dataset B where both have similar meta features? 2019-09-09T22:03:08.243

0 How can I remove the noise from an EEG signal? 2019-10-21T15:29:02.173

0 What's the correct approach to standardise data from a time-series used for LSTM neural network predictions? 2019-10-26T22:03:13.200

0 How to automatically detect and correct false information in columnar data? 2020-02-10T16:49:58.210

0 How can I group the entries of the network traffic by their similarity? 2020-04-24T04:50:44.007

0 Do we scale our target feature in regression problems? 2020-05-12T07:06:17.510

0 Is it necessary to standardise the expected output 2020-07-23T07:10:58.130

0 Multilabel stratified split for images/object detection 2020-07-30T06:06:19.590