Tag: audio-recognition

12 Deep Learning with Spectrograms for sound recognition 2016-01-29T15:39:26.277

9 How does a convolutional ply differ from an ordinary convolutional network? 2017-02-17T12:30:03.880

8 CNN for phoneme recognition 2017-04-29T01:58:47.287

7 Neural network with flexible number of inputs? 2016-10-20T18:46:47.740

7 Audio Analysis : Segment audio based on speaker recognition 2018-06-18T00:50:57.867

6 Optimizing CNN network 2017-03-16T02:52:28.617

6 Detecting voice in a noisy environment 2018-02-03T08:47:26.227

5 Tool for labeling audio 2019-07-12T10:23:18.730

4 How to reduce dimensionality of audio data that comes in form of matrices and vectors? 2016-03-14T00:37:25.940

4 Training a CNN with limited weight sharing 2017-03-01T16:30:20.760

4 python - What is the format of the WAV file for a Text to Speech Neural Network? 2017-03-27T19:53:17.807

4 Keyword localization in audio file 2020-01-30T00:27:28.280

3 Using dhmm_em to form the hmm of mfccs' from song clips 2016-07-23T21:02:17.010

3 How to get a feature from sound/audio data learning using machine learning supervised classification? 2018-04-24T14:22:29.517

3 How to create speech commands data set 2018-07-25T07:07:03.553

3 LIME visualization outputs padded regions as important - Mel-spectrogram (audio analysis) 2018-10-12T19:26:30.457

3 looking for databases of audio with labelled 'true' and 'deceptive' sections 2019-02-13T15:34:10.667

3 How to represent audio data in a format that can be used for preprocessing and modelling? 2019-09-07T15:00:38.247

2 Real-time classification of audio - thousands of classes 2016-07-01T20:07:31.860

2 The effect of an linear layer? 2017-01-26T13:06:08.353

2 Dataset: language audio clips and country labels 2017-03-03T23:31:52.123

2 Input and output feature shapes in CNN for speech recognition 2017-03-28T17:31:48.233

2 (CNN+)RNN-HMM hybrid for learning phonemes from a spectogram 2017-07-06T18:19:04.370

2 Deep Learning models with top-down transfer 2017-09-15T09:54:39.927

2 Training an AI to play Starcraft 2 with superhuman level of performance? 2018-02-15T12:46:47.083

2 Why is pre and post silence important when collecting speech data? 2019-03-01T23:01:26.063

2 Process melspectrograms with convolutional neural network 2019-03-08T12:14:30.883

2 Machine learning in audio? 2019-03-27T13:52:52.597

2 Why encode pitch as one-hot encoding instead of ordinal encoder? 2020-03-12T19:35:56.807

2 Machine learning on classifying speech 2020-05-09T12:24:54.953

2 What's the best way to validate a rare event detection model during training? 2020-05-31T15:14:07.397

1 Audio Spectrum Normalization for NeuralNetwork Classification 2017-06-28T15:50:58.170

1 Are annotated audio datasets augmented with mutated versions the way image datasets are? 2017-10-19T17:00:50.637

1 speech accent recognition data augmentation and training 2017-12-05T21:16:20.347

1 ESC-50 Audio data for binary classifier 2018-03-08T01:17:52.977

1 Working with audio data with different sample rates in Tensorflow 2018-04-08T18:06:24.233

1 Is machine learning a viable tool to map accent from speech onto text/syllables? 2018-05-11T15:55:38.513

1 How to check if audio samples have only noise or are silent? 2018-05-28T22:40:27.153

1 What is a good method for detection of rare occurencies of speech in noisy audio data? 2018-09-20T21:14:53.087

1 Buy key word audio data set 2018-10-03T09:15:09.003

1 Audio signal signal processing for background noise reduction or removal 2018-10-18T11:27:04.810

1 Wave form analysis ML algorithm 2018-11-30T12:13:01.013

1 Audio classification data balance 2019-01-02T12:20:38.840

1 Labeling audio dataset 2019-05-30T15:21:31.293

1 Classification of non-stationary acoustic signals 2019-07-10T07:21:40.643

1 Transfer learning VGGish (AudioSet). Impact of zero padding to fit the input size 2019-07-11T08:05:28.343

1 Feature engineering for time series (audio signals) 2019-07-31T09:03:06.740

1 What technique to use in order to identify what position an audio sample is at in a longer audio sample? 2019-09-17T18:41:01.387

1 Conv Net Model is overfitting 2019-12-02T00:10:20.250

1 Dataset for musical Instruments recognition 2020-01-02T15:53:13.313

1 How to begin understanding of audio and music analysis 2020-02-19T13:32:35.057

1 How to deal with different audio formats for audio classification? 2020-06-29T19:58:26.297

1 What AI model should I build to find out how similar are the 2 audio files of musical instruments like piano, guitar, synthesizer? 2020-07-06T04:15:10.817

1 Normalisation of features extracted from audio files 2020-09-02T11:08:24.180

1 Machine learning classification with time-domain signals how to ignore signal arrival time? 2021-01-27T17:23:35.203

1 Looking for research for separating conversational audio files 2021-02-11T08:52:10.560

0 How to search collection of podcasts (.mp3 files)? 2016-10-11T11:47:35.340

0 Downsampling audio files for use in Machine Learning 2018-02-12T11:19:31.880

0 LSTM/RNN model fails on new test data - TFLearn 2018-06-06T10:56:35.603

0 What is the effect of highly correlated data on a Convolutional Neural Network? 2018-11-04T20:04:45.237

0 Why normalization kills my accuracy 2019-01-06T14:56:02.073

0 Accuracy keep changing by changing randomState of classifier 2019-01-11T15:30:23.237

0 Segment 5-7 min audio into sentence wise audio clips for creating speech recognition dataset 2019-03-31T13:46:33.313

0 Curation of a dataset of audio files in different formats 2019-08-25T01:01:29.990

0 Recognition of numbers from audio 2019-09-27T06:55:00.200

0 Training a sound localization neural network 2019-10-17T23:09:26.873

0 Audio dataset preprocessing to perform cry detection 2019-11-19T08:51:09.783

0 CNN filter for music spectrograms 2020-01-06T11:54:31.257

0 How to build wake-word detection dataset from keyword pronunciations 2020-03-10T15:00:09.450

0 What is standard in Data Science for using Audio Features and Text Sentiment to predict affective annotations 2020-05-06T16:04:51.083

0 Natural Language Processing: Identifying Words That Are Out of Place? 2020-05-18T01:28:59.640

0 Selecting audio pre-processing parameters for ASR 2020-07-28T02:52:29.583

0 Getting error while performing late fusion of audio and video feature vectors. Please help me resolve this 2020-08-11T12:02:51.087

0 Spoken utterance classification on RAVDESS using MFCC 2020-08-17T07:43:11.217

0 Voice command classification - On Prem 2020-08-19T14:09:17.447

0 How to extract audio features for each video frame using pyAudioAnalysis 2020-08-24T16:48:32.960

0 What is the suggested way to create features (Mel-Spectograms) from speech signal for classification with ResNet? 2020-08-26T15:35:29.557

0 IIoT Sound Classification with Little Data 2020-09-04T15:13:20.547

0 Emotion detection on audio 2020-10-31T17:48:42.100

0 How do you determing the correct dimension of Mel Spectrogram Feature Extraction for NN 2020-12-16T20:57:09.063

0 Sound Classification for Multiple Classes for English Letters 2021-02-07T09:05:18.193

-1 Does the input data representation matter while training CNN for speech recognition? 2017-04-24T15:23:01.070

-1 could not broadcast input array from shape (13,160) into shape (13) while using sklearn normaliser 2020-03-19T10:58:32.943