Tag: text-classification

11 What is purpose of the [CLS] token and why is its encoding output important? 2020-01-09T17:20:10.963

5 How to use ndcg metric for binary relevance 2020-01-10T20:03:50.507

4 Text classification based on n-grams and similarity 2020-05-21T07:57:29.167

4 How to include categorical fields to enhance a text classification 2020-08-30T12:30:56.487

4 How to preprocess with NLP a big dataset for text classification 2021-02-01T20:38:58.513

3 Classifying one particular class of documents from the rest 2020-01-31T11:49:12.633

3 Doubt on scope of text classification problem 2020-03-06T20:09:37.637

3 Bag of words: Prediction on new (out-of-sample) data 2020-06-28T11:24:36.517

3 Predictive output with your own model built 2020-10-08T14:01:56.163

3 Over-sampling: is my model over-fitting? 2020-11-30T04:43:00.897

3 Effect of Stop-Word Removal on Transformers for Text Classification 2020-12-03T20:24:23.693

2 Which kind of model is better for keyword-set classification? 2019-12-26T06:00:27.503

2 Using Trainable=True in Keras Embedding obtained better performance 2020-02-10T08:10:12.737

2 Text vectorizer that capture feature offset in the text? 2020-03-19T14:39:39.517

2 Detecting low-quality, user-created text content 2020-04-13T21:56:39.150

2 How to use ontologies for text classification? 2020-05-28T14:06:58.577

2 Classify text as logical/ not logical 2020-06-19T20:19:40.883

2 Text Classification : Classifying N classes vs rest of the classes 2020-06-23T16:59:38.740

2 How to identify text similarity based on training data? 2020-07-07T11:09:18.720

2 Is there any way to plot ROC curve for Ensemble hard voting classifier? 2020-07-07T17:56:53.977

2 Any useful tips on transfer learning for a text classification task 2020-07-21T08:07:40.663

2 Interpreting confidence interval results for datasets 2020-08-18T23:43:34.800

2 Is there a way to classify an alphanumeric string? 2020-10-01T15:11:43.140

2 Does Keras allow using independant classifier 2020-10-14T16:28:57.833

2 Bug in sentiment analysis and classification for unlabeled text 2020-10-27T20:37:38.867

1 Can SMOTE be applied over sequence of words (sentences)? 2017-06-08T08:28:21.097

1 How to classify unseen text data? 2020-01-16T06:43:12.950

1 Difference between SVM and GD/SGD? 2020-02-06T10:53:43.553

1 Does it make sense to use a tfidf matrix for a model which expects to see new text? 2020-03-03T03:08:34.327

1 Taking huge time to execute piepeline text classification model using sklearn? 2020-03-26T06:11:51.747

1 TF Keras Text Processing - Classification Model 2020-03-30T17:08:39.363

1 Text classification into thousands of classes 2020-04-01T17:14:29.620

1 Thematic clustering of text 2020-04-05T17:24:12.710

1 approach to classify text with natural language processing methods 2020-04-13T17:50:45.440

1 What are some of the available methods for handling multi-label classification for longer sequences of text 2020-04-28T15:04:49.513

1 Classification - get some label value to check how close to another class (Python) 2020-04-29T10:21:22.087

1 Building a text classification model from scratch 2020-05-05T12:03:28.483

1 How to deal with imbalanced text data 2020-05-06T03:19:33.233

1 How to classify very short text for spend analytics? 2020-05-21T18:26:28.363

1 Text classification: accuracy 2020-05-23T22:24:56.497

1 Which insights a data scientist could derive from text-analysis? 2020-05-30T16:23:34.387

1 Confusion with using different classes in neural networks (training vs testing) 2020-06-20T04:54:15.083

1 How many layers should I replace in transfer learning CNN 2020-06-21T00:28:53.410

1 Identifying templates from SMS text 2020-07-12T17:49:15.067

1 Text classification with Word2Vec on a larger corpus 2020-07-15T14:25:43.773

1 Creating a valid dataset for obtaining results 2020-08-01T00:39:37.953

1 Problem of continuous training - Supervised learning 2020-08-06T10:07:55.177

1 Sampling in Text Classification: can the results be considered 'reliable'? 2020-08-09T16:18:05.460

1 Trying to use CBOW for tweet classification 2020-09-07T10:10:05.227

1 How can I make a better unsupervised text classifier model? Is POS tagging part of Machine Learning and Data science? 2020-10-04T13:31:25.897

1 Grouping profiles strings having the same words, but occurring out of order 2020-10-12T19:25:25.057

1 Understanding the generality of the NER problem 2020-10-27T11:56:03.153

1 Discovering important topics in corpus of text using metadata and text content 2020-10-29T01:28:48.777

1 Public dataset for news articles with their associated categories for multilabel data classification 2020-10-29T13:30:30.373

1 SVM on BERT-Embeddings with very small dataset does not converge 2020-10-29T17:52:03.753

1 Understanding the XLNet model for a concrete case 2020-12-08T14:03:11.337

1 Sklearn - multiclass text classification 2020-12-19T14:42:09.557

1 Matching Data Text of Two Place with Exception 2020-12-21T03:44:33.120

1 Classify tweets by topic 2020-12-26T13:50:15.663

1 auto updating text comparison model 2020-12-27T03:50:13.527

1 Steps to perform multi label word classification 2020-12-30T11:56:07.427

1 Multi-class classification with extremely small dataset 2021-01-05T00:22:34.897

1 Top-K vs AUC - communicating results and next steps 2021-01-13T11:12:30.090

1 What is a good approach for embedding both textual and spatial features for document classification? 2021-01-15T17:09:23.877

1 Extracting layer output from Classification model of SimpleTransformer 2021-02-10T07:15:33.307

0 Text detection on English and Chinese languages 2020-01-30T11:42:10.260

0 How to use text classification where the training source are txt files in categorized folders? 2020-02-01T21:56:53.683

0 How to identify new job descriptions/postings from a set of documents when I have a set of already labeled job descriptions/postings 2020-03-09T20:33:42.400

0 Training a cosine similarity matrix for similar text recommendation 2020-04-14T22:40:34.573

0 Can text analysis approaches using machine learning be used on financial statement reports? 2020-04-19T14:59:06.940

0 Training a classifier with text and numerical features - what is the state of the art? 2020-04-22T16:35:40.320

0 Overfitting with text classification using Transformers 2020-04-23T12:43:10.007

0 SciKit-Learn: predict values sometimes different from top predict_proba entries? 2020-04-25T06:02:03.857

0 NLP/Text Analytics : How to extract relevant features from a given text so as to predict the future probability of the label assigned? 2020-04-27T13:44:34.967

0 Machine Learning - Multilable Text Classification 2020-04-28T08:06:08.123

0 Text Mapping - Medicine Names 2020-05-01T04:57:50.697

0 Text classification and predictive model 2020-05-07T10:05:18.943

0 How to best handle imbalanced text classification with Keras? 2020-05-11T10:32:00.820

0 Text classification analysis based on similarity 2020-05-11T13:39:18.337

0 Correlation between words, then texts 2020-05-11T23:58:44.943

0 How to handle such a large class imbalance in text data? 2020-05-12T07:07:00.670

0 text classification : comparing classification reports 2020-05-23T14:38:20.767

0 Spam/ham classification 2020-05-25T15:16:26.317

0 Calculating accuracy in Extractive Summarization 2020-05-25T18:15:41.490

0 Dirichlet smoothing as an IDF component 2020-05-26T11:43:20.400

0 Text Classification One-shot learning (1sample/class ~1000 classes) 2020-05-28T21:06:30.227

0 How to properly vectorise when I have several text features? 2020-06-04T08:06:52.717

0 Clusters: how to improve results for text classification 2020-06-06T12:20:34.753

0 Text Classification on a very small data set with a lot of classes 2020-06-12T06:25:49.783

0 Need for Dense layer in Text Classifcation 2020-06-17T07:40:11.100

0 k-means and LDA for text classification: how to test accuracy? 2020-06-22T09:48:23.723

0 K-means and LDA for text classification 2020-06-22T13:02:14.393

0 use genetic algorithm as a feature selection for text classification 2020-07-05T18:07:03.103

0 Algorithm for rectifying incorrect text from OCR 2020-07-06T23:21:54.377

0 Sklearn SVM question classification 2020-07-11T16:37:24.493

0 (Serious) Dataset of paedophilic Youtube comments (or similar)? 2020-07-20T19:41:11.710

0 How to approach a text parsing problem 2020-07-30T22:55:38.827

0 How to create a Document Categorization Classifier for different contexts of Documents 2020-08-07T16:59:47.900

0 Classifiers and accuracy 2020-08-14T14:20:55.233