Tag: nlp

58 Latent Dirichlet Allocation vs Hierarchical Dirichlet Process 2014-05-18T06:10:52.543

44 What is the positional encoding in the transformer model? 2019-04-28T14:43:17.090

37 What are some standard ways of computing the distance between documents? 2014-07-05T16:10:21.580

29 What algorithms should I use to perform job classification based on resume data? 2014-07-03T16:11:22.637

29 General approach to extract key text from sentence (nlp) 2015-03-13T16:41:29.280

29 How do I load FastText pretrained model with Gensim? 2017-06-30T02:14:39.717

27 What is a better input for Word2Vec? 2015-11-08T04:17:56.450

26 Word2Vec for Named Entity Recognition 2014-06-19T19:29:57.797

26 Why are NLP and Machine Learning communities interested in deep learning? 2014-10-11T10:24:01.393

25 How can I get a measure of the semantic similarity of words? 2016-07-19T21:54:34.713

24 Word2Vec vs. Sentence2Vec vs. Doc2Vec 2017-06-30T07:05:33.707

23 Predicting a word using Word2vec model 2016-01-14T07:13:45.810

23 Improve the speed of t-sne implementation in python for huge data 2016-02-06T14:19:10.243

23 NLP - why is "not" a stop word? 2016-12-15T22:20:16.537

23 Sentence similarity prediction 2017-10-22T07:36:15.920

22 Best practical algorithm for sentence similarity 2017-11-23T14:40:25.603

21 How to annotate text documents with meta-data? 2014-05-29T20:11:16.327

21 How to grow a list of related words based on initial keywords? 2014-06-17T06:05:39.653

21 How to get sentence embedding using BERT? 2019-11-04T15:22:32.240

20 NLP - Is Gazetteer a cheat? 2016-01-25T18:41:24.083

20 Natural Language to SQL query 2018-05-14T04:23:08.600

20 What is the bleu score of professional human translators? 2020-02-23T17:08:50.137

19 Dataset for Named Entity Recognition on Informal Text 2014-06-30T21:02:05.053

19 How to initialize a new word2vec model with pre-trained model weights? 2016-03-14T09:47:28.813

17 Extract most informative parts of text from documents 2014-12-08T14:51:27.613

17 Similarity between two words 2016-07-04T06:00:34.093

17 What is the difference between word-based and char-based text generation RNNs? 2016-08-01T22:38:16.490

16 What is the difference between a hashing vectorizer and a tfidf vectorizer 2017-08-14T16:42:07.040

15 What is a 1D Convolutional Layer in Deep Learning? 2017-02-28T08:12:08.210

15 What is the difference between CountVectorizer token counts and TfidfTransformer with use_idf set to False? 2017-12-11T22:51:57.513

15 When to use cosine simlarity over Euclidean similarity 2018-02-12T13:31:46.740

14 How word2vec can be used to identify unseen words and relate them to already trained data 2015-12-26T03:47:48.800

14 Are there any good out-of-the-box language models for python? 2018-09-20T13:34:22.520

13 What features are generally used from Parse trees in classification process in NLP? 2014-08-24T17:09:40.510

13 Extract information from sentence 2016-11-02T08:15:53.687

13 Alternatives to TF-IDF and Cosine Similarity when comparing documents of differing formats 2017-01-02T20:41:13.493

13 So what's the catch with LSTM? 2018-02-02T15:45:12.373

12 How to process natural language queries? 2014-06-14T20:32:06.143

12 Efficient database model for storing data indexed by n-grams 2014-07-21T23:53:11.120

12 Unsupervised feature learning for NER 2014-07-28T07:19:49.877

12 Help regarding NER in NLTK 2015-01-29T12:13:01.677

12 ngram and RNN prediction rate wrt word index 2015-10-27T09:55:31.540

12 How does attention mechanism learn? 2020-01-23T06:05:27.383

11 applying word2vec on small text files 2016-01-10T10:49:20.447

11 How do "intent recognisers" work? 2016-04-05T09:03:15.113

11 How to determine if character sequence is English word or noise 2016-04-28T17:20:13.760

11 Word2Vec embeddings with TF-IDF 2018-03-04T12:07:33.313

11 What is purpose of the [CLS] token and why is its encoding output important? 2020-01-09T17:20:10.963

10 What is generative and discriminative model? How are they used in Natural Language Processing? 2014-05-18T06:17:37.587

10 Extract canonical string from a list of noisy strings 2014-08-22T15:59:07.097

10 How to create a good list of stopwords 2015-05-24T21:45:02.207

10 What is the difference between NLP and text mining? 2016-01-20T06:33:54.923

10 Are Word2Vec and Doc2Vec both distributional representation or distributed representation? 2016-03-20T19:18:18.573

10 Calculate cosine similarity in Apache Spark 2016-08-10T05:43:41.613

10 How to determine the complexity of an English sentence? 2017-06-03T20:12:19.593

10 Variable input/output length for Transformer 2019-02-13T03:43:48.647

10 Word2Vec how to choose the embedding size parameter 2019-05-04T23:29:32.420

10 In a Transformer model, why does one sum positional encoding to the embedding rather than concatenate it? 2019-07-18T08:34:46.710

9 Using NLP to automate the categorization of user description 2014-12-09T20:49:37.093

9 Using Vowpal Wabbit for NER 2015-06-06T07:00:13.363

9 Reducing the dimensionality of word embeddings 2015-07-28T17:54:23.927

9 Improving accuracy of Text Classification 2017-05-28T12:56:36.267

9 Public dataset for news articles with their associated categories 2017-09-26T08:56:30.433

9 How to get the number of syllables in a word? 2017-09-28T06:04:25.437

9 What is the difference between and Embedding Layer and an Autoencoder? 2019-06-21T15:52:36.537

8 What are some standard ways of computing the distance between individual search queries? 2014-07-05T16:20:17.963

8 Coreference Resolution for German Texts 2014-08-11T12:25:47.700

8 Which classification algorithms to try for classifying text data into 300 categories 2015-05-07T08:52:40.293

8 Complex Chunking with NLTK 2015-05-16T00:15:37.807

8 Shall I use the Euclidean Distance or the Cosine Similarity to compute the semantic similarity of two words? 2015-07-20T04:48:17.547

8 Is there an alternative to nltk in golang? 2016-06-03T16:38:52.037

8 What's an LSTM-LM formulation? 2016-08-04T08:25:33.450

8 NLP: What are some popular packages for multi-word tokenization? 2017-03-02T07:04:41.123

8 Text extraction from documents using NLP or Deep Learning 2018-06-19T16:09:57.667

8 Increasing SpaCy max NLP limit 2018-09-24T23:33:27.883

8 Bert Fine Tuning with additional features 2019-03-05T02:57:48.780

8 Fuzzy name and nickname match 2019-03-19T13:36:15.100

8 Preprocessing for Text Classification in Transformer Models (BERT variants) 2019-11-08T06:28:48.750

8 NLP : variations of a text without modifying it's meaning 2020-01-04T16:53:52.467

7 Dealing with diverse text data 2014-06-20T14:58:09.320

7 What are the main types of NLP annotators? 2014-06-25T17:37:23.380

7 Attributes extraction from unstructured product descriptions 2014-12-02T16:09:35.333

7 How word2vec can handle unseen / new words to bypass this for new classifications? 2015-08-11T05:04:52.300

7 How does word2vec handle the input word being in the context? 2015-09-17T21:02:33.367

7 Identifying templates with parameters in text fragments 2015-09-20T12:40:10.170

7 Best way to search for a similar document given the ngram 2015-11-17T03:06:51.357

7 Twitter Sentiment Analysis: Detecting neutral tweets despite training on only Positive and Negative Classes 2016-03-25T06:04:10.987

7 How to handle negative words in word2vec? 2016-12-17T11:36:12.127

7 Algorithms and techniques for spell checking 2017-01-06T06:05:56.000

7 Named entity recognition (NER) features 2017-02-02T18:48:59.820

7 Understanding of naive bayes: computing the conditional probabilities 2018-01-24T20:01:23.673

7 K-means clustering of word embedding gives strange results 2018-04-27T00:38:13.767

7 Mastering NLP: Reading List 2018-08-04T13:06:10.910

7 Under what circumstance is lemmatization not an advisble step when working with text data? 2018-08-08T22:26:50.310

7 On a multi lingual sentiment corpus 2018-11-18T17:41:42.027

7 How to use a one-hot encoded nominal feature in a classifier in Scikit Learn? 2019-03-25T20:33:12.757

7 Lemmatization Vs Stemming 2019-04-22T10:41:28.540

7 what is the first input to the decoder in a transformer model? 2019-05-11T08:36:07.907

7 Why is word prediction an obsession in Natural Language Processing? 2019-10-16T14:52:38.063