Tag: information-retrieval

23 Text categorization: combining different kind of features 2014-08-17T17:29:44.123

18 Does click frequency account for relevance? 2014-05-15T14:41:24.020

10 Extract canonical string from a list of noisy strings 2014-08-22T15:59:07.097

10 How to create a good list of stopwords 2015-05-24T21:45:02.207

9 How can you build a model that extracts data out from receipts? 2019-12-28T10:20:57.243

8 How can you build a model that reads out receipts and invoices? 2018-05-19T10:19:09.777

7 Best way to search for a similar document given the ngram 2015-11-17T03:06:51.357

6 Can we quantify how position within search results is related to click-through probability? 2014-10-10T03:45:09.343

6 Why do popular search engines not follow the usual AND, OR logic for queries? 2017-01-11T05:55:13.463

5 semi-structured text parsing using machine learning 2015-01-26T17:45:02.203

5 Can we compare a word2vec vector with a doc2vec vector? 2016-01-18T05:37:40.897

5 Difference between paragraph2vec and doc2vec 2017-02-23T13:28:07.500

5 In what data science applications has the stack exchange dump been used? 2017-09-12T08:59:14.717

5 Evaluating the performance of a machine learned recommendation system 2019-12-06T22:27:22.837

4 metric learning and information retrieval 2015-06-11T07:53:26.120

4 Why are there currently no content-based evaluation metrics for information retrieval? 2015-11-16T15:14:37.750

4 Information Extraction from Free-form text to create Transactions 2016-01-05T04:02:38.443

4 Why keep vocabulary and posting list separate in a search engine 2016-04-06T15:53:52.853

4 How can conclusions be drawn from recommendation systems evaluation? 2016-09-22T15:33:14.370

4 Extract 2 pieces of information from a string - what to use? 2016-09-26T09:52:12.410

4 TS-SS and Cosine similarity among text documents using TF-IDF in Python 2019-10-23T23:30:00.493

4 How is "relevance" defined in information retrieval outside the context of systems with user feedback? 2019-12-07T15:43:47.513

4 Visualizing F-score differences in information extraction 2020-01-29T23:54:12.587

3 Data sets for evaluating text retrieval quality 2014-09-05T14:47:52.127

3 non query-based document ranking 2014-09-07T22:51:27.490

3 Finding the top K most similar sets 2015-11-17T18:56:02.027

3 Tokenizing words of length 1, what would happen if I do topic modeling? 2015-12-30T02:46:00.233

3 Document similarity: Vector embedding versus BoW performance? 2017-03-07T21:24:26.520

3 Sparse IR with user feedback 2018-07-06T16:50:20.430

3 Best way to combine two similar document 2019-05-20T12:52:56.100

3 Evaluating a IR system (Precision and Recall) 2019-12-31T13:25:01.350

3 Building a tag-based recommendation engine given a set of user tags? 2020-03-08T02:41:29.277

3 Evaluation metric for Information retrieval system 2020-12-07T12:12:47.010

2 Approaches to Bag-Of-Words Information Retrieval 2015-01-09T05:42:54.587

2 How to rank documents using Bag of words approach 2015-08-28T02:15:14.793

2 What metrics must i use in my data(unstructured) preprocessing research? 2016-02-20T10:11:17.397

2 how to evaluate top n recommendation system with movie lens dataset? 2016-10-02T13:13:15.027

2 extract names in a list of names 2016-11-28T12:03:57.997

2 What model to use for matching two datasets 2017-09-14T05:04:24.750

2 Confusion with cosine similarity 2017-11-13T17:14:40.903

2 Peformance evaluation of ranking algorithms 2018-01-04T12:56:29.543

2 Extracting date, relation and noun phrase from text 2018-04-26T13:31:47.240

2 How to combine heterogeneous image features extracted with different algorithms for similar image retrieval? 2018-05-18T03:39:58.810

2 Two definitions of DCG measure 2018-08-10T13:29:26.913

2 Using ontology to infer labels for process model 2018-09-12T21:26:13.300

2 Capturing movement importance - logistic regression output 2018-09-22T12:38:54.000

2 What is NLP technique to generalize manually created rules in text? 2018-10-31T10:09:30.163

2 Detect sensitive data from unstructured text documents 2019-03-18T17:10:39.817

2 How to implement Semantic Search in R or Python 2019-04-08T09:30:30.293

2 Text Mining with Pubmed Widget Orange 2019-05-31T00:01:55.987

1 Extracting list of locations from text using R 2015-10-20T09:24:56.487

1 What algorithm to use for a specific 'Named Entity Recognition'/'Information extraction' problem 2016-03-10T15:52:55.427

1 Real time topic identification of news article 2016-06-21T17:38:56.907

1 Chance Curve in Accuracy-vs-Rank Plots in matlab 2016-09-14T11:55:24.817

1 Doc2Vec or Word2vec for word embedding 2017-04-03T19:19:49.480

1 Analyzing Web page structure 2017-06-08T21:16:55.867

1 ADHoc Information Retrieval 2017-08-16T14:13:30.537

1 Learning to rank: construct absolute ranking using pair-wise ranking approach 2017-09-14T07:23:50.240

1 Cosine similarity between query and document confusion 2017-11-05T14:02:42.483

1 Extracting NER from a Spanish language text file 2017-11-17T20:23:07.440

1 How to evaluate multi label image retrieval model 2017-12-01T10:37:27.800

1 ElasticSearch for data scientists 2018-01-18T02:22:39.100

1 I have 50 videos. I ask a customer 10 questions. Based on their answers, I send them a set of videos. How do I do it? 2018-01-20T01:29:01.677

1 Is recall more important than precision for mass mailings? 2018-03-27T12:47:48.753

1 How can I train a model to modify a vector by rewarding the model based on the modified vectors nearest neighbors? 2018-07-02T18:16:41.810

1 Industrial application(s) of LDA (latent Dirichlet allocation)? 2018-07-06T07:26:20.567

1 mathematical accurate definition of the binary independence model 2018-08-27T21:19:58.047

1 Understanfing OpenIE 5 output 2018-08-30T05:25:08.497

1 Efficient search for a Triples data 2018-09-11T12:04:39.767

1 Reconciling time-based data when data source clock drifts 2018-09-25T18:51:25.203

1 How to approach training a machine to read a form 2018-12-20T08:03:47.387

1 Algorithm for document retrieval in QA system 2019-01-13T11:23:15.010

1 Word embeddings for Information Retrieval - Document search? 2019-01-18T15:08:49.937

1 Can macro F1 score be greater than micro F1 score? 2019-02-18T17:29:31.703

1 Doc2vec most similar document to a query string 2019-03-30T19:42:03.250

1 Find specific topics with topic modelling 2019-06-25T13:06:41.400

1 Intuition behind the entropy definition 2019-07-29T23:36:47.603

1 Why TREC set two task: document ranking and passage ranking 2019-08-06T03:29:50.480

1 Getting answers to bullets (numbered items) from text via NLP 2019-09-04T11:31:20.747

1 Connecting to IEX with Pandas Datareader 2019-10-24T12:01:39.640

1 Populating Knowledge Base - Stanford DeepMind Alternatives 2019-11-21T15:50:47.177

1 Meaningful Information retrieval and question answering for unstructured data - Is it even possible? 2020-01-10T10:11:42.977

1 Domain scoring based on ranking 2020-02-28T20:40:04.877

1 Using transformers for information extraction 2020-03-18T21:06:34.640

1 Effecient Feature Searching 2020-04-12T17:34:14.587

1 Merging (intersecting) more than two posting list in linear time 2020-04-14T00:09:06.080

1 Using BM25 to rank words 2020-06-11T17:03:57.507

1 Evaluation of recommendation systems 2020-12-10T15:26:47.607

1 What is the difference between Okapi bm25 and NMSLIB? 2021-02-16T08:45:51.803

0 What approaches are there to not display a search result that a user has no permission to see? 2015-03-20T15:29:12.540

0 Typing error handling n-gram character index and vector space model 2015-11-06T09:28:56.003

0 Plotting Precision Recall Curve 2016-11-23T13:25:52.420

0 State of the art approaches for Information retrieval tasks based on deep learning 2016-12-02T06:14:31.097

0 Text Mining of Research Paper Abstracts 2017-01-04T15:14:30.100

0 Information retrieval / slot filling / NLP 2018-06-23T07:47:48.657

0 Does recall has different interpretation when comes to classification and information retrieval 2018-07-05T23:00:44.293

0 Classifying whether a comment or review is a complaint or appreciation of product and extracting the Topic? 2018-08-24T21:22:28.660

0 TextInformationRetrieval content based 2018-10-25T10:40:24.920

0 Dissimilarity Matrix of non-metric proximity data 2019-04-30T21:17:05.250