13 Alternatives to TF-IDF and Cosine Similarity when comparing documents of differing formats 2017-01-02T20:41:13.493

10 Calculate cosine similarity in Apache Spark 2016-08-10T05:43:41.613

10 Can I use cosine similarity as a distance metric in a KNN algorithm 2018-01-09T16:05:11.893

8 What should be the value of non-rated field when finding cosine similarity 2016-06-12T17:27:19.537

8 cosine_similarity returns matrix instead of single value 2018-01-15T13:22:44.650

7 Cosine Distance > 1 in scipy 2015-10-13T22:23:39.020

5 Evaluating the performance of a machine learned recommendation system 2019-12-06T22:27:22.837

4 How should I evaluate writing quality to compare two articles(which article is better suited/written for a topic ) according to their content? 2017-06-07T05:33:53.713

4 Calculating cosine similarity between 3D arrays using Python 2019-06-18T10:36:27.590

3 How to find similarity/distance matrix with mixed Continuous and Categorical data? 2015-12-07T15:40:19.207

3 Word analysis in Python 2016-04-03T08:11:14.220

3 How to find similar time series? 2018-03-19T20:16:25.640

3 clustering 2-dimensional euclidean vectors - appropriate dissimilarity measure 2018-07-09T13:50:22.060

3 word2vec word embeddings creates very distant vectors, closest similarity is still very far 2019-05-31T10:35:45.530

2 Is it possible to use Jaccard similarity instead of Cosine similarity in gensim document similarity? 2016-12-20T19:22:03.547

2 Recommendation matrix as a product of User Similarity and Ratings 2017-02-26T15:54:07.707

2 Confusion with cosine similarity 2017-11-13T17:14:40.903

2 Clustering with cosine similarity with specific threshold (in Python) 2018-05-04T19:24:15.783

2 cosine similarity between items (purchase data) and normalisation 2018-11-19T10:20:15.130

2 When I would use a specific similarity coefficient over another? 2019-02-03T12:10:15.303

2 Match a two items from two different receipts 2019-02-28T06:55:48.700

2 Cosine similarity with arrays contaning NaN 2019-04-27T13:15:57.080

2 memory error in matrix cosine_similarity 2019-07-21T11:35:52.807

2 Approach to semantic similarity between documents 2020-01-08T13:39:33.053

1 How to normalized term vector for document clustering? 2015-10-15T20:05:36.923

1 Is Vector in Cosine Similarity the same as vector in Physics? 2015-11-04T04:25:31.113

1 Compute angle of vector in word2vec models 2016-04-02T11:03:48.257

1 What techniques should I use to compare the similarity between a bunch of texts? 2017-08-08T09:45:48.753

1 Computing Item-to-Item Similarity Using Cosine 2017-09-12T12:42:34.673

1 Create similarity matrix 2017-09-15T12:24:56.750

1 Cosine similarity between query and document confusion 2017-11-05T14:02:42.483

1 Can I sum up feature vectors of a userâ€˜s collection? 2018-09-14T21:33:53.233

1 Best way to find dissimilarity in a 6x2 DataFrame? 2018-11-06T04:14:35.510

1 Hierarchical clustering with precomputed cosine similarity matrix using scikit learn produces error 2019-05-14T20:21:28.957

1 Checking TF-IDF Results 2019-06-16T13:01:32.270

1 Elbow method for cosine distance 2019-06-23T11:43:33.547

1 Cosine similarity vs The Levenshtein distance 2019-11-18T08:52:54.033

1 Understanding cosine distance with word vectors 2020-01-19T19:12:05.830

1 NearestNeighbors testing 2020-02-27T16:19:51.767

1 Fastest way for 1 vs all lookup on embeddings 2020-03-15T15:29:51.433

1 Should I create a tfidf on a subset of a data set or use the whole corpus? 2020-04-13T15:33:00.353

1 How to get the probability/closeness of a sample belonging to a specific cluster? 2020-06-13T19:51:42.800

1 Is summing a cosine similarity matrix a good way to determine overall similarity? 2020-08-26T23:06:50.137

1 Why is the cosine distance used to measure the similatiry between word embeddings? 2020-09-03T12:45:51.933

1 Question about BERT embeddings with high cosine similarity 2020-09-10T15:13:03.027

1 Is the magnitude of a word vector correlated with the frequency of the word in a text? 2020-09-30T09:48:20.940

1 If i use use BERT embeddings for if cosine(sent1,sent2) > 0.9, then is it fair to assume s1 and s2 are similar 2020-10-12T13:16:08.563

1 Evaluate document similarity / content-based recommender system 2020-11-23T15:15:24.780

1 Cosine Similarity but with weighting for vector indexes 2020-12-12T14:24:17.887

1 pairwise_distances with Cosine and weighting 2021-02-01T22:37:51.893

0 how to create word2vec for phrases and then calculate cosine similarity 2019-04-05T07:49:22.843

0 Document matching with more priority to certain features than others 2019-06-20T12:23:09.853

0 counter vector fit transform cosine similarity memory error 2019-07-22T14:35:26.043

0 Siamese networks vs Semantic similarity (may be gensim) 2019-10-23T04:57:51.147

0 TensorFlow: CosineDifference ObjFunc Constant throughout training 2019-11-24T12:09:22.237

0 How fit_transform, transform and TfidfVectorizer works 2020-03-11T23:50:44.143

0 How to compare cosine distances across two groups of words? 2020-07-28T08:36:54.127

0 Semantic similarity between two or more sentences 2020-09-09T00:16:56.927

0 Item-to-Item recommendation using DNN 2020-10-20T16:04:29.597

0 finding similarity of a new datapoint 2020-11-02T19:35:32.943

0 Is normalizing term weight necessary when cosine similarity is used in retrieval? 2020-12-30T17:44:12.033

0 Calculate the similarity between pairs of time series data 2021-01-07T01:47:31.433

0 Is there a way to calculate the cosine distance between 2 time series? 2021-01-21T14:22:27.620

0 how to calculate the cosine similarity between two files? 2021-02-24T18:29:27.810

-1 Matrix of pairwise cosine similarities from matrix of vectors 2020-07-24T12:40:47.117