Skip to main content
0 votes
1 answer
97 views

I have two distinct shapefiles that have a high degree of overlap, but aren't the same. I want to make a comparison and one of the things I would like to generate is the Jaccard Index of regions ...
BLP92's user avatar
  • 345
0 votes
0 answers
48 views

Initially, I performed kmeans clustering and obtained some meaningful clusters. To refine these clusters, I ran Fuzzy C Means on the Kmeans center using "e1071" package. Are there any ...
Mary's user avatar
  • 221
0 votes
0 answers
58 views

I have a huge PySpark dataframe that contains 250 million rows, with columns ItemA and ItemB. I'm trying to calculate the Jaccard Similarity M_ij that can run efficiently and takes a short amount of ...
Rayne's user avatar
  • 15.2k
2 votes
0 answers
59 views

Context I have this code for my attempt to create a "similarity mapping" between consonants (or consonant clusters), to the same set of consonants/clusters (basically a cross product mapping)...
Lance Pollard's user avatar
0 votes
3 answers
88 views

Please consider the reprex at the end of the post. I have two lists of dataframes. Each dataframe has a $keyword column, which is a vector of text. I am looking for a computationally efficient way to ...
larry77's user avatar
  • 1,543