After running the MB Affinity tool, I have a 6000x6000 matrix of products and their cosine similarity to each other (similar to a correlation matrix).
I'm trying to transform it into a less unwieldy form, that summarizes the top 10 most similar products and their scores for each product. The fields might look like:
[Product] - [#1 Most Similar Product Name] - [#1 Most Similar Product Score] - [#2 Most Similar Product Name] - ... - [#10 Most Similar Product Score]
...
N = 6,000
Has anybody dealt with something like this before? I'm stumped for solutions. Keeping in mind, I'm constrained to working with a 340MB CSV file.
Thank you,
Michael