A
A
arseny_sitdikov2021-06-20 22:48:33
Python
arseny_sitdikov, 2021-06-20 22:48:33

How to write a pairwise column comparison function?

You need to write a pairwise column comparison function. (Columns 1000+)
That is, the values ​​of one row are compared in two columns, and if the proportion of identical values ​​exceeds X, then one of the compared columns must be deleted.

All columns are numeric, with NaNs
Strict comparison required. Not

Pandas.Duplicate correlation looks for exact match, so another crutch is needed‍♂️

Answer the question

In order to leave comments, you need to log in

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question