Recently we have developed a python library named
PyCM specialized for analyzing multi-class confusion matrices. A compare system has been added in
version 2 of this module in order to generally (not considering the subject of the problem) compare the resulted confusion matrices from different classification methods over a unique data-set.
Now, we are searching for a strategy which can validate the result of this option.
This strategy can be either a mathematical proof or a counterexample.
For example suggesting two close confusion matrices which had been compared and the comparison is validated can be helpful to answer this question.
P.S.1. In order to find out how this module works please read the Compare section at this document.
P.S.2. For further information visit the following links or ask your questions as a comment.