This ensures that a "win" is only counted if it is both statistically significant and practically relevant, providing a robust and nuanced ranking system.