Occam’s razor The simplest explanation is best. For two hypothesis sets with the same empirical risk, which one is better? The one with the smallest cardinality.