Turn the question on it's head and make up a story where the math matches the observation in every circumstance. If you can, and I'm not sure I could, work backwards from there to find the breaking point. Or just remember that the map is not the territory, the finger that points to the moon is not the moon, and get on with things.
This was my attempt to make up a story where the math would match something real:
Statistically comparing two samples of equids would make some sense if Dr. Yagami had sampled 2987 horses and 8 zebras while Dr. Eru had sampled 2995 horses and 0 zebras. Then Fisher's exact test could tell us that they did, with high probability, not sample the same population with the same methods.
But in the actual case what we have is just a "virtual sample". I'm wondering if there are any conceivable circumstances where a virtual sample would make sense.
Go zebras! Bevtvany grkg erzbirq nf vg unf freirq vgf checbfr.