They didn't try using a different clock in the replication?
I get the impression that there is a huge infrastructure just for measuring the time, and that it's easier to check every part of your apparatus ten times over than to build a whole new clock.
A mundane cause for a surprising result. Consider this unconfirmed for now, however unsurprising it sounds.
Source: Science/AAAS