Making Sense of Coronavirus Stats
[Update: Some have pointed out the definition of mortality rate should be that of deaths to some defined population, typically a median estimate over a period of time (week, month, year...) and not limited only to those infected. (New update here. The ratios I have been considering are called Case Fatality Rate.) It was also correctly pointed out that my second calculation was simply wrong. The ration should not be deaths/recovered but deaths/(deaths+recovered) -- that is deaths to the total population considered. ] From what I've seen WHO and other health organizations are saying the mortality rate for the new coronavirus outbreak is between 2 and 3 percent. That seems to be based on the ratio of the reported deaths to the reported cases of infection. That doesn't seem right to me. The last statistics I looked at (news report) was: Total - 75,768 Recovered - 16,329 (21.6%) Deaths - 2129 (2.8%) However, that leave us with a bit over 75% with an unknown end state. If I try to infer the outcome for all the reported infections I can think of two ways to estimate the end results. One, is to assuming the remaining cases will produce a similar outcome as has been observed so far. Using that assumption I can then iterate through the unknown cases using the 21.6% percent will recover, 2.8% will die and ~75% will move to the next round. The other way would be to assume the ratio of the current deaths to current recoveries is a good measure of the mortality rate. Using the first approach the total deaths approaches 8,740 people from the current population of 75,768. The second approach results in a higher number, 9879. Both of these numbers would suggest this version of the coronavirus is pretty bad, in terms of mortality rates. In the context of the other two coronavirus outbreaks, it seem closer to SARS, though a bit worse, than to MERS. Should I think this approach to estimating mortality rates for new diseases without out a long history (like the flu) might be