Oncogenic signalling and heterogeneity in cancer evolution
COVID | data analysis (new CAAT 4.3 release)
WARNING. I am not a medical doctor nor an epidemiologist. The analysis I am sharing here is only for the data geeks around that are curious. Please follow the advice of your national authorities and health system.
I have just published a new release of CAAT, Matlab code to analyse Johns Hopkins dataset on the SARS-CoV-2 pandemics. The usual caveat is that data is likely to be underestimates. Underestimation does not occur only because of lack of transparency but most of the times because of differences in definitions of COVID-related fatalities and efficiency of reporting systems across countries. For example, using the excess mortality statistics we know that about 30-50% under reporting is rather physiological because of deaths occurring outside the hospital settings or because people might die positive to COVID but not for COVID. This also accounted to a significant adjustment of statistics we have noticed in the past in Hubei. However, changes over time within a country might be more reliable and, therefore, there is still something to learn from this data.
I started analyzing data when the UK government decided to drop the policies set up for containment of SARS-CoV-2. Now that data has been extensively discussed in the UK, I curate CAAT only for others that would like to explore the data. However, at this release it is worth mentioning just the main observation. While the first acute phase of the pandemics is subsiding in Europe and Northern America, it is now flaring in South America at worrying pace.
Although I have explained this before, I should probably clarify how I evaluate the population at risk. To estimate how many people might die in each country in the (unrealistic) scenario where everyone would get ill, I used the age-dependent fatality rates published in The Lancet by Ferguson’s group and multiplied these values with the demographics of each country as reported by the UN.