George Rebane
In ‘Testing’s Tower of Babel’ I promised to inform readers of the reliability of testing in order to determine the fraction of a target population that has a defined attribute. Today the whole country is supposed to be interested in the extent of the C19 infection, and also the extent of the recovery from the disease. Such test results are supposed to inform government policy makers about the stage of the pandemic, and whether to respond with more draconian quarantines or begin reviving the nation’s economy – important information indeed.
Let me begin with the punchline of this treatise – to determine a population fraction is more complex than people (and policy makers) are led to believe, and the ‘naïve’ approach of just taking the number of those testing positive from a random sample and dividing that by the size of the sample gives answers that are significantly in error. That is apparent by just recognizing that a real test is unreliable, having less than perfect sensitivity – it misses infected people – and also specificity – it counts some uninfected as infected. Such a test will give a wrong answer for the fraction infected when using the naïve approach on a tested random sample from the target population.
That motivated me to take a closer look, and after pushing some squigglies, the solution to calculating the correct fraction of infected emerged. As expected, the formula involves sample size, and the test’s reliability parameters. All such theoretical developments need confirmation, so I wrote a program to simulate real test results that takes into account errors in all parameter values involved in such testing. The results were heartening, the derived correction formula nails the actual infected fraction, its confidence bounds are easily computed from the relevant input error distributions.
I provide the supporting details for all this in a technical note that you can download here – Download TN2004-2
As a teaser for some more noodling about assessing the state and progress of the C19 pandemic, we read that it’s not only the Chinese who have been lying through their teeth about what happened in Wuhan and what is still going on in the country. Our own mavens of morbidity have not been all that professional and meticulous about the data they publish. Here is a piece in the American Thinker that exposes our CDC which now confesses to having lied about C19 death stats. In ‘The CDC Confesses to Lying About COVID-19 Death Numbers’ we read, “How many people have actually died from COVID-19 is anyone’s guess … but based on how death certificates are being filled out, you can be certain the number is substantially lower than what we are being told. Based on inaccurate, incomplete data people are being terrorized by fearmongers into relinquishing cherished freedoms.” (H/T to reader)
This also has a bearing on testing. When composing a random sample, how do we treat those in the sample who have recently died of C19 (or something else) or have already been diagnosed with C19? Different approaches will inform different policy decisions. Our media are not smart enough to either understand or report on any of these considerations. And does anyone really care about such details, beyond just using their misunderstandings to cast aspersions at each other from their ideological perches?
Now for the downside on the call for all this testing. No one knows how reliable the tests are, especially the new ones. (more here) Somehow all of that has passed by our entire news industry, showing again what kind of pikers they are. Since we know neither the average sensitivity and specificity of the tests, nor their probabilistic dispersions, all of these calls for testing are nothing short of an exercise in politics and emotions. Has anyone even heard Drs Fauci or Brix bringing up test reliability, or showing even a smidgen of concern about that overarching reality in the use of tests and testing???? We'll see what kind of interest RR readers show in this issue.
CDC's guidance for reporting.
Guidance for Certifying Deaths Due to
Coronavirus Disease 2019 (COVID–19)
"If COVID–19 played a role in the death, this condition should
be specified on the death certificate. In many cases, it is
likely that it will be the UCOD, as it can lead to various life
threatening conditions, such as pneumonia and acute respiratory
distress syndrome (ARDS). In these cases, COVID–19 should
be reported on the lowest line used in Part I with the other
conditions to which it gave rise listed on the lines above it. "
https://www.cdc.gov/nchs/data/nvss/vsrg/vsrg03-508.pdf
Posted by: D | 10 April 2020 at 04:59 PM
Somebody calls BS....one man’s opinion.
‘Bullshit Numbers’
You are seeing a lot of covid-19 numbers thrown around. Virtually all of those numbers are bullshit
https://streetwiseprofessor.com/bullshit-numbers/
Posted by: Bill Tozer | 10 April 2020 at 05:41 PM
D@4:59, That makes sense if the lower the position of the causal listing, the less cause of death is attributed to the Xirus. Can't be sure from your last line- which suggests that pneumonia and ARDS would be likely entries, rather than heart failure and COPD which are the actual most dangerous underlying conditions to have going in. What am I not getting?
Posted by: L | 10 April 2020 at 06:04 PM
GeorgeR: I like the exposition of uncertainty in testing regimes. My bet is that a .95 is wildly optimistic for a correct positive test. Since the medical people are basically treating symptoms and not the underlying disease, the whole point of the test appears to be:
. Make people feel better. 'Something is being done about this'.
. Driving public policy decisions.
A couple of things about this latter case.
. The time to measure and institute a public policy decision is larger than the time the epidemic needs to change nature.
. Individual public policy decisions will make less difference than the built-in inaccuracy in measurement. It's hard to tell if they work.
In essence, good public policy become a combination of common sense and throwing everything at the problem.
I spent a few minutes last week pawing through epidemiology papers (praise be to sci-hub and the copyright infringing Russkies) and couldn't help feel that they often are taking a preconceived curve (either calculated, guesstimated, or measured earlier) and are fitting a series of maths to that.
None of this seems well suited to a static treatment what with the the pesky humans at it's base. The importance of initial conditions, change in behavior over time, change in the virus itself over time. It's like weather prediction where clouds have a mind of their own.
Posted by: scenes | 11 April 2020 at 08:06 AM
scenes 806am - You're right on the mark Mr scenes.
Posted by: George Rebane | 11 April 2020 at 09:40 AM