VincentYu comments on The Universal Medical Journal Article Error - Less Wrong

6 Post author: PhilGoetz 29 April 2014 05:57PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (189)

You are viewing a single comment's thread.

Comment author: VincentYu 06 April 2013 03:24:59AM *  34 points [-]

You claim that medical researchers are doing logical inference incorrectly. But they are in fact doing statistical inference and arguing inductively.

Statistical inference and inductive arguments belong in a Bayesian framework. You are making a straw man by translating them into a deductive framework.

Rephrased to say precisely what the study found:

This study tested and rejected the hypothesis that artificial food coloring causes hyperactivity in all children.

No. Mattes and Gittelman's finding is stronger than your rephrasing—your rephrasing omits evidence useful for Bayesian reasoners. For instance, they repeatedly pointed out that they “[studied] only children who were already on the Feingold diet and who were reported by their parents to respond markedly to artificial food colorings.” They claim that this is important because “the Feingold diet hypothesis did not originate from observations of carefully diagnosed children but from anecdotal reports on children similar to the ones we studied.” In other words, they are making an inductive argument:

  1. Most evidence for the Feingold diet hypothesis comes from anecdotal reports.
  2. Most of these anecdotal reports are mistaken.
  3. Thus, there is little evidence for the Feingold diet hypothesis.
  4. Therefore, the Feingold diet hypothesis is wrong.

If you translate this into a deductive framework, of course it will not work. Their paper should be seen in a Bayesian framework, and in this context, their final sentence

The results of this study indicate that artificial food colorings do not affect the behavior of school-age children who are claimed to be sensitive to these agents.

translates into a correct statement about the evidence resulting from their study.

This refereed medical journal article, like many others, made the same mistake as my undergraduate logic students, moving the negation across the quantifier without changing the quantifier. I cannot recall ever seeing a medical journal article prove a negation and not make this mistake when stating its conclusions.

They are not making this mistake. You are looking at a straw man.


Full-texts:

Comment author: Kawoomba 06 April 2013 07:36:12AM 3 points [-]

The number of upvotes for the OP is depressing.

It's a good example for forcing your toolset into every situation you encounter. If all you have is a hammer ...

Comment author: David_Gerard 06 April 2013 11:04:07AM 0 points [-]

Don't worry, we'll have Metamed to save us!

Comment author: PhilGoetz 06 April 2013 02:55:51PM *  0 points [-]

Well, dammit, I wanted to delete this and rewrite above, but you can't delete comments anymore. This is not retracted, but I can't un-retract it.

You are wrong, and you have not learned to reconsider your logic when many smart people disagree with you.

Comment author: Douglas_Knight 06 April 2013 03:33:31PM *  2 points [-]

You can delete retracted comments by reloading the page and clicking on a new delete icon that replaces the retract icon.

Comment author: arundelo 06 April 2013 03:46:05PM 3 points [-]

Only if no-one's replied to them.

Comment author: TimS 06 April 2013 04:26:11PM 2 points [-]

I'm not sure that's true. See here

Comment author: ArisKatsaris 06 April 2013 06:38:12PM *  4 points [-]

People can reply to any comments that they still see in their browser page, even though they've been "deleted", if the replier has not refreshed said browser page.

EDIT TO ADD: As I see that wedrifid also mentions below.

Comment author: Eugine_Nier 08 April 2013 04:34:16AM 1 point [-]

Possibly there is also a similar effect if the deleter hasn't refreshed his browser page.

Comment author: wedrifid 08 April 2013 04:43:45AM *  1 point [-]

Possibly there is also a similar effect if the deleter hasn't refreshed his browser page.

Possibly. Specifically it would be if you (as the example) had retracted the page then refreshed it (to get the 'delete' button available to you) and then there is an arbitrary period of time after which you click the delete button without first refreshing again. (Untested, but those are the circumstances under which it would be at all possible if the code is not specifically designed to prevent it.)

Comment author: Douglas_Knight 06 April 2013 04:47:24PM *  0 points [-]

Why are you not sure of facts that are subject to easy experiments? (update: arundelo is correct)

Comment author: TimS 06 April 2013 05:01:10PM 3 points [-]

Experiment clutters the venue, and being less blunt avoids the appearance of a status conflict.

Comment author: Douglas_Knight 06 April 2013 05:17:32PM 0 points [-]

If deletion is possible, there is very little clutter. If deletion is not possible, and the comment says "I can't figure out how to delete this," at least it discourages other people's experiments. But this thread is itself clutter, so I don't think that is your true rejection. As to bluntness, I conclude that my being less blunt caused you to confabulate bullshit.

PS - I experiment on the open thread.

Comment author: TimS 06 April 2013 05:23:28PM 1 point [-]

As to bluntness, I conclude that my being less blunt caused you to confabulate bullshit.

On reflection, it is probably more accurate for me to say, "I wasn't interested in experimenting, including for concern that the experimenting would look low status, and I have higher preferred ways of acting low status."

As for my own choice not to be blunt, you are not correctly modelling my thought process.

In short, I gave two reasons for my action, and you might be right that one was confabulation, but not the one you identify as confabulation.

Comment author: wedrifid 06 April 2013 05:25:26PM 1 point [-]

Why are you not sure of facts that are subject to easy experiments? (arundelo is wrong)

I have performed the experiment in question and it seems to support arundelo's claim. I am not able to remove this comment. At the very least it demonstrates that the experiment required to prove arundelo's fully general claim is false is not the 'easy' one.

Comment author: TimS 06 April 2013 05:45:08PM *  0 points [-]

Well, now I'm totally confused. Checking Eugine_Nier's account on ibiblio shows that the comment is missing. (Searching for the word "sarcasm" will get you to about when the comment took place, at least as of the date of this comment)

Comment author: wedrifid 06 April 2013 05:55:00PM 1 point [-]

Well, now I'm totally confused. Checking Eugine_Nier's account on ibiblio shows that the comment is missing. (Searching for the word "sarcasm" will get you to about when the comment took place, at least as of the date of this comment)

See my investigation. Short answer: race condition.

Comment author: Douglas_Knight 06 April 2013 05:34:16PM *  0 points [-]

Thanks actually experimenting. My beliefs were two months out of date. I stand by my objection to Tim's comment.

Comment author: wedrifid 06 April 2013 04:38:23PM *  -1 points [-]

I'm not sure that's true. See here

It is possible that the comment was banned by a moderator rather than deleted by the author. (If so, it will still appear if you look at the user's comment page.)

After retraction EDIT: TimS. I can't seem to delete this comment even after refreshing.

Comment author: TimS 06 April 2013 05:04:50PM 1 point [-]

As it happens, I remember what Eugine_Nier wrote, and I am certain it did not meet the local criteria for mod-blocking.

(Anonymous downvoter: What is it in wedrifid's post you'd like to see less of? Helpful commentary about the mechanics of this site is not on my list of things to downvote).

Comment author: wedrifid 06 April 2013 05:08:22PM *  6 points [-]

As it happens, I remember what Eugine_Nier wrote, and I am certain it did not meet the local criteria for mod-blocking.

Interesting. This suggests that a feature has changed at some point since the retraction-then-delete feature was first implemented. (I have memories of needing to be careful to edit the text to blank then retract so as to best emulate the missing 'delete' feature.)

I notice that I am confused. Investigates.

  • Testing deletion feature. Deletion of (grandparent) comment that you have already replied to: Fail. It is still not (usually) possible to delete comments with replies.
  • Check for moderator deletion. (ie. Moderator use of the ban feature, actual delete per se is extremely rare). Confirm absence of a reply on Eugine_Nier's page that fits that part of history. The comment is, indeed, deleted not banned.
  • Check timestamps for plausibility of race condition. Ahh. Yes. Tim, you replied to Eugine within 3 minutes of him writing the comment. This means that most likely Eugine deleted his message while you were writing your reply. Your comment was still permitted to be made despite the deleted parent. The reverse order may also be possible, depending on the details of implementation. Either way, the principle is the same.
Comment author: TimS 06 April 2013 06:42:29PM *  0 points [-]

ArisKatsaris suggests browser refresh, not timestamps, is the issue.

Comment author: Eugine_Nier 08 April 2013 04:36:39AM 2 points [-]

For the record, I did in fact delete the comment.

Comment author: buybuydandavis 07 April 2013 03:01:06AM *  0 points [-]

You claim that medical researchers are doing logical inference incorrectly. But they are in fact doing statistical inference and arguing inductively.

Jaynes argued that probability theory was an extension of logic, so this seems like quite a quibbling point.

Statistical inference and inductive arguments belong in a Bayesian framework. You are making a straw man by translating them into a deductive framework.

They do, but did the paper he dealt with write within a Bayesian framework? I didn't read it, but it sounded like standard "let's test a null hypothesis" fare.

No. Mattes and Gittelman's finding is stronger than your rephrasing—your rephrasing omits evidence useful for Bayesian reasoners.

Which is not a valid objection to Phil's analysis if Mattes and Gittelman weren't doing a Bayesian analysis in the first place. Were they? I'll apologize for not checking myself if I'm wrong, but right now my priors are extremely low so I don't see value in expending the effort to verify.

Their paper should be seen in a Bayesian framework

If they did their calculations in a Bayesian framework. Did they?

Comment author: Matt_Simpson 07 April 2013 03:46:29AM 4 points [-]

hey do, but did the paper he dealt with write within a Bayesian framework? I didn't read it, but it sounded like standard "let's test a null hypothesis" fare.

You don't just ignore evidence because someone used a hypothesis test instead of your favorite Bayesian method. P(null | p value) != P(null)

Comment author: buybuydandavis 07 April 2013 03:54:38AM *  3 points [-]

I ignore evidence when the evidence doesn't relate to the point of contention.

Phil criticized a bit of paper, noting that the statistical analysis involved did not justify the conclusion made. The conclusion did not follow the analysis. Phil was correct in that criticism.

It's just not an argument against Phil that someone might take some of the data in the paper and do a Bayesian analysis that the authors did not do.

Comment author: Matt_Simpson 07 April 2013 04:31:48AM 3 points [-]

It's just not an argument against Phil that someone might take some of the data in the paper and do a Bayesian analysis that the authors did not do.

That's not what I'm saying. I'm saying that what the authors did do IS evidence against the hypothesis in question. Evidence against a homogenous response is evidence against any response (it makes some response less likely)

Comment author: buybuydandavis 07 April 2013 04:50:42AM 3 points [-]

What they did do?

Are you saying the measurements they took make their final claim more likely, or that their analysis of the data is correct and justifies their claim?

Yes, if you arrange things moderately rationally, evidence against a homogenous response is evidence against any response, but much less so. I think Phil agrees with that too, and is objecting to a conclusion based on much less so evidence pretending to have much more justification than it does.

Comment author: Matt_Simpson 07 April 2013 05:25:35AM 1 point [-]

Ok, yeah, translating what the researchers did into a Bayesian framework isn't quite right either. Phil should have translated what they did into a frequentist framework - i.e. he still straw manned them. See my comment here.

Comment author: PhilGoetz 07 April 2013 03:39:20PM 1 point [-]

That's not what I'm saying. I'm saying that what the authors did do IS evidence against the hypothesis in question.

I know that. That's not the point. They claimed to have proven something they did not prove. They did not present this claim in a Bayesian framework.

Comment author: PhilGoetz 07 April 2013 03:57:39PM *  -2 points [-]

No. I am not attacking the inductive argument in your points 1-4 above, which is not made in the paper, is not the basis for their claims, and is not what I am talking about.

You speak of the evidence from their study, but apparently you have not looked at the evidence from their study, presented in table 3. If you looked at the evidence you would see that they have a large number of measures of "hyperactivity", and that they differed between test and control groups. They did not find that there was no difference between the groups. There is always a difference between the groups.

What they did, then, was do an F-test to determine whether the difference was statistically significant, using the assumption that all subjects respond the same way to the intervention. They make that assumption, come up with an F-value, and say, "We did not reach this particular F-value, therefore we did not prove the hypothesis that food dye causes hyperactivity."

THEY DID NOT ASK WHETHER FOOD DYE INCREASED OR DECREASED HYPERACTIVITY BETWEEN THE GROUPS. That is not how an F-test works. They were, strictly speaking, testing the hypothesis whether the two groups differed, not in which direction they differed.

THERE WAS NO EVIDENCE THAT FOOD DYE DOES NOT CAUSE HYPERACTIVITY IN THEIR DATA. Not even interpreted in a Bayesian framework. They found a difference in behavior, they computed an F-value for 95% confidence assuming population homogeneity, and they did not reach that F-value.