@yudapearl
Echoing 
@f2harrell
's journey from a frequentist to a Bayesian statistician, I am
re-posting my journey from a Bayesian to a "Half Bayesian" AI
researcher:  https://ucla.in/2nZN7IH
Readers who have seen it before can just skip, and those who have not,
can take this re-posting as a confirmation that I still stand behind
every word of my 2001 confession.



Mar 27
I improved my second most popular blog article:
https://fharrell.com/post/journey #Statistics #bayes
Norm Matloff 一啲都唔明
@matloff
Even more eloquently written than 
@f2harrell
's, but still missing the point IMO. The key word in your quote of
Savage in the opening statement is "know." If the word is literally
true, than none of us frequentists would object (nor would we consider
it non-frequentist). Empirical Bayes is fine, for instance.

The problem occurs when "know" is replaced by "feel," in which case the
analysis becomes subjective and possibly very biased. One can use the
data to check frequentist assumptions but one cannot check feelings.

Feelings should have no place in scientific research, especially medical
research, which is a public good. If someone wants to incorporate their
feelings into their own private analysis of the stock market, say, good
for them.


Judea Pearl
@yudapearl
·
Mar 27
All scientific knowledge is based on some assumptions. Are you proposing
to purge science from Bayes analysis?


Juan Carlos Silva
@chobitoso
·
Mar 27
I found these discussions somewhat repetitive.  Even some frequentist
algorithms and interpretations rely on Bayesian analysis.  For instance,
diagnostic testing is a pure Bayesian exercise.  They are not completely
independent perspectives.
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
Please explain re diagnostic testing.
Juan Carlos Silva
@chobitoso
·
Mar 27
Simultaneous or in tandem??

Still a pure Bayesian exercise…
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
Sorry, I don't know the term. Please explain.


Juan Carlos Silva
@chobitoso
·
Mar 27
In epidemiology and other fields, you would classify a patient with a
test that has two properties sensitivity and specificity.  However these
properties are affected by a third variable, the prevalence of the
condition.  Thus, you need to update the tests results even if you…


Post

See new posts
Conversation
Judea Pearl
@yudapearl
·
Mar 27
Echoing 
@f2harrell
's journey from a frequentist to a Bayesian statistician, I am
re-posting my journey from a Bayesian to a "Half Bayesian" AI
researcher:  https://ucla.in/2nZN7IH
Readers who have seen it before can just skip, and those who have not,
can take this re-posting as a
Show more
Quote
Frank Harrell
@f2harrell
·
Mar 27
I improved my second most popular blog article:
https://fharrell.com/post/journey #Statistics #bayes
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
Even more eloquently written than 
@f2harrell
's, but still missing the point IMO. The key word in your quote of
Savage in the opening statement is "know." If the word is literally
true, than none of us frequentists would object (nor would we consider
it non-frequentist). Empirical
Show more
Judea Pearl
@yudapearl
·
Mar 27
All scientific knowledge is based on some assumptions. Are you proposing
to purge science from Bayes analysis?

Juan Carlos Silva
@chobitoso
Mar 27
I found these discussions somewhat repetitive.  Even some frequentist
algorithms and interpretations rely on Bayesian analysis.  For instance,
diagnostic testing is a pure Bayesian exercise.  They are not completely
independent perspectives.


Norm Matloff 一啲都唔明
@matloff
Mar 27
Please explain re diagnostic testing.

·
Mar 27
Simultaneous or in tandem??

Still a pure Bayesian exercise…


Norm Matloff 一啲都唔明
@matloff
Mar 27
Sorry, I don't know the term. Please explain.


Juan Carlos Silva
@chobitoso
·
Mar 27
In epidemiology and other fields, you would classify a patient with a
test that has two properties sensitivity and specificity.  However these
properties are affected by a third variable, the prevalence of the
condition.  Thus, you need to update the tests results even if you…


Norm Matloff 一啲都唔明
@matloff
That's entirely frequentist. It's in Fisher LDA for instance, and it is
part of the intercept term in the logit model. It is thus part of the
process of estimating from the data, again thoroughly, classically
frequentist.

Now if the mix in the data is not representative of the population, one
can then do "what if" analysis, looking at various cases. That is
frequentist as long as one does not model the disease prevalence by a
distribution, which becomes subjective.


Norm Matloff 一啲都唔明
@matloff
·
Mar 27
1. I personally do not use (subjective) Bayesian methods. 2. If someone
wants to use such methods for their personal, private use, that's fine
with me. 3. But (subjective) Bayesian methods should not be used in
settings affecting the public, e.g. medical research.



Biostatsfun
@biostatsfun
Mar 27
And the issue raised by Frank is decision making based on p-values. 

A clinician understands pr(benefit) better than p-values.


Norm Matloff 一啲都唔明
@matloff
Mar 27
What clinicians understand or not is a whole other interesting topic.
:-) Re p-values, DON'T USE THEM.

But just because a clinician may "understand" superstition doesn't make
it right.



Biostatsfun
@biostatsfun
·
Mar 27
But you check prior sensitivity, what’s the issue? 

How does justifying a prior = frequentist? 

No, I wouldn’t equate plausibility with feelings. On feelings, I’d trust
a physicians “feelings” over my own.


Norm Matloff 一啲都唔明
@matloff
Mar 27
Fine, but the problem is that one doctor's feelings will be different
from another's. That's why it's not scientific.

As to checking sensitivity, think of the implication: If one gets
essentially the same results with different priors, one is basically
back to the frequentist realm. If one gets different results with
different priors, then which one is correct? It's all superstition.


Judea Pearl
@yudapearl
Mar 27
All scientific knowledge is based on some assumptions. Are you proposing
to purge science from Bayes analysis?
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
Please reread my point about checking assumptions.



Dylan Armbruster
@dylanarmbruste3
Mar 27
Would you say you lean more towards Frequentist?
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
Not lean towards. 100% for anything that affects the public.
·
Mar 27
What do you mean. How would you classify yourself then?
Norm Matloff 一啲都唔明
@matloff
·
Mar 27
1. I personally do not use (subjective) Bayesian methods. 2. If someone
wants to use such methods for their personal, private use, that's fine
with me. 3. But (subjective) Bayesian methods should not be used in
settings affecting the public, e.g. medical research.


Frank Harrell
@f2harrell
If you think that Bayesian methods are more subjective than frequentist
ones you have not deeply examined frequentist methods.
4:39 AM · Mar 28, 2025
·

Norm Matloff 一啲都唔明
@matloff
Mar 28
I think this is at least the third time you and I have debated freq. vs.
Bayes here on Twitter/X. I've looked deeply at whatever you've brought
up, but you now say I've overlooked some points in your newly-expanded
article. I will definitely take a look.


Frank Harrell
@f2harrell
Mar 28
This is a bit of an oversimplification, but I don't love Bayes because
it's perfect.  I love Bayes because of (1) its problem-solving
capabilities and (2) frequentist approaches are deeply flawed.


Norm Matloff 一啲都唔明
@matloff
Mar 27
Even more eloquently written than 
@f2harrell
's, but still missing the point IMO. The key word in your quote of
Savage in the opening statement is "know." If the word is literally
true, than none of us frequentists would object (nor would we consider
it non-frequentist). Empirical


Norm Matloff 一啲都唔明
@matloff
OK, I've gone through your article again, and really don't see anything
new or view-changing. 90% of it is on the problems with hypothesis
testing, which as you know I oppose. 

(You probably don't remember this, but the very first interaction I had
with you was when in some online forum you said that R should not be
reporting the "1 star, 2 stars, 3 stars" p-values paradigm, and I said
something like "YES!! Finally glad to see someone other than me say
it.")

A couple of your statements did bring up very important issues (maybe
these are among the "new" ones). First, "I came to not believe in the
possibility of infinitely many repetitions of identical experiments, as
required to be envisioned in the frequentist paradigm." I disagree; on
the contrary, we all go through life implicitly making decisions on this
basis, whether we are buying insurance, gambling in a casino or
whatever. Ask any gambler (I'm not one) what it means that the
probability of a payoff is 0.22, and after some thought he/she will
indeed talk in terms of repeated trials. 

Ironically, I hope you will recall that I've often said, "We are all
Bayesians." We do have our priors in the numerous decision-making
contexts we encounter in life, and informally act on those priors.
Again, this is fine with me -- when I said "We all" above, I meant it --
but priors should have no place in scientific research, due to the
subjectivity.

The other statement in your article that caught my eye concerned
frequentists' need "to completely specify the experimental design,
sampling scheme, and data generating process..." I would agree,
providing one says "to contemplate the implications of" rather than "to
completely specify." In fact, there was an example of this earlier in
this thread, when we were discussing the overall prevalence of a disease
or condition. I said that we must look at the nature of the sampling
process, asking whether it is representative of the target population. I
then said we might engage in "What if" questions along these lines.


Arman Oganisian
@StableMarkets
Mar 28
How would you answer a doc who wants the probability that the patient in
their clinic now will relapse w/n 1 yr? not “the proportion of patients
drawn from an infinite subpopulation of patients with the same features
as that patient” but the probability for that specific patient.


Norm Matloff 一啲都唔明
@matloff
·
Mar 28
The doc should first say "x% of people like you have a relapse." But a
good doctor should add, "Of course, there are many factors underlying
this, some of which we know and some we don't know, so it's possible
that one of the latter factors makes your case very different."

Note that the above applies to both frequentist and Bayesian analyses.
The only difference is that the probabilities in the latter case are not
real data, just personal hunches that may vary from one doctor to
another. An ethical, responsible Bayesian physician should disclose
this.

Norm Matloff 一啲都唔明
@matloff
Going a little further, the doctor should give the patient the
probability of a relapse, given those known factors, e.g. age, if it is
available. Again, as a patient, this is what I would want, and I'm
pretty sure most patients would want it, even if they have no statistics
background.

Interestingly, that's what the Kaiser system apparently does in reports
for blood workups. The "Normal Range" in their graphs is tailored to the
patient's age and gender (though it doesn't say so), a nurse told me.


Frank Harrell
@f2harrell
Mar 28
Thanks for all those comments Norm.  I have to take issue with your
belief that life decisions are made like sampling statisticians think or
that gamblers work like that.  A resounding "no" to those two.  Life and
gambling are all about playing the odds in one-time situations.


Norm Matloff 一啲都唔明
@matloff
Actually, after I posted this tweet, I put the question to several LLMs,
asking how people perceive that 0.22 figure for winning a game. All of
them cited repeated trials as one of the ways that number is perceived.
(The other ways they cited didn't really pertain to the question, for
example noting that many people tend to underestimate probabilities.)

One can dismiss responses from LLMs -- I didn't ask the LLMs whether
they are frequentist or Bayesian :-) -- but still I think their
responses here are worth considering, don't you agree?

The fact that people are making one-time decisions is not really
relevant. People make many many one-time decisions of various kinds over
their lifespans, and in principle many of those will involve
probabilities of 0.22 or close to it. So there is in fact a long run to
consider, even if it consists of a series of one-time settings. 

not saying that the gambler in this scenario will say, "Hmm, if I play
this game many times...” but if you actually ask him/her what the 22%
figure means, they will give it some thought and then give some sort of
answer based on repeated trials. 

I think the situation becomes clearer if one asks the same question
about expected value, E(X). Putting aside the point that it is misnamed,
how is it perceived by people? Of course, most people have never heard
the term, but if you explain that it is a long run average based on the
probabilities of the events in question, they will immediately
understand, and accept that it is a useful quantity to have, even though
you bring it up in the context of one-time decision. I would humbly
submit that explaining expected value in a Bayesian context is a bit of
a challenge. If you have experience with this, I would certainly like to
hear it.


Frank Harrell
@f2harrell
Mar 28
No, I don't think that idea of probability in that setting is worth
considering.  A successful poker player is good at estimating
probability of ultimately winning the hand in the one-time situation
(one-time because it's not only the cards; it's also the players).


Norm Matloff 一啲都唔明
@matloff
All true (in the second sentence), but still not addressing the issue of
how that player is assessing/interpreting the 0.22 probability. 

As I said, the one-time nature is irrelevant, as this player, can and I
claim does view this in the context of a lifetime of poker with 0.22
probability situations or a lifetime of 0.22 probability events of
various kinds (games and nongames). 

Note the word situations; this player may play with different people, be
dealt different hands and so on, but will encounter various situations
which, though basically different in lots of ways, still have probabity
0.22 (or near it). In other words, there are repeated trials with the
same probability even though the trials are qualitative different. It's
like tossing many different but fair coins; we still have probability
0.5 in each toss, even though it's with a different coin each time.


Arman Oganisian
@StableMarkets
Mar 29
What I mean is the doctor didn’t ask for the rate in the subpopulation
of patients who share certain features with patient i. They asked for
the probability that that particular patient i would relapse. The valid
frequentist response doesn’t answer this.



Arman Oganisian
@StableMarkets
·
Mar 29
From a Bayes perspective this is a well-defined request for the
posterior predictive probability of relapse Y_i given data on previous
i-1 subjects the doc has seen: P(Y_i =1 | y_1:i-1 ). It’s not
equivocating - just viewing probability as more than just sampling
error.
Arman Oganisian
@StableMarkets
·
Mar 29
There are (at least?) two valid frequentist responses. 1) Norm’s
response, which answers a different question. 2) saying that the
probability of relapse is either 100% or 0% and we won’t know for 1 yr -
see Blackwell. Going with 2) would ensure no doc works with me ever
again!


Norm Matloff 一啲都唔明
@matloff
Comments:

1.  Putting such questions to mathematicians, e.g. Blackwell, is
generally counterproductive. They haven't had much if any experience
working on real-life problems with real-life data, and tend to
appreciate the mathematical eloquence of Bayesian methods. Same BTW for
philosophers, specifically meaning @learnfromerror . (She blocks me,
BTW, so if you add her to this conversation, I won't see what she says.)

2. As a patient, I want clinicians to give me reasonable assessments.
Even if they give me a Bayesian answer, it should mention the role of
unknown factors in the studies etc. Earlier in this thread, I talked of
"ethical, responsible" physicians, and the same goes for statisticians.
To say, "I'll give them an overly simplistic answer, to shut them up,"
is unethical and irresponsible.

3. It's wrong to say "The probability is either 1 or 0." My usual
explanation is to refer to the Monty Hall game show example. (A couple
of you brough in game theory; well, this is my game theory. :-) )


Frank Harrell
@f2harrell
Mar 28
di Finetti is probably a good source here.


Norm Matloff 一啲都唔明
@matloff
Mar 28
I'd happy to look at a specific reference if you have one, but I must
say a priori that I doubt that game theory -- mathematical derivations
and proofs -- can shed any light on the topic at hand here, which
involves mental perceptions of probability and expected value.