Counterfactuals and Grue
All of the emeralds we’ve observed have been green. They’ve also been
grue—either green and observed or blue and unobserved. Yet we take our
observations to support or confirm the hypothesis that all emeralds are
green, but not the hypothesis that all emeralds are grue. We inductively
project the ‘green’ predicate, but not the ‘grue’ predicate. The new
riddle of induction is roughly the challenge of explaining and
vindicating these inductive policies.
One popular strategy for answering the riddle appeals to
counterfactual differences between green and grue. For a given
observed green emerald, \(\textrm{e}\),
the following counterfactuals seem true:
(I)
(II)
If \(\textrm{e}\) hadn’t been
observed, it still would have been green. But if it hadn’t been
observed, it wouldn’t have been grue. On this strategy, counterfactuals
like these are key to understanding why projecting ‘green’ is warranted
but projecting ‘grue’ is not.
The counterfactual strategy comes in different forms. Not all of them
appeal to (I) and (II). And not all that appeal to (I) or (II) do so in the exact
same way. For some it is our knowledge of such counterfactuals
that is key (Jackson
1975). For others it is enough that we believe such
counterfactuals (Schramm 2014; Okasha
2007). For still others, the truth
of (I) and (II) is what
matters (Godfrey-Smith
2003, 2011). These differences are important,
and will be relevant at several points below, but I won’t belabor them.
It has been claimed that every counterfactual approach to grue
fails for the same reason—counterfactual symmetry (Roskies 2008; Dorst 2016, 2018). I will respond to the challenge
here, but obviously, even if my response is successful, counterfactual
responses to grue might fail for other reasons.
Counterfactual Symmetry
The grue theorist can respond to counterfactual approaches by
rejecting (I) and (II) and
instead accepting:
(III)
(IV)
Given that the antecedent is possible (\(\lozenge\neg\textrm{Observed(e)}\)), (I) and (III) are
incompatible, as are (II) and (IV). The challenge is roughly that if we can
vindicate our practices by appealing to (I) and
(II), the grue theorist can vindicate gruesome
practices by instead appealing to (III) and (IV). The exact nature of the appeal will, of course,
depend on the details of the counterfactual account on offer. Taken
generally, the claim is that the gruesome practice is symmetrical to our
non-gruesome practice, in the epistemically relevant way.
Some counterfactual theorists will be fine with this. Alfred Schramm
merely attempts to show that at most one out of ‘all emeralds are green’
and ‘all emeralds are grue’ is confirmed by our evidence. And Samir Okasha’s position makes
warrant for an inductive inference relative to our beliefs. If you believe (I) and (II), you are warranted
in concluding that all emeralds are green. If you instead believe (III) and (IV), you are
warranted in concluding that all emeralds are grue. These versions of
the counterfactual strategy aim for a somewhat limited conclusion. They
want to show that projecting ‘green’ but not ‘grue’ is warranted for
us, given our beliefs.
This is compatible with gruesome practices being warranted relative
to alternative beliefs. Admitting this is not paradoxical, nor need it
collapse into epistemic relativism. Nearly everyone accepts this kind of
distinction. Different background beliefs warrant different conclusions.
The kind of epistemic warrant being used here is broadly
internalist; it is based on factors internal and accessible to
the agent, such as their beliefs. Call this kind of internalist
epistemic warrant justification. There is also an externalist
kind of epistemic warrant, call it entitlement. Entitlement can
depend on factors completely outside of and inaccessible to an agent. In
the case at hand, perhaps the mere truth of (I)
and (II) means that we are entitled to ‘green’
projections, but not ‘grue’ projections.
So both justification and entitlement to ‘green’ projections can be
secured using counterfactual strategies. This is significant; most
attempted replies to grue don’t even get this far. But while it
certainly shouldn’t be dismissed, we may hope for a bit more. On the
internalist front, we might hope for non-question-begging background
arguments that beliefs in (I) and (II) are justified, while beliefs in (II) and (IV) are not. On the
externalist front, we might hope for non-question-begging arguments that
(I) and (II) are true, while
(II) and (IV) are not.
These and other hopes seem to hinge on our being able to successfully
defend a counterfactual theory based on (I) and
(II) against a gruesome counterfactual theory
based on (II) and (IV).
Can counterfactual attempts to solve grue successfully rebut the
challenge of counterfactual symmetry? This is our question. To answer
it, we’ll need to distinguish between two different ways of pushing the
symmetry challenge. I think that both ways fail, but they fail for
different reasons.
The Epistemological Version
Goodman originally defined ‘grue’ as being either green and observed
before \(t\), or blue and unobserved
before \(t\). Above, I followed the
common practice of simply assuming that \(t\) is now. Either way, the
definition of ‘grue’ involves a temporal element. A natural first
thought is that this temporal element is what defeats the projection of
‘grue’. In response to this, Goodman introduced a parallel term,
‘bleen’, meaning observed and blue or unobserved and green. He then
noted that ‘green’ and ‘blue’ are definable using ‘grue’ and ‘bleen’
(Goodman
1955). In a language that starts with ‘grue’ and ‘bleen’, it
is ‘green’ and not ‘grue’ that has an explicit temporal element in its
definition. This shows that whether a term’s definition includes a
temporal element—and even whether a term is defined at all—is
language-relative.
This point is much cleaner than the counterfactual symmetry point.
All Goodman needed to show was that a definition existed for ‘green’
that included an explicit temporal element. The correctness of Goodman’s
definition is common ground in the debate. Some have doubted the
possibility of a grue/bleen mother tongue, and others have pointed to
lingering temporal asymmetries between ‘grue’ and ‘green’, but nobody
challenges the accuracy of Goodman’s definition of ‘green’ in terms of
‘grue’ and ‘bleen’. The definitional facts are agreed upon by all
parties.
Not so for the counterfactual facts. Counterfactual claims are about
the world and its features. This match would light, if struck.
This sugar cube would dissolve, if placed in water. That star
would collapse into a black hole, if it were twice as massive.
Those who disagree with us about the world and its features may well
disagree with us about which properties are independent of our
observational procedures. So unlike grue theorists who adopt Goodman’s
symmetrical definitions, grue theorists who endorse symmetrical
counterfactuals disagree with us about matters of fact.
The question is whether there is truly symmetry here and, if so, what
kind? One idea is that there is epistemological symmetry. Adina
Roskies (2008) has
pushed something like this form of the counterfactual symmetry point,
calling it the problem of ‘counterfactual robustness’. Her direct target
is Jackson’s theory, which required knowledge of counterfactuals like
(I) and (II). Roskies claims
that any route to this knowledge is question-begging against proponents
of (III) and (IV).
This version of the counterfactual symmetry challenge isn’t merely
the original grue puzzle in another guise, at least on the most natural
understanding. It is instead akin to a skeptical challenge
based on grue. It asks us how we know that the world isn’t radically
different than we think it is. How do we know that color features don’t
depend on observations? The challenge is now to vindicate our overall
picture of physical reality, not just to defend some of our local
inductive policies.
Of course, from the earliest presentations, Goodman tied the grue
puzzle to issues of laws and counterfactuals. So you might question my
claim that the epistemological symmetry challenge substantively differs
from the original grue puzzle. I agree that in its more expansive
formulations, Goodman’s puzzle concerns not only induction proper but
also more general explanatory reasoning, including abduction or
‘inference to the best explanation.’ This point was perhaps first
clearly made in the literature by John Moreland in an excellent but
little-noticed discussion published in 1976:
What is misleading in Goodman’s formulation of the Riddle is that it
mixes questions of induction with questions of abduction. It is not just
a question of which [hypothesis] to project. We have seen that in the
appropriate circumstances either might be projected. We wish normally to
reject [the gruesome hypothesis] out of hand (regardless of the
evidence) because in most situations [the gruesome hypothesis] would not
be accepted as an explanatory hypothesis; and this is a question of
abduction, not induction. […] it does seem important to distinguish
between the question of whether or not [the gruesome hypothesis] is to
be projected in a given situation and the question of whether or not
[the gruesome hypothesis] would ever be formulated as an explanatory
hypothesis and, thus, made a candidate for inductive confirmation. (Moreland 1976,
376)
In the case of Jackson’s counterfactual theory, a division like this
falls out quite naturally. We start with the question of which
predicates we can project, but answering that question involves an
appeal to background knowledge of counterfactuals. This background
knowledge is what is questioned by Roskies. There are at least two
questions here, and they are not the same. Proponents of counterfactual
theories are independently committed to distinguishing them.
In fact, everyone must distinguish between questions of
induction and questions of abduction, not just counterfactual
theorists. So this isn’t an ad hoc
move of desperation in the face of refutation. Nor does distinguishing
these questions mean that a unified inductive logic, covering all
reasonable non-deductive reasoning, is impossible.
With this distinction noted, Jackson can plausibly explain how it is
that we know (II) (which is what, on his account,
blocks the projection of ‘grue’). The overall answer is likely a very
long story. In short: an extended process of
observation, induction, deduction, and—most crucially—abduction or a
related epistemic process led to our overall theory of the natural
world. This overall scientific story entails (II),
so knowing this, our overall theoretical knowledge transfers from our
background theory to (II). Unless
everything is always up for grabs, it is perfectly
legitimate to appeal to our fundamental beliefs about the natural world
when evaluating some particular inductive inference involving a newly
introduced predicate. No question is begged in the process. Toward the
end of her discussion, Roskies herself indicates openness to this type
of reply to her challenge. She says her goal was only to show that a
Jackson-style counterfactual account required supplementation.
I don’t disagree completely, but we should put the point somewhat
differently. We should say that Jackson’s account of projection is fine
as it stands, but add that it appeals to background knowledge that must
itself be explained, in the long run. That explanation will involve not
the original anti-grue reasoning, but instead general explanatory
reasoning about the world, so there is no circularity. If general
explanatory considerations tell against the overall grue position,
including the alternative counterfactuals, then Jackson has an answer to
the symmetry challenge.
It is worth noting that in requiring knowledge of
counterfactuals like (II), Jackson’s account is
extremely demanding. Every other counterfactual theory
of projection requires much less of us. This is important. I already
mentioned that variant theories like Okasha’s and Schramm’s require only
belief in the relevant background counterfactuals, not
knowledge. And other counterfactual theories, like
Godfrey-Smith’s, require only the truth of the relevant
counterfactuals. I highlight these points to stress that, by considering
the symmetry challenge as aimed at Jackson’s original counterfactual
approach, we have been considering it in its strongest form. Other
counterfactual theories should do at least as well at answering
the challenge.
Whatever form the counterfactual theory takes, there is no
epistemological symmetry between us and the grue theorists with respect
to these counterfactuals. If all parties understand counterfactuals as
we do, then there are good reasons for preferring (I) and (II) over (III) and (IV). These reasons
are general and theoretical and explanatory, but they aren’t
question-begging. Of course, this assumes that the challenge is posed
using our understanding of counterfactuals. There is another
way of pressing the counterfactual symmetry challenge. This more radical
approach has recently been pursued by Christopher Dorst in critical
discussions of the theories of both Alfred Schramm and Wolfgang
Freitag. Here I’ll be discussing the general
merits of the challenge, not its justice as an objection to any
particular counterfactual theory.
The Similarity Version
Consider how we semantically evaluate counterfactuals like (I), (II), (III), and (IV). Obviously,
we used and asserted and evaluated counterfactuals long before anyone
came up with an explicit semantic theory for counterfactuals. Still, a
semantics is useful for codifying the truth conditions our practices
assign to counterfactuals. The usual counterfactual semantics derives
from Stalnaker and Lewis and uses a similarity metric over the space of
possible worlds. Here’s a simplified version of this
kind of semantics:
What exactly ‘similarity’ comes to here has been much discussed. There is broad agreement over
cases, but the precise analysis is tricky. Sometimes ‘similarity’ is
claimed as subjective—including by Goodman himself.
Yet if subjectivity about similarity is combined with a
similarity-semantics for counterfactuals, and then fed into our
scientific and inductive practices as the counterfactual strategy
requires, absurdities result. This will be
illustrated below.
Let’s first assume that the relevant notion of ‘similarity’, though
context-sensitive, is not completely subjective. Given what we
mean by ‘most similar’ in this semantic clause, the only way for (III) and (IV) to be true
while (I) and (II) are false
is for the world to be wildly different in the manner discussed in the
previous section. Yet there is another option. Grue theorists could
appeal to radically different ‘similarity’ judgments, and then use those
judgments in their counterfactual semantics without otherwise
disagreeing with us about the world.
This involves saying that a world in which an observed green emerald
\(\textrm{e}\)—this very one—is
unobserved and blue, is more similar to our world than is a
world where \(\textrm{e}\) is
unobserved but green. This is bizarre to us, given what we mean by
‘similar.’ Perhaps there are possible worlds where \(\textrm{e}\), this very thing, is blue and
not green. But given everything we know and believe about physics,
chemistry, optics, and more, such a world must be very dissimilar to our
world. The imagined grue theorist denies this. They agree with us about
which worlds are possible. And they also agree with us about the facts
in this world, but they disagree about how similar certain worlds are to
this world.
Something like this reply might be implicit in some of Goodman’s
later discussions of gruesome matters. More recently it has been
explicitly pursued by Dorst in reply to recent counterfactual
approaches:
We are thus examining the same world in both cases, so only one of
the two counterfactuals can possibly be true. […] But which one is true?
That will evidently depend on the similarity metric we impose on the
space of possible worlds. On our traditional understanding of
‘similarity,’ the closest (most similar) world where the emeralds in our
evidence class were not observed before 2020 will be one in which they
are green and not grue. Surely, however, a ‘grue’-speaker would have
exactly the opposite conception of ‘similarity.’ After all, he thinks
grue things all “look alike,” so it is only natural that his conception
of similarity would reflect that. […] So if we appeal to counterfactuals
to justify the ‘green’ induction over the ‘grue’ induction, the
‘grue’-speaker will have a precisely symmetric justification open to
him. (Dorst 2016,
153)
This understanding of the counterfactual symmetry challenge differs
from the epistemological understanding discussed above. In some ways it
is a more radical and troubling challenge.
There is no accounting for taste, and maybe there is no accounting
for weird similarity judgments either. Yet meaning is determined by use.
It’s plausible that anyone who clear-headedly used the term ‘similar’ so
differently would no longer mean what we mean by the term. If they then
use their alternative notion of ‘similarity’ in giving a counterfactual
semantics, this difference in meaning will also infect terms like
‘would’ and ‘counterfactual.’ But the real issue is not about semantic
theory. The real issue is use—the use that the formal semantic
theory was meant to codify. Drastic changes in use lead to changes in
meaning. If these grue theorists use counterfactuals in a way that
aligns with their ‘similarity’ judgments and not ours, then they no
longer mean what we mean by counterfactuals.
If this “change of meaning” charge is true, it provides a response to
the similarity version of the counterfactual symmetry challenge. The
response is that, in adopting this version of the challenge, grue
theorists have changed their language significantly. They have changed
it so much, in fact, that they no longer disagree with us. Our
dispute has devolved into a merely verbal dispute, with no direct
disagreement.
In order to see this, it is important to understand that the kind of
linguistic change involved here is not the simple change of
moving to a language in which there are primitives for ‘grue’ and
‘bleen’ but all else remains the same. In that type of language,
‘similar’ still has the same meaning it has in our language. So those
‘grue’-speakers will agree with us about (I), (II), (III), and (IV) or rather, about their translations
into the grue language. That change did not amount to a difference in
worldview, only a difference in language. This shows that the radical
counterfactual similarity charge is not backed up by the possibility of
grue/bleen languages of the kind discussed by Goodman.
Instead a much more radical linguistic change is required, one that
systematically alters the truth conditions of counterfactuals.
Some may quibble. Has meaning really been changed, they will
ask. Anyone who thinks meaning is closely tied to use will say
yes. And since almost everyone thinks that meaning is closely
tied to use, almost everyone will say yes. Even Quine, the
arch-critic of analyticity, argued that drastic meaning changes
undermine simple homophonic translations (Quine 1970).
So I don’t think my claims about meaning change beg any significant
questions about analyticity or the like.
We could argue for meaning change here theoretically, by
appealing to widely accepted theoretical principles of interpretation or
translation—charity, humanity, rationality, and so on.
But the central point is probably best illustrated more simply, by
reflecting on simple applications of our actual practices of translation
and interpretation. Imagine that you encounter someone who
clear-headedly makes ‘similarity’ judgments that align with those of our
imagined grue theorist. Even after all of the facts are in, they
continue to disagree with you. They say that grue things are ‘more
similar’ to each other than green things, even more similar with respect
to ‘color,’ and that grue things, but not green things, ‘look alike.’
After you convinced yourself that these divergences are not caused by
some perceptual deficiency or a mistake about the factual situation, you
would conclude that your interlocutor simply spoke a different language
than you did. They simply do not mean what you mean by ‘similar’ or the
like.
This meaning change diagnosis is the best and most appealing way to
understand the apparent disagreement here. To some extent though, it can
be left to one side. The crucial point concerns the differences in
practical language use. Even those who think there is a difference of
opinion, not a difference of meaning, must admit that the gruesome
practices differ wildly from our own. Let me provide a concrete
illustration of this by considering what happens after time
\(t\), where \(t\) is the time to which the definition of
‘grue’ is indexed. Let’s update Goodman’s original definition with a
predicate for ‘observed before 2020’: \[
\textrm{Grue2020}(\alpha) \leftrightarrow (\textrm{Green}(\alpha) \land
\textrm{Observed2020}(\alpha)) \lor (\textrm{Blue}(\alpha) \land
\neg\textrm{Observed2020}(\alpha))
\] Something is grue2020 just in case it is either green and
observed before 2020 or blue and not observed before 2020. Since it is
now past 2020, we can observe previously unobserved emeralds without
them being observed2020. What happens when we do is instructive. On
January 1st, 2020, the grue defender is committed to the following for
previously observed emerald, \(\textrm{e}\):
(III*)
(IV*)
Now let us observe a previously unobserved emerald, \(\textrm{m}\).
1. |
\(\neg\textrm{Observed2020(m)} \land
\textrm{Emerald(m)}\) |
(assumption) |
2. |
\(\forall
x(\textrm{Emerald}(x) \rightarrow \textrm{Grue2020}(x))\) |
(inductive projection made by the grue
defenders, backed up by (III*) and (IV*)) |
3. |
\(\textrm{Grue2020(m)}\) |
(1,2) |
4. |
\(\textrm{Blue}(\textrm{m})\) |
(1,3 and the definition of
‘Grue2020’) |
In other words, these grue theorists can prove to themselves that
\(\textrm{m}\) is blue, and then they
look to the world and see that it’s green. Saying that \(\textrm{m}\) is green does not beg any
questions here. This is because the similarity-based symmetry challenge
differs from the epistemological challenge. The radical grue theory
under consideration is supposed to agree with us about all of the
physical facts, including facts about the color of emeralds. They were
supposed to differ from us only over similarity-based claims.
Related arguments have been used elsewhere in the massive grue
literature, for different purposes. The purpose here is to
illustrate that when counterfactuals connect to induction, as proponents
of the counterfactual strategy believe that they do, our practice of
evaluating counterfactuals is not isolated. It instead feeds into a
cluster of related physical notions, including ‘nomological modality’,
‘laws’, ‘dispositions’, and ‘causes’.
So if you change what you count as ‘relevantly similar,’ you change a
great deal indeed. Someone who changes what is meant
by counterfactual terms would be ill-advised to fit their alternative
‘counterfactual’ notions into the same conceptual space as our notions.
Doing so leads to radically different ways of reasoning about and
interacting with the same natural world.
Claiming that similarity itself is entirely subjective doesn’t change
this. If you say that, and then use similarity-relations to analyze the
counterfactuals which underwrite justified inductive inferences, you
descend into a subjectivist nightmare.
Use any alternative counterfactual practice and you will likely find
yourself with many false beliefs and many frustrated expectations. You
could get lucky, but I wouldn’t bet on it. Neither would you. Induction
is not a pointless game we play for our amusement, it is instead a
crucial part of how we reason about and master the physical world that
surrounds and includes us.
So, on neither reading does the counterfactual symmetry charge lead
to genuine and troubling symmetry between our position and the grue
theory. If the symmetry challenge is posed using our
counterfactual notions, then we have non-question-begging
epistemic reasons for favoring our counterfactuals over theirs.
And if it is posed using some alternative notion of
counterfactual similarity, then we have non-question-begging
practical reasons for favoring our counterfactual practice over
theirs. Either way, we have non-question-begging reasons for favoring
our practices over the gruesome practices. Counterfactual approaches to
grue might fail for other reasons, but the counterfactual symmetry
charge doesn’t stick. And despite its superficial appeal, the
argumentative strategy it exemplifies is quite risky. I will close by
explaining this.
Philosophical Fair Play
Twentieth-century philosophy was replete with overtly
semantic, broadly skeptical challenges. These
challenges attacked some of our most cherished doctrines using clever
semantic tricks, principally clever redefinitions of crucial terms. The
targets differed, as did the particular semantic tactics employed. Yet a
general similarity between these challenges is easily recognized,
provided it isn’t overstated. Quine’s translation argument, Putnam’s
model-theoretic argument, and Kripkenstein’s skeptical paradox all fit
into this model (Quine 1960; Putnam 1980; Kripke
1982). So too, does Goodman’s grue puzzle.
Seen from this perspective, the overall dialectic surrounding the
counterfactual symmetry challenge is quite familiar. A challenge has
been posed by a semantic skeptic. One of our treasured assumptions is
under threat. We rush in gallantly to offer a defense. Alas, the
semantic skeptic uses a version of the original re-definition move yet
again. This time on the very defense we have offered. The defense itself
is seen by the skeptic as “just more theory” to be reinterpreted, just
more grist for the skeptical mill.
More often than not though, this move is not quite fair. When a
constraint is used to screen off some skeptical reinterpretation,
reinterpreting the statement of the constraint misses the mark.
If we respond to Kripkenstein by claiming we mean addition and not
quaddition by ‘plus’ because we execute the addition algorithm in
response to ‘plus’ queries, talk of “quaddition algorithms” misses the
point. The constraint concerns what we do, not what we say
about what we do. Likewise with Quine’s challenge, and Putnam’s.
Likewise too, with Goodman’s grue challenge.
Almost the same exact dialectic pops up again and again, all across
the philosophical landscape, so the point is worth belaboring. Skeptical
reinterpretation is risky. Great care must be taken whenever
the move is attempted. In the present context, we have seen that blithe
appeals to gruesome counterfactuals come with baggage. The
counterfactual symmetry claim has hidden costs. Either a commitment to
absurd factual claims or an unnoticed change of topic. In contexts like
this, we must always take care to tease out all ramifications of the
skeptic’s maneuvering. The semantic skeptic’s tricks are ever so easy to
apply, but they can very quickly take us into uncharted waters, where
monsters lurk. In these waters, merely ersatz symmetry is often mistaken
for the real thing.
Acknowledgements
Thanks to Rosa Cao, Christopher Dorst, Eli Hirsch, Alfred Schramm,
Douglas Stalker, and Daniel Waxman.