Modus Tollens Probabilized

CARL G. WAGNER
University of Tennessee, U. S. A.

Abstract

We establish a probabilized version of modus tollens, deriving from
p(E|H) = a and p(Ē) = b the best possible bounds on p(H̄). In particular,
we show that p(H̄) → 1 as a, b → 1, and also as a, b → 0.

Keywords

conditional probability, modus ponens, modus tollens, probability logic

1 Introduction

Given an exact or approximate specification of the probabilities of propositions
A1, . . . , An, what are the narrowest bounds that can be placed on the probability of
some other proposition B? This problem was first investigated by George Boole
[B54], whose work was revived by Theodore Hailperin [H65, H84, H96] in a
sequence of publications that offered a fully rigorous, comprehensive treatment
of the problem from the standpoint of linear programming. We follow Hailperin
in calling this problem Boole’s General Probability Problem.

While the propositions in question may be arbitrary, special interest naturally
attaches to cases in which the inference from the conjunction of the “premisses”
Ai to the “conclusion” B is truth-functionally valid, symbolized in what follows by
A1, . . . , An � B. For solutions to Boole’s General Probability Problem in such cases
may be expected to reveal the degree of confidence in the premisses Ai sufficient
to ensure some desired degree of confidence in B, as measured by their subjective
probabilities. Elementary examples include the valid argument

A1, . . . , An � A1 ∨···∨ An,

which has the probabilistic counterpart

p(Ai) = ai ⇒ max{ai} 6 p(A1 ∨···∨ An) 6 min{1, a1 +···+ an}, (1)

and the valid argument

A1, . . . , An � A1 ∧···∧ An,

1


2 Proceedings in Informatics

which has the probabilistic counterpart

p(Ai) = ai ⇒ a1 +···+ an −(n − 1) 6 p(A1 ∧···∧ An) 6 min{ai}. (2)

The right-hand inequality in (1) and the left-hand inequality in (2) are, respec-
tively, the familiar inequalities of Boole and Bonferroni. That the bounds in (1)
and (2) are the best possible was apparently first shown by Maurice Fréchet [F35].

A number of valid arguments of propositional logic involve the truth-function-
al connective −→, variously termed the conditional, the indicative conditional,
and material implication, the truth table of A → B being identical with that of
Ā ∨ B. In the next two sections we consider probabilistic counterparts of modus
ponens (E → H, E � H) and modus tollens (H → E, Ē � H̄). In section 2, a premiss
such as E → H is simply replaced by its probability p(E → H). In section 3, it is
replaced by the arguably more salient conditional probability p(H|E). Our results
on the conditional probability counterpart of modus tollens are new, and exhibit a
feature that is unusual in these sorts of problems, namely, that a high probability
of H̄ is entailed not only when p(E|H) and p(Ē) are high, but also when they are
low.

2 Probabilities of Conditionals

The valid argument E → H, E � H, known as modus ponens or the rule of detach-
ment, has the probabilistic counterpart

p(E → H) = a and p(E) = b ⇒ a + b − 1 6 p(H) 6 a, (3)

for all a, b such that a + b > 1, the bounds in (3) being the best possible. These
results follow immediately from the observations that

p(E → H) = p(Ē ∨ H) 6 p(Ē) + p(H)

and

p(H) = p(E ∧ H) + p(Ē ∧ H) 6 p(E ∧ H) + p(Ē) = P(E → H),

along with easily constructed examples of probabilities p satisfying the antecedent
conditions of (3) and, respectively, (i) p(Ē ∧ H) = 0 and (ii) p(Ē ∧ H̄) = 0. Hail-
perin [H84] has also given an interesting proof using linear programming. Note
that the condition a + b > 1 is necessary and sufficient to ensure that the set of
probabilities satisfying the antecedent conditions of (3) is nonempty.

It follows from (3) that

p(E → H) > 1 − ε and p(E) > 1 − ε ⇒ p(H) > 1 − 2ε (4)

for all ε 6 1/2, as noted by Patrick Suppes [S66].


Carl G. Wagner: Modus Tollens Probabilized 3

From the fact that p(H → E) = p(Ē → H̄) it follows trivially from the above
that the valid argument H → E, Ē � H̄, known as modus tollens, has the proba-
bilistic counterpart

p(H → E) = a and p(Ē) = b ⇒ a + b − 1 6 p(H̄) 6 a, (5)

for all a, b such that a + b > 1. In particular,

p(H → E) > 1 − ε and p(Ē) > 1 − ε ⇒ p(H̄) > 1 − 2ε (6)

for all ε 6 1/2. Understandably, not much is usually made of (5) and (6), since
they are trivial variants of (3) and (4). We include them here only for purposes of
comparison with results in the next section.

3 Conditional Probabilities

3.1 Adam’s Thesis

Truthful speakers do not assert propositions that they do not take to be highly
probable. As David Lewis [L76] puts it, “assertibility goes by subjective prob-
ability.” But Ernest Adams [A65, A75] has pointed out that conditionals seem
to be an exception to this rule, with the assertibility of E → H going not by the
probability p(E → H) of the conditional but, rather, by the conditional probability
p(H|E).

Adam’s evidence consists of a number of cogent examples that accord with
his thesis. Lewis [L76] offers an additional argument: When p(E → H) is high
because p(Ē) is high and, moreover, p(H̄|E) is high, then it is pointless, and in
fact misleading, to assert E → H, rather than Ē. Taking the product p(Ē)p(H̄|E)
as a simple measure of the lessening of assertibility of E → H, we then arrive
at p(E → H) − p(Ē)p(H̄|E), easily shown to be equal to p(H|E), as a suitable
measure of the assertibility of E → H.

Frank Jackson [J79] offers a further argument in support of Adam’s thesis:
Suppose that we have assessed the prior probability p(E → H). Further inquiry
convinces us of the truth of E, and in such a way that the revision of p to the
posterior q(·) = p(·|E) is judged to be reasonable. Then

q(E → H) = p(E → H|E) = p(H|E) and q(E) = 1

and so q(H) > p(H|E) by (3). A large value of p(E → H) is of no utility in such
dynamic cases of modus ponens. What matters is the value of P(H|E).

Even apart from the above discussion, it is clear that conditional probabilities
are salient in a way that probabilities of conditionals are not. For the probability
that H is true, given that E is true, is conceptually simpler and easier to assess
than the probability that E is false or that E and H are both true. Moreover, when


4 Proceedings in Informatics

the probabilities in question are observed or estimated relative frequencies, prob-
abilities of conditionals are of virtually no interest.

For the above reasons it is of interest to investigate probabilistic versions of
valid arguments such as modus ponens and modus tollens in which conditional
probabilities play a central role, a subject to which we turn in what follows.

3.2 Modus Ponens for Conditional Probabilities

Replacing p(E → H) by p(H|E) yields the following variant of (3): If 0 6 a 6 1
and 0 < b 6 1 , then

p(H|E) = a and p(E) = b ⇒ ab 6 p(H) 6 ab + 1 − b, (7)

the bounds in (7) being the best possible. This result follows immediately from
the fact that

p(E)p(H|E) = p(E ∧ H) 6 p(H) = p(E ∧ H) + p(Ē ∧ H) 6 p(E ∧ H) + p(Ē),

along with easily constructed examples of probabilities p satisfying the antecedent
conditions of (7) and, respectively, (i) p(Ē ∧ H) = 0 and (ii) p(Ē ∧ H̄) = 0 Hail-
perin [H96] has also given an interesting proof using linear programming.

It follows from (7) that if 0 6 ε < 1, then

p(H|E) > 1 − ε and p(E) > 1 − ε ⇒ p(H) > (1 − ε)2, (8)

as noted by Suppes [S66].

3.3 Modus Tollens for Conditional Probabilities

Unlike the case of modus tollens for probabilities of conditionals, we cannot make
short work of the present subject by invoking the results of section 3.2. For it is not
generally true that p(E|H) = p(H̄|Ē). Indeed, from p(E|H) = a, with 0 6 a < 1,
one can conclude only that 0 6 p(H̄|Ē) 6 1, as shown by Hailperin [H96]. The
following theorem, which seems not to have appeared in the literature, delineates
what might reasonably be termed modus tollens for conditional probabilities.

Theorem 1 Let p(E|H) = a and p(Ē) = b.
If (i) 0 < a, b < 1, then

max{(1 − a − b)/(1 − a), (a + b − 1)/a} 6 p(H̄) < 1. (9)

If (ii) a = 0 and 0 < b 6 1, then

1 − b 6 p(H̄) < 1, and (10)

If (iii) a = 1 and 0 6 b < 1, then

b 6 p(H̄) < 1, (11)

and these bounds are the best possible.


Carl G. Wagner: Modus Tollens Probabilized 5

Proof. Let A be the algebra of propositions generated by E and H. Under each
of the conditions (i), (ii), (iii) above (and only under these conditions) the set P
of probabilities p on A satisfying p(E|H) = a and p(Ē) = b is nonempty and
consists of precisely those p such that p(E ∧ H) = at, p(E ∧ H̄) = 1 − b − at,
p(Ē ∧H) = (1−a)t, and p(Ē ∧H̄) = b−(1−a)t, where the parameter t is subject
to restrictions ensuring that these quantities are all nonnegative.

Under condition (i), we have

0 < t 6 min{b/(1 − a), (1 − b)/a}. (12)

Since p(H̄) = 1 −t, the inequalities in (9) follow from those in (12).
It is clear that the upper bound in (9) is the best possible. To show that this is

also true for the lower bound requires consideration of two cases. (1). If a + b 6 1
then b/(1 − a) 6 (1 − b)/a, and so t = b/(1 − a) is an allowable value of the
parameter t. For the probability p defined by this value of t, we have p(H̄) =
(1−a−b)/(1−a). Since a + b 6 1, we have (1−a−b)/(1−a) > (a + b−1)/a,
and so p(H̄) attains the lower bound in (9). (2). If a + b > 1, then (1 − b)/a <
b/(1 − a), and so t = (1 − b)/a is an allowable value of t. For the probability p
defined by this value of t, we have p(H̄) = (a + b−1)/a. Since a + b > 1, we have
(a + b − 1)/a > (1 − a − b)/(1 − a), and so p(H̄) again attains the lower bound
in (9).

Under condition (ii), we have

0 < t 6 b, (13)

which, with p(H̄) = 1 − t, implies (10), the lower bound in (10) being attained
when t = b.

Under condition (iii), we have

0 < t 6 1 − b, (14)

which, with p(H̄) = 1 − t, implies (11), the lower bound in (11) being attained
when t = 1 − b. 2

The above analysis reveals an interesting feature of modus tollens for con-
ditional probabilities, namely, that p(H̄) may be made as close to 1 as we wish
not only by making p(E|H) and p(Ē) sufficiently large, but also by making them
sufficiently small. This is of course intuitively reasonable. To take a very concrete
example, suppose that we have a finite set of spheres, each either hollow (H) or
solid (H̄), and each painted either ecru (E) or white (Ē). If most of the hollow
spheres are ecru, but most of the spheres are white, then most of the spheres must
be solid. Similarly, if few of the hollow spheres are ecru and few of the spheres
are white, then most of the spheres must be solid.

We conclude with an elaboration of the above in the spirit of Suppes’ inequal-
ities (4), (6), and (8).


6 Proceedings in Informatics

Corollary 1 Let 0 < ε 6 1/2. If

p(E|H) > 1 − ε and p(Ē) > 1 − ε, or (15)
p(E|H) 6 ε and p(Ē) 6 ε, then (16)

p(H̄) > (1 − 2ε)/(1 − ε). (17)

Proof. That (15) implies (17) follows from setting a = b = 1 − ε in (9) and
verifying by partial differentiation that (a + b − 1)/a is increasing in both a and
b. That (16) implies (17) follows from setting a = b = ε in (9) and verifying by
partial differentiation that (1 − a − b)/(1 − a) is decreasing in a and b. 2

Note that in each of the probabilistic inferences (4), (6), and (8) there is
some degradation in probability in passing from premisses to conclusion. The
same is true for the inference from (15) to (17), but this case of modus tol-
lens for conditional probabilities always involves less degradation in probabili-
ties than modus ponens or modus tollens for probabilities of conditionals and, if
ε < (3 −

√
5)/2

.
= 0.38, it involves less degradation than modus ponens for con-

ditional probabilities.

References

[A65] E. Adams, The logic of conditionals, Inquiry 8: 166–197, 1965.

[A75] E. Adams, The Logic of Conditionals, Reidel, Dordrecht, 1975.

[B54] G. Boole, An Investigation of the Laws of Thought, on Which are
Founded the Mathematical Theories of Logic and Probabilities, Walton
and Maberly, London, 1854.

[F35] M. Fréchet, Généralizations du théoreme des probabilités totales, Funda-
menta Mathematicae 25: 379–387, 1935.

[H65] T. Hailperin, Best possible inequalities for the probability of a logical
function of events, American Mathematical Monthly 72: 343–359, 1965.

[H84] T. Hailperin, Probability logic, Notre Dame Journal of Formal Logic 25:
198–212, 1984.

[H96] T. Hailperin, Sentential Probability Logic, Lehigh University Press, Beth-
lehem, 1996.

[J79] F. Jackson, On assertion and indicative conditionals, The Philosophical
Review 88: 565–589, 1979.

[L76] D. Lewis, Probabilities of conditionals and conditional probabilities, The
Philosophical Review 85: 297–315, 1976.


Carl G. Wagner: Modus Tollens Probabilized 7

[L86] D. Lewis, Probabilities of conditionals and conditional probabilities II,
The Philosophical Review 95: 581–589, 1986.

[S66] P. Suppes, Probabilistic inference and the concept of total evidence, in
Aspects of Inductive Logic, J. Hintikka and P. Suppes, eds, North-Holland,
Amsterdam, 49–55, 1966.

Carl G. Wagner is with the Department of Mathematics, University of Tennessee,
Knoxville, TN, U.S.A. 37996-1300. Email: wagner@math.utk.edu