Abstract: The Law of Non-Contradiction has been high orthodoxy in Western philosophy since Aristotle. The so-called Law has been the subject of radical challenge in recent years by dialetheism, the view that some contradictions are indeed true. Many philosophers have taken the Law to be central to many of our most important philosophical concepts. This book mounts the case against this view. Starting with an analysis of Aristotle on the Law, it discusses the nature of truth, rationality, negation, and logic itself, and argues that the Law is inessential to all of these things. The book develops Priest’s earlier ideas in In Contradiction.
Preface
In Contradiction, which was first published in 1987, provided a sustained argument for and defence of dialetheism. My thoughts on the topic have continued to evolve since that time, however. A general discussion of the evolution can be found in the second edition of that book, which is a companion volume to this one. One particular area of development concerns the implications of dialetheism for certain key philosophical notions, such as truth and rationality—and vice versa. That is what this book is about.
The book draws on papers that I have published since 1987. In particular, it draws on the following material, which, however, has been refashioned in some places—at some points substantially—in the cause of coherence:
Chapter 1: ‘To Be and Not to Be—that is the Answer: Aristotle on the Law of Non-Contradiction’, Philosophiegeschichte und Logische Analyse, 1 (1998), 91–130.
Chapter 2: ‘Truth and Contradiction’, Philosophical Quarterly, 50 (2000), 189–95.
Chapter 3: ‘Perceiving Contradictions’, Australasian Journal of Philosophy, 77 (1999), 439–46; and ‘Could Everything be True?’, Australasian Journal of Philosophy, 78 (2000), 189–95.
Chapter 4: ‘What Not? A Defence of a Dialetheic Account of Negation’, in D. Gabbay and H. Wansing, (eds.) (1999), What is Negation?, Dordrecht: Kluwer Academic Publishers, 101–20.
Chapter 5: ‘Boolean Negation and All That’, Journal of Philosophical Logic, 19 (1990), 201–15.
Chapter 6: ‘Can Contradictions be True? II’, Proceedings of the Aristotelian Society, suppl. vol. 67 (1993), 35–54; and ‘Rational Dilemmas’, Analysis, 62 (2002), 11–16.
Chapter 7: ‘Why it's Irrational to Believe in Consistency’, in B. Brogard and B. Smith (eds.), Rationality and Irrationality, Proceedings of the 23rd International Wittgenstein Symposium, Vienna: öbv&hpt Verlagsgesellschaft mbh & Co. (2001), 284–93.
Chapter 8: ‘Paraconsistent Belief Revision’, Theoria, 68 (2001), 214–28.
Chapter 9: ‘Inconsistency and the Empirical Sciences’, in J. Meheus (ed.) (2002), Inconsistency in Science, Dordrecht: Kluwer Academic Publishers, 119–28.
Chapter 10: ‘On Alternative Geometries, Arithmetics and Logics: A Tribute to Łukasiewicz’, Studia Logica, 74 (2003), 441–68.
Chapter 11: ‘Validity’, in A. Varzi (ed.) (1999), The Nature of Logic, Stanford: CSLI Publications (European Review of Philosophy), 183–206.
Chapter 12: ‘Logic: One or Many?’, in J. Woods and B. Brown (eds.), Logical Consequences: Rival Approaches, Oxford: Hermes Scientific Publishers Ltd. (2001), 23–38.
end p.vii
I am grateful to the editors, professional associations, and publishers in question for permission to reuse the material.
Footnotes in those papers thanked all those whose constructive criticism helped in their production. These included by name: Jonathan Adler, Diderik Batens, JC Beall, Paddy Blanchette, Andrew Brennan, Deb Brown, Stewart Candlish, Max Cresswell, Nick Denyer, André Fuhrmann, André Gallois, Jay Garfield, Len Goddard, Bill Grey, Ian Hinckfuss, Colin Howson, Dominic Hyde, Arnie Koslow, Roger Lamb, Julian Lamont, Isaac Levi, David McCarty, Gary Malinas, Joke Meheus, Winston Nesbitt, Daniel Nolan, Thomas de Praetere, Greg Restall (several times), Hans Rott, Steve Schiffer, Hartley Slater, Timothy Smiley, Roy Sorensen, Gören Sundholm, Richard Sylvan, Koji Tanaka, Ellen Watson, Peter Woodruff, Peter Unger, Achille Varzi, and Byeong Yi.
A version of the whole book was given in a series of Arché seminars in the Department of Logic and Metaphysics, University of St Andrews, Candlemas term, 2004. I am grateful to all the people who provided helpful comments and criticisms there, too; and especially to Ross Cameron, Roy Cook, Neil Cooper, Hud Hudson, Daniel Nolan, Agustin Rayo, Stephen Read, John Scorupski, Stewart Shapiro, Robbie Williams, Crispin Wright, Elia Zardini. Collective thanks go to Arché and the University of St Andrews for the congenial atmosphere they provide, to the University of Melbourne for their graciousness in allowing me to take periods of research leave there, and to my colleagues at the University of Melbourne for their never-failing support and stimulation. Finally, my thanks go to the staff of Oxford University Press, particularly Peter Momtchiloff for his thoughtful guidance with the book, and to Peter Milne, who read the penultimate draft of the book for the Press, and gave detailed and perceptive comments which much improved the final product.
GP
Melbourne
2005
end p.viii
Introduction
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
Dialetheism is the view that some contradictions are true: there are sentences (statements, propositions, or whatever one takes truth-bearers to be), α, such that both α and ¬α are true, that is, such that α is both true and false. 1 Dialetheism is what this book is about.
There are many questions that one might ask about the view. One—perhaps the most obvious—is why one might suppose it to be true. In Contradiction 2 gives a number of reasons for supposing it to be so. These concern the paradoxes of self-reference, motion and change, and the conflict of norms, particularly legal norms. Doubtless there are other reasons. But that is not what this book is about.
Another question one might ask is as to the history of the view. The short answer is that in Western philosophy it flies in the face of orthodoxy, in the shape of the so-called “Law of Non-Contradiction”. The long and more adequate answer is a complex one. But this book is not about that either. 3
Yet another question is that of how dialetheism bears on the history of philosophy in general. Philosophers often seem to be pushed into the inconsistent despite themselves. The situation suggests itself as a prime locus of the relevance of dialetheism. It arises, in particular, when the issue of the limits of thought or language is broached. Think of Kant's problem with speaking of noumena, or Wittgenstein's problem of saying the unsayable (in the Tractatus). Beyond the Limits of Thought 4 discusses that situation. This book is not about that either.
One more question that one might ask about dialetheism concerns the details of a formal logical theory appropriate for the view. It is clear that a paraconsistent logic is required, but how does it work? Though the book alludes to details of this kind occasionally, it is not about that either. 5 (Some knowledge of formal logic is certainly required to read parts of the book; but I have deliberately omitted from it any heavy-duty logic.)
What this book is about is the relation of dialetheism to certain core philosophical notions. Since dialetheism is the view that some sentences and their negations are true, it obviously concerns truth and negation explicitly. The first part of the book is about the first of these, truth; the second is about the second, negation. Dialetheism, as defined, does not require it to be rational to believe some contradictions. However, the view would be of limited significance if it were never rational to believe contradictions. 6 Hence, rationality is also intimately connected with dialetheism. The third part of
end p.1
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the book is about this. For Aristotle, the “Law of Non-Contradiction” was a principle of metaphysics, not logic. Since at least Leibniz, however, it has been taken to be a logical principle, indeed a central such principle. Dialetheism now, therefore, bears on the nature of logic. The final part of the book concerns this. The contents of each part of the book, in more detail, are as follows.
PART I: TRUTH
The only major defence of the “Law of Non-Contradiction”—and so the only major critique of dialetheism—in the history of Western philosophy was given by Aristotle in book Γ of the Metaphysics. The defence is highly problematic. It is not clear what, exactly, his arguments are meant to establish, or how, exactly, they are meant to establish it. Chapter 1 of the book is a commentary on the relevant passage, as well as on some arguments that other commentators have claimed to find in it. As we will see, none of the arguments in question succeeds in discrediting dialetheism. The chapter also raises a number of issues that are taken up in subsequent chapters. Chapter 1 is the longest chapter of the book, and detailed analysis of texts is not to everyone's taste. If one is keen to get on to matters of more philosophical substance, this chapter can be skipped, or read fast, provided one is prepared to accept that Aristotle's defence of the law—and its modern descendants—are badly flawed. However, since Aristotle's is the canonical defence of the law, a detailed analysis of the whole text is necessary, lest I be thought to be hiding something.
If contradictions can be true, what does this tell us about truth: what must truth be like if contradictions are such that they may be true? This is the topic of Chapter 2. The answer is, perhaps surprisingly, ‘pretty much any way one likes’. As we will see, all the standard accounts of truth are compatible with dialetheism; indeed, a number of them actually lead in its direction.
Once one has conceded that some contradictions can be true, though, what is there to stop us supposing that all contradictions are true? This is the topic of Chapter 3. That question might, in fact, be understood in different ways. One is as the question: how does one know that not all contradictions are true? Another is as to how one would go about arguing with someone who held all contradictions to be true. Both questions are addressed in the chapter.
PART II: NEGATION
Just as one may ask what truth must be like if contradictions can be true, so one may ask what negation must be like if contradictions can be true. But in the case of negation, unlike that of truth, the answer cannot be so liberal. In particular, if some and only some contradictions are true, negation must not be explosive: a sentence and its negation cannot imply everything. Is there a credible account of negation according to which it is like this? The answer to this question is ‘yes’, as Chapter 4 shows.
end p.2
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
But even if that is right, is there not a perfectly good notion of negation which is explosive, and so, for which, at least, dialetheism is ruled out? It is standardly assumed that there is, in the shape of “Boolean negation”. This assumption is taken up and rejected in Chapter 5.
And speaking of rejection, it is, again standardly, assumed that there is a close connection between negation and rejection: to negate is to reject. The first part of Chapter 6 investigates the connection between negation, rejection, and the closely related notion of denial. These notions are connected, but the connection is by no means as simple-minded as the orthodox assumption has it. The chapter also takes up the connection between rejection and truth—or, more accurately, untruth. This will take us into the territory of rationality, in the shape of a discussion of rational dilemmas.
PART III: RATIONALITY
Chapter 7 takes up squarely the question of rationality. Again, we may ask what rationality must be like if contradictions are such that some of them may be believed rationally. This chapter provides an answer. Though it is frequently assumed that belief in a contradiction is the nadir or rationality, an account of rationality that allows for the possibility of this is not so far from orthodox accounts, as we will see.
We do not only believe or not believe things rationally, we revise or fail to revise our beliefs rationally. And it is often assumed that consistency in the fulcrum of this procedure: rational belief-revision is a form of consistency-maintenance. This is not, in fact, correct. As Chapter 8 shows, belief-revision works perfectly intelligibly in the environment of inconsistency.
Discussions of the belief of inconsistencies, and so of their rational belief, are often carried out in the domain of the a priori, and specifically in the domain of conceptual paradoxes of certain kinds. What of the domain of the a posteriori? Do the same considerations obtain in the empirical sciences? Chapter 9 returns an affirmative answer to this question.
PART IV: LOGIC
It is sometimes thought that consistency is integral to a coherent account of logical validity. The lie to this is already given by formal paraconsistent logics, which provide a coherent notion of validity allowing for inconsistency. Such logics are not (currently) orthodox, though. In other words, orthodox beliefs concerning validity must be revised to make way for this possibility. Can views about logic be revised? Not only can they be, they have been, as Chapter 10 demonstrates. But what, then, must logic be like if belief about it is rationally revisable? Chapter 10 answers that question. Logic, it will turn out, is simply a fallible theory about crucial notions such as validity.
But what, then, is validity? Chapter 11 returns an answer to that question. It provides a schematic account of validity on which all logicians, paraconsistent and non-paraconsistent, deductive and non-deductive, may agree.
end p.3
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Finally, if logical theories may be revised, it is clearly possible for there to be different theories that vie for acceptance, that is, for there to be rival theories. But what, exactly, does rivalry in logic amount to? And must there be a uniquely correct logic, or might more than one logic be correct? The issue of logical monism and logical pluralism is the topic of Chapter 12.
By the end of the book we will have an integrated and coherent account of the connections between dialetheism and the notions of truth, negation, rationality, and validity, together with a number of closely associated notions. The account is perfectly general, in the sense that it may be subscribed to by dialetheist and non-dialetheist alike. But if the account, or something like it, is correct, it has particular significance for dialetheism. Many philosophers have taken consistency to be a sine qua non of the notions in question. Hence, it might well be supposed—and many philosophers have supposed—that the behaviour of these notions rules out the very possibility of dialetheism, the view that some truths are false. The supposition is, then, mistaken. These core notions of philosophy provide no ground to doubt that the conjunction of all truths is also false, that is, to doubt truth to be a liar.
end p.4
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
PART I TRUTH
end p.5
1 Aristotle on the Law of Non-Contradiction
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
1.1 INTRODUCTION
A number of the Presocratic philosophers endorsed explicitly contradictory views. In book Γ of the Metaphysics, Aristotle took these in his sights, and defended what was to become known as the Law of Non-Contradiction. This was a crucial moment in the history of philosophy. With the exception of Hegel and his fellow-travellers, and whilst Aristotle's opinion on nearly every other matter has been overturned—or at least challenged—nearly every Western philosopher and logician has accepted the authority of Aristotle on this matter. There is hardly a defence of the Law since Aristotle's, worth mentioning.
Of recent years, things have taken a novel twist. For developments in contemporary logic itself have made it possible to countenance, if not the old Presocratic views, at least others that endorse the truth of some contradictions. It therefore becomes crucial to recharge the debate that has lain dormant for over two millennia, and ask: did Aristotle settle the matter? Addressing this question is the aim of this chapter.
The matter has already been addressed by a few commentators in the last hundred years. 1 Some have come out in Aristotle's favour; others against. All the commentators so far, however, believed that Aristotle's conclusion was correct, even if his arguments were incorrect. 2 I do not. The following confrontation of views will therefore be a real one, not merely an academic exercise.
I will proceed by producing a commentary on Aristotle's text, with suitable interpolations where appropriate. But the analysis does not pretend to be a scholarly one; I am not competent to do this, and there are already excellent scholarly commentaries available, e.g. Ross (1924), Kirwan (1993), De Rijk (2002). What interests me is not so much the niceties of exegesis as whether there is any interpretation of what Aristotle says that will establish what he wishes. As I warned in the Introduction, this chapter is the longest and, in some ways, most detailed in the book. Those who are not interested in textual commentary may read the chapter quickly, or skip over it and come back to parts of it if or when necessary.
end p.7
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
1.2 THE LAW OF NON-CONTRADICTION box(5 b 18–22)
Our text, then, is Metaphysics Γ, 1003 a 21–1012 b 34 (future references are abbreviated). The arguments we are concerned with occur largely in chapter 4, but let us start with a quick look at the whole book. In the first three chapters Aristotle explains that there is a study whose job is to investigate the most fundamental features of “being qua being”, i.e. the properties that all entities have merely in virtue of being entities. It turns out that these are the Laws of Non-Contradiction (LNC) and Excluded Middle (LEM). Chapter 4 contains arguments against those who would violate the LNC. Chapters 5 and 6 attack the arguments that were supposedly given by various Presocratics for violating the Law. Chapter 7 defends the LEM; and chapter 8 deals with both laws, but adds essentially no new arguments concerning the LNC. 3
Let us now pick up the text towards the end of chapter 3. Apparently following Plato, 4 Aristotle states the LNC thus (5 b 18–22): 5
For the same thing to hold good and not hold good simultaneously of the same thing and in the same respect is impossible (given any further specifications which might be added against dialectical difficulties).
Two comments are in order here. First, the principle, as stated, says that it is not possible that there is an object, a and property, F, such that Fa ¬Fa. This is obviously not the most general form of the law, which would be more like: it is not possible for there to be a proposition, α, such that α ¬α. Aristotle appears to be ignoring those cases where α is of something other than simple subject/predicate form. I do not think that this is the case, however; Aristotle is simply assuming that the general case can be reduced, in some way or other, to this one. Indeed, in a later chapter he states the Law in what would appear to be the more general form: ‘opposite {i.e. contradictory} assertions are not simultaneously true’ (11 b 14).
Secondly, Aristotle realizes that there are many apparent violations of the LNC: a top is moving (has angular velocity) and not moving (has no linear velocity); the Channel Tunnel is in England (at one end), but also in France, and so not in England (at the other); capitalism is private production (in that the means of production are privately owned), but public production (in that the means are worked communally). But these apparent violations are due to the fact that we have not spelled out the object or the property finely enough: once we say which part of the object we are referring to, and in what respect the property is claimed to apply, the apparent violations disappear.
end p.8
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Aristotle is rather vague about what qualifications may be made to save the Law from apparent counter-examples. And given a putative counter-example, it is always possible for a defender of the Law to try to disarm it with suitable qualifications. For example, some have tried to solve the Liar Paradox by arguing that the relevant sentence is true in one context, false in another, or true in one tokening and false in another. 6 These suggestions have to be taken on their individual merits (and demerits). Note, however, that there is no reason to suppose that such a device will always work, unless, in advance, one assumes the LNC. That this device is sometimes available does not, therefore, constitute a defence of the LNC. 7 Aristotle pursues the matter no further.
1.3 THE FIRMEST OF ALL PRINCIPLES (5 b 22–35)
After stating the LNC, Aristotle goes on, next, to argue that the LNC is the ‘firmest’ of all principles since no one can believe anything of the form α ¬α (5 b 22–27). This is, prima facie, a rather strange thing to say. After all, Aristotle takes up the challenge to defend the LNC precisely because some people appear to believe things of this form. And whether or not they did, I certainly do. I believe, for example, that the Russell set both is and is not a member of itself. 8 Aristotle points out that what people say, they may not necessarily believe. This is quite true, but hardly sufficient to show that people such as I do not believe contradictions. People are not infallible about what they believe, but that someone sincerely asserts something (and is clear that what they assert is what they mean) is very strong prima facie evidence that they believe it. And someone who would fly in the face of this evidence had better have pretty good reasons.
Aristotle does go on to give such a reason (5 b 28–33). Essentially it goes as follows. If someone believes α ¬α then they believe α and they believe ¬α; but if they believe ¬α then they don't believe α (believing α and believing ¬α are contraries). Hence it follows that they both believe and do not believe α—a violation of the LNC. This is a hopeless argument. For a start, it will work only if it is impossible to have violations of the LNC of a certain form—something still moot at this stage of the argument; more importantly, it begs the question against someone who claims that they believe contradictions. Such a person will not accede to the claim that believing α and believing ¬α are contraries. In the absence of further considerations, their beliefs simply refute this. 9
end p.9
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
There are arguments in chapter 4 that can be interpreted as attempts to show that one cannot believe a contradiction. We will come to these in due course. For the present, there appears to be no reason to believe that I do not believe what I believe I believe.
The chapter closes (5 b 33–35) by Aristotle claiming that the LNC is the most fundamental principle of demonstration: ‘it is, in the nature of things, the principle of all the other axioms …’. What this means is not entirely clear. There are many principles of demonstration that do not depend on the LNC in any obvious sense. For example, in the semantics for some paraconsistent logics, sentences are allowed to be both true and false, and so the LNC fails. Yet many other things hold as logical truths (e.g. the LEM) or as valid inferences (e.g. α α V β). Aristotle even recognizes this elsewhere. In the Posterior Analytics (A11, 77 a 10 ff.) he says:
The impossibility of joint affirmation and denial is presupposed in no proof (syllogism) unless the conclusion itself was also to have demonstrated such. Then it is demonstrated insofar as one accepts that it is true to predicate the major term of the middle term and not true to deny it. But as far as concerns the middle term and likewise the minor term, it makes no difference to hold that it is and is not. 10
Aristotle's point is simply illustrated: if all As are Bs and all Bs are Cs, it follows that all As are Cs, even if some As are also not Bs. After all, they are still Bs as well, and so Cs.
Maybe, then, what Aristotle meant by the claim that the LNC is the principle of all the other axioms was not that each of the others presuppose it, but that the very notion of (deductive) inference presupposes it. 11 The best argument for this that I can construct goes as follows. If it is logically possible for a contradiction to be true, then it is logically possible for anything to be true. But the inference α β is valid iff it is impossible for α ¬β to be true. Hence the inference is not valid. This argument fails, however. First, if contradictions may be true then, even if α ¬β is possible, it may be impossible also, and hence the inference α β may still be valid (even if invalid as well). 12 Secondly, the definition of validity is tendentious anyway. A more adequate modal definition of validity is that the inference α β is valid iff, necessarily, if α is true then β is true. The suggested definition reduces to this if (and only if) one accepts (an incorrect) material account of the conditional. One might argue that even without this account, since α ¬β entails ¬ (α → β), (α ¬β) entails ¬ (α → β) (and so ¬□ (α → β)), but both of these inferences are problematic once the possibility of
end p.10
inconsistency arises. For example, the second fails in the semantics of chapter 6 of In Contradiction. 13
1.4 ARISTOTLE'S OPPONENTS
Before we turn to chapter 4 itself, it will be important to take note of who Aristotle's opponents are. As Aristotle stated the LNC, anyone who thinks that some contradictions are true, a simple dialetheist, like me, is an opponent. However, it is clear that often Aristotle is attacking opponents with stronger views, who hold all contradictions to be true. Two such groups are mentioned explicitly: Herakliteans and Protagoreans.
Aristotle's views concerning the beliefs of these two groups can be extracted from what he says, and particularly the arguments he marshals against them in chapters 5 and 6. It is clear that he draws heavily on Plato's account of both views in the Theaetetus. Whether his views are historically correct is a moot point. 14 This is not important here, however, since our concern with these matters is only as an aid to understanding Aristotle's arguments in chapter 4.
Herakliteans, according to Aristotle, thought that everything about the physical world was in a state of flux, and that the way to describe this was in a contradictory way. A change from α to ¬α was thus to be described by the contradiction α ¬α. It would seem that this state was sui generis. In particular, it was not an amalgam of the states described by α and ¬α separately; and hence both the conjuncts were simply false of the transition state. (See, Theaetetus 157, a, b, which can be interpreted as attributing such views to Herakliteans. 15 ) Such Herakliteans must therefore have rejected the law of conjunction elimination: α β α (and β). It is worth noting that Aristotle himself may well have rejected this law. At Prior Analytics 57 b 3 Aristotle argues that contradictories can't both entail the same thing. Now suppose that α ¬α entails α and ¬α. Then by contraposition (which Aristotle endorses immediately before this), ¬α and ¬¬α each entail ¬ (α ¬α). Łukasiewicz (1957), 49 f., discussing this passage, argues that Aristotle was simply making a logical mistake. There may well be more to it than this, however, as we shall see in 1.13.
Some Herakliteans, such as Cratylus, held an even more extreme view: all things, including even meanings, are in a state of flux. And if this is so, then it makes it very difficult to say anything determinate about the world at all. Hence, for such people, even the contradiction α ¬α could describe a situation only in an unsatisfactory and indeterminate way. (See, e.g. Theaetetus 183, a, b.) Cratylus, indeed, came in the end to the view that nothing could be said, and so gave up speech all together (10 a 10–15).
end p.11
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Protagoreans, again according to Aristotle, were rather different. They held everything to be true, and a fortiori, every contradiction. In fact, assuming conjunction-elimination, these two claims are equivalent. Aristotle argues that Protagoreans are committed to every contradiction in chapter 5 (9 a 9): ‘… if everything that is thought or imagined is true, it is necessary that everything should be simultaneously true and false.’ This strikes me as quite unfair to Protagoras—even Aristotle's Protagoras. For a start, Protagoras thought that whatever someone believes is true for them, not true simpliciter. Arguably, at least, he thought that nothing was true simpliciter (a view Aristotle attacks in chapter 8). More importantly, even ignoring this distinction, that everything is true follows from Protagoras' views if (but only if) everything is believed by someone. This is an empirical premise, which I see no reasons to believe to be true.
For these reasons, I will not call the people Aristotle has in mind here ‘Protagoreans’. I will call the view that everything is true ‘trivialism’; and I will call someone who believes everything a ‘trivialist’. The principle that not everything is true I will call the Law of Non-Triviality (LNT). Trivialism clearly entails dialetheism. If a contradiction entailed everything, then dialetheism would entail trivialism, and hence these two views would not be distinct. However, I, for one, do not accept this entailment; and as far as I am aware, there is no textual evidence to suggest that Aristotle did either. Indeed, there is evidence that he did not. 16
It is not clear that anyone has ever endorsed trivialism, as such; 17 but the question of what one can say against it is an interesting one for dialetheism: for a dialetheist cannot argue that triviality is impossible simply on the ground that inconsistency is impossible. Hence, in what follows, we will also consider the import of what Aristotle has to say for trivialism.
A couple of observations about trivialism should be made straight away. For a start, we know that the world is not trivial. This is a necessary truth whose proof is as follows. If the world is trivial it is not trivial (since everything follows). Hence the world is not trivial (by the LEM or consequentia mirabilis).
More interestingly, one may argue that one ought to reject the view that the world is trivial as follows. If the world is trivial then everything follows. In particular, it follows that it is irrational to believe that the world is trivial. Hence, if one believes that the world is trivial then one should believe something and, at the same time, believe that it is irrational to believe it. This is irrational. One ought to reject something if it is irrational to believe it. 18
end p.12
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
One might object to the last step, on the ground that one ought not to reject something irrational if it is also rational to believe it. 19 Whether or not this is so is beside the point here. For it is not rational to believe that the world is trivial. There is no evidence, there are no reasons, to believe that the world is trivial (or even the much lesser claim that I am a frog). And it is not rational to believe that for which there is no evidence. Thus, one ought to reject the claim that the world is trivial.
Of course, the trivialist can accept all these conclusions—after all, they accept everything. The trivialist is a slippery protagonist, and there will be a lot more to be said about them in Chapter 3. This will do for the present.
1.5 DEMONSTRATION BY REFUTATION (5 b 35–6 a 28)
Let us now turn to chapter 4 itself. This contains a number of arguments for the LNC. Ross and Kirwan both isolate seven different arguments, though they cut the cake slightly differently. I will follow Ross's enumeration. Of these seven, the first is the most complex, and takes up as much space as the other six put together. But before we discuss the arguments we need to get clear about what, exactly, they are supposed to demonstrate.
At the start of chapter 4 (5 b 35–6 a 10), Aristotle claims that the LNC is so fundamental that it is impossible to demonstrate it. By ‘demonstration’, here, he means something quite specific. A demonstration is a deductive argument from “first principles”. The principles are both more certain than that which is to be demonstrated, and also, in some sense, explain it. Even granting this notion of demonstration, it is not clear that the LNC cannot be demonstrated, since it is not clear that it is a first principle. Consider, for example, the Law of Identity, □ (α → α). Though nothing is completely uncontentious, there is hardly any disagreement about the correctness of this Law. 20 And given this, ¬ (α ¬α) follows from one application of the rule of inference: □(α → β) ¬ (α ¬β).
One might object that the principle of inference here simply presupposes the LNC. This is moot: it is valid, for example, in the semantics of In Contradiction, chapter 6, according to which contradictions may be true. But whatever one says about this particular case, the point remains: it is not at all obvious that no proof of the LNC in Aristotle's sense is possible.
This is all a side issue, though. For the point of this paragraph is to distinguish this kind of proof of the LNC from the kind of proof that Aristotle thinks he can give. Aristotle calls it a ‘proof by refutation’. Someone who disbelieves the LNC can be refuted ‘if only the disputant says something’ (6 a 12); and if they will not, they are ‘similar to a vegetable’ (6 a 16). Presumably, this is directed at the mature Cratylus. The point, presumably, is not just to abuse him, but to note that if someone refuses to say anything, then debate is pointless. We are close to one of Aristotle's later arguments at this point, so I will take up the issue at the appropriate juncture. The important question here is what kind of proof Aristotle thinks he can give against an opponent
end p.13
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
who will speak. And here we meet the first exegetical problem: what, exactly, is a proof by refutation? For the moment, let us stick with the first refutation. We will come to the others later.
Clearly, a demonstration by refutation would seem to be some kind of reductio argument. But there are many things that can go by this name. For a start, reductio ad absurdum can simply be a principle of inference of the following form: from a proof that α entails β ¬β, infer ¬α. In standard logic it is deductively valid, and though it is not a valid syllogism, Aristotle recognized it as a legitimate mode of inference. (See Prior Analytics, 50 a 29–39.) One thing that might be meant by a proof by refutation is simply an argument with this form. This, however, is not what is going on. For a start, such an inference does not depend on anyone saying anything. A reductio argument in this sense may start by making an assumption, α, but no one has to assert this. Moreover, by the end of the argument the assumption has been discharged. Hence we end up with a categorical proof of ¬α which is just what, according to Aristotle, we cannot have. Another problem with interpreting the argument in this way is that the form of inference cannot be assumed to be valid if one supposes that there are true contradictions (at least, not without a lot of further argument). If α entails something true, nothing follows. Hence, Aristotle's arguments would beg the question. 21
Another thing that is often meant by a reductio argument is getting an opponent to retract a view held, by showing them that it leads to an absurd consequence. The consequence may be a contradiction, but it may also be something else: some non-contradictions (e.g. that I am a frog) are more absurd than some contradictions (e.g. that the Liar sentence is both true and not true). All that is necessary is that the consequence be one that the person regards as absurd. Arguments of this kind are sometimes called ad hominem, and some kinds of ad hominem arguments are fallacies (notably those where a view is attacked by attacking the character of the person who holds it). There is nothing fallacious about this kind of argument, however. Provided the person's views really do have these conclusions (at least, according to the person in question), there is nothing illegitimate about this form of argument.
It is sometimes suggested that such a form of argument must be useless against someone who denies the LNC. 22 This is not true. If the absurdum to which the view leads is a contradiction then there is no a priori guarantee that it is one that the person will regard as absurd, and so no guarantee that it will work. But the person need not accept everything, certainly not every contradiction; hence, the argument may well be effective. This is nicely illustrated by Aristotle himself in a later chapter. At 10 a 32– b 1 Aristotle argues against Herakliteans as follows. If all contradictions (about the physical world) and no other things, were always true, then nothing changes. Assuming that moderate Herakliteans do not want to deny their major thesis (after
end p.14
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
all, the situation that this describes is not itself, presumably, in a state of change), this consequence is unacceptable.
The situation concerning an ad hominem argument is quite different if the opponent is a trivialist. For such a person accepts everything: there is therefore no consequence of their view that will have the slightest effect. 23 There is therefore no point in entering into discussion with them. Moreover, there is no point for them either, since anything they would like you to believe, they already believe you believe. Again, we are very close to one of Aristotle's arguments here. So I will pursue this issue further when we reach the appropriate point in the text.
Is Aristotle to be interpreted as giving an ad hominem argument, in the sense just explained, here, then? This is certainly what he means by the phrase ‘demonstration by refutation (elenchus)’ elsewhere (e.g. Prior Analytics, bk. 2, ch. 20). So this is a natural supposition. 24 It is, however, problematic. For this kind of argument to work, it is necessary that the opponent have the views targeted; it is not necessary that they express them. Yet, as we have seen, Aristotle insists that the opponent must say something for the process of refutation to work. Moreover, it will turn out that the person does not even have to claim anything: all they have to do is say something they take to be meaningful. Finally, such an argument cannot possibly work against a trivialist, and surely this must have been obvious to Aristotle. Yet the trivialist is one of the people on whom Aristotle explicitly sets his sights.
Another interpretation of what Aristotle is up to is as follows. As we have noted, Aristotle requires of a demonstration, properly so-called, that the premises are general principles that explain the thing deduced. But we might have a deductive argument whose premises are not like this, but which are undeniable. This would clearly give an argument of sorts for the LNC. Let us give the word ‘demonstration’ to Aristotle, and use instead the word ‘reason’. Some commentators 25 have suggested that Aristotle is giving a reason for the LNC. Most obviously, one can take Aristotle to be arguing that if utterances are to make sense, as they do, the LNC is necessary. One might call this a transcendental argument for the LNC. 26
This strikes me as the most plausible interpretation, but it is not without its problems. The argument starts, not from some undeniable truths, but from an opponent saying something meaningful. It might be suggested that this is just a device, and what Aristotle is doing is showing that the LNC follows from the fact that there is meaningful discourse. But the opponent seems to be much more integral to the plot than this. In the passage in question, Aristotle is very concerned that they say something; and a few sentences later he is concerned that he should not beg the question: it is therefore necessary for the opponent to say something meaningful. This all sounds very ad hominem.
In the light of problems of this kind, some commentators have suggested that what Aristotle is attempting in this chapter is not any kind of demonstration of the LNC at all; what he is attempting to establish is something about the Law, e.g. that it must be
end p.15
believed, or that it is a necessary condition for thought. 27 As such, he is not attempting to convince a dialetheist or a trivialist, nor does an appeal to the LNC (or LNT) beg the question: consistency or non-triviality can be taken for granted. I will call this kind of interpretation of the argument perinomic.
As an interpretation of the text, a perinomic interpretation strikes me as an act of desperation. Not only does it go against what Aristotle explicitly says he is doing. But the conclusion of the major argument is exactly a statement of the LNC, not something about it. Moreover, if this is what he is up to, why bring in the opponent saying something at all; and why call it a refutation?
The exegetical situation is desperate, then. 28 It is made impossible when we bring in the other six arguments Aristotle deploys. These are a motley crew, but the most striking thing about them is that none of them starts from an opponent's saying anything. The interlocutor has disappeared entirely. So what does Aristotle think he is up to? I doubt that there is any single answer to this question, or even that Aristotle has any clearly thought-out aims. He is just shooting with everything he can think of. As we shall see, the other arguments are most naturally interpreted in various of the ways we have considered, sometimes with respect to the LNC; sometimes with respect to the LNT—for Aristotle often slides blithely between the two.
At any rate, we have four things that any one argument might be trying to do: argue by reductio ad absurdum, provide an ad hominem argument, give a reason, and provide a perinomic conclusion. These will provide us with a set of yardsticks against which to measure what Aristotle actually achieves in his seven arguments.
1.6 FIRST REFUTATION: PART I (6 a 28– b 34)
We now come to the first of Aristotle's arguments. This is long, complex, and can be divided into two major parts. But before Aristotle states the first, he has a throwaway remark. (Kirwan treats this as a separate argument.) The opponent of the LNC is invited to say something meaningful. It will turn out that they say ‘man’. Aristotle then says (6 a 28–30):
First, then, it is plain that this at least is itself true, that the name {‘man’} signifies to be or not to be this particular thing, so that it could not be that everything was so-and-so and not so-and-so.
Let us say that something is true simpliciter if it is true and not false. The argument is that if it is true that ‘man’ signifies man (the property of being a man, or what it is to be a man, as Aristotle puts it later), it is true simpliciter. The argument is simply a non sequitur for a simple dialetheist or a trivialist. Nor, for different reasons, need either dissent from the conclusion. It has more bite against a Heraklitean:
end p.16
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
for if the situation were of the kind where ‘man’ both does and does not signify man, it would not be one where ‘man’ signified man.
We now get the first part of the main argument (6 a 31–6 b 34). In essence it is simple. If we do not rule out ambiguity, then it is clear that there can be apparent violations of the LNC. We might say that Hipparchia, for example, is a man (human) and not a man (a woman). Aristotle has already said that such “dialectical” violations of the LNC are not what is at issue. Hence we select one meaning of the word and fix on it. In the case at point, we fix ‘man’ to mean human (two-footed animal, as Aristotle puts it).
Aristotle then says that if an objector (6 b 6–8):
were to assert that {the word} signified infinitely many things, it is obvious that there would be no statement. For not to signify one thing is to signify nothing, and if names do not signify, discussion is eliminated with others; and in truth, even with oneself …
This is slightly odd. Even if the word means infinitely many things, we can still fix on one of them. I think that for what Aristotle has in mind here ‘indefinite’ is a better word than ‘infinite’. The objector he has in mind is the extreme Heraklitean, like Cratylus, who thinks that meanings are in a state of flux; in which case, a word has no definite meaning. Any such person who attempts to engage in discussion with others—or even think—therefore refutes themself. One might take issue with this argument; as Derrida—a modern proponent of this view—I am sure, would. 29 But a defence of such views is not on the agenda here. So let us grant Aristotle this step, and move on.
Aristotle concludes this step of the argument as follows (6 b 28–34):
It is accordingly necessary, if it is true of anything to say that it is a man, that it be a two-footed animal … and if that is necessary, it is not possible that the same thing should not be, at that time, a two-footed animal … Consequently it is not possible that it should be simultaneously true to say that the same thing is a man and not a man.
Following Dancy (1975), let us call this passage the ‘clincher’. Just before this (and after the discussion of ambiguity), there is a strange paragraph that appears to interrupt the argument (6 b 13–28). I will discuss this in the next section. Let us deal with the main argument first.
The structure of this is ambiguous, depending on the scope of the modal operator ‘necessarily’. Let us write Mx for ‘x is a man’ (Aristotle says ‘it is true to say of x that it is a man’, but these mean the same for him), Tx for ‘x has the property of being a two-footed animal’, and let a be an arbitrary object. Then the argument is one of the following:
(1) □ (Ma → Ta)
(2) ¬ (Ma ¬Ta)
(3) ¬ (Ma ¬Ma)
end p.17
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
(1′) Ma → □Ta
(2′) Ma → ¬ ¬Ta
(3′) ¬ (Ma ¬Ma)
The first interpretation is simpler. (1) is straightforward because of what we have taken ‘man’ to mean. (2) follows, provided that α → β entails ¬ (α ¬β). This inference holds on many accounts of the conditional, though it fails in others. Notably, relevant logics like B that contain identity, α → α, but no truth-functional tautologies, and in particular, ¬ (α ¬α). In virtue of this, it could well be argued that this inference begs the question in this context. (3) follows from (2) since Ma and Ta are necessarily equivalent, and so inter-substitutable in modal contexts.
I will call the second argument the primed version. If this is the correct interpretation it turns on a certain kind of essentialism. There will be a lot more to be said about this later. For the present, the following will suffice. According to Aristotle, some properties of objects are accidental (contingent) and some are essential (necessary). Properties like being a man are essential; properties like being white are not. (If Aristotle ceased to be a man, he would cease to be; if he turned green, he would still be the same old Aristotle we all love.) One might take issue with this picture, but if it is correct, then (1′) is true. (2′) follows simply from (1′). What about (3′)? Given that Ma and Ta are inter-substitutable in modal contexts, this is equivalent to ¬ (Ma ¬Ta). The question then is whether ¬ (α ¬β) follows from α → ¬ ¬β. It does, provided that the conditional is strict, 30 a point one might well contest: couldn't the matter of whether or not a property is itself essential change from world to world? In our world, being a man is an essential property, but in a fairy-tale world, where Aristotle turns into a frog (and is still Aristotle), it is not.
We have now looked at the two interpretations of the argument. Which of these Aristotle intends is a question that can be answered with plausibility either way. 31 The major argument for interpreting it in the second way is that it appears to relate better to part 2 of the argument. I will examine and reject this claim later. The major argument against interpreting it in the second way is that if this were correct, the argument would establish at most the LNC for sentences where the predicate (or at least its disambiguation) is an essential one, whereas Aristotle's aim—which is stated again at the end of part 2 (7 b 18)—is to establish it quite generally. 32 Hence, I think that the first version is the more plausible. 33
end p.18
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Let us now turn to the question of whether or not the argument succeeds in its aim. Suppose, first, that its aim is to give a reason for the LNC. We have seen that both versions of the argument have steps at which one might cavil. I think, however, that these are relatively minor problems. Let us grant that the conclusion follows.
The first major problem is that the argument does not establish the LNC in general, as Aristotle requires. This is obvious with the primed version, as I have already observed. But both versions establish it only for subject/predicate sentences. And even if Aristotle thought that all sentences were subject/predicate, we now know better. Actually, this problem may be repaired, at least given certain assumptions. The argument (on the first interpretation) clearly generalizes to all atomic sentences, and in certain paraconsistent logics (e.g. LP) one can show that if every atomic sentence behaves consistently, every sentence does. Alternatively, if we interpret the argument in the first way, we may generalize the argument in a natural way. Let α be any statement and let β be a disambiguation. Then □ (α → β). Now, as before ¬ (α ¬β). (This strategy is not open to us if we interpret the argument in the primed way: there is no reason to suppose that every α is equivalent to a necessitive, □β.)
Dancy (1975), 38 ff., who interprets the argument in the first way, objects to it as follows. First, it is necessary to find some non-trivial definition of ‘man’ for the argument to proceed. We cannot just use the same word, or we could be fairly accused of begging the question. Secondly, most words do not have definitions. So, as a general argument for the LNC, the argument fails. Now it is not clear to me why using the word ‘man’ itself would make the argument any more question-begging than the one that Aristotle actually gives. The reason for choosing a different phrase is simply disambiguation. Moreover, I do not think it necessary that one be able to find a definition. The important point is that, once disambiguated, ‘man’ has a fixed and determinate meaning. (It means ‘one thing’.) It is not even necessary that we should have to be able to spell out what that is. We could just call it ‘X’, and then get on with the rest of the argument.
With certain provisos, then, the argument appears to work. The important—in fact, crucial—point, however, is that it establishes nothing that someone who violates the LNC need disagree with. This is obvious if the person is a trivialist. But it is equally true for a simple dialetheist. Though some logicians have suggested that a suitable paraconsistent logic should not validate the LNC in this form, many paraconsistent logics, e.g. LP, render it a logical truth (see In Contradiction, ch. 5). And their modal extensions render its necessitation a logical truth too. Of course, given any truth of the form α ¬α the law will generate a “secondary contradiction” of the form (α ¬α) ¬ (α ¬α). But this is no more problematic than the original contradiction.
Whitaker ((1996), 199) considers this reply, but rejects it on the ground that if someone ‘genuinely accepts the truth of the Principle of Contradiction, then they no longer believe that contradictions are possible …’. But this reply is entirely
end p.19
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
question-begging. Why can they no longer believe this? Because it would be contradictory! I suspect that what Whitaker means by ‘genuinely accepts the truth of the Principle’ is ‘refuses to accept contradictions’. But subscribing to ¬ (α ¬α) does not force one to do this at all. Hence Aristotle's argument does not establish the LNC in this sense.
Could we not interpret it as establishing this? Let us use as an operator for denial (see Chapter 6, especially 6.4). Suppose that we take the conclusion of the argument to be (Ma ¬Ma). Then, in either version, the argument starts off with an assertion and ends with a denial. No argument is like this: an argument is a sequence of appropriately related assertions. We could, of course, invent a new rule: given an assertion of ¬α, deny α (from ¬α go to α). In the present context, such a rule would obviously beg the question, though: dialetheists think that one can assert ¬α without denying α. As we will see in 6.2, the rule is fallacious, even from a non-dialetheic perspective. 34
Interpreted in the only way that both makes sense and is legitimate, then, the argument is sound, but has no sting for a dialetheist or trivialist. (Whether it causes a problem for Herakliteans, I leave scholars to argue about.) The other obvious interpretation of the argument is as ad hominem. But similar comments apply here. Interpreted in this way, it shows that one who violates the LNC is committed to certain secondary contradictions. But this is not something that should faze them. A dialetheist who believes α and ¬α need not feel constrained to retract the claim that α, when they assert its negation; similarly, someone who endorses LP as the correct logic, and who asserts α ¬α, need not feel constrained to take this back when they assert ¬ (α ¬α). 35 There being no other natural interpretations of the argument, let us move on.
1.7 FIRST REFUTATION: INTERLUDE (6 b 34–7 a 20)
We are not finished with the first refutation yet. There is a major second part. But before Aristotle gives this, there is another apparent interpolation. This seems to relate to the passage just before the clincher that I mentioned in the previous section. So let me deal with these two passages together.
The passage just before the clincher starts as follows (6 b 13–28):
Then it is not possible that ‘to be a man’ should signify just what ‘not to be a man’ [signifies], if ‘man’ signifies not only about one thing but also one thing (for we do not want to count
end p.20
as signifying one thing this, viz. signifying about one thing, since in this way ‘artistic’ and ‘pale’ and ‘man’ would signify one thing, so that all will be one, because synonymous). And it {a thing?} will not be to be and not be the same thing unless homonymously, as if others were to term not-man what we term man. But what is found perplexing is not whether it is possible that the same thing should simultaneously be and not be a man in name, but in actual fact.
The paragraph then goes on to say something about not being a man and being a not-man; its sense would appear to be that these two things are much the same in this context, but I find it difficult to make any sense of the details of this argument. In any case, Aristotle merely seems to be setting aside a possible source of complication.
Prima facie, the import of the above passage is to establish that ‘to be a man’ and ‘not to be a man’ mean something different; and the fact that the argument comes immediately before the clincher, which starts with ‘accordingly’, would seem to suggest that if ‘man’ and ‘not man’ meant the same thing then it might be false of something that was a man to say that it was a two-footed animal—or at least, that someone might suppose this. But it is not at all obvious why this might be so. 36
My guess about what is going on in this bit of text is as follows. Aristotle envisages an objector who says something like this—I change ‘two-footed animal’ to ‘rational animal’ to make things a bit punchier: ‘Look, Aristotle, it's quite clear that contradictions can be true. Just consider Hipparchia; she both is and isn't a man. So it's not necessarily true that if anything is a man it is a rational animal. Hipparchia is a woman, so she's not rational. You are probably going to say that “man” and “not man” have two different meanings here, so the objection doesn't count. But “man” and “not man” do have the same meaning here. They both mean her’.
Against this, Aristotle replies as follows—the paragraph of the text just scrambles it a bit: ‘Hipparchia may be a man and not a man, but only in different senses of the word “man”. And of course, if you are going to allow different meanings of the word, there is no problem about finding contradictory sentences. But that's not what we were talking about. Provided you stick to a single meaning of “man”, i.e. human, then it is necessarily true that anything that is a man is a rational animal. And saying that “man” and “not man” mean the same because they both mean her is just confusing what something means with what it is true of. 37 “Man” and “not man” can't mean the same, or pretty much everything would mean the same, so everything would be truly predicable of an object, and all would be one (as I shall go on to argue in my second refutation).’
This, at any rate, is the best sense that I can make of the text. Anyway, if the point of the text is that ‘man’ and ‘not man’ mean something different, this seems clear enough for us not to make a fuss about it: they clearly have different extensions. And
end p.21
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
even a trivialist must concede that they mean something different: they agree with everything. 38
Let us turn now to the second passage (6 b 34–7 a 20). This comes immediately after the clincher, and starts by saying ‘The same argument applies also in the case of not being a man.’ And so one might expect to find the main argument repeated, with ‘not man’ replacing ‘man’; but we do not. What we do find is a swift argument for the claim that ‘man’ and ‘not man’ mean something different: ‘man’ and ‘pale’ obviously mean something different; but ‘not man’ is much more “strongly opposed” to ‘man’ than ‘pale’ is. 39
Aristotle then continues (7 a 7–9): ‘If this is not possible, what we have stated follows, if he [the objector] will answer the question asked’. From the next few lines we gather that the question asked was something like ‘Is this a man?’ or ‘What is this?’ (pointing to a man). Since no such question has been asked explicitly, presumably Aristotle is thinking that the original speech-act of his opponent was prompted by a question of this kind.
Aristotle goes on by imagining that in answering the question, the respondent says ‘man’, but insists on adding ‘and not a man’ too. Aristotle objects to this. Why should he do this? He originally asked for one significant utterance. Now he has two; and he has already said that the argument can be run just as well for the second. So, it would seem, he should welcome the addition. The answer, I think, is that he has Herakliteans in his sights at this point. Aristotle's aim is to get his opponent to concede that ‘man’ is meaningful by asking ‘what is that?’ (pointing to a man). A Heraklitean will volunteer only ‘man and not man’, and this is not sufficient to continue the argument. Even if (sufficiently disambiguated) this phrase refers to some property or other, all the argument will give is: it is not possible for something to be (a man and not a man) and not (a man and not a man), which is an instance of the LNC with very little generality. Hence Aristotle's argumentation now becomes intelligible.
Since I have no desire to defend Herkliteanism, or, at least, Aristotle's version of it, I will not discuss this part of the text further. A simple dialetheism has other ways out of Aristotle's net anyway, as we have seen.
1.8 ARISTOTLE ON SUBSTANCE
We now come to the second main part of the first refutation (7 a 20– b 18), which introduces some major new considerations. Violators of the LNC are said to destroy
end p.22
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the notion of substance. To try to sort out what is going on here we need to get a few facts about Aristotle's ontology straight. 40 According to him, the ultimate constituents of the world are objects, such as Hipparchia. These, Aristotle calls ‘substances’ (‘primary substances’, in the Categories). One important role they have is to be the bearers of properties. That there are such things is not, of course, uncontentious; but it is a recognizably modern view. What follows is not. Objects such as Hipparchia are, logically speaking, composites of matter and form. A typical form is: being a man. The form of an object gives it its identity. Consequently, one says what an object is (and not merely what properties is has) when one gives its form, as in: Hipparchia is a man. Just to confuse matters, Aristotle calls form ‘substance’ too (‘secondary substance’ in the Categories), in the sense of being the substance of something. For example, he might say, the substance of Hipparchia is manhood/being a man/what it is to be a man/to be a man, as it is often put—or translated.
Consequently, we must distinguish between two senses of a sentences of the kind ‘a is b’ for Aristotle. In the first, b gives the form of a. And since a could not exist unless it had this form, such sentences, if true, are necessarily true (as is required in the primed version of the argument of part 1): b gives the essence of a. In the second kind, ‘a is b’ is to be interpreted in such a way that b is merely giving one of the properties, or accidents, of a. The property b “coincides” with a, as Kirwan translates it. Such sentences, if true, are contingently true.
For Aristotle, the essence of each object defines its species. Thus the essence of Hipparchia is being a two-footed animal—animal being the genus, and two-footed the differentia of the species. Since things belong to a single species, it follows that objects have a single essence; it also follows that properties that are trans-generic, like not being a man, are not essential.
Modern philosophy has done away with Aristotle's analysis of substance. There is but one notion of predication, and about the best one can do to make Aristotle's distinction concerning predication is to mark off those properties an object has necessarily (its essential properties) from the others, 41 by the use of a modal operator, as I did in discussing part 1 of the argument. This distinction is not the same, however. It is no part of modern essentialism, e.g. of Kripke's kind, that objects have a single necessary property.
1.9 FIRST REFUTATION: PART II (7 a 20– b 18)
With these preliminary comments out of the way, let us return to the text. This continues (7 a 20) as follows:
Those who say this entirely eliminate substance and what it is to be. For it is necessary for them to maintain that all things are coincidences and that there is no such thing as what to be a man or to be an animal [is].
end p.23
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
The claim is clearly that someone is destroying the notion of substance, in the sense of essence (substance of). It is less clear who this person is.
The text goes on to argue: (i) if ‘man’ refers to the substance of man, it refers to one thing; (ii) hence if ‘man’ and ‘not man’ meant the same thing ‘man’ would refer to the substance of ‘not man’; (iii) but the substances (meanings) of ‘man’ and ‘not man’ are different (which was already argued for in the interpolation to part 1 of the first refutation, 6 b 13–28); hence ‘man’ would refer to (at least) two things. Contradiction. Hence, ‘man’ does not refer to the substance of man, by reductio.
From this, it is clear that the person who is destroying substance is the person who claims that ‘man’ and ‘not man’ mean the same thing. It also seems to be clear that given that ‘man’ and ‘not man’ do mean something different, and given that we specified that ‘man’ should refer to just one thing, this argument works. 42 If ‘man’ and ‘not man’ mean the same, ‘man’ is not an essential property.
But we are not finished yet. It is easy enough for someone simply to reject the claim that being a man is an essential property. Many modern philosophers (such as Quine) have, after all, rejected essentialism, and so the existence of substance in Aristotle's sense. In the next paragraph Aristotle goes on to give an argument for the existence of his substances. The argument is to the effect that if there are any properties, even accidental ones, there must be substances for these to be properties of. One might take issue with some of Aristotle's arguments for this. One might even bite the bullet and accept an ontology of properties but no bearers. However, someone who does this is certainly flying in the face of common sense. So let us grant that this follows, and that it is unacceptable. Still the argument has an obvious failing. For even granting that there are substances simpliciter (bearers of properties), it does not follow that there are substances of (essences). Whether Aristotle attempts to make good this lacuna elsewhere in his work, or whether he is simply a victim of ambiguity, I leave Aristotelian scholars to argue about. The point is that rejecting essentialism does not necessarily entail rejecting a subject/property ontology.
It remains to discuss how part 2 of the argument relates to part 1, and hence evaluate its significance. Some commentators have interpreted part 2 as an entirely separate argument. 43 This would seem to be manifestly incorrect, since the passage in question finishes (7 b 17–18):
Consequently, there will be something signifying substance even in this case {i.e. if all properties are accidental}. And if this is so, it has been shown that it is impossible to predicate contradictories simultaneously.
Hence we are still with the main argument. So, how does it fit in? There would seem to be two major possibilities. The first is that we interpret the main argument as the second (primed) version. This appeals to the existence of essential properties, which Aristotle is defending here.
This strikes me as the less plausible interpretation. For a start, it makes Aristotle's defence of the existence of substances (simpliciter) irrelevant—though if Aristotle is
end p.24
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
confused about the two notions of substance this might carry little weight. More importantly, if this interpretation is right then, as I have already said, the whole argument establishes the LNC only for statements with essential predicates, and this would presumably have been obvious to Aristotle. And yet his conclusion reiterates the more general conclusion.
The other major possibility is that in part two of the argument Aristotle is providing yet another argument against the person who claims that ‘man’ and ‘not man’ mean the same. It is this, after all, that is said to entail the denial of substance. If this is what is going on, then the argument is vulnerable to anti-essentialist objections (though not ones I would make). More importantly, this possibility was introduced as part of a (rather implausible to the modern ear) way of avoiding the clincher. Hence what we have here is not a part of the main argument at all, but a reply to one objection that no one is now likely to make. As such, it is irrelevant to our evaluation of the main argument.
1.10 THE ANSCOMBE/CRESSWELL INTERPRETATION
We have now dealt with the first refutation; before we move on to the others it is worth discussing what some further modern commentators have made of the first refutation.
Following Anscombe, 44 Cresswell has given a rather different interpretation of the text. 45 According to this, it is important that Aristotle start by taking a substance (essential) predicate, man. In a true such predication, ‘a is b’, ‘b’ is said to signify a, where ‘signify’ here is a technical term for Aristotle, quite distinct from ‘means’ and ‘is true of’. Its use, according to Cresswell, is in giving the truth conditions of essential predications. ‘a is b’ is true just if ‘a’ and ‘b’ signify the same thing, just as the sentence ‘George Elliot is Mary Ann Evans’ is true just if the two names flanking the identity sign refer to the same thing.
Given this background, Cresswell spells out the core of Aristotle's argument thus:
1. If F and G are both substance predicates, then if x is F and x is G, F and G signify the same thing (viz. x).
2. F and not-F never signify the same thing.
3. Hence, no x is both F and not-F.
1 is not argued for in the text, but is, according to Cresswell, Aristotle's account of predication to be found elsewhere in his writings (though 6 a 32–4 is supposed to be a cryptic explanation of the technical sense of ‘signify’). The heart of the argument is in the passage just before what I have called the clincher. It first states 2 at 6 b 13, and then defends it against possible objections and misunderstandings. The clincher merely ‘states the conclusion in explicitly modal terms’. The rest of the text then ‘draws
end p.25
out the consequences of the proof’. In particular, running the argument in reverse, if something is both F and not F, then F cannot be an essential predication. Hence, “substance has been destroyed”. Part two of the argument expostulates, and defends the notion of substance.
I find this interpretation of the text less plausible than the one I have given, for several reasons. First, if it is correct, then the argument establishes the LNC only for essential predications (as Cresswell agrees). This is at odds with Aristotle's explicit statement at the start (5 b 19–21), repeated at the end of the first refutation (7 b 17), and again at 11 b 14, that the LNC holds in general. Secondly, this interpretation seems to make less sense of the text. The crucial premise 1 is not only not argued for in the text, but is not even stated. The clincher, though it appears to be the coup de grâce of the argument, is not. It certainly does more than state the conclusion in modal terms: it is clearly an argument. But since it depends on none of the considerations about signification that Cresswell adduces, it shoots off at a tangent. Third, since premise 1 depends on Aristotle's technical notion of signification, there is absolutely no reason why an opponent should accept it. Worse, and most tellingly of all, another premise is required to make the argument work (as Cresswell notes), that not-F (not-man) is an essential predicate. But for Aristotle, if F is an essential predicate, then not-F isn't. So the argument is not sound, even in Aristotle's own terms!
Anyway, even if this is the correct interpretation of the argument, it need not worry a simple dialetheist. First, it establishes only that some contradictions are not true, not all of them. Secondly, it depends on Aristotle's theory of substance (in fact, more than this, as we have just seen), and we have already noted that this may be rejected without problem. Third, it depends also on a technical notion of signification and the “identity theory” of predication, which is now defunct 46 (and no sensible person would suppose that ‘man’ and ‘not-man’ mean the same thing in the non-technical sense). And finally, it shares the same failing as the argument as I have interpreted it: the dialetheist and the trivialist may accept both the conclusion of the argument and its negation.
1.11 SOME MODERN VARIATIONS I: TALKING OF OBJECTS
Some other commentators have given what seem to me to be looser interpretations of the text. One is Irwin (1988), sect. 98. He summarizes the upshot of Aristotle's argument as follows (p. 183, italics original):
(1) To deny every instance of PNC {LNC}, O must say that it is possible for the same subject, man, to have the contradictory of each of its properties. (2) If O is speaking of the same subject, he must acknowledge that he signifies the essence, the property that makes the subject of the two predications the same subject. (3) Hence, since the subject man is essentially F {two-footed animal}, a subject that is not F is not the same subject as the one signified by the first occurrence of ‘man’. (4) Hence when O says that man is not F, O is committed to denying that this subject
end p.26
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
(the one that is not F) is the same subject that {sic} as the one that he said is F, since being F is the essential property of man.
As the first sentence shows, Irwin is actually confusing the LNT with the LNC, and defending the latter. At its core, his argument is simple. Let a be any man. Then since F is an essential property, □Fa, and so Fa. Now suppose that ¬Fb; then a ≠ b. Thus, the same thing cannot have contradictory essential properties.
Note that the argument, as given, establishes something weaker than the LNT: the LNC for essential predicates (the same as the Anscombe/Cresswell interpretation). Note, further, that the fact that F is essential is playing no real role in the argument. Since □Fa is used to infer Fa, F could, in fact, be any predicate. So we do have an argument for the LNC. Unfortunately, it is fallacious, or at least question begging. This is so for two reasons. There is no reason to challenge Leibniz' Law here: a = b ¬ (Fa ¬Fb). But in this context one cannot assume its contrapositive: Fa ¬Fb a ≠ b. Substituting ‘a’ for ‘b’, this gives that any object with contradictory properties has the very particular contradictory property of being non-self-identical (a = a being a logical truth). Now, it may make sense to suppose that an object is non-self-identical sometimes (e.g. in dialectical contexts involving change), but the mere fact that it has contradictory properties should not entail that it has that particular one. It is not, therefore, surprising that the contraposed form of inference fails in standard paraconsistent logics. 47 Secondly, even if it did not fail, if the LNC does, the fact that a ≠ b does not rule out the fact that a = b. So the same object can satisfy both F and ¬F. It just has the further property of being non-self-identical.
Another account of the argument is given by Lear (1988). 48 His summary of its import is as follows (p. 260):
If ‘not-man’ could be said of the very same thing of which ‘man’ is said, there could be no substance, for there would be nothing which was just what it was to be a man. In Aristotle's view this is tantamount to destroying the possibility of discourse, for there is no longer a subject about which to make any affirmation or denial …
This is a version of the Anscombe/Cresswell interpretation, which I have already discussed. However, again, it is clearly directed against trivialism rather than dialetheism: there is no substance only if ‘a is P and not P’ is true for every essential predicate, P.
Lear continues (p. 263):
Aristotle argues that an opponent of the principle of non-contradiction must eliminate substance, that there is nothing that his statements are about. But that an opponent cannot say anything follows only if one assumes that the correct account of language-use is the one Aristotle gives: that to say anything is to affirm or deny something of a subject … Why could not a … sophisticated opponent completely reject this world-view and theory of language? Could he not hold that … Aristotle's argument shows only that we must give up the picture of the world as composed of substances and properties?
end p.27
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Lear is right that the argument (thus interpreted) presupposes Aristotle's theory of substance, so that its conclusion may be avoided by rejecting this. However, he also seems to think that if one does this, then one must give up the picture of the world as composed of subjects and their properties—and so the claim that one can express oneself by predicating things of objects. If he does think this, he is incorrect. We can give up a picture of the world as composed of properties and substances in Aristotle's sense, without giving up an ontology of objects and properties. The more damaging conclusion would follow only if the non-existence of substances simpliciter, i.e. bearers of properties, followed from the non-existence of substances of, i.e. essences. But not even an essentialist must hold this.
1.12 SOME MODERN VARIATIONS II: MEANING
Lear is not finished yet though. He thinks that even if one rejects a subject/property ontology one can still extract a damning argument from Aristotle, to the effect that violations of the LNC have unacceptable consequences. Before we turn to this, let us look first at a similar argument in McTaggart: 49
… it is impossible … to assert anything without involving the law of {non-}contradiction, for every positive assertion has meaning only in so far as it is defined, and therefore negative. If the statement ‘All men are mortal’, for example did not exclude the statement ‘Some men are immortal’ it would be meaningless. And it only excludes it by the law of {non-}contradiction.
McTaggart claims that a proposition has meaning only if it rules something out. It is not immediately clear whether this claim is made in defence of the LNC or of the LNT. If it is the LNC, then a dialetheist might attempt to reply that even if a contradiction does not rule out its negation, it still rules something out. A trivialist does not have this reply since they believe everything, and nothing is ruled out. But whether directed against dialetheist or trivialist, the argument fails: the claim that a meaningful proposition must rule something out is just plain false. Consider, for example, the claim that everything is true. This entails everything, and so “rules out” nothing. But it is obviously meaningful (or what is Aristotle arguing against a lot of the time?). Moreover, consider its negation ‘something is not true’. This is obviously true (it is entailed by: ‘snow is black’ is not true), and so meaningful. How could a meaningful sentence have a meaningless negation?
Maybe, then, “ruled out” has to be construed differently. We might take it to mean that there are certain situations (or worlds) in which the statement fails. This, at least, makes ‘everything is true’ rule something out. Unfortunately, this account of meaning is equally obviously false. Consider any necessary truth, such as ‘132 = 169’. This rules nothing out: it is true in all possible worlds. Yet it is not meaningless: its truth can be a substantial discovery. 50
end p.28
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Now let us return to Lear. His argument goes as follows: 51
… one might wonder why Aristotle did not formulate a more abstract argument, one which is independent of his particular theory of substance. Certainly he had an argument immediately to hand. For within the details of his proof by refutation a valid point is being made that transcends both his theory of substance and his philosophy of language. An assertion divides up the world: to assert that anything is the case one must exclude other possibilities. This exclusion is just what fails to occur in the absence of the principle of non-contradiction, even when it is construed in its most general form: for any statement S, it is not the case that both S and not- S.
The paragraph then continues, but let us first deal with this. A preliminary comment: I, at least, find it impossible to read this argument into Aristotle. However, since my main concern is with substantive arguments against violation of the LNC, this is not presently important.
As is clear, the argument just recycles McTaggart, and therefore fails for reasons that we have already seen. It also has further problems. It is claimed that if the LNC fails, a proposition (‘proposition’ seems to be the better word here; it does not have to be asserted) does not divide up worlds (or situations) into those where it holds and those where it fails. This does not follow at all. All that the violation of the LNC requires is that for some αs there be an overlap between the worlds where α holds, and those where ¬α holds. This is quite compatible with each proposition dividing worlds in the required way 52 —and even with each proposition failing at some world (something that does not happen classically, note).
I suspect that Lear is thinking as follows: the worlds at which ¬α holds are exactly those where α fails. This division provides the required partition. Maybe so. But if these truth conditions for negation are not the correct ones, it hardly follows that there is no partition. 53
We are not finished with Lear yet, but what follows raises quite new considerations. Before we turn to these, let us consider some other commentators who relate the LNC to the possibility meaning. The view that violation of the LNC is quite compatible with the possibility of meaningful discourse is defended in Dancy (1975), 34 ff. Dancy then goes on (p. 37) to raise the question of what would happen to meaning if someone were prepared to endorse all contradictions. Suppose that one met a trivialist, who was prepared to assent to anything in any situation. One could not then use what they say and do to get any kind of fix on what they mean. If one adopted the methodology of radical translation, one might well conclude that their language had no meaning. 54 But this would be too fast (as Dancy notes). Suppose that I start to read Heraklitus, or Protagoras, or whatever; in a fit of madness I come to the conclusion that they were right: everything is true. In virtue of this I start to utter indiscriminately. Does this
end p.29
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
mean that I have ceased to speak English, that ‘table’ in my mouth has ceased to mean table? I don't think so. How could a change in my beliefs, a private phenomenon, affect meanings, a public phenomenon? When I point to a chair, and say ‘this is and is not a table’, I mean exactly what I say. This, after all, is what I have become persuaded of. Loony it may be; literally meaningless it is not.
It might well be asked why I should bother to say anything at all, once I had become persuaded that everything is true. This is a fair question, but a different one; and since it is precisely an issue that some of Aristotle's other arguments raise, let us defer consideration of it until it comes up there. Further, suppose that everyone became convinced of the truth of trivialism, more or less simultaneously—what would happen to meaning then? Again, I will defer a discussion of this to a later point.
Taking off from a discussion of Putnam, 55 Thompson (1981) also suggests that violation of the LNT results in a destruction of meaning. The relevant passage goes as follows (p. 459):
We might say that the minimal principle of contradiction {LNT} is true because we find from experience that it is impossible to express ourselves in a language without accepting the rule that not every statement is true … Failure to conform to the principle results in our meaning nothing at all. Whatever we say we mean, we would have to say it is true that we do not mean it. The qualification that in a sense we mean it and in a sense we don't will not help. For any sense in which we say we mean something, we must say that it is true that we do not mean it in that sense. Meaning in a sense becomes impossible.
Now, note first that the argument is not to the effect that if the LNT were false then there would be no meaning. (This is trivially true: every conditional starting ‘if everything were true …’ is a necessary truth.) Rather, the point is that if someone believes everything to be true (fails to “conform” to the LNT) there would be no meaning. Now, this is not true, as we have just noted. The person who believes that all is true will say that any utterance both means and does not mean anything one likes (they will say everything, after all). But that does not imply that their utterances have no determinate meaning. One might wonder of such a person why they should bother to say anything at all: what they could mean by any utterance. But, as I have just said, that is another matter, which we will come to in due course.
Before we move on to a different topic, let me make one final observation. A number of the arguments in this section can be thought of as attempting to give a transcendental argument for the LNT. Specifically, non-triviality is argued to be a necessary condition for the meaningfulness of language. And since we do have meaningful language, we must have non-triviality. Leaving aside the fact that none of these arguments appears to work, there is a general and profound problem about any attempt to provide a transcendental justification for the LNT. It is not that such arguments are difficult to find; the problem, rather, is that they are far too easy. That everything is true entails everything; in particular, for any α, xTx → ¬α. Hence, by contraposition
end p.30
α → ¬ xTx. In other words, the LNT is a necessary condition for everything. There would seem to be little to be gained from a conclusion that can be bought so cheaply. 56
1.13 SOME MODERN VARIATIONS III: NEGATION AS CANCELLATION
Let us now return to Lear. The last passage of his that I quoted (attempting to give an argument for the LNC that is independent of Aristotle's view of substance) continues: 57
One cannot assert S and then directly proceed to assert not-S: one does not succeed in making a second assertion, but only in cancelling the first assertion. This argument does not depend on any theory of substance or on any theory of the internal structure or semantics of statements. It is a completely general point about the affirmation and denial of statements.
Note that this argument is quite distinct from the one given in the first part of the paragraph. That one is about an arbitrary proposition: if the LNC fails, it rules nothing out, and so is not meaningful. The argument is specifically about contradictory propositions, and is to the effect that such a proposition has no content; a fortiori, it has no true content. The argument hinges on quite specific claims about the behaviour of negation. Let us return to it in a moment, after a few appropriate background comments. 58
One may distinguish between three accounts of the relationship between negation, contradiction, and content. (1) A cancellation account. According to this, ¬α cancels the content of α. Hence, a contradiction has no content. In particular then, supposing that an inference is valid when the content of the premises contains that of the conclusion, a contradiction entails nothing—or nothing with any content; it may entail another contradiction. (2) A complementation account. According to this, ¬α has whatever content α does not have. Hence α ¬α has total content, and entails everything. (3) An intermediate account, where the content of ¬α is a function of the content of α, but neither of the previous kinds. According to this account, α ¬α has, in general, partial content, neither null nor total. Hence, contradictions entail some things but not others.
An account of kind (3) is given in relevant and paraconsistent logics. An account of kind (2) is packed into orthodox modern logics, such as “classical” and intuitionist logic. An account of kind (1) is clearly distinct from either of these. It appears to have been an influential account in Ancient and early Medieval logic. Arguably, Aristotle subscribed to something like it, since he appears to have rejected the claim that α ¬α
end p.31
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
entails α. (See 1.4. We will have further evidence of this later.) It appears in Boethius and Abelard. It is intimately connected with principles such as ¬ (α → ¬α), which are built into modern connexive logics. 59
Though the cancellation and complementation accounts are quite distinct, some modern writers have run them together. A notable example of this is Strawson. In his book, Introduction to Logical Theory (1952), he gives the orthodox truth-tabular account of negation, and orthodox account of validity, according to which contradictions entail everything. But in his informal account of contradiction we read (p. 2 f.):
Suppose a man sets out to walk to a certain place; but when he gets half way there, he turns round and comes back again. This may not be pointless. But, from the point of view of change of position, it is as if he had never set out. And so a man who contradicts himself may have succeeded in exercising his vocal chords. But from the point of view of imparting information, or communicating facts (or falsehoods) it is as if he had never opened his mouth … The point is that the standard function of speech, the intention to communicate something, is frustrated by self-contradiction. Contradiction is like writing something down and erasing it, or putting a line through it. A contradiction cancels itself and leaves nothing.
This is a clear statement of a cancellation view of negation—and quite different from an orthodox account. On an orthodox account, someone who asserts a contradiction is much worse off than someone who has asserted nothing: they are committed to everything.
Now let us return to Lear's argument. It is clear that this invokes a cancellation account of negation, and stands or falls with it. (And if I am right about Lear's thinking in the first part of his argument, he, too has confused a cancellation account with a complementation account.) Before we evaluate that account, we need to get clearer as to what, exactly, it is. Though Lear does not explain it, I take it to be something like this. Speaker's assertions (and here, ‘assertion’ does seem the appropriate word) normally convey information. Normally, when they make a new assertion this adds to the stock of information conveyed. But when the stock contains the information α, an assertion of ¬α adds nothing, but merely removes α.
Even filled out like this, the account is obviously sketchy and unsatisfactory. What does an assertion of ¬α do if α is not in the information store? It must do something; negative statements do, after all, have content. So presumably that content is merely added to the store. So why doesn't it do this if α is already there? (Inconsistent data bases are not news.) And what happens if α and β are in the information store, and ¬ (α β) is asserted? A natural suggestion is that we delete either α or β; but we have, in general, no way of knowing which. So presumably we can only add ¬ (α β) to the store. But if we can have α, β and ¬(α β) in the store, why not α and ¬α?
In any case, and for quite general reasons, the cancellation account of negation doesn't stand up to inspection. For a start, in a borderline situation of, e.g. rain, one might say that it both is and isn't raining. This is, perhaps, something of a special case; but it shows clearly that negation does not have to function as a cancellation operator.
end p.32
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Or consider another sort of situation. One can, in considering one's beliefs, come to assert contradictory statements, and in so doing discover that they are inconsistent. The assertion of ¬α in this context does not “cancel out” the assertion of α —whatever this might be supposed to mean. The assertion of ¬α is providing more information about what it is one believes, not less. And it is precisely the inconsistent nature of this information that gives one pause; if what one said had no content, it would have no unsatisfactory content, so it is difficult to see why one should bother to revise one's beliefs at all.
Moreover, even if one does try to resolve the contradiction in such a situation, until one succeeds, one may well continue to use parts of the inconsistent information. (Think of scientific theories that are known to be inconsistent.) It is certainly not “cancelled out”. Consider an extreme case, the paradox of the preface. A person writes a book and thereby asserts the conjoined truth of all of the claims in it. Being aware of the overwhelming inductive evidence, they also assert that there are mistakes in the book, i.e. the denial of that conjunction. This does not cancel out the claims in the book. Indeed, in this case, it might even be argued that believing the inconsistent totality of information is the rational thing to do, something that would make no sense if bits of it cancelled out other bits. 60 The account of negation as cancellation therefore fails, as does the second part of Lear's argument.
In the last three sections of this chapter we have looked at a number of philosophers who have been inspired by Aristotle's first refutation. In the end, none are any more successful than was Aristotle himself. Let us now return to Aristotle's text, and to his other refutations.
1.14 SECOND REFUTATION: ELEATIC MONISM (7 b 18–8 a 2)
The second refutation (7 b 18–8 a 2) starts: ‘if contradictories are all simultaneously true of the same thing, it is plain that everything will be one. For the same thing will be both a warship and a wall and a man.’ And a bit later we have: ‘we also get the doctrine of Anaxagoras, that “every article is mixed together”; so that nothing is truly one.’
The premise of the argument is that every contradiction holds of all objects. Aristotle's conclusion is not entirely clear, though. In the first argument, what he is entitled to is that every object has exactly the same properties. It follows that every object is the same, in this sense. And maybe this is what he intends. The stronger conclusion that there is but one object follows if we are allowed to use the Identity of Indiscernibles. In the second argument, what Aristotle is entitled to is that nothing is a single stuff, but equally all stuffs. But, again, using an appropriate identity principle, we can wring out a conclusion about identity. If a has every property, it has both P and ¬P (for any P). But if Pb and ¬Pc then b ≠ c. Hence, a ≠ a.
To what extent Aristotle subscribed to these identity principles, I leave scholars to argue about. It is clear that one might take issue with both. (We have already noted that the second arguably fails, in discussing Irwin's argument in 1.11.) But we can
end p.33
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
get the conclusions about identity from a much weaker premise and unproblematic identity principles. Suppose that just one thing, a, has all properties. Then it has both the property of being identical with b, so a = b, and that of being different from a, so a ≠ a. The view therefore entails that there is but one thing, that is both identical with and different from itself. Parmenidean monism with a Heraklitean twist!
The most natural way of interpreting this argument is as ad hominem. As such, the argument clearly fails against a simple dialetheist since the required premise is, at the very least, that some object have every property. Presumably, it would have more success against a Heraklitean, since such a person is unlikely to buy into Parmenidean monism. It will fail against a trivialist: all ad hominem arguments always fail against a trivialist, as we have noted.
Aristotle then goes on to argue (7 b 26 ff.):
These people {who hold that every contradiction is true} seem, therefore, to be stating something indefinite; and while they consider that they are stating that which is, their statement is actually concerning that which is not (for the indefinite is what is potentially and not in complete reality).
I am really not sure I understand this argument. The best spin I can put on it is this. Suppose that for every P, Pa ¬Pa then (i) a is neither definitely P, as opposed to ¬P, nor definitely ¬P as opposed to P. But (ii) what is indefinite in this way is only potential. And (iii) what is potential does not really exist. Hence, (iv) trivialists cannot be describing what they think they are, that which exists. If this is the argument, it would seem to be ad hominem. It must therefore fail, as all such argument must. (Of course the trivialist thinks that they are talking about what does not really exist: they think everything.)
We might, instead, take (iv) to be: what is inconsistent cannot exist; and hence interpret the argument as giving a reason for the LNT. But even this fails. The problem is with (ii). According to Aristotle, if something is not actualized, it may be only potentially P and potentially ¬P, and so indefinite in that sense. (See 9 a 31–5. 61 ) But it does not follow (even for Aristotle, I think) that if it is indefinite in the (different) sense of (i), it is merely potential. In any case, the step just begs the whole question. Someone who takes contradictions to be true, just is affirming that the actual, and not merely the potential, can be indefinite in that sense, that is, contradictory.
On Kirwan's translation, the text continues: ‘On the other hand their statements, at least, must affirm or deny everything of everything’ (and goes on to give a new—and apparently otiose in the context—argument for this). This might be taken to suggest that the argument in question is a simple reductio. The absurd consequence is the very possibility of talking about the non-existent, of saying anything true or false about it at all. I doubt that this is the correct interpretation, since Aristotle himself often talks about the potential. But in any case, even if it is, the argument fails, for it is certainly possible to talk about mere possibilia. We often talk about events that may
end p.34
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
never, in fact, happen: a planned holiday, the birth of a child, retirement. Moreover, and crucially, the whole argument still begs the question, exactly as before, at step (ii).
1.15 THIRD REFUTATION: THE LEM AND ASSERTION (8 a 2–7)
Aristotle's next, and third, refutation is at 8 a 2–7. (Kirwan treats it as part of the previous argument.) It goes as follows:
another {consequence of every contradiction being true} is that it is not necessary either to assert or deny. For if it is true that he is a man and not a man, plainly he will be neither a man nor not a man …
This appears to be an argument to the effect that violators of the LNC are committed to a denial of the LEM, too. (Kirwan and Ross both interpret it in this way.) Simply, α ¬α entails ¬¬α ¬α by double negation, which entails ¬ (¬α V α) by De Morgan. This argument obviously need not trouble a trivialist. The entailment need not trouble a simple dialetheist either. The fact that ¬ (¬α V α) does not rule out ¬α V α. Thus, occasional violations of the LNC do not prevent the LEM from being logically valid. (It is valid, for example, in the semantics of LP; see In Contradiction, ch. 5.)
There is another possible interpretation of the argument. If neither α nor ¬α is true, it is not necessary to say either of these things. If this is true for all α, it is not necessary to say anything. Hence, if a trivialist says something—and does not adopt the silence of Cratylus—we have an ad hominem argument against them. They cannot really believe what they say. In this form, the argument is not very persuasive. For they do believe what they say: they believe everything. In any case, people often do what is unnecessary.
There is a deeper worry here, though. If someone believed that everything is true, there would be no point in their asserting anything. The point of asserting something is to get one's hearers to believe (or believe that one believes) the thing in question. But if one believes everything, one already believes that they believe it (or that they believe that one believes it). This still does not give an ad hominem argument against a trivialist who asserts things. The trivialist may have no point in asserting; but people do engage in pointless activity, e.g. for amusement. The importance of the observation is rather different. For a trivialist, assertion, as an act of communication, is pointless. In fact, all communicative activity (commanding, questioning, etc.) is pointless. For the point of communicative activity is, in the first instance, to induce certain mental states in the hearer. But the trivialist already believes the hearer to be in that state. Similarly, there is no point in a trivialist even listening to the communicative attempts of another. For whatever information they might hope to gain from the communication, they are already in possession of it: whatever the beliefs, desires, etc., of the other, the trivialist already believes the other to have them. 62 A general rejection of trivialism
end p.35
is therefore integral to the rationale of communication, and hence the possibility of social life. 63
This does not, of course, show that the trivialism is false. Indeed, we could all come to believe it true and take the consequences. But it at least establishes something important about it. This argument therefore has an important and sound perinomic interpretation. It can be thought of as a transcendental argument, not for the LNT, but for the fact that trivialism is generally rejected.
1.16 FOURTH REFUTATION: AN ARGUMENT BY CASES (8 a 7–34)
This brings us to the fourth refutation (8 a 7–34), which starts by distinguishing between some contradictions being true and all being true. What follows is aimed at the latter. Aristotle then distinguishes between two cases: (i) anything may be asserted iff it may be denied; (ii) anything that is asserted may be denied, but not vice versa. There would seem to be another case, viz. (iii): everything denied may be asserted, but not vice versa; but Aristotle ignores this. (Why, I am not sure. 64 ) In any case, what follows is aimed squarely at case (i). In such a case there are two further possibilities; for all α either: (a) neither α nor ¬α is true separately (Herakliteanism); or (b) both are true (trivialism).
To wrap up the first case, Aristotle continues (7 a 20–3):
If it is not true to state {the contradictories} separately, then not only does he {the Heraklitean} not state these things but nothing whatever is—and how can things-that-are-not walk and talk. Also, everything would be one, as we said before …
The first complaint is puzzling. How do we get from the fact that the conjuncts of a true contradiction are not themselves true to the claim that nothing exists? I see no plausible way of doing this. Maybe this is a version of the argument that we met in connection with the second refutation, to the effect that what actually exists cannot be inconsistent. If it is, I have already dealt with it. The second complaint is that if all contradictions, and nothing else, hold of any object then all objects are identical. This recycles a version of another argument we met and dealt with in connection with the second refutation.
Against the second case, Aristotle argues (8 a 27–30):
Equally, even if it is possible to have the truth in stating all things separately, the result we have stated {that all things are one} follows; and in addition it follows that everyone would have
end p.36
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the truth and everyone would be in error, and [the disputant] himself is in error by his own omission.
The new argument here is clearly another ad hominem argument (compare Theaetetus 171 a, b), and fails against a trivialist, as all such arguments must. But a version of it might be thought to work against a simple dialetheist. Such a person asserts both α and ¬α. So they are in error by their own lights; and a person is likely to reject the claim that they are in error.
This argument is no better, though. Suppose that we take being in error to mean simply ‘stating an untruth’ (as Ross, (1924), 267, glosses it). Then, given dialetheism, there is a simple fallacy in moving from ‘¬α is true’ to ‘α is not true’. But even ignoring this, the argument will not work since it begs the question. A dialetheist who believes that α is both true and not true need not accept that believing α is an error. They are, after all, believing what is true. 65 Believing α would be an error if there were no evidence for it, but that is a different matter.
Aristotle appears to complete the argument against this horn of the dilemma as follows (8 a 30– b 2):
in response to this person there is nothing for an investigation to deal with; for he says nothing. For he says neither that it is so-and-so nor that it is not so-and-so, but that it is both so-and-so and not so-and-so; and again he also denies both of these, saying that it is neither so-and-so nor not so-and-so.
At first, it might appear that Aristotle is going to make the point that if someone believed that everything is true, they would not bother to investigate anything any more. 66 But reading on, it becomes clear that his point is that the violator in question has said nothing, and so may be ignored. Prima facie, this is obviously false. The violator has said two things (in fact, three, α, ¬α and α ¬α). Aristotle is presumably, therefore, thinking that the assertion of ¬α “cancels out” the assertion of α and vice versa, and so appealing to the cancellation view of negation. 67 (This still leaves the conjunction, but presumably the violator will assert the negation of this too.) I have already discussed and rejected this view of negation, in 1.13; so we need pursue the matter no further.
Actually, this passage would appear to be more at home arguing against the first horn of the dilemma (as Kirwan notes, p. 104), but the point is the same. Such a person has said something: α ¬α (unless the person is an extreme Heraklitean; in which case, the complaint has some force). So the thought must be that an assertion of ¬ (α ¬α) cancels this out. Similar comments therefore apply.
end p.37
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
1.17 FIFTH REFUTATION: THE TRUTH-CONDITIONS OF NEGATION (8 a 34– b 2)
The fifth refutation is a swift one, and can be dealt with equally swiftly (8 a 34– b 2):
if whenever an assertion is true its denial is false, there can be no such thing as simultaneously asserting and denying the same thing truly. However, they {violators of the LNC} would doubtless assert that this is the question originally posed.
Whoever this argument is aimed at, its conclusion is simply a non-sequitur. 68 In the semantics of many paraconsistent logics (such as LP), α is true iff ¬α is false (and vice versa). Yet things of the form α ¬α can be true. If ‘false’ is taken to mean ‘not true’, the situation is slightly different. The argument is then: α is true iff ¬α is untrue. Hence, (by the truth conditions of conjunction) α ¬α is untrue. The truth conditions of negation are now contentious, but this is unimportant in the end. If certain contradictions can be true, then there is no a priori reason why ‘α is true and not true’ cannot be true. Indeed, this is exactly what happens in the case of the Liar Paradox, ‘this sentence is not true’. The argument does, therefore, beg the question, as Aristotle suggests, at least against the simple dialetheist. It does not beg the question against a trivialist; nothing does; but we have already seen that no ad hominem argument against a trivialist will work. Let us move on.
1.18 SIXTH REFUTATION: PART I, THE VEGETABLE (8 b 2–12)
What Ross (and Kirwan) call the sixth refutation has two quite distinct parts. The first (8 b 2–12) goes as follows:
Again, are we to say that he who believes that things are in a certain state or are not, is in error, while he who believes both has the truth? For if he has the truth, what can be meant by saying that the nature of things-that-are is of that kind? If he does not have the truth, but has more truth than one who believes the former way, then the things-that-are would already be in some state, and that would be true and not simultaneously also not true.
The exact argument is, again, opaque, but seems to be this. Suppose that a believes α ¬α, and that b believes just one conjunct, say α. What is a to say about b's beliefs? a may say that b is mistaken—and presumably will, if they are a Heraklitean. But then what could it be for things to be like that? Alternatively, a may say that what b believes is true, but that a has more truth. In this case, something, presumably α, would be ‘true and not simultaneously also not true’.
Both horns or Aristotle's dilemma are unpersuasive. The rhetorical question of the first horn is lame. The Heraklitean will reply: ‘What could it be like? In a state of flux; just like that, Aristotle’. The second horn employs the invalid inference from truth
end p.38
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
to truth simpliciter that we met in the throw-away argument at the beginning of the first refutation.
Aristotle then goes on to complain about a person who asserts all contradictions (8 b 7–12):
there will be nothing for such a person to speak or say; for he simultaneously says this and not this. And if a man believes nothing, but considers it equally so and not so, how would his state be different from a vegetable's?
At the start of this it looks as though Aristotle is going to make the point about communication ceasing, which we have already discussed on connection with the third refutation. But by the second clause it is clear that Aristotle is claiming that the person says nothing. This has some force against an extreme Heraklitean. But against anyone else, it is just false (unless one buys into the cancellation view of negation that we have already discussed and rejected). Someone who asserts that something both is and is not a table says exactly that. (Recall the discussion of Dancy in section 1.12.) Aristotle then goes on to make the corresponding (and similarly fallacious) point about belief. Then comes the ad hominem crunch. Whether this is licit or illicit, is of no importance here. For even if the rest of the argument were correct, the final step would fail. The fact that someone has no beliefs does not mean that they have no other mental states: they may still be thinking, desiring, having emotions, having experiences, etc. So such a person is hardly a vegetable.
There is another point that can be extracted from this passage, however. Maybe Aristotle means that there is no more point in discussing with such a person than there is with a vegetable. The vegetable will not agree with anything; the trivialist will agree with everything. This point is right: the rejection of trivialism is a precondition of discussion. This is a special case of the perinomic point that a rejection of trivialism is a precondition of communicative activity in general, that we met already in connection with the third refutation.
1.19 SIXTH REFUTATION: PART II, ACTION (8 b 12–31)
The second part of the sixth refutation (8 b 12–31) goes as follows:
From which it is quite obvious that nobody actually is in that condition, neither those who state the thesis nor anybody else. For why does anyone walk to Megara rather than stay where he is, when he considers that he should walk there? Why does he not proceed one morning straight into a well or over a precipice, if there is one about: instead of evidently taking care to avoid doing so, as one who does not consider that falling in is equally a good thing and not a good thing? It is plain that he believes that one thing is better and another not better. And if so, he must also believe that one thing is a man and another not a man, one thing sweet and another not sweet. For he neither seeks nor believes everything indifferently when, considering that it is better to drink water or see a man, he thereupon seeks to do so; and yet he ought to do so if the same thing were equally a man and not a man.
This is an important argument. Essentially, it is to the effect that we can tell that someone does not believe a contradiction by looking at their non-linguistic behaviour.
end p.39
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Hence, we have an ad hominem argument against someone who claims to believe a contradiction.
Suppose, first, that this is an argument against a simple dialetheist. An immediate problem for the argument is that there are lots of contradictions that one might believe that have no practical ramifications in any immediate sense; and where, therefore, it is difficult to argue from the inappropriateness of behaviour. A paradigm example is the Liar Paradox: the Liar sentence is both true and not true. This would appear to have no consequence for action.
But even with contradictions that have practical ramifications, the argument will not work. Suppose, for example, that I believe that Jesus is a man and not a man. (Not necessarily the Christian Jesus, but he is not excluded.) Nothing at all follows from this about how I should act. It is also necessary to know my desires. Suppose, then, that I desire to see a man (any man). If I believe that Jesus is a man and not a man, then I believe he is a man; and if he is the most convenient man available, I will take steps to see him. Equally, if I desire (at a different time) to see something that is not a man, I may take steps to see him. In this way I actually demonstrate my inconsistent beliefs. I may even desire (at the same time) both to see a man and to see something that is not a man. I may demonstrate my belief that Jesus is both a man and not a man by satisfying both desires in the one act. The argument is therefore a failure. 69
The situation is different if it is trivialism that is in question (which is what, presumably, Aristotle has in mind). Trivialists believe that everything is a man and not a man, and both desire and do not desire to see a man (or at least, they believe that they do). There is therefore no reason to do anything, as opposed to anything else—at least as far as seeing a man goes, and for the trivialist the same holds true of everything else. Aristotle infers that if we see a person engaging in apparently purposeful activity, we can therefore infer that they are not a trivialist.
Naturally, if someone behaved apparently randomly, then we could not take them to demonstrate purposeful activity, and so the argument would get no grip on their beliefs. Still, such a person is not likely to last very long. But even if a person does appear to show purposeful activity, I think that it is impossible to deploy the argument. For such a person might say: since I believe everything, there is no purpose to my activity. But I am well trained: I do whatever comes naturally. At certain times of the day I seek out food or a bed. Not because of any beliefs, but simply because that is what I am moved to do. Such a person might even compose poetry or philosophy if that is what the spirit moves them to do. 70 Alternatively, the trivialist in question might simply be suffering from some cognitive defect such as bad memory (one does not, after all, always remember all of one's beliefs) or bad faith (actively repressing certain beliefs).
end p.40
But there is an important point to be learned here. On reflection, a trivialist can have no reason for doing any one thing, as opposed to any other. Suppose, for example, that they desire to be at the North Pole. Since they believe that they are not at the Pole, they have reason to move; but since they believe that they are at the Pole, they have reason to stand still. Hence, they have no reason to do either as opposed to the other.
It might be suggested that the tie in this situation could be broken by some other fact, e.g. another desire. Some have argued that desires just are certain kinds of beliefs. 71 If this is right, then trivialists have all desires, and hence there is no hope of appealing to a particular desire to break the tie. But even if desire is sui generis, such a suggestion would still seem to fall to the fact that the trivialist believes everything. Suppose, for example, that the trivialist has the desire to act on beliefs of a certain kind. This will not help; for since they believe everything, they believe every belief to be of this kind. Or suppose that they desire to act on this belief more than that. Then since they believe that this belief is that belief, they will desire to act on that belief more than this.
I will have more to say about this argument in Chapter 3. But this will suffice for the present. What we have seen is that rational, purposeful, activity requires a rejection of trivialism. Our prior conclusion that the rationale of communicative activity presupposes a rejection of trivialism is just a special case of this. 72 For communication is one species of purposeful activity. In particular, then, if trivialism were generally accepted, the institution of communication would, presumably, cease. This would be the case—to return to a question raised in section 1.12—if everyone, more or less simultaneously, became convinced of the truth of trivialism. Whether any words that people uttered under such circumstances retained their old meanings, took on new ones, or lost meaning all together, depends on how, exactly, meanings supervene on intention and action; and this is too big an issue to take on here.
We have, at any rate, a firm and general conclusion concerning trivialism. A belief in trivialism is incompatible with reflective, purposeful action; and, in particular, with communication. This does not show that trivialism is false, or that no one cannot suppose it to be true. But its rejection is a precondition of this central feature of what it is to be a person (and not a vegetable). We can therefore interpret Aristotle's argument as perinomic, establishing this important fact.
1.20 SEVENTH REFUTATION: TELEOLOGY (8 b 31–9 a 6)
The seventh, and final, refutation, which brings the chapter to a close, is a swift one (8 b 31–9 a 6). Even if everything is true, some things are truer than others.
end p.41
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
For example, ‘2 is even’ is truer than ‘3 is even’. But then:
there must be something true which the more true view is nearer. And even if that is not so, at least there is something more firm or truthlike, and we are rid of the unadulterated thesis which would prevent us from having anything definite in our thinking.
Whatever Aristotle is trying to achieve here, it would seem to fail, since it contains a number of fallacious steps. For a start, there is no reason, as far as I can see, that a trivialist must agree that some things are truer than others. They might reject talk of degrees of truth as just nonsense, and they are not committed to believing nonsense statements as true. 73 Certainly, a dialetheist is not committed to the claim that ‘snow is white’ is truer than ‘this sentence is not true’. Secondly, and in any case, the existence of degrees of X ness does not imply a maximum, such that the degree of X ness is defined by proximity to this. Think of tallness, for example. Finally, as we saw in section 1.13, provided that one eschews a cancellation account of negation, a contradiction, α ¬α, can have a definite, non-empty, content—and so, therefore, may thinking one. This argument, then, can simply be ignored.
1.21 CONCLUSION
We have now reviewed all of Aristotle's arguments, as well as various others inspired by them. Let us step back and see what, if anything, they succeed in achieving. They do not provide any kind of argument against dialetheism. Neither do they provide any kind of argument against a trivialist. As we noted, nothing can do this. Nor do they give any transcendental reason for the LNC, or the LNT (as we saw, there are problems concerning the latter enterprise in any case). They do show, however, that a rejection of triviality is a precondition for reflective purposive activity, and especially for the institution of communication. This is a lot less than Aristotle advertised, but it is still an interesting and important conclusion. We will come back to the matter in Chapter 3. It is now time to turn to truth itself.
end p.42
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
2 Theories of Truth
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
2.1 INTRODUCTION
Aristotle's arguments against dialetheism are, as we saw in the last chapter, a failure. Where might one look for successful arguments? A natural place is the nature of truth itself. Maybe there is something about this that rules out contradictions from being true.
Characterizing contradiction is relatively easy: contradictories are any things of the form α and ¬α. This definition hides a difficulty, though. What sort of things are we talking about here: sentences, statements, propositions, beliefs? This is a thorny issue. 1 Fortunately, then, nothing much seems to turn on the niceties of the question for present purposes. I will simply assume that α and its ilk are truth-bearers, whatever those are required to be. I will use angle brackets to refer to such bearers. Thus, <α> is the name of the truth bearer α. If we write T for ‘is true’, the question of the possibility of true contradictions is that of whether there can be an α such that T <α> and T <¬α>.
Characterizing truth is much harder. Indeed, this is an old philosophical chestnut. There are, of course, many theories of truth. 2 Each of them gives an account of the nature of the beast. What I will do in this chapter is look at a number of such theories to see whether there is anything in them inimical to dialetheism, that is, in favour of the “Law of Non-Contradiction”. There are certainly arguments for the “Law” which appeal to other considerations, but these will not be our concern in this chapter. 3 The question on the agenda is whether there is anything about the nature of truth that rules out dialetheism; and if there is, this should follow from a theory that spells out that nature.
It would be impossible to look at all the theories of truth that have been given. I will therefore restrict myself to the major ones. The traditional accounts are the correspondence, coherence, and pragmatist theories of truth. More modern accounts include the deflationist, semantic, and teleological accounts. It is these six theories that I will discuss—starting with the newer ones. Naturally, each of these accounts comes in different versions, sometimes very different versions; but in each case, there
end p.43
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
is a main motivating idea, which different versions develop in different ways. Some of these ways may, in fact, build in the impossibility of true contradictions. But, I shall argue, in every case there is nothing about the idée maîtresse that requires this; and if there are particular versions that render dialetheias impossible, there are, equally, versions that do not. Finally, let me emphasize that I am not at all concerned here with evaluating any of these accounts of truth and determining which theory—if any—is correct. 4 My concern is solely with the bearing of each of these theories on dialetheism.
2.2 THE DEFLATIONIST THEORY OF TRUTH
Let us start with the deflationist theory. The first major proponent of this was Ramsey, and more modern versions have been endorsed by a number of people, notably, Horwich (1990). According to this account, to say that α is true is to say neither more nor less than α itself. That is, truth is simply that property (or predicate), T, which satisfies the T-schema:
T <α> ↔ α
for every truth-bearer, α, and a suitable biconditional, ↔.
Is there anything inimical to dialetheism in this account? Not at all. The account is entirely neutral as to what things are true. Such matters of substance are deferred elsewhere. In particular, we have:
(T <α> T <¬α>) ↔ (α ¬α)
Hence, the contradictories, <α> and <¬α>, are true iff α and ¬α. If we are looking for arguments against the possibility of this, we will have to look elsewhere. 5
As a matter of fact, the deflationist account is not simply neutral on the issue of dialetheism: it actually has a tendency to favour it. This is because of the well-known fact that the T-schema itself seems to give rise to contradictions. Given self-reference of any number of different kinds, it is easy enough to construct a truth-bearer, β, of the form ¬T <β>. The T-schema gives us:
T <β> ↔ β
i.e.:
T <β> ↔ ¬ T <β>
Whence, by the law of excluded middle or consequentia mirabilis (α → ¬α ¬α), we have T <β> and ¬T<β>. Writing γ for T <β>, the instances of the T-schema for γ and its negation, give us T <γ> and T <¬γ>.
end p.44
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
This is, of course, the Liar Paradox. Naturally, one may try to avoid it by rejecting the principles of logic employed. 6 Typically, however, contemporary deflationists have not taken this line. Here, for example, is Horwich (1990), 41:
Indeed—and for that reason [the Liar Paradox] we must conclude that permissible instantiations of the equivalence schema [sc. the T-schema] are restricted in some way to avoid paradoxical results.
Though this is, of course, a possible move, it is also clear that it goes completely against the spirit of deflationism, whose prime thought is, after all, that ‘α’ and ‘<α> is true’ just amount to the same thing. It is thus a quite ad hoc manoeuvre. Honest deflationism is not only compatible with dialetheism; it leads in its direction. 7
2.3 THE SEMANTIC THEORY OF TRUTH
Let us turn now to the semantic account of truth. This is the account which derives from Tarski's famous work on truth. (See Tarski (1956).) Tarski showed how to give a theory of a truth predicate (for a given language), or, more generally, a satisfaction predicate (truth being a special case of satisfaction—when there are no free variables in the formula concerned). The theory is a recursive one, in the sense that the truth of any compound sentence is specified in terms of the truth of its parts. The T-schema—or rather its slight generalization: T <α> ↔ α′, where α′ is a suitable translation of α—is not one of the axioms of the theory itself, but follows from them in a natural way. The exact details of the Tarskian construction are well known, and need no rehearsal here. The question at hand is whether there is anything in it which precludes contradictions from being true.
One immediate thought to this effect is as follows. Given that the Tarskian theory delivers the T-schema, if there were an α such that T <α> and T <¬α>, then it would follow that α′ and ¬α′. 8 explosive logic is used (in which a contradiction entails everything), then truth would be reduced to triviality. Of course, a similar argument can be deployed concerning any theory that delivers the T-schema. The answer would be the same in every case: use a paraconsistent logic—one in which contradictions don't entail everything. After all, no other logic makes much sense if you are a dialetheist. This answer might be thought to be unsatisfactory in the case of a Tarskian theory of truth, though. For it is sometimes said that a Tarskian theory must be based on classical logic: this logic is required for the construction to be performed. Such a claim is just plain false. It can be carried out in intuitionist logic, paraconsistent logic, and, in fact, most logics. A version of the construction based on a paraconsistent logic
end p.45
can be found in In Contradiction, chapter 9. In fact, very little propositional logic is required to implement the Tarskian construction: little more than the substitutivity of provable equivalents. 9
It might be thought that truth conditions for negation of the form:
• *
•
would rule out the possibility of true contradictions in a Tarskian theory. But, for a start, such conditions are not mandatory in the construction. For example, it is possible to give joint recursive conditions for the predicates T and F (‘is false’)—or, strictly speaking, their counterparts for satisfaction—where the conditions for negation are the pair:
T <¬α> ↔ F <α>
F <¬α> ↔ T <α>
Moreover, even a dialetheist may employ (*) as the truth conditions for negation. Such conditions merely mean that any contradiction shown to be true, T <α> T <¬α>, turns into an explicit contradiction of the form T <α> ¬T <α>. And whilst there may be arguments against the possibility or desirability of this, 10 there is nothing intrinsic to the Tarskian construction itself. (In In Contradiction, chapter 9, it is shown how to give a truth-theory in each of the above ways.)
There is nothing, then, in the semantic account of truth which, in itself, is inimical to the possibility of true contradictions. More: just as with the deflationist account, the semantic account actually leads in this direction. For truth-theories of a Tarskian kind deliver the T-schema, and this produces contradictions in a very simple way, as we have already seen. 11 Tarski blocked off this consequence by insisting that the truth predicate for a language not occur in that language, and hence that appropriate self-referential sentences cannot be formulated. But, first, this is not essential to the construction. It is quite possible to give a paraconsistent Tarskian truth-theory for a language which contains its own truth predicate. In Contradiction, chapter 9 does just that. 12 Worse, for a formal language that is supposed to behave anything like a natural language, Tarski's restriction seems an entirely implausible and ad hoc one, which it is difficult to defend successfully, as most would now agree. 13 Just as with the
end p.46
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
deflationist theory, then, the semantic account of truth is not just compatible with dialetheism; it actually leads in its direction.
2.4 THE TELEOLOGICAL THEORY OF TRUTH
The last modern account of truth that I will discuss is less well known than either of the preceding accounts. It arises out of some comments of Dummett (1959), and is advocated as an account of truth in In Contradiction, chapter 4. This is the teleological theory of truth.
The account starts by noting that ‘is true’ is a predicate with a point. Even if one knew the entire extension of the predicate, one would not understand what it was for something to be true, unless one knew that it is the truth that one aims at, in a certain sense. In the same way, one could know the extension of ‘is a winning position’, without understanding what winning is. One could know that a winning position in chess is when the opponent's king cannot avoid being taken on the next move, a winning position in bridge is when one scores more points than one's opponents, etc.; but one would not know what winning is unless one understood that it is winning that is, generically, what one aims to do in playing games: it is the telos of game-playing. In a similar way, truth is the telos of certain cognitive activities, notably, assertion (and related things such as belief). When one asserts, one aims, generically, to speak truly. If, for example, everyone started to aim to speak falsely, and this became common knowledge, then what was false would become the truth. It would be as if every utterance were prefixed with a tacit ‘it is not the case that’. 14
There is nothing in this account which tells against dialetheism. It is, in fact, quite neutral as to what is true. For what it is that fixes the extension of ‘is true’ we must, as in deflationist accounts, look elsewhere. Moreover, it may well be that there is no uniform answer to this question. 15 answer (the physical world) may be appropriate in dealing with assertions of the natural sciences; another (the existence of certain proofs) may be appropriate in dealing with statements of pure mathematics; another (certain kinds of social fiat) may be appropriate in dealing with discourse about legal rights, etc. And some of these possibilities also lead in the direction of dialetheism. Consider, for example, the last of these cases. If a duly constituted legislature passes legislation which makes it illegal for a certain person to do something (under one description) and makes it permissible for them (under another), this determines a true contradiction. 16 That arbitrary rules may eventuate in inconsistency is not news. 17
The teleological account of truth may well, then, be a friend of dialetheism; it is certainly no foe.
end p.47
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
2.5 THE PRAGMATIST THEORY OF TRUTH
The first of the traditional theories that I will look at is the pragmatist theory of truth. Important advocates of versions of this theory were the founding fathers of US philosophy, Peirce, James, and Dewey. According to pragmatism, something is true if it “works”. The variations in the theory arise because of the different ways one may interpret ‘works’. A central idea has always been that something works if it is verified in practice, and specifically by our sensory observations. This is the way I will interpret the notion here.
The first thing to note about this conception of truth is as follows. Only a very limited number of truth-bearers can be verified, directly, by observation. Though the early logical positivists claimed that all meaningful statements were, or were reducible to, ones that are directly verifiable, this idea ended in failure. We have to admit that there are meaningful statements that are not directly verifiable. How are we to understand talk of verification in these cases? The now familiar answer to this question is that a statement is verified if it has consequences that are, in fact, observed, and falsified if it has observable consequences that are not. An immediate consequence of this is that it makes little sense to talk of individual truth-bearers as being verified. The consequences of any one belief depend on what other things one believes. Our beliefs face the tribunal of empirical inquiry collectively, as Quine most famously observed. Hence it is better to talk of the truth of theories: deductively closed collections of truth-bearers.
Can inconsistent theories, i.e. theories that contain contradictory truth-bearers, be verified? The simple answer to this is ‘yes’. Many have been. Bohr's original theory of the atom was inconsistent; yet it had striking observational confirmations. Newtonian dynamics was, for a long time, based on an inconsistent theory of infinitesimals. All of this is well known (and we will return to discuss the matter further in Chapter 9). Of course, if a theory is inconsistent, its underlying logic had better not be explosive. For we certainly do not perceive that everything of an observable kind is the case. But provided that a paraconsistent logic is used, there is absolutely no reason why a theory should not contain inconsistent statements of a non-observable kind, yet have quite consistent observable consequences, verified in the standard fashion.
One might argue at this point that a theory is not a candidate for the truth unless all its observable consequences under standard, classical, logic are observed—which no inconsistent theory's could be. Whatever the force of this philosophical move, it is out of place here. For pragmatism has no criterion for the truth of a theory other than verification, i.e. confirmed observable consequences. How such consequences are determined is of no relevance. If the theory works in practice, then it's true.
One might also try to argue that no inconsistent theory can be a candidate for the real truth: its acceptance can only ever be a temporary expedient. Bohr's theory, for example, was later replaced by another theory, as was the inconsistent infinitesimal calculus. These theories certainly were replaced, 18 but it can hardly be claimed
end p.48
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
in this context that the acceptance of an inconsistent theory must be a temporary expedient. According to this account of truth, expedience is truth! And whilst an inconsistent theory—and indeed, any theory—may eventually be replaced, the claim that an inconsistent theory that has been verified is not a candidate for the truth, makes no sense unless one has criteria for the truth other than verification, which the pragmatist does not have.
The upshot of this discussion is that a pragmatic account of truth is not only quite compatible with the possibility of inconsistent theories being true, it actually protects inconsistent theories from important attacks to the effect that they cannot be true.
The preceding discussion relied on the fact that an inconsistent theory can have quite consistent observational consequences, if its underlying logic is paraconsistent. Before we leave the pragmatist theory of truth, let us ask what one would say of a theory whose observational consequences were themselves inconsistent. Could such a theory be verified? The answer, in fact, is still ‘yes’, provided that the consequences are of an appropriate kind. For inconsistent states of affairs are observed sometimes. For example, in visual illusions of certain kinds we perceive contradictory situations. Think for example, of perceiving “impossible objects”, like cubes whose struts assume impossible orientations. 19 Such perceptions are not veridical, which we can tell, for example, by shifting the angle of perception. But I see no reason why a theory might not predict that an object have inconsistent properties, which were verified by perception “from all angles”. And if so, it too would be verified ceteris paribus, and, assuming the pragmatic account of truth, true.
2.6 THE COHERENCE THEORY OF TRUTH
Let us turn now to the coherence theory of truth. Versions of this were endorsed by idealists such as Blanshard, and by some of the positivists, such as Neurath. 20 According to this theory, a truth-bearer is true if it belongs to a coherent, or perhaps better, maximally coherent, set of bearers. Just as with the pragmatist theory of truth, it is therefore more appropriate to talk of truth as applied to sets of truth-bearers in the first instance.
Of all theories of truth, the coherence theory is perhaps the most difficult to deal with here; but this difficulty arises because the notion of coherence itself has never really been satisfactorily spelled out. It is usually taken that consistency is a necessary condition for coherence. I will return to this later. No one has taken it to be a sufficient condition also. The reason is obvious; it is far too weak: any consistent truth-bearer is a member of some (maximally) consistent set of sentences. Some kind of deductive
end p.49
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
relationship is usually also required for coherence; for example, that each truth-bearer be entailed by others in the set. This is also a very weak condition, satisfied by any deductively closed set of sentences (theory). For if α and β are in a theory, so is α β—which entails each of them. There must, therefore, be stronger criteria than this.
To understand what they might be, one needs to look at the rationale for the coherence theory of truth. Typically, those who endorsed the theory have held that it makes no sense to define truth in terms of some objective reality, independent of our cognitive functioning: there is no such thing, or if there is, we have no access to it. If we are to have any meaningful notion of truth, this can be defined only in terms of what we are justified in believing (maybe in the ideal limit). The criteria of coherence are therefore the criteria of justification.
Now, an important part of justification of any overall theory is empirical adequacy, that is, consonance with observation. This does not mean that a coherent set must include a statement describing every observation we make. Observation, as all admit, is fallible. For example, we all see things that are not really there sometimes. Hence, individual observation statements need not be in a coherent set if they fit ill with other aspects of the theory. This notion of fitting ill requires yet other criteria. (Again, consistency is not enough, since consistency may be produced in many ways—as Quine has always emphasized—some ways including the obdurate observation statement, some not.) And here the floodgates open: simplicity, explanatory power, non-adhocness, unity—whatever these things amount to. The upshot of all this for the present concern is that coherence is to be determined by a number of features including those of the kind I have just listed, plus consistency (maybe) and empirical adequacy.
Now let us return to the question of whether consistency is a necessary condition for coherence. However one cashes out the above criteria, it is clear that they may not all issue in the same verdict. One theory may be consistent but complex and highly ad hoc; another may be inconsistent, but simple and unified. Which theory is the most coherent in this situation? The only plausible answer would seem to be that it is the theory that is overall most satisfactory. That is, the most coherent theory is that which comes out best on most of the criteria. This is vague, but sufficient for present purposes. 21 For it is clear that if this is the case, inconsistency may be trumped by other considerations. (This was exactly what happened in the case of the Bohr theory of the atom for example.) Hence, consistency is not a necessary condition for coherence, merely one of a number of desiderata that may be overridden by other factors.
One may even raise the question of why consistency ought to be on the list at all. Of course, other desiderata require certain amounts of consistency. A theory could hardly be empirically adequate if it contained everything. But this does not require consistency as a separate desideratum.
One might argue that consistency is required because an inconsistent theory entails everything, which would certainly violate empirical adequacy. But this is a poor reply. For a start, if inconsistency is ruled out because it would violate empirical adequacy, it is hardly required as a separate criterion. More importantly, there is no reason to
end p.50
suppose that the underlying logic of the theory must be explosive. If coherence is our only criterion of truth, the true logic is to be determined holistically with the rest of the package, and it may well be that a theory based on a paraconsistent logic is overall simplest, least ad hoc, etc. Similarly, trying to justify consistency by appealing to the law of non-contradiction will not work: there is no a priori reason why the most coherent theory must contain this law. 22
We see, then, that the coherence theory of truth does not mandate that the true (= most coherent) theory be consistent. Consistency may not even be a desideratum at all, except for such aspects of it as are required to fulfil other desiderata of coherence.
2.7 THE CORRESPONDENCE THEORY OF TRUTH
The final theory we will look at is also the most traditional: the correspondence theory of truth. It has its roots in Ancient and Medieval philosophy, but it flowered in early twentieth-century British philosophy. According to the correspondence theory, a true truth-bearer is one that corresponds to reality. A central problem in its articulation has always been how to spell out this notion of correspondence.
I have saved this theory till the last because it is the one, I think, that puts up the stiffest psychological resistance to the idea that there might be true contradictions. For it entails that reality itself is inconsistent in a certain sense, and how could that be? Reality is all there together: how could parts of it possibly contradict other parts?
One should note, for a start, that if one supposes reality to be constituted solely by (non-propositional) objects, like tables and chairs, it makes no sense to suppose that reality is inconsistent or consistent. This is simply a category mistake. But the most natural understanding of the correspondence theory of truth requires that there be more to reality than such objects. ‘Brisbane is in Australia’ is true because it corresponds to Brisbane's actually being in Australia. There must therefore be, in some sense, things like Brisbane's being in Australia, call them facts or states of affairs or whatnot. A correspondence theory of truth requires an account of things of this kind. One of the most sophisticated account ever given is undoubtedly that of Wittgenstein's Tractatus. And certainly, according to this, there are no contradictory facts.
But other accounts are possible. Here is one. The constituents of reality include a set of properties and relations, R, a set of objects, D, and a set of polarities, π = {0, 1}. Each property r R has an adicity, n. I will indicate this with a subscript, thus: r n . A (potential) atomic fact is a tuple
Given this account, it is a simple matter to articulate a correspondence notion of truth. Let be the language obtained by closing atomic predications under negation, conjunction and disjunction and let δ be an assignment of meanings to the predicates
end p.51
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
and constants of . Specifically, for each n-place predicate, P n , δ (P n ) is an n-place relation in R; and for every constant c, δ (c) is in D. We define what it is for a sentence, α, to be true in W (W T α) and false in W (W F α), by standard recursive clauses:
W T P n a 1 … a n iff <δ(P n ), δ(a 1 ), …, δ(a n ), 1> W
W F P n a 1 … a n iff <δ (P n ), δ(a 1 ), …, δ(a n ), 0> W
W T ¬α iff W F α
W F ¬α iff W T α
W T α V β iff W T α or W T β
W F α V β iff W F α and W F β
The truth and falsity conditions for are the obvious dual ones. T is a correspondence relation; it holds between just those sentences that are true and the world. Moreover, as should be clear, it is quite possible for contradictions to be true. Ultimately, this is because W may contain both a “positive” fact,
The theory is, in fact, a very well known one. It is essentially a simple part of situation-semantics, where atomic facts are more usually called ‘states of affairs’ or ‘situation types’ and sets of facts are called ‘situations’. 23 A pair,
The theory can be extended to handle more complex grammatical constructions, such as quantifiers and intensional operators, but the details of this are irrelevant to our present purpose, which is simply to illustrate what a correspondence theory of truth which allows for the possibility of true contradictions might be like. It is not metaphysically unproblematic. For example, one obvious question is what makes an atomic fact, such as
end p.52
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Are there any reasons for supposing that the above theory is metaphysically incoherent—or at least metaphysically incoherent in the way that the Tractatus is not? A correspondence theory of truth needs to suppose that there are, in some sense, facts in the world; it does not, though, have to suppose that there are facts corresponding to all true sentences: disjunctive facts, general facts, etc. (though it may 27 ). The truth of disjunctions, generalizations, etc., can simply be defined in terms of more basic facts. The account just given does not have disjunctive facts or conjunctive facts, but it does have negative facts of a certain kind (things of the form
Are there negative facts? Are there such facts as you might call ‘Socrates is not alive’? … One has a certain repugnance to negative facts, the same sort of feeling that makes you wish not to have a fact ‘p or q’ going about the world. You have a feeling that there are only positive facts, and that negative propositions have somehow or other got to be expressions of positive facts. 28
What is this repugnance? One source of it is, I suspect, the obvious truth that everything that exists is. Add to this the thought that negative facts are not, and it follows that no such facts exist. This is a confusion, however, as old as Parmenides: negative facts are not, in the sense that they ground truths of the form ‘it is not the case that so an so’, but they are in exactly the same way that all existent things are, viz. they are part of reality.
An explicit argument against the possibility of negative facts was given by one of the people who heard Russell's lectures. According to Demos (1917), 189, negative facts are not to be countenanced since they are ‘nowhere to be met with in experience’. Now it is not clear that we meet any facts in experience. We meet with people, stars, chairs, and other objects, but not facts or states of affairs. And if this is so, and the objection is cogent, it tells against all correspondence theories of truth. But, it might be argued, we do see facts: we see, e.g. that the sun is shining; but we never see negative facts, e.g. that it is not shining. This is a dubious argument, though: one can see negative facts. I can see, for example, that there is no one in the room when I walk through the door. Moreover, to be translucent is not to be opaque, and vice versa; yet I can see that something is translucent, and see that something is opaque. Whichever, then, is the negative fact, it can be seen. 29 In any case, Demos' objection is flawed by its simple empiricism. Why should one suppose that the mere fact that one cannot perceive a kind of entity entail that it does not exist? No reason that I can see, especially if one is a metaphysical realist of a kind that the correspondence theory is likely to appeal to.
end p.53
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Another objection to a theory with negative facts might focus on the polarity objects, 0 and 1. What strange beasts are these? Of course, the use of 0 and 1 themselves here is purely conventional. Nor does one have to think of these things as objects. They simply code the fact that there are two ways that r n , say, may relate to d 1 , …, d n : positively or negatively. It is certainly the case that this polarity is built into reality. But there are lots of polarities built into physical reality (think, for example, of being a left hand or a right hand, or of the spin of an atomic particle). I don't see why metaphysical polarities should be any worse than these.
A final objection to the theory is possible because there are two relations of correspondence, T and F . Given the symmetry between these, what makes one of these the truth? A first answer is that T is truth because of its relation to the positive polarity, 1. But this just defers the question. 1 and 0 are symmetric, too. What makes one of them the one relevant to truth? The only answer that a correspondence theorist can give, as far as I can see, is simply that it is a brute fact. But more importantly here, one should note that exactly the same problem besets a theory where there are only positive facts. There is still a symmetry between truth and falsity in this. What makes one of these the truth? A first answer is that one is truth because of its relation to existent facts; falsity relates to non-existent facts. But again, this just defers the question. Existence and non-existence are symmetric too. What makes existence relevant to truth? Again, the only answer that a correspondence theorist would appear to be able to give is simply that it is a brute fact.
Maybe there are other arguments against negative facts, 30 but as far as I can see, this notion is in no way more problematic that the notion of a fact in general.
Even given the legitimacy of the notion of negative facts, one might still object that the above account does not make true contradictions possible, simply because this requires the world, W, to contain a positive fact and its corresponding negative fact, which cannot happen. But why not? Notice that the theory of facts, itself, delivers a prima facie presumption that this is perfectly possible. All the atomic facts, whether positive or negative, are independent entities, and can be mixed and matched at will. 31 It is, of course, possible, that there are considerations that override this presumption (for example, considerations which motivate the law of non-contradiction for other reasons), but these are not arguments that derive from the theory of facts itself, or from the correspondence theory of truth, which, as we have seen, is quite compatible with the existence of true contradictions.
end p.54
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
2.8 CONCLUSION
We have now considered six accounts of truth. There are certainly others, but, at any rate, these exhaust the major views on the subject. As we have seen, none of them provides any reason for rejecting dialetheism; a number of them even point in its direction. If there are arguments against dialetheism, the friends of consistency will therefore have to look elsewhere to find them. Perhaps the obvious place to look is to the notion of negation. We will turn to this in the second part of the book. We have not finished with truth yet: there is more to be said about the view that everything is true—trivialism.
end p.55
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
3.1 INTRODUCTION
Is everything true? The answer, presumably, is no—and there is not likely to be much disagreement about that. But how do you know? As we saw in Chapter 1, Aristotle gave a number of arguments against trivialism. But as we also saw there, his arguments are less than successful. This chapter concerns whether it is possible to do better. The question of how you know that not everything is true may seem a rather arcane one, but it is certainly a significant one for a dialetheist. The trivialist believes that all contradictions are true. (Indeed, assuming that one believes a conjunction iff one believes both conjuncts, this is an alternative characterization of the trivialist.) It is often said by way of objection to dialetheism that if one is a dialetheist, one might just as well be a trivialist. This is a silly objection, as are most slides from ‘some’ to ‘all’. (Compare: you believe some things to be true; why don't you believe all things to be true? 1 ) Nonetheless, even though this is not a good objection, the question of why one should not accept everything is one that an honest dialetheist will ask themself. The question is also significant for a non-dialetheist; I will explain why later in the chapter.
The question of how to justify the common-sense view concerning trivialism is, in fact, one that can be interpreted in different ways. For a start, we need to distinguish between the following interpretations:
Interpretation 1: We do, presumably, have good grounds for supposing that trivialism is unacceptable. What are they?
Interpretation 2: Suppose that you met someone who took everything to be true. How could you justify your position vis à vis theirs?
If the distinction is not clear, just reflect on the following analogy. I know that I have some money in my bank account. You have no reason to doubt my word for this. But you may still ask how I know. I can then give you the following sort of explanation: ‘In my last bank statement there was a certain amount of money in the account.
end p.56
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
I haven't spent that much since then, and the bank virtually never makes accounting errors.’ This is a perfectly acceptable answer. But suppose that I meet a sceptic, who is convinced that I do not know that I have money in my account; then considerations of the kind just adduced will not be at all acceptable. They will fall to the usual range of sceptical considerations. In the same way, Interpretation 1 takes it for granted that not everything is true, and just asks for an explanation of how we know this. But in a confrontation with a trivialist, the situation is quite different. One cannot, in this case, take it for granted that not everything is true: that, after all, is exactly what is at issue. A much more robust justification is required. This is Interpretation 2, and is much harder. Given the fact that the trivialist will agree with everything that is said, it is not even clear how to conceptualize the issue adequately.
I return to Interpretation 2 in the second part of this chapter. In the first half we will look at Interpretation 1.
3.2 SEEING CONTRADICTIONS
Concerning Interpretation 1 of the request for justification, I will argue that we have good a posteriori grounds for supposing that trivialism is unacceptable. To make the case for this, however, we need to look carefully at the connection between perception and contradiction.
When it is suggested that contradictions might be true, many an analytic philosopher will screw up their face into a look of anguish, and say ‘But I just don't see what it could be for a contradiction to be true’. They might mean many things by this. ‘See’ might simply mean ‘understand’, in which case they might be complaining that classical two-valued semantics leaves no room, as it were, for something to be both true and false. Such a lack of understanding can be rectified by explaining to them the semantics of a suitable paraconsistent logic, which does allow for this possibility. But often, I think, the angst (real or imagined) is of another kind. What such a philosopher is trying to do is imagine what it would be like for a situation to be contradictory. They are trying to visualize how an inconsistent situation might look—and they fail. But it can be done.
3.3 PERCEPTUAL ILLUSIONS
How can one visualize a contradictory state? To visualize a situation (literally) it must, of course, be one of a visible kind. The Liar sentence might be true and false; and indeed one might be able to construct some quasi-visual mental representation of this fact; but the state of affairs itself is not of a kind that we can see, or, therefore, literally visually depict. Other things that have been suggested as true contradictions look more promising as candidates for a visible contradiction. It can be argued that some instantaneous transition states are contradictory; that, for example, at the instant a
end p.57
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
person leaves the room, they are both in it and not in it. 2 Now, being in a room and not being in it are the kinds of things that can be seen. Do we, in this situation, then, perceive a contradiction? No. For the contradictory state in question here is an instantaneous one, and one can perceive states only if they persist for some minimal time. (Suppose, for example, that you are watching something red which turns green for exactly an instant before becoming red again. You would perceive the thing as continuously red. The greenness has no duration.) Hence, though a moving object may realize instantaneous contradictory states, these are not such as can be seen.
But we are looking in the wrong place, and missing the obvious. The strategy pursued so far was: find a true contradictory state of affairs and look at it. If we were to succeed in this enterprise, the content of our visual experience would, by definition, be veridical. But the contents of our visual experiences are not always so: we experience many kinds of visual illusions. These are not necessarily of contradictory situations. Many, in fact, most, optical illusions are of quite consistent situations: they are just not veridical. Consider, for example, the well-known Hermann grid, a white grid on a black background. (See Fig. 1 3 ) When one looks at this, one sees dark patches at the interstices of the grid. The dark patches are not really there, but there is nothing contradictory about the way the figure appears. There are, however, visual illusions where what is perceived is not consistent. 4 I will give three here.
Fig. 1 Hermann grid
end p.58
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Fig. 2 Schuster figure
Example 1. The first kind of example concerns impossible figures: drawings of (or photographs which appear to be of) things which are physically impossible. These have become very familiar thanks to the art of Maurits Escher. 5 Not all impossible figures depict what we require, however. Consider, for example, the Schuster figure (Fig. 2). At the right end, the figure appears to have two protuberances; at the left, it appears to have three prongs. One can focus on either end of the figure and see that part coherently; but there is no way that one can focus on the whole and see it coherently. (This, at least, is my reaction.) One never, therefore, sees a contradictory situation. One can see parts of the situation, each of which is quite consistent.
But the situation is different with other impossible figures. Look, for example, at the Penroses' figure (Fig. 3). If one takes a corner, say the nearest one, one can see that, travelling continuously counter-clockwise, one can ascend to arrive back at the same place. The point, then, is higher than itself (but obviously, it is not higher than itself, as well). Moreover, one can take the whole figure in, visually parse it, all in one go. This is a case where we can see a contradictory situation. 6
There is an intriguing auditory analogue of the continuously ascending staircase. It is possible to produce a collection of musical tones which appear to be continuously ascending in pitch whilst, at the same time, never getting any higher—again, a contradictory situation. 7
end p.59
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Fig. 3 Penrose figure
Example 2. The second kind of example concerns motion. It is often called the ‘waterfall effect’, and depends on a certain kind of visual afterimage. 8 If the visual field is conditioned by viewing continuous motion of a certain kind, say a rotating spiral, when the viewer then looks at a stationary scene, it appears to be moving in the opposite direction. But a point at the top of the visual field, say, does not appear to change place. As Blakemore puts it:
Any object viewed after looking at movement seems to be drifting in the opposite direction. And yet the apparently moving object does not appear to change its position relative to its surroundings. 9
Yet to move is to change position. The situation is, therefore, contradictory.
An illusion of the same kind can be obtained with certain drugs. For example, if one consumes enough alcohol, then before one passes out, one's environment—the room—appears to “swim”, that is, move—usually, spin. If one now focuses one's attention on a point in the room, it appears to be stationary. The rest of the visual field, however, appears to continue moving. But the internal spatial relations between the fixed part and the moving part do not appear to change. Again, we have a perception of stationary motion.
Example 3. The third example concerns colour. It is well known that the brain will “fill in” details of the optical field that are not there; for example, to fill in the fovea (blind spot). Now suppose a subject is shown a field, half of which is red and half of which is green, the two halves being separated by a black line. If the line is then
end p.60
removed, the brain fills in colours in the vacated space, and some observers report seeing that the boundary is now red and green. 10
It may be possible to obtain a similar experience in a different way. It is easy to construct a pair of glasses that have a red filter on one lens and a green filter on the other. (It is possible to buy glasses of this kind for viewing old-fashioned 3-D films.) If I put such glasses on, and allow perception to stabilize, everything appears a uniform brownish hue. But for a short space of time, 11 until things stabilize, I have the very strange experience of seeing everything as red and as green, though the red and the green seem, somehow, to be at different depths in the visual field.
It might be said that being red and green is not a contradiction. But it is: red and green are complementary colours. It is, hence, a conceptual impossibility for something to be both colours. A feature of complementary colours is that they can't go together. There is no reddy-green, in the same way, for example, that there is a reddy-blue. Something that is red and green, is red and not red.
These examples show that it is quite possible to perceive a contradictory situation. What is it like to see something that is, at once, both higher than itself and no higher than itself? Consult the first example. What is it like to see something that is moving and not moving? Consult the second. What is it like to see something that is red all over and not red all over? Consult the third. Of course, the examples are not veridical; they are illusions; but they fulfil the required function nonetheless. They show that we may have perceptual experiences the contents of which are contradictory. 12
3.4 THE NON-TRIVIALITY OF THE WORLD
After this prolegomenon, we may now return to trivialism. The cup in front of me sits on a table, and these are at rest with respect to each other. How do I know that they are not also not at rest, that is, in motion? I can see it. I know what it is like for things to be both at rest and in motion with respect to each other. That is exactly how things appear in example number two, and it is not like that. This, at any rate, is one contradiction I can see not to obtain. Similarly, the grass before me is green. How do I know that it is not also red? Again, I can see it. I know what it is like for something
end p.61
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
to be both red and green. That is how things appear in example number three, and it is not like that. A similar argument can be based on example number one. I leave this as an exercise. We know, then, that the world is not trivial, since we can see that this is so.
One can put the point in an alternative way—I do so for the second example; it can be made the same way for the others. I see that the cup and table are at rest relative to each other. I do not see that they are in relative motion, which I would do if they were—even if they were at rest and in motion. There is something, then, about the world that fails to obtain.
These considerations, like all a posteriori considerations, are defeasible. Observation is as fallible matter, and what appears to be the case may not, in fact, be so. If it turned out, for example, that supposing grass in Australia to be red and green all over allowed us to explain and predict every fluctuation of the Australian dollar, but had no other untoward consequences, we would have strong evidence that our senses deceive us in this case, and that Australian grass is red and green. But there is no cogent evidence that our perception ever does deceive us by making an inconsistent situation appear consistent. Hence, the reasoning stands.
It is, in some sense, a priori possible that an inconsistent object appear in a consistent way. Indeed, a sceptic holds that our perceptions may always fail to reflect how things are. And a trivialist, who holds that everything is the case, will hold that all our consistent perceptions are misleading. Against such opponents, the above argument carries no weight—nor does any a posteriori argument. What to say about such a case we will turn to in the second half of the chapter.
3.5 THE CONSISTENCY OF THE EMPIRICAL WORLD
But before we look to this, let us pause to note one other fact. As I have argued, our perception tells us that not everything is true. We may establish something stronger and more interesting. Let us work our way up to it.
When we perceive, we can see that something is the case. Can we also perceive that something is not the case? Some have thought not. 13 We can perceive only that something is the case. For example, we cannot see that something is not green. We can see only that it is red. Any judgment to the effect that it is not green has to be added to what we see by inference. This, as we now see, is false. I can see directly that something is not green. Or consider another example: you enter a room; the whole room is visible from where you stand; there is no one there. You can see that Pierre is not in the room. No Pierre-shaped objects meet the eye. 14 Even the very distinction between seeing what is the case and what is not the case is a false one. Some seeings are both. With respect to actual physical objects, to be transparent is not to be opaque, and vice versa. But you can see that something is transparent and you can see that something is opaque.
end p.62
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Next: if α and β are actual observable states of affairs, then so is their conjunction. We can see that something is a cat; we can see that the thing is black. Hence, we can see that it is a black cat. The principle needs appropriate qualification. If α and β record states of affairs that arise only at different times, or at places such that one cannot see both at once, then the principle is obviously not true. More importantly, it can be the case that one of the conjuncts occludes the others. Thus, suppose that α is ‘the street sign is blue’ and β is ‘it is night’. Both of these states of affairs are observable, but if it is, indeed, night, then one cannot see that the street sign in question is blue. When we apply the principle, it must be with the appropriate qualifications.
To apply it: let α describe an observable state of affairs, that x is happening here and now. Then ¬α describes another observable state of affairs, that x is not happening here and now. But then the conjunction α ¬α also describes an observable state of affairs. That is, if such a state of affairs were ever to be the case, it could be seen. Clearly, in applying the principle we do not need to worry about α and ¬α being at different places or at different times. But what of the possibility of occlusion? (This is raised by Beall (2000). 15 ) Might it not be the case that our cognitive functioning makes it impossible for us to see inconsistent states of affairs? Maybe our perceptual mechanisms impose a “consistency filter” on what we see, so that we see only one of the conjuncts, which masks out the other? This is certainly an empirical possibility. But there is no empirical evidence that I know of to suppose that there is such a filter. Indeed, there is every reason to suppose that there is not. For as the examples of section 3.3 show, we can see both conjuncts of an inconsistent state of affairs. Beall raises the more general worry that we may just not know what contradictions look like. 16 If δ is an observable state of affairs, for example, what does (δ ¬δ) ¬ (δ ¬δ) look like? But we do know what contradictions look like. That, again, is what 3.3 showed us. In particular, (δ ¬δ) ¬ (δ ¬δ) is logically equivalent to δ ¬δ, and so would look like that. Still, it might be suggested, for all we know an inconsistent situation could appear quite consistent. But if one were to endorse a claim of this kind, we would collapse into total scepticism. Maybe, for all we know, people can appear like fried eggs; square things can appear round, etc. For such a worry to bite, there must be specific grounds for supposing things may not be as they appear to be. Are there such grounds? None that I am aware of. 17
And now to the point. Consider the observable world, i.e. all that is observably the case. If there were inconsistencies in this, it would follow from the above that we would perceive them. But apart from the odd visual illusion, we do not: our perceptions of the world are entirely consistent. Hence, the observable world is consistent. Again, we can put the point in another way. If I look out of the window at the grass, I can
end p.63
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
see that it is green. How do I know that it is not also red, that is, not green? If it were red, I would see that it is—even if it were red and green—which I do not. Similarly for other observable situations. 18
Here, again, it is possible to object. We do have perceptions with contradictory contents: the visual illusions themselves. Why, in this context, are we entitled to assume that they are illusions? The question is a good one, but there are simple answers. In the case of the first example, we are not actually perceiving a situation at all, simply a picture of one, which we interpret in a certain way. The second example is more complicated. How do I know that when I perceive the waterfall illusion, or am drunk, the room is not both moving and stationary? First, there is the testimony of my other senses. If I touch the spinning wall, it feels (consistently) stationary. Second, there is the testimony of others. Everyone else who perceives the room—or at least, everyone else who is capable of saying anything coherent to me—does not see it in the way that I do. Third, there are all sorts of other information. If the room were really spinning, the lamp would swing, things would fall off shelves, etc. Whilst all these facts could be explained away, the simplicity of the hypothesis that the apparent motion of the room is entirely subjective makes this overwhelming. The third example is very similar. How do I know that the room does not go red and green all over when I put on the glasses? Again, the facts that only I see it in this way, that the colouring comes and goes with the glasses, etc., clearly make the hypothesis that the colouring is subjective a superior one.
Could it be the case that, in the future, science will be able to conjure up actual situations that are inconsistent, perceivably so, and not illusions? Who knows?—science is a strange thing. But for the present, at least, there is good reason to suppose that the perceivable world is consistent.
Perhaps this is a major reason why a number of philosophers are not prepared to countenance dialetheism—unless that countenance be screwed up in anguish. Take the conclusion that the perceivable world is consistent, throw in a simple empiricism—what is so in the perceivable world is so everywhere—and the result follows. Given the state of modern science, the shortcomings of such empiricism, and so of this argument, require no further comment.
3.6 THE TRIVIALIST
Let us now turn to Interpretation 2 of the request for justification, the harder issue of how one can justify one's position vis à vis trivialism against the trivialist themself.
end p.64
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Note that this problem of justification is just as hard for the classical logician as it is for the dialetheist. In reply to Interpretation 1 of the question of justification, the classical logician can give a very simple answer. We know a priori that no contradictions are true. A fortiori, some things are not true. The dialetheist will, of course, contest the first claim: one cannot know a priori what is not true; they can, though, give an a posteriori justification as we have just seen (which is equally available to the classical logician).
But for Interpretation 2, the classical logician and the dialetheist are in exactly the same boat. No appeal to the properties of classical logic is going to help against the trivialist: the trivialist agrees with classical logic! Every principle of inference that a classical logician subscribes to, such as that a contradiction entails everything, the trivialist subscribes to too (indeed, this might be why they became a trivialist in the first place). Every semantic principle that the classical logician subscribes to, such as that nothing is both true and false, the trivialist subscribes to too. One cannot, therefore, simply appeal to classical logic when faced with the trivialist.
To get a feeling for the full difficulty of the problem, it will be helpful, for a start, to return to the analogy that I noted in section 3.1 between the trivialist and the sceptic. The trivialist will subscribe to everything; 19 the sceptic will subscribe to nothing. In this respect, they are dual figures. And both take such extreme positions that it is difficult for those who wish to hold the middle ground to know where to start the debate. It might be thought that, at least historically, scepticism and trivialism are not on a par. For there have certainly been sceptics, such as Sextus Empiricus; but there have been no trivialists. This is less than clear, though. As we saw in section 1.5, Aristotle takes Heraclitus and Protagoras to be trivialists—though whether this is fair to them may be a moot point. Perhaps more importantly, it is not at all clear that there have really been any sceptics. Sextus certainly claimed to be, but the fact that he navigated his way around his world may suggest that he did really believe some things to be true rather than others. At this point, we have already engaged in substantive philosophical issues; ones we will, in fact, return to later. 20
In any case, whether there have really been any sceptics or trivialists is unimportant. It is the positions themselves that are important. Both of them are extreme and incredible, but it is exactly by taking such extreme views as foils that we come to understand better our own views, as the generations of debate related to scepticism have demonstrated in epistemology.
So how does one justify one's view vis à vis the trivialist's? Before we can engage the issue, there is a problem even about how to conceptualize it. It is easy enough to show that trivialism is not true—indeed, necessarily so. For it is either true or it is not. But
end p.65
if it is true, it follows that it is not true (everything follows). Hence, in either case, it is not true. Does this settle the issue? Not at all: for the trivialist accepts that everything is not true! Quite generally, one cannot simply enter into debate with the trivialist as to whether or not everything is true. In fact, there is no point in engaging in debate with the trivialist about anything. We know that they are going to agree in advance! There is, as far as I can see, no way of getting the trivialist to change their position at all. Just because everything one says is something that they already believe, there is nothing one can say that will have the slightest purchase.
It makes more sense to ask how one would justify one's view to some independent third party, an arbitrator. The third party may be supposed to be neutral on the issue, at least initially. The job for each of the first two parties is to persuade them. The arbitrator will listen to arguments from each side, and, if convinced by either party, will judge in their favour. 21
Convincing the arbitrator is still not as easy as it might at first appear, though. In particular, one must still not beg the question, or the arbitrator will rule ‘foul’. For example, an obvious argument to use is that the trivialist's position is inconsistent, and so ought not to be believed. (Naturally, a dialetheist is not going to make this objection; but a classical logician might.) It is clear that this argument fails, though, since it begs the question. The trivialist affirms everything, including all contradictions. Simply to claim that these cannot be true is to take for granted part of what is at issue.
So what arguments can be used? There are a number of possible answers to this question, but I shall not attempt an exhaustive discussion here. What I will do is consider three answers that are interesting. The first two are less than completely successful, as I will argue; the third, I hope, is. 22
3.7 ARGUMENT ONE: EVIDENCE
The first argument is as follows. The trivialist believes many strange things. They believe, for example, that you are a scrambled egg. This is objectionable since there is just no evidence to justify it. The rational person should not believe anything for which there is no reason. The arbitrator must agree. And if the trivialist argues that there is a reason, namely that it follows from trivialism, then it is they who now beg the question. The trivialist will also claim that there is independent evidence. (They will claim everything.) But claiming does not, of course, make it so. The evidence needs to be produced for the arbitrator.
end p.66
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Unfortunately, the trivialist is not out yet. There is evidence independent of trivialism that you are a scrambled egg. Consider a sequence of objects obtained, at each step, by replacing a micron of you with a micron of scrambled egg. I start with an object that is you, and if I have an object that is you, and replace a micron with a micron of scrambled egg, it is still you. Hence, the final stage of the sequence is still you. And the final stage is a scrambled egg. Hence, you are a scrambled egg. This argument depends in no way at all on trivialism.
Naturally, we will take it that there is something wrong with the argument. If it were patently invalid, say it proceeded by asserting the consequent, it would be easy enough to justify this. But the only form of inference that the argument uses (albeit 106 times) is modus ponens, and this is valid, at least prima facie. Moreover, its only premises (that you are you, and that something that is you is still you if a micron is exchanged) are also prima facie true. Hence the argument does appear to be sound. The trivialist has shifted the onus of proof.
If the argument were produced by a non-trivialist, we could say that since the conclusion is absurd—one can see, after all, that the conclusion fails—there must be something wrong with it. But against the trivialist, this move is of no avail, for it clearly begs the question. One might try a stronger argument. Let a be any physical object, and let F be any property, tolerant in its applicability with respect to small changes, possessed by some possible physical object, b. Then if soritical reasoning were sound, we could construct (at least in thought) a sequence of objects, each barely different from its predecessor, running between a and b, and hence establish that Fa. 23 In other words, if soritical reasoning were legitimate, we could show that every physical object has every tolerant physically possible property. Hence, soritical reasoning cannot be legitimate. Against an ordinary opponent, this argument is conclusive. But against the trivialist, the argument begs the question just as before. The trivialist holds that every physical object does have all such properties.
Let us take stock. Sorites reasoning of the kind we have been looking at does not establish trivialism in general. There is no way (as far as I can see) to use such reasoning to show that physical objects have all sharp properties (or to show anything much at all about non-physical objects—if there are any). It does not, therefore, constitute a complete reply to objection number one. But such reasoning does appear to show that every physical object has every tolerant physically possible property. Since this encompasses virtually every object and property we commonly deal with, this is as near trivialism as makes no difference for practical purposes. Let us call a person who takes all physical objects to have all such properties a near-trivialist. The sorites argument does provide a defence of near-trivialism against objection number one. 24
end p.67
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
3.8 ARGUMENT TWO: MEANING
For the next argument we return to Aristotle. As discussed in Chapter 1, in Metaphysics Γ 4, Aristotle gave a number of arguments in support of the Law of Non-Triviality, LNT (not everything is true)—which Aristotle tends to confuse with the Law of Non-Contradiction, LNC. As we also saw in section 1.12, Aristotle's major argument (1006 a 24–7 b 18) has sometimes been interpreted as showing that violations of the LNC or LNT would render language meaningless. I argued there that this interpretation is not very plausible, and in any case, does not succeed in establishing its conclusion. Here is another argument for the same conclusion.
To start with, let us consider the family of colour words. Let us suppose that the LNT failed for these. Thus, every (visible) object is green, and blue, and red, and all other colours. If such were the case, colour words would have no public meaning. For the public meanings of words have to be learned; and they are learned by contrasting situations where they do apply with situations where they do not. The lack of such a distinction would therefore undercut the possibility of public, transmissible, meaning. One might be tempted to reply that the meanings of such words could still be grasped since things at least appear different colours. But of course, if trivialism were true, everything would appear red, and appear blue, and appear green, and so on, too.
The argument generalizes. To convey the sense of any predicate, one depends on situations where it applies, and situations where it does not. 25 If there are no such situations, public, communicable, sense is impossible. Hence, trivialism entails the meaninglessness of public language. The arbitrator, as long as they are to be an arbitrator, clearly cannot accept this conclusion. Hence, they must find against the trivialist.
One might take issue with this argument. For example, it depends on a contrastive account of the communicability of meaning, which one might contest. But there are quite general reasons why this, or any, argument to the effect that violation of the LNT entails the meaninglessness of language must fail to do what is required in the present context. For the trivialist may simply take to heart the thought that public language is meaningless, and so cease to assert anything. 26
It might be thought that this would take the trivialist out of the debate altogether. Since they say nothing, we are free to ignore them. Aristotle certainly thought this. This is too fast, however. They may not be able to assert anything, but that does not stop them uttering things. They may take themselves to be just “babbling”—to use a helpful phrase of Peter Unger. But we don't; and we still have to take into account the force of what they say, or at least, of what we take to be that force. The situation here is a very familiar one in the context of scepticism. 27 The sceptic claims that there is no ground for believing anything, as opposed to anything else. This means that they can assent to nothing. But they can still utter things; and we, who do assent to
end p.68
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
some of the things uttered, cannot ignore their consequences. The sceptic, in a word, utters ad hominem. The trivialist does the same. If the trivialist is prepared to go in this direction, then any argument from meaninglessness fails.
3.9 ARGUMENT THREE: THE PHENOMENOLOGY OF CHOICE
As we saw in Chapter 1, the other arguments of Metaphysics Γ 4 against the LNT are pretty hopeless. For example, the arguments between 1007 b 18 and 1008 a 7 point out a number of apparently absurd consequences of everything being true; but such arguments simply beg the question against the trivialist, as I have already observed.
Aristotle's most interesting argument comes at 1008 b 12–31, where he points out that if someone really believed everything to be true they would have no reason to act in any way, as opposed to any other. When they felt hungry they would have no reason to eat bread rather than broken glass, or for that matter, eat nothing at all, since they believe that each of these actions will satisfy their hunger. Such a person will, then, act erratically—and not for very long. Conversely, if we do see someone acting in a systematic and apparently purposeful fashion, we can infer that they do not really believe everything, whatever they say.
As we noted in 1.19, a similar argument can be urged against the sceptic who claims not to believe anything, as opposed to anything else. Indeed, Sextus considers just such an objection, and replies to it that, though the sceptic behaves systematically, they do not, in fact, behave on the basis of possessing reasons. They do what they do, simply because they are well trained. When they are hungry they reach for bread, not because they believe that it, rather than glass, will nourish them, but simply because they are conditioned to do so. 28 The same reply is available to the trivialist. They, too, may claim that when they reach for bread, it is not because they do not believe that glass would be just as good. It is simply because they have been trained to do so.
This does not get to the heart of the matter, though. Given some purpose, the trivialist can have no reason for behaving in one way rather than another to bring it about. But the situation is worse. The trivialist—at least whilst they remember that they are a trivialist—can have no purpose at all. One cannot intend to act in such a way as to bring about some state of affairs, s, if one believes s already to hold. Conversely, if one acts with the purpose of bringing s about, one cannot believe that s already obtains. 29 Hence, if one believes that everything is true, one cannot act purposefully. It might be retorted that if one believes that everything is the case, one believes that s
end p.69
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
is not the case, and so one can intend to bring it about. Normally, it is true, someone who believes ¬s does not believe that s; and so believing ¬s allows one to have the intention of bringing s about. But for the trivialist, ¬s does not rule out s; hence these considerations do not apply. The trivialist cannot aim to bring about s, because it is simply part of a situation that (they think) already obtains, viz. s ¬s.
But now: as long as a person is conscious, they make choices. They decide what to do, and what not to do. Even if they decide to “do nothing” they have still chosen a course of action. And the person who is running on Sextus' “autopilot” chooses to go along with it, and not override it. This observation that people must make choices may sound like an endorsement of free will. I do not intend it to be so. It may be that, at some level, our actions are determined by things that are quite beyond our consciousness, and so are unfree. I intend the observation purely as a phenomenological report of our consciousness. Having to choose is something phenomenologically unavoidable. The point was stressed, famously, by Sartre. 30
But to choose how to act is to have a purpose: to (try to) bring about this rather than that. Not even to act on the toss of a coin is a counter-example to this. For to act on the toss of a coin is simply to let one's next purpose be determined by the result—and one has that purpose in tossing the coin, to boot. Choosing is an irredeemably goal-directed activity. And as we have seen, such action is incompatible with believing everything. 31 It follows that I cannot but reject trivialism. Phenomenologically, it is not an option for me. This does not show that trivialism is untrue. As far as the above considerations go, it is quite possible that everything is the case; but not for me—or for any other person. 32
Notice that the argument is not vulnerable to the Ungerian trivialist of argument two. The meaning which they deny is a public phenomenon; I cannot guarantee that words are meaningful in this way simply by inspecting my own consciousness. But that I am (subjectively) a chooser, and cannot be otherwise, is something that is phenomenologically guaranteed. Notice, also, that this argument works just as much against the near-trivialist of argument one as it does against the trivialist. For one of the things about which we must choose is how to act in the physical world (at least, the phenomenological physical world). We cannot, therefore believe everything to be the case about this world.
As is clear, argument number three depends on an analysis of consciousness. Perhaps the best way to think of this is in Kantian terms. In the Transcendental Deduction of the Critique of Pure Reason, Kant tries to establish the applicability of the Categories from the nature of (the unity of) consciousness. Similarly, the argument I have
end p.70
deployed provides a transcendental deduction from certain features of consciousness to the impossibility of being a trivialist. In the context, this is the best kind of argument one can give. The arbitrator must rule in our favour if we can give them a transcendental proof that our opponent does not exist.
3.10 CONCLUSION
In this chapter, we have looked at two interpretations of the question: how do you know that not everything is true? In response to the first interpretation I gave an a posteriori answer: if everything really were true, we would see many contradictory states of affairs; but we do not. In reply to the second interpretation, I gave a transcendental argument against the possibility of any conscious being—at least, any being with a consciousness like mine—accepting trivialism. There are probably other ways in which one might interpret the request for justification of 3.1, and probably much more to said about the answers I have given. But enough for the present.
In the first part of the book we have looked at a number of issues concerned with truth and dialetheism. In the process, as the observant reader will have noted, a number of other topics of concern to the book have been broached. It is time to turn to the next of them: negation.
end p.71
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
end p.72
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
PART II NEGATION
end p.73
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
end p.74
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
4 Contradiction
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
4.1 INTRODUCTION
Dialetheism is the view that some sentences of the form α and ¬α are both true, where ¬ is negation. As we saw in Chapter 2, there is nothing in the notion of truth that prevents dialetheism from being acceptable. The other obvious place to look for an argument for the same conclusion is in the nature of negation. Now there certainly are accounts of negation that rule out dialetheism; thus, any account, such as those provided by “classical” and intuitionist logics, according to which negation is explosive (that is, such that α, ¬α β, for an arbitrary β), rules out dialetheism—unless, of course, one is a trivialist. But such an account is by no means mandatory. Arguably, the correct account of negation both allows for the truth of some contradictions and is not explosive. To argue this is the point of the present chapter. 1
Negation is one of a collection of notions central to logic. Nowadays, they are standardly called logical constants, though they have been called by different names at different times. For example, they were called syncategoremata by medieval logicians. Much of logic has therefore been devoted to an analysis of these notions. Thus, how to analyse conditionals and how to analyse what we would now call quantifier phrases have both been historically contentions issues. Consensus concerning the latter was achieved early in the twentieth century, due to the work of Frege and others. The debate concerning the former shows no similar sign of convergence.
Negation, too, has been the locus of very different views. Major theories of negation in the twentieth century included the “classical” (Frege/Russell) account and the intuitionist (Brouwer/Heyting) account. But rival accounts of negation have always existed. In section 1.13 I distinguished between three sorts of account of negation: those according to which contradictions entail:
1. nothing;
2. something (but not everything);
3. everything.
The classical and intuitionist accounts are accounts of kind 3; but later medieval accounts which endorsed both an extensional disjunction and the disjunctive syllogism also belong to this category. Connexivist accounts of the early Middle Ages, such as those endorsed by Boethius and Abelard (which we met in section 1.13) are of kind 1.
end p.75
The accounts provided by contemporary paraconsistent and most relevant logics are of kind 2; but so was that provided by Aristotle. 2
4.2 NEGATION OR NEGATIONS?
How, then, does negation behave? 3 There is a short way with this question. There is no such thing as negation; there are lots of different negations: Boolean negation, intuitionist negation, De Morgan negation. Each of these behaves according to a set of rules (proof-theoretic or semantic); each is perfectly legitimate; and we are free to use whichever notion we wish, as long as we are clear about what we are doing. If this is right, there is nothing left to say about the question, except what justifies us in categorizing a connective as in the negation family. And I doubt that there is anything very illuminating to be said about that. Virtually every negation-like property fails on some account of a connective that is recognizably negation-like: the law of excluded middle, the law of non-contradiction, double negation, De Morgan's laws, contraposition, and so on. All we are left with is a family resemblance whose fluid boundaries are largely historically determined.
I do not think that the answer is right, however. It makes a nonsense of too many important debates in the foundations of logic. Doubtless, philosophical debates do rest on confusion sometimes, but questions concerning the role of negation in discourses on infinity, self-reference, time, existence, etc., are not to be set aside so lightly.
At the root of this kind of answer is a simple confusion between a theory and what it is a theory of. 4 We have many well worked-out theories of negation, each with its own proof-theory, model-theory and so on. And if you call the theoretical object constituted by each theory a negation, then, so be it: there are many negations. But this does not mean that one can deploy each of these theoretical objects at will and come out with the correct answer. The theoretical object has to fit the real object; and how this behaves is not a matter of choice.
A comparison with geometry may be helpful here. There are, in a sense, many geometries. Each has its own well-defined structure; and, as an abstract mathematical structure, is worthy of investigation. But if we think of each geometry, not as an abstract mathematical structure, but, suitably interpreted, as a theory about the spatial (or spatio-temporal) structure of the cosmos, we are not free to choose at will. The theory must answer to the facts—or, if one is not a realist, at least cohere in the most satisfactory way with the rest of our theorizing.
end p.76
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
There is always an extreme conventionalist line to be run here. One might say, as Poincaré (n.d.) did, that we are free to choose our geometry at will, e.g. on the grounds of simplicity, and then fix everything else around it. Similarly, we might insist that we are free to employ a certain notion of negation and make everything else fit. But such a line is not only philosophically contentious, but foolhardy, at least in advance of a good deal of further investigation. The tail may end up wagging a dog of a considerable size. For example, as Prior (1960) pointed out a long time ago, we can determine to use a connective, * (tonk), according to the rules of inference α α * β and α * β β. But the cost of this is accepting that if anything is true, everything is!
4.3 CONTRADICTORIES
I will return to the issue of the nature of logical theories in a more general context in Chapter 10. But enough for the present. We have seen that a simple voluntarism with respect to negation is unsatisfactory. If it is to be applied, an account of negation must be considered not just as an abstract structure, but as a theory of something, just as a geometry is a theory of physical space. And this will put substantial constraints on what an acceptable account is.
The next, and obvious, question is what, exactly, an account of negation is a theory of. It is natural to suggest that negation is a theory of the way that the English particle ‘not’, and similar particles in other natural languages, behaves. This, however, is incorrect. For a start, ‘not’ has functions in English which do not concern negation. For example, it may be used to reject connotations of what is said, though not its truth, as in, for example, ‘I am not his wife: he is my husband’. (Some linguists call this ‘metalinguistic negation’, 5 though this is obviously not a happy appelation in the context of logic.)
More importantly, negation may not be expressed by simply inserting ‘not’. For example the negation of ‘Socrates was mortal’ may be ‘Socrates was not mortal’; but, as Aristotle pointed out (De Interepretatione, ch. 7), the negation of ‘Some man is mortal’ is not ‘Some man is not mortal’, but ‘No man is mortal’.
These examples show that we have a grasp of negation that is independent of the way that ‘not’ functions, and can use this to determine when “notting” negates. But what is it, then, of which we have a grasp? We see that there appears to be a relationship of a certain kind between pairs such as ‘Socrates is mortal’ and ‘Socrates in not mortal’; and ‘Some man is mortal’ and ‘No man is mortal’. The traditional way of expressing the relationship is that the pairs are contradictories, and so we may say that the relationship is that of contradiction. Theories of negation are theories about this relation.
As usual in theorization, we may reach a state where we have to reassess the situation. For example, it may turn out that there are several distinct relationships here, which need to be distinguished. But at least this is the data to which theorization must (and historically did) answer, at least initially.
end p.77
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Having got this far, the next obvious question is what the relationship of contradiction is a relationship between: sentences, propositions, some other kind of entity? There are substantial issues here; but, as far as I can see, they do not affect the question of negation substantially. For any issue that arises given one reasonable answer to this question, an equivalent one arises for the others. So I shall simply call the sorts of thing in question, non-committally, statements, and leave it at that.
4.4 THE LAWS OF EXCLUDED MIDDLE AND NON-CONTRADICTION
So if α is any statement, let ¬α represent its contradictory. (Contradictories, unlike contraries and sub-contraries are unique—at least up to logical equivalence.) What relationships hold between these? Traditional logic and common sense are both very clear about the most important one: we must have at least one of the pair, but not both. 6 It is precisely this which distinguish contradictories from their near cousins, contraries and sub-contraries. If we have two contraries, e.g. ‘Socrates was black’ and ‘Socrates was white’, it is necessarily false that (Socrates was black Socrates was white); but it is not necessarily true that (Socrates was black V Socrates was white). Dually, if we have two sub-contraries, e.g. ‘Socrates was under 2 m tall’ and ‘Socrates was over 1 m tall’, it is necessarily true that (Socrates was under 2 m tall V Socrates was over 1 m tall), but it may also be true that (Socrates was under 2 m tall Socrates was over 1 m tall).
This fact about contradictories obviously gives immediately two of the traditional laws of negation, the law of excluded middle:
LEM □ (α V ¬α)
and the law of non-contradiction:
LNC □ ¬ (α ¬α), i.e. ¬ (α ¬α)
in each case, for all α. Note that one may formulate the law of non-contradiction in a number of very different, and non-equivalent, ways. As I have formulated it here, the truth of the principle is quite compatible with dialetheism, as we will have occasion to observe in a moment. The question of how to formulate the law in such a way that a dialetheist cannot accept it is, in fact, a very sensitive one. 7 At any rate, the LEM and
end p.78
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the LNC as formulated here are central aspects of the traditional characterization of contradictories. Maybe the traditional characterization—and consequently these two laws—is wrong; but it would certainly seem to be the default position. The onus of proof is therefore on those who would dispute it.
Disputation comes from at least two directions. The first is that of some (though not all) paraconsistent logicians. The argument here is that some contradictories are both true, i.e. for some βs we have β ¬β. We do not, therefore, have ¬ (β ¬β) (and so □¬ (β ¬β)). We will look more closely at the first part of this argument later. For the moment, just note that if it is correct, it undercuts the second part of the argument (at least without some further considerations). For if some contradictions are true, we may well have both β ¬β and ¬ (β ¬β). Hence, the fact that some contradictions are true does not, of itself, refute the LNC (at least in the form in question here).
The second direction from which one might dispute the traditional characterization is that of those logicians who suppose there to be sentences that are neither true nor false. This includes intuitionist logicians (though they might not express their own position in this way). The argument here is that if α is neither true nor false, so is ¬α. Hence, assuming that disjunction behaves normally, α V ¬α does not hold. 8 The claim that certain statements are neither true nor false is clearly a substantial one. The claim that disjunction behaves normally is also challengeable. (If we give a supervaluationist account, α V ¬α may be true even though each disjunct fails to be so.) However, we need discuss neither of these issues here. For from the present perspective there is an obvious objection. If ¬ behaves as suggested, it is not a contradictory-forming operator at all—merely a contrary-forming one. This would seem particularly clear if we consider the intuitionist account of negation. According to this, ¬α is true (= assertible) just if there is a proof that there is no proof that α. This is obviously a contrary of α. 9
A genuine contradictory-forming operator will be one that when applied to a sentence, α, covers all the cases in which α is not true. Thus, it is an operator, ¬, such that ¬α is true iff α is not true, i.e. is either false or neither true nor false. (In English, such an operator might be something like: it is not the case that.) For this notion, which is the real contradictory-forming operator, the LEM holds.
Those who believe in simple truth-value gaps (and, in particular, use a classical metalanguage) would seem to have little reply to this objection. The intuitionist does have a reply to hand, however. First, they can (though they need not) endorse the claim that ¬α is true iff α is not true, T <¬α> ↔ ¬T <α>. 10 Given that T <α V β> ↔ (T <α> V T <β>), we can infer that T <α V ¬α> ↔ (T <α> V ¬ T <α>), but provided that
end p.79
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the logic of the truth-theory is intuitionistic we cannot infer the left-hand side of this, since we do not have the right. Thus, one can quite coherently give a homophonic intuitionist truth-theory for an intuitionist language.
Second, an intuitionist can argue that a contradictory-forming operator, conceived of as one that delivers the LEM, literally makes no sense. The argument is a familiar one from the writings, notably, of Dummett. 11 In nuce, it is as follows. If a notion is meaningful there must be something that it is to grasp its meaning. Whatever that is, this must be manifestable in behaviour (or, the argument sometimes continues, the notion would not be learnable). But there is no suitable behaviour for manifesting a grasp of a connective satisfying the conditions of a classical contradictory-forming operator. In particular, we cannot identify the behaviour as that of being prepared to assert ¬α when (and only when) α fails to be true. For this state of affairs may well obtain when there is no principled way for us to be able to recognize that it does. Meaning requires that a sentence, if true, can be recognized as such, in principle. This is often called the Verification Constraint.
There are subtle issues (and a substantial body of literature) here. And to deal with them satisfactorily would require taking up a disproportionate part of this chapter. But let me say at least something about the matter. For a start, I do not see why the grasp of a notion must be manifestable. There is no reason why, in general, certain notions should not be hard-wired in us. If, for example, there is a Fodor-style language of thought, 12 it is quite natural to suppose that single-bit toggling is a primitive operation. One might even tell an evolutionary story as to how the hard-wiring came about; for example, it is the simplest and most efficient mechanism for implementing dissent (which occurs in non-human animals as well as humans). In particular, then, a contradictory operator does not have to be learned; its use is merely triggered in us by certain linguistic contexts, in much the same way that the categories of universal grammar are, according to Chomsky. 13
But even granting that the grasp of a notion must be manifestable, I do not see why it must be manifestable by anything as strong as the argument requires (which is, I agree, impossible). In particular, it can be manifested by being prepared to assert ¬α when in a position to recognize that α fails to be true, and refusing to assert it when in a position to recognize that α is true. 14 It could well be suggested that such a manifestation would not be adequate. There will be many cases where we are not in a position to recognize either. People could therefore manifest the same behaviour whilst disagreeing about how to handle new cases when these become recognizable, and so meaning different things. This is true. But if people not only behave as suggested, but also manifest a disposition to agree on new cases, this is sufficient to show (if not, perhaps, conclusively, then at least beyond reasonable doubt) that they are operating with the notion in the same way. In just this way, the fact that we are all prepared to
end p.80
apply, or refrain from applying, the word ‘green’ to hitherto unseen objects when they come to light, shows that we all mean the same thing by the word. This is essentially what following an appropriate rule comes to, in Wittgensteinian terms. 15
There is much more to be said here. But if the onus of proof is on an intuitionist, as it would seem to be in the case of a contradictory-forming operator, I know of no argument against the LEM that I find persuasive. (None of this contests the claim that one can tell a coherent epistemological/metaphysical intuitionist story. One may certainly contest this claim as well, however. In particular there is a well-known argument which can be seen as a reductio of the Verification Constraint. We do not need to go into this here, however. 16 )
Before we leave the LEM it is worth noting that the fact that for every pair of contradictories one must be true (simpliciter) does not entail that for every world or situation one of each pair must be true at it. Thus, if □ (α V ¬α), one of α and ¬α holds at each possible worlds, but there may be impossible worlds where laws of logic such as the LEM break down, and so where neither α not ¬α obtains. 17 Alternatively, if one thinks of a situation as part of the world, then it may well be argued that neither of a pair of contradictories need be true of it. Thus, consider the situation concerning my bike. It may be the case that neither ‘Ghent is in Belgium’ nor ‘Ghent is not in Belgium’ is true of this situation. 18 The question of whether or not one needs to consider impossible worlds and partial situations is important in discussions of the semantics of conditionals, intentionality, and other areas; but we do not need to go into these matters here.
4.5 TRUTH AND FALSITY
So far, we have met two of the classical laws of negation, LEM and LNC. A third, the law of double negation (LDN) is simply derivable. The relationship of being contradictories is symmetric. That is, if β is the contradictory of α, then α is the contradictory of β. In particular, α is the contradictory of ¬α. Hence, ¬¬ α just is α.
We are now in a position to look at another important feature of negation: its truth conditions. To do this we will need a definition of falsity. As already explained in the Introduction to the book, let us take ‘α is false’ to mean that ¬α is true. This is not the only plausible definition; one might also define it to mean that α is not true. It may turn out that these two definitions are equivalent, of course. However, to assume so here would be to beg too many important questions. And the present definition is one that all parties can agree upon, classical, intuitionist, paraconsistent, or otherwise.
The definition of falsity assures us that ¬α is true iff α is false. Dually, ¬α is false iff ¬¬α is true (by the definition of falsity) iff α is true, by LDN. Hence, the
end p.81
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
traditional understanding of the relationship between truth and falsity falls out of the understanding of negation as contradiction, and the definition of falsity.
Two more of the classical laws of negation, the Laws of De Morgan (LDM), can also be dealt with. These involve conjunction and disjunction essentially; and so we need to make some assumption about how they behave. Since this chapter is not about conjunction and disjunction, this is not the place to discuss the matter at great length. For present purposes, let us suppose that they behave as tradition says they do: a conjunction is true iff both conjuncts are true, and false iff at least one conjunct is false. The conditions for disjunction are the obvious dual ones.
One of De Morgan's Laws is the equivalence of ¬(α β) and ¬α V ¬β. This can now be demonstrated thus: ¬(α β) is true iff α β is false iff α is false or β is false iff ¬α is true or ¬β is true iff ¬α V ¬β is true. Dually, ¬ (α β) is false iff α β is true (LDN) iff α and β are true iff ¬α and ¬β are false (LDN) iff ¬α V ¬β is false. The other of De Morgan's Laws is an equivalence between ¬ (α V β) and ¬α ¬β, and can be verified by a similar argument.
The connection between negation and the conditional is more difficult to deal with, but this is because the conditional is itself more contentious. Indeed, the claim that there are different kinds of conditional (entailments, causal conditionals, indicative conditionals, subjunctive conditionals) is well known. If such distinctions are well-motivated, negation may well interact with different conditionals in different ways. A minimal condition for a conditional of any kind would seem to be that it preserve truth in an appropriate way from antecedent to consequent. Hence we have modus ponens, α, α → β β. The most important question concerning a conditional in the present context is whether it preserves falsity in the reverse direction. For some conditionals, at least, this would seem to fail, as, e.g. Stalnaker and Lewis have argued. 19 And if α → β fails to preserve falsity backwards, ¬β → ¬α will fail to preserve truth forwards, and so will not be true. The law of contraposition (LC), α → β ¬β → ¬α is not, therefore to be expected to hold for an arbitrary conditional. Of course, there may well be conditionals which do preserve falsity in the appropriate way; in fact one can always define one, , in a simple fashion: α β is just (α → β) (¬β → ¬α). For such a conditional contraposition will hold.
In this section, I have talked of truth. I have said nothing about truth-in-an-interpretation, as required, for example, for a model-theoretic account of validity. It is important to distinguish these two notions, for they are often confused. The first is a property (or at least a monadic predicate); the second is a (set-theoretic) relation. It is natural enough to suppose that truth is at least coextensive with truth-in- , where is some one privileged interpretation (set). 20 And this may provide a constraint on the notion of truth-in-an-interpretation. But it, even together with an account of truth, is hardly sufficient to determine a theory of truth-in-an-interpretation. It does not even determine, for example, how to conceptualize an interpretation. So how are an
end p.82
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
account of truth-in-an-interpretation, appropriate for the connectives we have been discussing, and a corresponding model-theoretic notion of validity, to be formulated? A model-theory for a relevant paraconsistent logic provides what is required. 21 But this takes us into areas of formal logic that I have forsworn in this book, so I shall not pursue the matter here.
4.6 TRUTH AND CONTRADICTION
Starting with a conception of negation as a contradictory-forming operator, we have now validated five standard laws of negation (LEM, LNC, LDN, and the two LDM), and a sixth (LC) in certain contexts. We have hardly settled all the central issues concerning negation, however. As we observed in 4.4, the fact that ¬ suit (α ¬α) holds does not rule out α ¬α holding too. This does not mean that ¬ is not a contradictory-forming operator. It just means that there is more to negation than one might have thought. Let us call this more, for want of a better phrase, its surplus content. The classical view is to the effect that negation does not have surplus content: any such content would turn into the total content of everything since α ¬α β. But the classical view has been called into question by dialetheists. The case for dialetheism is a long one, and, like the intuitionist case against classical negation, is too long to take up in detail here; but let me say a little. 22
Many examples of dialetheias have been suggested, but perhaps the most impressive ones are those generated by the paradoxes of self-reference. Here we have a set of arguments that appear to be sound, and yet which end in contradiction. Prima facie, then, they establish that some contradictions are true. Some of these arguments are two and a half thousand years old. Yet despite intensive attempts to say what is wrong with them in a number of logical epochs, including our own, there are no adequate solutions. It is illuminating to compare these paradoxes with ones of equal antiquity: Zeno's. Zeno's paradoxes have also been the subject of intensive study over the years, and for these there is a well recognized and stable solution. 23 (Philosophers may still argue over some of the details, but then philosophers will argue over anything.) The fact that there is no similar thing in the case of the paradoxes of self-reference at least suggests that, in their case, trying to solve them is simply barking up the wrong tree: we should just accept them at face value, as showing that certain contradictions are true.
Because a major part of what is at issue in this chapter is the semantics of negation, the semantic paradoxes are particularly pertinent. Every consistent solution to these is generally acknowledged as wrong (except by the few who propound it). Moreover, there are general reasons why, it would seem, no consistent solution will be forthcoming. The reason is the following dilemma (actually, trilemma). 24
The paradoxes arise, in the first place, as arguments couched in natural language. One who would solve the paradoxes must show that the semantic concepts involved
end p.83
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
are not, despite appearances, inconsistent. And it is necessary to show this for all the concepts in the semantic family, for they are all deeply implicated in contradiction. Attempts to do this, given the resources of modern logic, all show how, given any language, L, in some class of languages, to construct a theory, T L , of the semantic notions for L, according to which they behave consistently.
The first horn of the dilemma is posed by asking the question of whether the theory T L is expressible in L. If the answer to this is ‘yes’ it always seems possible to use the resources of the theory to construct new semantic contradictions, often called extended or strengthened paradoxes. Nor is this an accident. For since T L is expressible, and since, according to T L , things are consistent, we should be able to prove the consistency of T L in T L . And provided T L is strong enough in other ways (for example, if it contains the resources of arithmetic, as it must if L is to be a candidate for English), then we know that T L is liable to be inconsistent by Gödel's second incompleteness theorem.
It would seem, then, that the answer to the original question must be ‘no’. In that case we ask a second question: is English, or at least, the relevant part of it, E, one of the languages in the family being considered? If the answer to this is ‘yes’ then it follows that T E is not expressible in English, which is self-refuting, since the theorist explained how to construct each T L in English (assuming the theorist to speak English—and if they do not, just change the language in question). If, on the other hand, the answer is ‘no’ then the original problem of showing that the semantic concepts of English are consistent has not been solved.
Hence, all attempts to solve the paradoxes must swing uncomfortably between inconsistency, incompleteness and inexpressibility, a pattern that is clear from the literature. 25
Let us take it, then, that truth and falsity overlap: for some αs we have both α and ¬α. For such αs we have α ¬α, and so (α ¬α). We can now deal with another law of negation: Explosion: α, ¬α β. Unlike the other laws we have already met, this one has always been contentious historically. And its presentation to a class of students before they have been taught a logic course, is sure to draw pretty universal dissent. Given the present discussion, it can clearly be seen to fail. For we can simply take an α which is both true and false, and a β that is not true. This instance of the inference is not truth-preserving, and hence the inference is not valid (truth-preservation being at least a necessary condition for validity). For good measure, the equally contentious inference of Antecedent Falsity (AF), ¬α α → β, must also be invalid, for exactly the same reason (modus ponens holding); as, again, and more contentiously, must be the disjunctive syllogism (DS): α, ¬α V β β. 26
In the case of someone who endorses the idea that there are truth-value gaps, so that α and ¬α may both fail, one may object, as I did, that ¬ is not playing the role of a contradictory-forming operator. A genuine such operator is one given by the truth
end p.84
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
conditions:
T <¬α> iff ¬ T<α>
It is therefore natural to suppose that a dual objection can be made if one takes it that α and ¬α may both me true. Dialetheic negation is merely a sub-contrary-forming operator. The displayed clause defines the genuine contrary-forming operator. 27
The situation is not the same, however. Given the notion of negation employed with gaps, the LEM and LNC fail. Given the conception of negation I have just described, they do not; so the negation is a contradictory-forming operator. It may just have surplus content as well.
There is more to the matter than this, though. In fact, a dialetheist may say much of what the intuitionist is inclined to say about the matter. First, they can (though they need not) endorse the claim that ¬α is true iff α is not true. Given this clause, and that T <α β> ↔ T <α> T <β>, we can infer T <α> ¬ T <α> from T <α ¬α>. But provided that the logic of the truth-theory is dialetheic, so that Explosion fails, we cannot infer an arbitrary β from T <α ¬α>. Thus, one can quite coherently give a homophonic dialetheic truth-theory for a dialetheic language. 28 Second, a dialetheist can argue that a contradictory-forming operator, conceived of as delivering Explosion, and so ruling out surplus content, literally makes no sense. In the next chapter, we will call negation, so conceived, Boolean negation, and I will argue just that. Before this chapter ends, however, there are a couple of other issues concerning negation which can be dealt with quickly.
4.7 ARROW FALSUM
Negation, as we have just seen, does not satisfy Explosion. But how can this be? There must be some sense of negation that satisfies it. For example, let (falsum) be a logical constant such that it is a logical truth that → α, for every α. For example, if we have a truth predicate, T, satisfying the T-schema (T <α> ↔ α, for some detachable conditional operator, →, and every α), can be defined as xTx. We can then define −α simply as α → to obtain the appropriate Explosion. For we have α, −α and β. 29 (−α is, of course, equivalent to ¬α in both classical and intuitionist logic.)
The point is well made. There is such a logical constant, and such a notion defined by employing it. But it is not negation. Its properties depend, of course, on the notion of conditionality employed. In particular, the LEM and LNC reduce to α V (α → ) and (α (α → )) → , respectively. There is no reason to accept the first of these. The only ground could be that the falsity of α suffices for the truth of α → , i.e. AF; and we have already seen that this is to be rejected. The latter may
end p.85
appear more plausible, but in fact fails in a number of accounts of the conditional, namely those that reject Contraction (Absorption): γ → (γ → δ) γ → δ. (One should not confuse (α (α → β)) → β with modus ponens, α, α → β β.) The other laws also depend on properties of the conditional, often of a dubious nature. 30
One might wonder whether a dialetheic solution to the semantic paradoxes can be sustained once the connective, -, is at our disposal, due to the reappearance of triviality-inducing extended paradoxes. It can. For example, the form of the Liar using, -, is just a sentence, β, such that T <β> ↔ (T <β> → ). If we could help ourselves to the principle of Contraction then we could infer T <β> → , and so T <β>, and so . This is just, in fact, a Curry paradox. But contraction fails in numerous accounts of the conditional, and there are reasons to suppose that the conditional of the T-schema does not satisfy it. 31
4.8 REDUCTIO AD ABSURDUM
Before we conclude, let me make one final comment on negation. This concerns reductio ad absurdum (RAA). 32 The purpose of a reductio argument is often to establish something of the form ¬α by deducing a contradiction from α. Dialetheism need not affect this enterprise. If the deduction establishes that α → (β ¬β), and the → in question contraposes, then we have (β V ¬β) → ¬α by contraposition LDM and LDN. ¬α follows by the LEM.
In a polemical context, the point of a reductio argument is not normally to establish something, but to try to force an opponent to give up a view. In a dialetheic context, establishing the negation of the view is not logically sufficient for this. However, whilst a contradiction may be logically possible, 33 it does not at all follow that it may be rational to believe it. That I will turn into a fried egg tomorrow is logically possible, but a belief in this is ground for certifiable insanity. (A fortiori that I both will and will not turn into a fried egg, since this entails it.) And an argument against an opponent who holds α to be true is rationally effective if it can be demonstrated that α entails something that ought, rationally, to be rejected, β. For it then follows that they ought to reject α. β may be a contradiction, or it may be the claim that I will turn in to a fried egg. Not all contradictions may work. For example, that the Liar sentence is both true and not true may be (in fact, is) perfectly rationally acceptable.
end p.86
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
4.9 CONCLUSION
The discussion of the previous section raised the question of when and why something is rationally acceptable. This question provides the topic of the third part of this book. But we are not finished with negation yet. For a start, the issue of Boolean Negation raised its head at the end of section 4.6. This is the topic to which we turn in the next chapter.
end p.87
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
5 Boolean Negation
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
5.1 INTRODUCTION
In the last chapter I argued for a certain account of negation. The account, whilst respecting traditional features of the notion of negation, such as its being a contradictory-forming operator, nonetheless allowed for contradictions to be true without triviality—that is, as I put it there, for negation to have surplus content.
It might be argued, however, that, though the account may, in fact, be correct, this is in many ways beside the point. Grant that negation behaves as I say it does. It remains the case that there is a logical operation that has all the properties of classical negation. Classical logic is therefore perfectly correct provided that we understand its negation sign as meaning this. In particular, a dialetheic solution to the paradoxes of semantic self-reference—as advocated in 4.6—is ruled out, since we may formulate the Liar sentence employing this notion and apply the unrestricted T-schema to it, deriving a corresponding contradiction, and hence infer triviality.
The point may be reinforced by an appeal to the semantics of certain paraconsistent logics themselves. The negation of relevant logics, often called ‘De Morgan negation’, is of the kind that I described in the last chapter. However, Meyer and others have investigated relevant logics with two negations. One is the standard De Morgan negation. The other is a negation that behaves exactly as the classical account of negation says it does. This they called ‘Boolean negation’. 1 Both negations are provided with a semantics and proof theory.
If one takes it that a dialetheic solution to the semantic paradoxes is correct, one must deny the coherence of Boolean negation. This I do, and the present chapter explains why (as promised at the end of section 4.6). As we will see, the coherence of an explosive notion of negation can be maintained, at best, only by question-begging.
5.2 INFORMATION AND AUGMENTATION
Let us write Boolean negation as $, and start with an argument to the effect that it is an incoherent logical connective. 2 The semantics of many logics, including relevant logics of the kind required for a plausible account of the conditional, involve the
end p.88
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
notion of worlds. 3 Suppose, for a moment, that these are to be thought of as states of information. States of information may well be incomplete. Thus, if w is a world then we may have neither Pa nor ¬Pa holding in w, for some a and P. Clearly, it should be possible to add missing information about a sensibly. Thus, in the case at hand, it should be possible to add Pa to the state of information coherently. But if Boolean negation is a legitimate connective, then since Pa w and ¬Pa w, $Pa w and $¬Pa w. Hence, the addition of either Pa or ¬Pa to w will result in triviality. $ can therefore have no sense; if it did, perfectly sensible pieces of information would be rendered impossible.
It might be suggested that when, say, Pa is added, $Pa ought to be removed. $, it may be suggested, functions something like negation-as-failure in PROLOG. 4 When a PROLOG data-base is asked whether Pa, it searches for a proof of this. If it cannot find one, it reports that $Pa. If Pa is added to the data base, it will then cease to report this. But this just shows that $Pa is not reporting a piece of information. It is simply reporting the absence of information. If it really did report information, then adding Pa and dropping $Pa would not be a case of augmenting incomplete information—some information would have been lost.
The point about the untoward consequences of $ may be made, though perhaps less obviously, if we think of worlds, not as states of information, but ontologically. If the conditional Qa → (Pa V ¬Pa) is to fail, as it will do in any relevant logic, then there must be a world, w, where Pa V ¬Pa, and so Pa and ¬Pa, fail. w may not be a possible world. Indeed, if Pa V ¬Pa is a necessary truth, it will not be: it will be an impossible world—but a world nonetheless. Now, there ought to be a perfectly sensible world which is just a little less incomplete, which is the same as w except that Pa holds as well. But as before, if Boolean negation makes sense, $Pa holds at w, and so a world that is the same as w, except that Pa also holds, collapses into triviality. If $ were legitimate, it would render a perfectly sensible state of affairs incoherent. And again, if it be suggested that we should not expect $Pa to be preserved at this world, this just shows that $Pa is not something that holds in w, but merely something about w. In other words, $ is not to be understood as an operator which acts on the content of Pa to give another content of the same kind. A fortiori, it is not a meaningful such operator.
The general situation, then, is this. Whatever worlds are, they may be incomplete with respect to a state of affairs/piece of information. If w is any such world, there must be a world that is essentially the same, except that this state/piece is added. Let us call this the Augmentation Constraint. Any connective which would produce worlds that violate the Augmentation Constraint generates violations of what makes sense; so it, itself, cannot make sense. Boolean negation—but not the standard connectives—fails this test. Compare the situation with intuitionism. An intuitionist subscribes to the Verification Constraint: any meaningful sentence is such that, if true, it can be recognized as such. As we noted in section 4.4, Boolean negation fails this test, too.
end p.89
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
5.3 BOOLEAN NEGATION BY PROOF THEORY
The foregoing argument is not definitive. Someone might deny the Augmentation Constraint, just as a classical logician might deny the Verification Constraint. But it at least shows why the meaningfulness of Boolean negation is not necessarily to be taken for granted, and so why arguments that it is are centrally important. What might such arguments be?
Let us start by asking how Boolean negation is to be characterized. Some kind of definition is required. Clearly, an explicit definition, of the form ‘dialetheia means true contradiction’, is not going to get us very far. Such definitions are eliminable without loss—or if they are not, they are creative, and so objectionable. We must appeal therefore to some notion of implicit definition. Two sorts of implicit definition (hopefully equivalent) recommend themselves here: proof-theoretic and model-theoretic. Let us consider the proof-theoretic characterization first.
This way of characterizing Boolean negation is to specify that it is governed by a set of rules which generate all (and only) those rules of inference that are valid according to the classical theory of negation. Unfortunately, proceeding in this way does not show that there is a coherent notion satisfying those conditions. If one is free to introduce a connective by stipulating that it satisfy a certain collection of rules, then there is any number of ways of getting into a mess with the T-schema. For example, we might just introduce a new zero-place connective, *, satisfying the rule α ↔ * β. Triviality then follows from the instance of the T-schema T <*> ↔ *.
The point is that stating that a connective be such as to satisfy a certain set of rules is no way to guarantee it a sense, that is, to ensure its intelligibility. The point is a familiar one thanks to Prior and tonk, as we saw in section 4.2. Thus, a dialetheist may simply deny the intelligibility of Boolean negation thus characterized.
It is worth noting a disanalogy at this point. Various consistent attempts to solve the Liar paradox run into trouble due to the fact that, though they may solve the pristine Liar, extended or strengthened versions are available which they do not solve. Thus, against someone who claims that the Liar sentence is a truth-value gap, one considers the sentence: this sentence is false or a truth-value gap. A contradiction is soon forthcoming. If someone is tempted by the thought that they can avoid the consequence by denying the intelligibility of the notion of a truth-value gap, they should not be tempted for very long. The coherence of this notion is required by the coherence of the very solution to the paradox: if the notion makes no sense, neither does the solution. Thus, a person who wishes to solve the Liar paradox by appealing to truth-value gaps, yet denies the coherence of that notion is open to a devastating ad hominem argument. By contrast, Boolean negation is nowhere needed by a dialetheist. In particular, dialetheic solutions to the Liar paradox do not make use of the notion at all. It is therefore impossible to mount an ad hominem argument for the coherence of Boolean negation. 5
end p.90
5.4 CONSERVATIVE EXTENSION
So what other arguments might be given as to the intelligibility of Boolean negation characterized proof-theoretically? The obvious ones are those provided by the standard replies to Prior on tonk. There are three of these, all of the form that some constraint must be applied to a set of rules if it is to determine sense. The first (that of Belnap 6 ) and, I think, least satisfactory, is that the addition of the new connective must produce a conservative extension. That is, the addition must not allow anything in the unaugmented language, that was unprovable before, to become provable. Does Boolean negation do this? Obviously not: semantically closed theories with a suitable underlying paraconsistent logic are known to be non-trivial. 7 The addition of Boolean negation produces triviality. By this test, Boolean negation is, indeed, senseless.
Things are a bit more complex than this, however. The notion of conservative extension is, of course, relative to a pre-existing notion of deducibility; and, in particular, to the logical apparatus already present (as Belnap was at pains to note). It might, therefore, be replied that though Boolean negation produces a non-conservative extension if added to a logic with an unrestricted truth predicate, it does not do so if there is no such predicate. It is therefore the truth predicate the addition of which produces triviality, and so lacks sense.
Now it is not always true that the addition of Boolean negation to a logic without a truth predicate produces a conservative extension. The addition of the rules for Boolean negation to classical positive logic is indeed conservative. But, as is well known, the addition of Boolean negation to positive intuitionist logic is not conservative: it gives rise to “Peirce's law” ( ((α → β) → α) → α), which is not intuitionistically valid. 8 However, it is intuitionistic logic that is at issue here, but a paraconsistent logic, and the addition of Boolean negation to standard relevant logics is conservative. 9 Unfortunately, it is equally true that adding an unrestricted truth predicate to many such logics produces a conservative extension—even when the machinery of self-reference is available. 10 Thus, it is the joint addition of Boolean negation and an unrestricted truth predicate that is non-conservative. Where, then,
end p.91
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
should the blame be laid? The conservative-extension test is silent on this issue, and we witness its limitation.
5.5 HARMONY
Given what we have seen, another shortcoming of the conservative-extension test is also clear. The mere fact that a notion does not produce a conservative extension of some pre-existent set of rules is absolutely no reason to suppose that the notion has no sense. The non-conservative nature of the extension may be attributed to the fact that the old set of rules was simply incomplete. This, for example, is what the classical logician will say about the addition of Boolean negation to positive intuitionist logic.
For the conservative-extension test to have any punch, the pre-existing rules must, in some sense, be complete. How is one to cash out the notion of completeness here? An obvious answer is that the rules should be complete with respect to an appropriate semantics. But let us set semantics aside for a little longer. Another way of trying to understand an appropriate notion of completeness is also proof-theoretic. The thought here is that the rules of proof say everything there is to be said about the meanings of the connectives employed. Thus, following Dummett and Prawitz, 11 suppose that our proof-theory is formulated in terms of some system of natural deduction or sequent calculus, and that each connective comes with an introduction and an elimination rule. The introduction rule can be arbitrary. It is exactly what characterizes the meaning of the connective. But if this is so, the elimination rule cannot be arbitrary. It can allow us to get out of a formula that is packed in by its meaning, but no more than this. In the jargon used, the introduction and elimination rules must be in harmony. How, exactly, to cash out the notion of harmony depends on the exact details of the kind of proof-theory employed. But a natural way is to require that consecutive applications of the introduction and elimination rules can be eliminated: they form a detour that takes us nowhere. To illustrate (and following Read (2000)), suppose that we formulate our proof theory in terms of a sequent calculus (sequents being of the form Γ: Δ, where Γ and Δ are finite sets of formulas), and that the introduction and elimination rules for a connective operate on the right-hand side of sequents. Then if the appropriate introduction rule for conjunction is:
•
the appropriate elimination rule is:
•
end p.92
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
These are in harmony, for if the second rule is applied immediately after the first, we have:
•
The detour via conjunction may now be eliminated thus:
•
The cut rule:
•
(which can be avoided if cut-elimination holds) is applied at each step.
Thus, it might be suggested that the conservative extension should be relative to a set of rules that are in harmony. In fact, once we have the notion of harmony to hand, an appeal to conservativeness becomes redundant. The rules for a coherent notion need to be in harmony; and this the rules for tonk fail to be—spectacularly. This is the second reply to Prior. 12 The introduction and elimination rules for tonk (*) are:
•
Successive applications gives:
•
But as is clear, there is no way that the tonk-introduction can be eliminated.
Now, what of the case for Boolean negation? Appropriate sequent calculi for relevant logics (which have De Morgan negations) are more complicated than those for classical and intuitionist logics. In particular, they require two distinct ways of grouping together antecedents and consequents:, (corresponding to extensional connectives) and; (corresponding to intentional). 13 The extra complexities are largely irrelevant to the present situation, however. The appropriate introduction and elimination rules for Boolean negation are:
•
end p.93
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
These are obviously in harmony, since successive applications give:
•
From which $α may be eliminated with a cut in the obvious way:
•
So it looks as though Boolean negation passes the test. Unfortunately, exactly the same phenomenon arises as with conservative extensibility. The truth predicate has the following introduction and elimination rules:
•
These are obviously in harmony, since successive applications give:
•
From which T <α> may be eliminated with a cut in the obvious way:
•
Thus, T also passes the harmony test. But if we have Boolean negation and the truth predicate (together with self-reference), triviality ensues. Here is the proof. 14 Suppose that we have a formula, λ, of the form $T <λ>. Then:
The harmony of the rules for each notion, does not, then, ensure coherence. In particular, though Boolean negation be harmonious, this is not enough. Of course, tonk does a lot more than violate conservativeness and harmony. It delivers triviality. This is what tells us that it is incoherent. The same, in the appropriate context, is true of Boolean negation.
end p.94
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
5.6 BOOLEAN NEGATION BY MODEL THEORY
We are still in search of a constraint that may be imposed on a set of rules, and which is sufficient for coherence. Two proof-theoretic constraints have failed. The third possible constraint, and, I think, the most adequate one, is that the rules of inference in question must answer to some satisfactory semantic account of the connective, in the sense that the rules are demonstrably sound (and, if we are lucky, complete) with respect to the semantics. 15 This brings us to the second way of characterizing Boolean negation: in model-theoretic terms.
What is an adequate semantic characterization to which the rules of Boolean negation must answer? This raises the prior question of what an adequate semantics should be like anyway. The question is an important one. For if one is an intuitionist or constructivist of some other stripe, no semantics can be adequate unless it gives semantic conditions which we can effectively recognize as obtaining when they do. If this view is correct then Boolean negation is an immediate casualty. For, as intuitionists have stressed, classical negation does not have such semantics. 16
However, the majority of people who have discussed Boolean negation in the present context are not intuitionists. And I have argued against the intuitionist critique of classical semantics in 4.4. So we may set this point aside. This only partly solves the problem of what a semantics for Boolean negation should be like, however; for even the defenders of Boolean negation have given different kinds of semantic account (four-valued ternary relational, two-valued ternary relational, algebraic, etc.). Fortunately, then, the differences between them are not relevant to the essential points that need to be made.
Suppose that we are giving a model theory in which connectives are given truth (and falsity) conditions. 17 Given an interpretation, , assume that the conditions for conjunction and disjunction in an interpretation are as usual.
α β is true in iff α is true in and β is true in
α β is false in iff α is false in or β is false in
α V β is true in iff α is true in or β is true in
α V β is false in α is false in and β is false in
end p.95
I will continue to use ¬ for (De Morgan) negation. 18 The model-theoretic generalization of the truth (simpliciter) conditions of 4.5 are:
¬α is true in iff α is false in
¬α is false in iff α is true in
Boolean negation can now by characterized semantically by
$ α is true in iff α is not true in
$ α is false in iff α is true in
Note that the truth conditions for Boolean negation (unlike those for De Morgan negation) actually use the notion of negation (boldfaced). 19
One might contest the claim that these truth conditions provide an adequate specification of meaning. 20 However, let us grant them (at least for the sake of argument). As can easily be checked, given that the negation employed in stating the truth conditions of $ has the properties of De Morgan negation that we established in the last chapter, we have the following. In every interpretation, :
1. α is true or $α is true
2. it is not the case that α and $α are both true
Hence, in every interpretation α V $α and $(α $α) are true. To this extent, $ behaves like ¬. The question is whether it satisfies the other principles of Boolean negation—in particular, whether it satisfies Explosion, so ruling out the possibility of surplus content.
5.7 EXPLOSION
In model-theoretic semantics, a valid inference is defined as one which is truth-preserving in all interpretations. That is, α β iff for every interpretation :
3. if α is true in , β is true in
end p.96
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
(Keep your eye on the boldfaced if here.) The validity of Explosion for $ now comes to this. For all :
4. if α $α is true in , β is true in
What to say about the matter now depends on how we interpret if. Suppose, first, that we interpret it as a genuine possible conditional. A moment's thought suffices to show that the inference from 2 to 4 is a quantified version of the inference AF (¬γ γ → δ). And this, as we saw in section 4.6, is fallacious. The argument to the effect that $ does not allow for surplus content therefore fails. 21
Another possibility is that the if of 4 is taken to be the material conditional, (where α β is ¬α V β). Then the inference from 2 to 4 is a quantified version of the inference ¬α ¬α V β, which is perfectly valid. On this definition of validity, Explosion is indeed valid. But now we hit another problem. The validity of Explosion in this sense does not rule out surplus content, since it does not generally licence detachment! Though we certainly have:
for all (γ is true in δ is true in ) (γ is true in 𝒥 δ is true in 𝒥)
we cannot, given the premise and the antecedent of the material conditional, detach to obtain the conclusion that δ is true in 𝒥. For this is to deploy the Disjunctive Syllogism, which, as we saw in 4.6 does not support detachment in inconsistent contexts. In particular, then, suppose we can, through techniques of diagonalization (or whatever) come up with a “Boolean contradiction”—a sentence, α, such that α $α is true—and an interpretation, 𝒥, which is in accord with the actual (in the sense that something is true in 𝒥 iff it is true simpliciter). Then we are obviously in
end p.97
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
an inconsistent context: we have α $α true in 𝒥, even though it is not. There is therefore no way to conclude that β is true in 𝒥, and so β, for an arbitrary β. The collapse of surplus content into total content does not eventuate.
At this point, it might be noted that the argument thus far has assumed that the negation in the truth conditions for Boolean negation, not, is De Morgan negation. If it is itself Boolean negation, then the crucial inferential moves that I have been discussing go through, as, therefore, does the argument to the effect that $ has no surplus content. But now recall the dialectic. The point of the argument was precisely to establish the coherence of a notion satisfying the properties of classical negation. If the only way we can do this is by appealing to such a notion, and so presupposing its coherence, then the argument is clearly question-begging.
I have sometimes heard it argued that taking negation to be Boolean in the present context is perfectly acceptable, since that context is metatheoretic, and metatheory is (must be?) classical. This is a short-sighted argument. Any intuitionist or dialetheist takes themself to be giving an account of the correct behaviour of certain logical particles. Is it to be supposed that their account of this behaviour is to be given in a way that they take to be incorrect? Clearly not. The same logic must be used in both “object theory” and “metatheory”. 22 Indeed, even this distinction is bogus for someone who espouses a dialetheic solution to the semantic paradoxes. The idea that the metatheory must be a distinct, more powerful, theory, is a response to the first horn of the dilemma of 4.6. It has nothing to recommend it once we give up trying to find a consistent solution to the semantic paradoxes. The distinction between a theory (say about numbers) and its metatheory makes perfectly good sense to a dialetheist. But there is no reason to insist that the metatheory must be stronger than, and therefore different from, the theory. Indeed, if the original theory deals with, say, numbers and truth, then the metatheory may be a sub-theory of the theory.
5.8 BOOLEAN NEGATION NEGATED
As we have now seen, the argument to the effect that there is an intelligible connective, $, which rules out the possibility of surplus content may be rejected. If Boolean negation is characterized proof-theoretically, it is a connective like tonk that leads to triviality, and so has no coherent sense. And the classical truth conditions can be shown to deliver a connective having the properties of Boolean negation only by fallacious, or, at least, question-begging, arguments. The dialetheist is therefore at liberty to maintain that Boolean negation has no coherent sense.
It may come of a shock to some that the rules of classical negation characterize a connective that lacks sense. (We have been talking gibberish since Boole!) But one should not forget that whether or not something makes sense is theory dependent. And the dialetheist lines up with the intuitionist in this battle: the classical theorist has got it wrong. Nor has a classical logician any reason to feel smug about this. As we have
end p.98
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
seen, if Boolean negation is meaningful, then a predicate satisfying the unrestricted T-schema cannot be—which seems no better. 23 Indeed, it is worse, since dialetheist accounts of negation are much simpler and more natural than any of the standard restricted accounts of truth.
It is always possible for someone to reply to all of this by agreeing that it is impossible to prove that $ behaves classically, but saying, nonetheless, that they intend to employ a connective, $, and let it be governed by the rules of classical negation, such as α $α β. But this proves nothing. Someone can equally say: I am going to act according to the rule that people do not fall when they walk out of tenth-floor windows. All may, indeed, go well until a semantic paradox or a tenth-floor window is encountered. Both theorists must then prevaricate or take the consequences.
Though not, perhaps, strictly relevant, this is a good place to say a word about the history of relevant logic. Modal logic was born of a dissatisfaction with classical logic, and in particular with the material conditional. Lewis took himself to be offering a rival to classical logic. History has changed this perception of the situation. Modal logic is not a rival to classical logic, but an extension. A modal logic can be seen as classical logic extended by an intentional functor or two. History has a strange way of repeating itself. Relevant logic, too, was born of a dissatisfaction with the material conditional, and was proposed as a rival to classical logic (and modal logic to the extent that a strict conditional is supposed to give an adequate account of a genuine conditional). However, later there came voices, even those of erstwhile relevant logicians, according to whom it, too, should be seen as a mere extension of classical logic. 24 Classical logic is the logic of conjunction, disjunction, and negation—Boolean of course. Relevant logic merely adds to this two funny functors: De Morgan negation and an intentional →. Whatever relevant logicians as such think about this proposal, it should be clear that from the perspective of dialetheism, this view is not only wrong, but highly misguided. De Morgan negation is not some funny add-on functor: it is the correct theoretical account of the negation that we commonly use and love. Boolean negation—to the extent that something can be characterized by this epithet—is either unintelligible, or else does not have the properties of classical negation. It is therefore the funny functor, if anything is. Dialetheic logic, unlike modal logic, does, therefore, provide a genuine rival theory to that provided by classical logic.
5.9 CURRY CONDITIONALS
To return from this digression: Perhaps the most fundamental lesson to be learned from the deliberations of section 5.7, and one that is already well known to most perceptive logic teachers, is that to show the validity of certain rules of inference, truth
end p.99
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
conditions alone are not enough. Inference must be made from those conditions; and those inferences may be just those whose validity one is trying to demonstrate. 25 (In fact, I suspect that one of the appeals of the view that the Boolean truth conditions are the correct ones for negation arises from the mistaken thought that these deliver the validity of the classical rules of proof for negation. As we have seen, they do not—at least on their own.)
A way of illustrating the same point is by considering another way in which a dialetheic semantics may be thought to fail in connection with the paradoxes of self-reference. It is not only Explosion that may cause triviality in a theory with an unrestricted truth predicate. As was demonstrated by Curry, amongst others, and is now well known, if the conditional employed in the T-schema (or the abstraction schema of naive set theory), →, satisfies the rules of modus ponens and Contraction (or Absorption), α → (α → β) α → β, then triviality ensues. 26 Modus ponens is hard to get away from for any genuine conditional; so what this is naturally taken to mean is that Contraction must fail. Contraction must fail not just for →, though. If there is any conditional, , such that α → β α β, and which satisfies modus ponens and Contraction, triviality follows. 27 Call any connective that satisfies these conditions a Curry conditional. Someone who subscribes to a dialetheic solution to the paradoxes of self-reference must hold that no connective, including →, is a Curry conditional. 28
In fact, it is not difficult to give an account of a conditional appropriate for the T-schema that is not a Curry conditional: any contraction-free relevant logic will provide one. 29 But it might, again, be suggested that there are other perfectly intelligible Curry conditionals. Making the case for this is prone to fail for exactly the same reasons that the corresponding case for Boolean negation fails, however. If such a connective is characterized proof-theoretically, then its intelligibility may simply be denied: it clearly fails to give a non-trivial extension. Rules for a contracting conditional, if they are harmonious, still give triviality when combined with the harmonious rules for the truth predicate (given self-reference). We merely reproduce Curry's paradox in this context.
Establishing that the rules for the conditional pass the semantic test is the same task as demonstrating that an appropriate semantics for the conditional shows it to validate Contraction and modus ponens. The details of the argument will depend on the form that one thinks the semantics of conditionals ought to take. However, essentially the same sort of situation will arise whatever this is. So let me illustrate with a simple
end p.100
case. We suppose the semantics of conditionals simply requires us to give truth (in-an-interpretation) conditions. (As before, adding the machinery of possible worlds, does nothing to change the situation essentially.)
Then to give the truth conditions for a conditional, , one might use clearly extensional connectives, such as disjunction and negation, as in:
α β is true in iff α is not true in or β is true in
or else one might use a conditional itself:
α β is true in iff if α is true in , β is true in
The notion, defined in either of these ways, is quite intelligible. But the first reduces modus ponens for to the DS, whence it is not semantically valid.
What happens in the second case depends, of course, on what properties the conditional used in stating the truth conditions, if, is taken to have. As is easy to see, inherits its properties from if. In particular, assuming that if satisfies modus ponens, so does . And the only way that can be shown to satisfy Contraction is to assume Contraction for if. But this is to assume, in effect, that there is an intelligible Curry conditional, which was just what was to be shown. The argument is therefore viciously circular. 30
5.10 CONCLUSION
To establish the validity of various inferences one needs more than truth conditions; one needs various inferences. And the inferences may be the ones whose validity we are trying to establish. In other words, the process requires a certain amount of bootstrapping. But this by no means implies that we must fall back on classical logic willy-nilly. Many logical theories can provide the relevant bootstrapping. Decision between them has to be made on other grounds. The grounds include the many criteria familiar from the philosophy of science: theoretical integrity (e.g. paucity of ad hoc hypotheses), adequacy to the data (explaining the data of inference—all inferences, not just those drawn from consistent domains!), and so on. This is a topic to which we will turn in the next part of the book. All that this chapter has attempted to demonstrate is that the dialetheist can legitimately reject that claim that there is a coherent connective satisfying the conditions of classical negation. And thus, in particular, a dialetheic solution to the paradoxes of self-reference is not impugned.
end p.101
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
We are still not finished with negation, however. This chapter and the last have been concerned largely with the semantics of the notion. But there is usually thought to be an intimate connection between negation and pragmatic notions—particularly the notions of denial and rejection. It is often claimed that this connection provides trouble for the dialetheist. To this subject we turn in the next chapter. 31
end p.102
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
6 Denial and Rejection
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
6.1 INTRODUCTION
This core topic of this chapter is rejection, of the propositional kind. The linguistic expression of rejection is denial, an act that has often been thought to be intimately connected with negation. In the first part of the chapter, we will investigate the connection and some of its consequences—or otherwise—for dialetheism.
Rejection is also connected with the core concept of Part I of the book: truth, or more precisely, untruth. The second part of this chapter explores the connection. This will raise the issue of rational dilemmas of certain kinds; and in the last part of the chapter we will look at that topic.
Let us start by getting some of these basic notions straight, starting with rejection and its correlative notion, acceptance. Acceptance and rejection (as I shall use the terms—of course they can be used in other ways) are cognitive states. To accept something is simply to believe it, to have it in one's “belief box”, as it were. 1 To reject something is to refuse to believe it: if it is in one's belief box one takes it out, but whether or not it was in there before, one resolves to keep it out. Accepting something and rejecting it would certainly seem to be exclusive; but they are not exhaustive. One might not believe something because one has never considered it, or because one has considered it but found no evidence, or found insufficient evidence. In such cases one is agnostic about the claim in question; the jury is still, as it were, out; and in such cases, one neither accepts nor rejects it. Rejecting something is a lot stronger than agnosticism of this kind: it is putting a bar on accepting it (although, of course, one can change one's mind about this in the light of new evidence, etc.). When justified, it is so because there is evidence against the claim: positive grounds for keeping it out of one's beliefs—rather than the mere absence of grounds for having it in. 2
Assertion and denial, unlike acceptance and rejection, are not cognitive states but kinds of linguistic acts. Equivalently, one may think of them as illocutory forces with which utterances may be made. They are not the only such forces: questioning and commanding are others. Thus, consider an utterance of the sentence ‘The window is open’. Given the appropriate context, this could be an assertion, a question, or a
end p.103
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
command (to shut it). It could even be a denial (consider an utterance heavily laden with irony).
Assertion and denial are closely connected with cognitive states, however. Crudely, they are the linguistic expression of acceptance and rejection, respectively. Less crudely, the typical aim of assertion is to indicate that the utterer accepts the thing asserted, and, it may well be added, has appropriate grounds for doing so. (Derivatively, then, it often aims at getting the listener to accept it too). The typical aim of denial is to indicate that the utterer rejects the thing denied, and, again, one may add, has appropriate grounds for doing so. (Derivatively, then, it often aims at getting the listener to reject it too.) 3
6.2 NEGATION AND DENIAL
So far, things are reasonably straightforward. Henceforth it is more contentious. Although assertion and denial are distinct linguistic acts, Frege argued 4 (and many now accept 5 ) that denial is not a sui generis linguistic act. Specifically:
to deny α is simply to assert ¬α
If negation is understood in the way that I suggested in Chapter 4, this identity is not at all plausible. If I assert a sentence that is the contradictory of α, I certainly do not, in general, expect a hearer to reject α or to believe that I do. The sentence may well be a complex one; and the fact that it expresses the contradictory of α may not be obvious—or even known—by either of us. The identity is more plausible if one takes ¬α to be something like α prefixed by ‘it is not the case that’. Even that is not quite right though. A sentence of this form can be used to make a “metalinguistic negation” (see sect. 4.3): just consider ‘It's not the case that Hitler was a bad man—he was downright evil.’ Still, it might be thought, once negation is purged of idiosyncracies of this kind, Frege was right enough.
He was not. For a start, there are many ways of denying something that do not involve the assertion of anything. Thus, I can shake my head, say ‘no’, or even stomp off in a rage. Perhaps more importantly, consider someone who supposes that some sentences are neither true nor false. Let α be a sentence that they take to be of this kind. They will then deny α; but their denial is certainly not to be taken as an assertion of ¬α. They take this to be neither true nor false as well, and so will not assert it. And it makes no difference if you think that there are no truth-value gaps. The point is
end p.104
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
that you understand someone who thinks that there are. You understand their denial, and do not understand them as asserting the negated sentence.
Conversely, a dialetheist who has ground for believing that α and ¬α are both true may assert ¬α without thereby denying α. Thus, for example, I will assert both ‘The Liar sentence is true’ and ‘The Liar sentence is not true’. In asserting the latter I most certainly do not intend you to come to believe that I reject the former: I don't. Nor does it matter if you are not a dialetheist yourself. You understand me perfectly well.
Even without taking dialetheism into account one can see that asserting a negation (in the Fregean sense) is not necessarily a denial. In explaining their views, people often assert contradictions unwittingly. In this way, they discover—or someone else points out to them—that their views are inconsistent. In virtue of this, they may wish to revise their views. In asserting ¬α in this context, they are not expressing a refusal to accept α and so denying it. It is precisely the fact that they accept both α and ¬α that tends to promote belief revision. But it may even be rational sometimes (completely independently of any consideration of dialetheism) to hang on to both beliefs and continue to assert them. Consider, for example, the Paradox of the Preface. In this, one has evidence (as good as you like) for the joint truth of a certain number of claims, say those in a book, α 1 , …, α n , and so their conjunction—call this α; but we also have very strong inductive evidence that at least one of these claims is false: ¬α. In such a situation, the rational thing to do is to accept both contradictories (whilst, possibly, noting that the situation is anomalous). If this is right—and philosophers as orthodox as Arthur Prior have thought so 6 —someone who asserts ¬α does not reject α, and rationally so. And even if Prior were wrong, this does not matter. If to assert ¬α were ipso facto to deny α, Prior's suggestion could not even make any sense—which it does.
Denying is not, therefore, the same thing as asserting something starting ‘it is not the case that’. Yet acts of denial can be performed by asserting negations. If, for example, I am in a discussion with someone who claims that the truth is consistent, it is natural for me to mark my rejection of the view by uttering ‘it is not’, thereby denying it.
This raises the question of when the uttering of a negated sentence is to be interpreted as an assertion, and when as a denial. There is, in general, no neat answer to this question. One has to assess the intentions of the utterer. The information provided by tone of voice, context, etc. will provide relevant clues here. Nor is there anything novel in this kind of situation. The utterance of the sentence ‘would you close the door’ can constitute linguistic acts with quite different illocutory forces (e.g. commanding, requesting). One often needs to be a very fluent speaker of a language (including having a knowledge of the social practices and relations in which the language is embedded), and have detailed knowledge of the context, to be able to determine which kind of linguistic act is, in fact, being performed.
In most contexts, an assertion of −α, that is, α → (see section 4.7), would constitute an act of denial. Assuming that the person is normal, they will reject ,
end p.105
and so, by implication, α. The qualifier ‘in most contexts’ is there because if one were ever to come across a trivialist who accepts , this would not be the case. For such a person, an assertion of −α would not constitute a denial: nothing would.
6.3 EXPRESSIBILITY
With this understanding in place, we can now confront a common objection to dialetheism. It is often charged that dialetheists cannot express their own views. 7 A dialetheist cannot say anything which rules something out, and so which expresses disagreement. For example, if you say α, I do not express disagreement when I say ¬α. For it is logically possible that both are true. More generally, whatever I say, there are models of both it and α (if only the trivial one, where everything is true). 8 As a special case of this, it is claimed, a dialetheist cannot even express the claim that a sentence is true but not false. They can, of course, use these words; but the fact that a sentence is not false does not rule out its being false as well; it may still be both true and false.
One particular reason why this charge is significant—though the objection is a quite general one—is that a standard objection to someone who espouses a truth-value gap solution to the semantic paradoxes is precisely that they cannot express their own theory—or the notions involved would engender extended paradoxes. (See section 7.5.) The dialetheist, it would seem, is no better off.
Note, first, however, that with respect to the paradoxes of self-reference, the problems for the gap theorist and the dialetheist are not the same. The gap theorist does have the words to express their views. Those words just turn out to be untrue—if consistency is to be maintained—and so we have a case of self-refutation. Thus, a gap theorist wants to say that the sentence ‘This sentence is untrue’, is neither true nor false. But in that case, it is untrue—and hence true. So they cannot say that (truly). The dialetheist also has the words to express their views: to say that something is true and not false, they can use those very words. There is nothing—even in the paradoxical case, that prevents them from being true. The words may turn out to be false as well sometimes too; but that's the nature of the beast. Thus, the dialetheist can express their views truly.
What the dialetheist cannot do, whether the topic is paradox or anything else, is ensure that views expressed are consistent. The problem, then, for a dialetheist—if it is a problem—is that they can say nothing that forces consistency. But once the matter is put this way, it is clear that a classical logician cannot do this either. Maybe they would like to; but that does not mean they succeed. Maybe they intend to; but intentions are not guaranteed fulfilment. Indeed, it may be logically impossible to fulfil them, as, for
end p.106
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
example, when I intend to square the circle. Even if dialetheism were unacceptable and Boolean negation, contra the argument of the last chapter, made sense, asserting $α does not rule out α, at least in this sense: someone who asserts $α may still assert α: the cost is that this occasions a collapse into triviality. But there is no logical guarantee against a person being a trivialist. Moreover, a dialetheist can assert something with exactly the same effect. Asserting ¬α will not do, but, as we noted in section 4.7, asserting -α, that is, α → , will.
It may be replied that asserting -α still does not express disagreement. After all, this would be asserted by someone who thinks that everything is true, and so who agrees with α. If we are searching for a mode of expressing disagreement that even a trivialist can use, then there is, indeed, nothing that can be asserted that will do the trick. But in the previous sections of this chapter, we have seen exactly how someone, be they a dialetheist or otherwise, can show that they reject something, and so express disagreement. They simply deny it (which is not the same as asserting any negation). The objection against dialetheism therefore lapses. A dialetheist can express disagreement. 9
I end this section with the following observation. In this part of the book we have met (at least) three distinct notions: negation, denial, and arrow falsum (–). We have seen that all of these notions make perfectly good sense from a dialetheic perspective. They are just not the same thing. The orthodox classical account of negation fuses them together. But this fusion is a confusion: they need to be kept distinct. It is common enough in the development of science for us to come to realize that we had run together different concepts (e.g. rest mass and inertial mass). The case at hand is another such. And in this case, an antipathy towards dialetheism may well be a product of this confusion. 10
6.4 DENIAL AND PARADOX
Now that we have clearly distinguished denial from negation, it might be wondered, in the context of paradoxes such as the Liar, whether this allows us to formulate damaging versions of the liar paradox in terms of denial. Let us see.
Let us write as a force-operator for denial. Thus, α represents an utterance of the sentence α with the force of a denial. In a similar way, one might write α, ?α, and !α, for utterances of the sentence α with the forces of assertion, question, and command, respectively.
It is natural to suppose that one may formulate a version of the Liar as an utterance u, of the form:
u is true
end p.107
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
This utterance does not even make sense, however. It is not utterances that are true or false, but their propositional contents (the αs inside the force-operators). Hence, the supposed Liar has to be an utterance, v, of the form:
the content of v is true
But what is the content of v? It is ‘the content of v is true’. Thus, the content of v is a standard truth-teller sentence. v is therefore simply a denial of the truth-teller. What to make of this, we might argue about. If one takes the truth-teller to be a truth-value gap, this is exactly what one should do to it. If, on the other hand, one takes it to be a truth-value glut (as in In Contradiction, section 4.7), and so true, then one should not deny it. But whatever one says about this matter, it is clear that there is nothing problematic about denying or asserting the truth-teller.
Similar comments apply to an utterance, w, of the form:
the content of w is false
except that this time the content of w is that the content of % w is false; so the content is a standard Liar sentence. If one takes this to be neither true nor false, then one should deny it; if one takes it to be both true or false, one should not: one should assert it. But again, there is no problem in either case.
What these examples illustrate is the fact that attempts to formulate distinctive Liar paradoxes in terms of denial fail, since , being a force-operator, has no interaction with the content of what is uttered.
It might be thought that there should be an operator on content that, in some sense, mimics force-operators. But there would appear to be no such thing. It is true that, for example, in English one can transform ‘The door is open’ into ‘Is the door open’. An utterance of the former would normally be an assertion, and an utterance of the latter would normally be a question. But as a little thought will certify, in the right context an utterance of either could be either—or other things as well, such as a command.
About the best one can do in this regard is employ prefixes such as ‘I am denying that’, ‘I am asking whether’, etc. So what should one say about someone who utters:
I am denying that this sentence is true
If this is uttered with the force of assertion, this is clearly false (and so should not have been uttered). If it is uttered with the force of denial, then what it appears to be denying is α, where α is ‘I am denying that α [is true]’, which is true, so it should not have been denied. Alternatively if the demonstrative refers differently, the utterance could be interpreted as a denial of ‘I am denying that β ’, where β is ‘β is true.’ This is simply false: what one is denying is that one is denying β. It is therefore correctly denied.
We see, then, formulating Liar sentences in terms of denial produces nothing much of interest. 11
end p.108
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
6.5 TRUTH AND REJECTION
Having explored the connection (or lack thereof) between rejection and negation, let us now explore the connection between rejection and truth—or untruth.
It is natural enough to claim that one ought to believe what is true. Put in this way, though, the claim is false. There is no onus to believe something simply because it is true. There is, however, if there is good evidence that it is true. Thus:
Accept. One ought to accept something if there is good evidence for its truth.
The ‘should’ here, note, has nothing to do with what morality or prudence require: it might well be the case that one ought not to believe things that are evidentially grounded, in those senses. (Having true beliefs can well get you into trouble!) The ‘ought’ here is one of rationality. It is rational to believe what is evidentially grounded (and irrational to believe what is not). This raises the question of what evidential grounding is, but we do not need to pursue that matter here; we will return to it in the next part of the book.
What we will pursue now is the possibility of a claim about rejection dual to Accept. Anyone who believes that there are truth-value gluts or gaps must distinguish between falsity (that is, truth of negation) and untruth—that is, between T <¬α> and ¬T <α>. If there are truth-value gaps, something can be untrue without being false. And if there are truth-value gluts, then, at least prima facia, something can be false without being untrue. 12
A natural dual to Accept is:
Reject(F). One ought to reject something if there is good evidence for its falsity.
There is nothing in the existence of truth-value gaps that threatens this principle, but it is unlikely to recommend itself to a dialetheist. Consider something taken to be both true and false, such as ‘this sentence is false’. Call this λ. Half of the paradox argument for this provides good evidence for ¬λ, but one would not, on this count, want to reject λ. For the other half of the paradox argument provides evidence for λ. Accept itself, therefore, counsels its acceptance. 13
end p.109
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
The lack of symmetry between Accept and Reject(F) might be felt to be uncomfortable. Why the one, but not the other? But one should recall that falsity is nothing other than truth of negation. So the correct principle concerning falsity is that delivered by Accept:
One ought to accept ¬α if there is good evidence for the falsity of α.
The dual of Accept looks more plausible if we formulate it in terms of untruth, not falsity. After all, if it is the truth that one is after, it is the untruth that one wishes to avoid. Thus we have:
Reject(U). One ought to reject something if there is good evidence for its untruth.
Even this dual may be resisted, however. Take the liar paradox in the form ‘this sentence is not true’. The standard argument establishes that it is both true and untrue. Should one reject it on this count? Not obviously. The principle concerning rejection might be something like: one ought to reject something if there is good evidence for its untruth, unless there is also good evidence for its truth. Thus, Reject(U) may be understood as an acceptable default rule, but not as an indefeasible one. 14
The failure of symmetry in this case is might be thought to be more troubling, however. Falsity is not the opposite of truth: it is a sub-species of it. Untruth, by definition, is. So does the acceptance of Reject(U) damage dialetheism? The answer is ‘no’. It is quite open to a dialetheist to accept both Accept and Reject(U). Suppose that we have some α for which there is good evidence that it is both true and untrue. Then, by these two principles, one ought to accept α and one ought to reject α. We have a dilemma, since we cannot do both. 15 (It should be noted that this is not the situation which we are in with respect to most paradoxes of self-reference. In these we have arguments for things of the form α and ¬α; but, in general, α is not of the
end p.110
form: β is true. It can be in special cases, such as the Liar Paradox in the form ‘this sentence is not true’—but not in the form ‘this sentence is false’. 16 )
6.6 RATIONAL DILEMMAS
A dilemma is not a contradiction. Let us use the operator O, ‘It is obligatory that’, from standard deontic logic. Paradigm dilemmas are of the form: Oα and O¬α, where α is a statement to the effect that something be done. 17 More generally, in a dilemma there are two such statements α, β, such that □ ¬ (α β), yet Oα and Oβ.
Now, arguably, the existence of dilemmas is simply a fact of life. The thought that there are irresolvable moral dilemmas is commonplace. 18 And in In Contradiction, chapter 13, I argued for similar legal dilemmas. The obligations in these cases are not the obligations of rationality. But if one sort of system of norms can produce dilemmas, why not other sorts? It might be thought that there is something special about rationality, though. Could people really be rationally required to do the impossible? How could rationality be so stupid? Well, arguably, it is. One can formulate apparent rational dilemmas that have nothing to do with Reject(U).
Consider a claim of the form, ‘It is irrational to believe this claim’, i.e. something of the form:
It is irrational to believe ρ
where it, itself, is ρ. Supposes that you believe ρ. Then you believe something, and at the same time believe that it is irrational to believe it. This, presumably, is irrational. Hence, you ought not to believe ρ: O¬Bρ. But we have just shown that ρ is true! Hence, you ought to believe ρOBρ (by Accept). This is a version of the Irrationalist's Paradox. 19 The argument may be formalized as follows. Let I be ‘It is irrational (to bring it about) that’. Let B be ‘you believe that’ and let ρ be of the form IBρ. There is one principle about rationality appealed to, namely: for any α, IB(α IBα). From this, it follows that IB(ρ ρ), and so IBρ, that is ρ. Given that IBα O ¬ Bα, and that if α then OBα. The dilemma follows. 20
It is, note, just as much a paradox for the non-dialetheist as for the dialetheist. In fact, it is worse for the classical logician. The Irrationalist's Paradox deploys the
end p.111
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
principle that if one believes that (α and to believe that α is irrational), then one's belief is irrational (IB(α IBα)). A dialetheist might contest this: it is not irrational if one also believes that to believe α is rational. The supposition that believing that α may be both rational and irrational is unlikely to appeal to a classical logician, however.
Actually, the situation is even worse for a classical logician, since not just dilemma but contradiction lurks in the wings. Let R be ‘It is rational (to bring it about) that.’ It would seem just as plausible a fact about rationality that, for any α, ¬RB(α ¬RBα). Now let ρ be ¬RBρ. Then, as before, we can show that ¬RBρ, that is, ρ. And assuming that if α then RBα, we have a flat contradiction. Note that since this paradox makes no use of semantic or set-theoretic notions, none of the standard solutions to paradoxes of self-reference gets a grip on it.
6.7 GAME-THEORETIC DILEMMAS
The Irrationalist's Paradox does not depend on Reject(U), but it still depends on Accept, and it also employs self-reference. It might therefore be wondered whether there are rational dilemmas that are independent of anything connected with dialetheism. There certainly appear to be, and in the last part of this chapter I will explain some of them. These are rational dilemmas that arise in connection with well-known situations in game theory. 21 Standardly, commentators accept one or other horn of the dilemma and try to dispose of the other. I want to set things up in such a way that if you accept one, you should accept both. 22
I will give two arguments of this kind. Both depend on a certain principle of rationality, which is as follows. Suppose that a person has to choose between two alternatives, and they know that if they choose one alternative, the benefit derived will be a certain amount; and if they choose the other alternative the benefit derived will be a lesser amount. Then they ought to choose the first alternative. Call this principle Rat. We may formalize it thus:
•
The premises are above the line; the conclusion is below. C(γ, δ) means that you have a choice between making γ true and making δ true; Mγ means that you make γ true; Gx means that you gain x. → denotes the indicative conditional. Strictly speaking,
end p.112
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the premises should be within the scope of an epistemic operator, K (it is known that), but in what follows this will go without saying.
Now to the first dilemma: Newcome's Paradox. This comes in different versions; let me spell out the one that I have in mind. There are two boxes, a and b, and you are to choose between taking either the contents of both boxes, or the contents of just one box, box a (the aim being to maximize your financial gain). b is transparent, and you can see a $10 note inside. You do not know what is in box a, but you do know that money has been put inside by someone who knows exactly what you are going to do, a perfect predictor, p. That is, if you are going to choose one box, p knew this; and if you are going to choose both boxes, p knew this too. If p predicted that you would choose one box, $100 was put inside a; if p predicted that you would choose two boxes, nothing was put inside a. Should you choose one box or two? 23
We may now show that you ought to choose one box, and that you ought to choose both boxes. Let α be ‘you choose just box a’ and β be ‘you choose both boxes’. Then we have C(α, β) and ¬(Mα Mβ). The two horns of the dilemma proceed as follows.
Horn 1: Let c be an abbreviation for the description ‘whatever is now in box a’, where this is to be understood as a rigid designator. If you make α true, then you get c: Mα → Gc. If you make β true, you will get c plus the extra $10: Mβ → G(10 + c). But 10 + c > c. By principle Rat, it follows that OMβ.
Horn 2: If you make β true, then p knew that you were going to choose both boxes. Hence, there is $10 in a, and nothing in b; so you get $10. That is: Mβ → G10. On the other hand, if you make α true, then p knew that you were going to choose one box. Hence, there is $100 in a, which is what you get. That is: Mα → G100. But clearly, 100 > 10. By principle Rat, it follows that OMα. 24
A second example of a rational dilemma is provided by a suitably symmetrized version of the prisoners' dilemma. The following will do. You are in a room with two buttons, and you have to choose between pressing them. If you press button a, you will receive $10. If you press button b you will receive nothing (by your own efforts), but the person next door will receive $100. That person is in exactly the symmetric situation. Moreover, you have known the person in the other room for a long time, and you know that they are just like you: in choice situations you always both choose the same thing. Which button should you press?
end p.113
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Let α be ‘you press button a’; let β be ‘you press button b’. Then we have C(α, β), and ¬(Mα Mβ). The dilemmatic argument proceeds as follows.
Horn 1: Let c be an abbreviation for the description ‘whatever is obtained as a result of the action of the other person’, where this is to be understood as a rigid designator. If you make α true, then you get 10 + c: Mα → G(10 + c). If you make β true, you will get just c: Mβ → Gc. But 10 + c > c. By principle Rat, it follows that OMα.
Horn 2: If you make α true, then so will the person in the other room. Hence, you will get 10 plus nothing else: Mα → G10. If you make β true, then so will the person in the other room. Hence, you will get 100: Mβ → G100. Since 100 > 10, by principle Rat, OMβ.
6.8 OBJECTIONS
The most plausible line of objection to the preceding arguments, it seems to me, concerns the nature of the conditionals employed. In particular, one may argue that the first horn of each dilemma is invalid as follows. I give the argument for the Newcome problem; the argument in the prisoners' dilemma example is similar.
Let us suppose that, as a matter of fact, you do choose the one box, a, and consequently that c = 100. Consider the conditional Mβ → G(10 + c). If you were to make β true then you would not get 10 + c, i.e. 110, but 10, since there would be 0 in box a. (Recall that c is a rigid designator, and so does not change its value in different hypothetical situations.) Hence this conditional is false. Similarly, let us suppose that you do, as a matter of fact, choose both boxes, and consequently that c = 0. Consider the conditional Mα → Gc. If you were to make α true, you would not get c, i.e. 0, but 100, since that is what would be in box a. Consequently one or other of these conditionals is false.
Assuming the standard distinction between indicative and subjunctive conditionals, this objection fails since what it establishes is that certain subjunctive conditionals are false (note the ‘were's). But the conditionals employed in the arguments are, in every case, indicative, not subjunctive. (Go back and check them!)
At this point, it might be argued that a correct formulation of the principle Rat requires the use of subjunctive conditionals. But why should one suppose this? After all: the reasoning, including the formulation of principle Rat, seems quite in order as it is.
Here are two bad reasons why the conditionals in Rat must be subjunctives. The first: we are reasoning about cases at least one of which will not arise. Hence, the conditionals are counterfactuals, and reasoning about counterfactual situations requires subjunctive conditionals. Not so. Counterfactuals are often expressed by indicative conditionals, not subjunctives. The bus is due. I tell you: if we don't leave now, we'll miss the bus; and so we leave, and catch the bus, which is on time. The conditional I uttered was a true indicative conditional, though it is also a counterfactual. Even in cases where the antecedent is not just false, but its truth would require the past to be
end p.114
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
different from what it was, we can still use indicative conditionals. Let us suppose that you do not know who won the Grand Final yesterday. You do know that if you read in a reputable paper today that the Broncos won the Final, then they won it yesterday. This is a perfectly legitimate indicative conditional, even though the Broncos did not, in fact, win yesterday.
The second, and more sophisticated, argument to the effect that condition Rat requires subjunctive conditionals goes as follows. 25 Again, I use the Newcome example to illustrate. Suppose that, as a matter of fact, you make β true; then you do not make α true, ¬Mα. Then by the properties of the material conditional, Mα → G1000. So if you know that you will make β true then you know that Mα → G1000. You ought, therefore, to make α true, since you know that 1000 > 10. And quite generally, whenever you have decided what to do, so that you know what you are going to do, you ought to do the opposite! So the principle, formulated with indicative conditionals, is incoherent.
The flaw in the argument is in identifying the indicative conditional with the material conditional (so that we can reason ¬γ γ → δ). Standard indicative conditionals are not material conditionals. There are just too many clear counter-examples to the identity for it to be credible. Of the many that could be given, here is just one. 26
There is an electrical circuit with two switches in series, both off. If both switches are on (and only if both switches are on) a light in the circuit will go on. Let α be ‘switch 1 is turned on’; let β be ‘switch 2 is turned on’; let γ be ‘the light goes on’. Then we have: if α and β then γ. If this conditional were material, , and since (α β) γ ((α ¬β) γ) V ((β ¬α) γ), it would follow that one or other of the following is true: ‘if only switch 1 is turned on, the light will go on’, ‘if only switch 2 is turned on, the light will go on’. Both are clearly false.
6.9 CONCLUSION
The view that there are rational dilemmas is, as we have now seen, a plausible one—even dilemmas that have nothing to do with dialetheism. If there are such things, the next obvious question is what one should do if one finds oneself in one. What one should do, is, of course, the impossible. But one can't do that. Rationally, one way or other, one is damned. C'est la vie.
We have now completed our tour of issues concerned with negation, and covered much ground in the process. The last issue to raise its head was rationality. But as far as that goes, we have only just scratched the surface. The next part of the book brings rationality to centre stage.
end p.115
end p.117
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
end p.118
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
7 Rational Belief
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
7.1 INTRODUCTION
When we describe someone as being inconsistent, we may mean many different things. If someone is fickle, changing their views or emotions from day to day, we call them inconsistent. If someone, such as a judge, fails to treat like cases alike, we call them inconsistent. When someone refuses to accept an obvious logical consequence of something they believe, we call them inconsistent. And of course, when someone believes things of the form α and ¬α we call them inconsistent. In all such cases, calling a person inconsistent would normally be some form of criticism. The first three cases will not concern us in this chapter; only the last.
The sense of belief that I have in mind here is explicit, unqualified, belief. Sometimes one might describe a person as having inconsistent beliefs if their views, unbeknownst to them, entail a contradiction. This is not the sort of belief at issue here. Or, as we noted in section 6.2, situations of the kind involved in the preface paradox appear to make it rational for a person to believe a contradiction explicitly. But in such cases, the person would be inclined to admit that their beliefs have gone wrong somewhere, though they knows not where. 1 This is not the sort of belief I have in mind either. The beliefs in question are ones that a person will knowingly endorse without any reservation—other than, perhaps, one of general fallibility. The topic of this chapter is precisely whether criticism of a person with inconsistent beliefs of this kind, on the ground of the inconsistency, is perforce justified. I shall argue that it is not. In the process, I will give an account of rational belief.
7.2 THE HISTORY OF THE BELIEF IN CONSISTENCY
Let us start with the history of the view that criticism on the ground of inconsistency (in the appropriate sense) is justified. This is a curious one, not to say a little puzzling. There were certainly Presocratic philosophers who, presumably, thought it was perfectly legitimate to believe contradictions, since they did so. Heraclitus, for example, held that ‘We step and do not step into the same rivers; we are and we are not.’ 2 Even
end p.119
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Plato seems to have been prepared to countenance the possibility that ordinary things might have contradictory properties—though not the forms:
Even if all things come to partake of both [the form of like and the form of unlike], and by having a share of both are both like and unlike one another, what is there surprising in that? … when things have a share in both or are shown to have both characteristics, I see nothing strange in that, Zeno, nor yet in a proof that all things are one by having a share in unity and at the same time many by sharing in plurality. But if anyone can prove that what is simple unity itself is many or that plurality itself is one, then shall I begin to be surprised. 3
Of course, interpreting texts such as these, especially the Presocratices, may be a contentious matter. But at least Aristotle interpreted his Presocratic precursors in this way. For as we saw in Chapter 1, in book 4 of the Metaphysics, he launched a sustained attack on them. He argued that nothing is more obvious or certain than that a contradiction cannot be true (the LNC)—so much so, he says, that one cannot give a proof of this fact. 4 This is odd for two reasons. The first is this: if the LNC is so obvious, how come it wasn't obvious to the people he was attacking? The second is that straightaway he goes on to give seven arguments for the LNC. He says that these are elenchic arguments, not proofs; but what, exactly, this comes to, and whether all the arguments are of this kind is not at all clear.
But let that pass. Reading the arguments makes it clear that they do not succeed in rendering obvious the LNC. The first, and longest, argument is so opaque, it is entirely unclear what it is. I defy any sensible person to read it and claim that it makes its conclusion obvious. (It is not, of course, irrational to believe something merely because it happens to be untrue. It is the evidential status that is important.) The other arguments are even less successful: for their conclusion is patently that it is not the case that all contradictions are true (or the even weaker: it is impossible for someone to believe that all contradictions are true). Even if this is true, and were manifestly so, nothing at all follows about the possibility that some contradictions are true. Again, we saw all this in Chapter 1.
It is true that, at various times since Aristotle, various thinkers have wittingly endorsed contradictions. This is particularly true of thinkers in the Neoplatonist tradition. 5 Plotinus himself, for example, denies that anything positive can be said of the One (Enneads. v . 6); but also describes it as a simplex, something which is beyond being, the source and generator of all else. Cusanus goes even further, saying that (Of Learned Ignorance. i . 3):
end p.120
in no way do they [distinctions] exist in the absolute maximum [sc. God]. The absolute maximum … is all things, and whilst being all, is none of them …
But it is fair to say that, at least since the Middle Ages, Aristotle's views concerning contradiction have been orthodoxy. (This is so obvious, that it is hardly worth documenting.) They are taken for granted so much that, as far as I know, there is no sustained defence of the LNC in Western philosophy other than Aristotle's. Why? I really don't know. It is certainly not because of the rational persuasiveness of Aristotle's arguments. I suspect (unhappily) that the view was accepted simply on the basis of the magisterial authority of Aristotle's texts in the Middle Ages. In general, that authority disappeared long ago, of course. In logic it hung on till the twentieth century; most of it there has been swept out since then, but the views about contradiction have hung on doggedly.
Before we leave the history of philosophy, and to emphasize the parochial nature of Western philosophy on this issue, let us consider briefly the situation in Indian philosophy. The LNC certainly had its defenders here. Logicians of the Nyaya school, for example, adhered to it. But the classical orthodoxy in this case was exactly opposite from that in the West. The standard view in classical Indian logic, going back to about the same time as Aristotle, was that on any issue there are always four possibilities to be considered: that a view is true (and true only), that it is false (and false only), that it is both, and that it is neither. This is the catushkoti (“four corners”). 6 The difference in the two traditions could not be more acute.
So far, then, we may conclude that if figures in the history of Western philosophy held that it was irrational to believe a contradiction, this view was itself irrationally held. The LNC is not at all obvious. And wise people, as Hume put it, 7 proportion their beliefs according to the evidence. Those who subscribed to the orthodox view, were not, then, wise.
A caveat: no one is an expert in many areas, and we all have to accept things on the say-so of experts in some areas. If someone believes that it is irrational to believe contradictions on the basis of experts in logic and philosophy, their belief is not irrational. The charge of irrationality is levelled against people who ought to have known better.
7.3 IS CONSISTENCY MANDATORY?
Is the situation essentially any different in contemporary philosophy? Not as far as I can see. The knee-jerk reaction of most modern philosophers will be that it is patent that no contradiction can be true, since, if it were, everything would be true, which it is patently not—everything follows from a contradiction. This is so in standard logic, of course; but it is so because of the fact that in the semantics of this logic, truth, and falsity are taken to be exclusive—something that may fail in a paraconsistent logic,
end p.121
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
where contradictions do not entail everything. The assumption that truth and falsity are exclusive is simply packed into nearly all presentations of standard logic without comment. In other words, it has the same dogmatic status as the LNC itself. This does not provide a justification.
It might be thought that the exclusivity of truth and falsity holds by definition. Falsity just is the lack of truth. It does not. The relevant sense of falsity here is truth of negation; and the claim that ¬α is true when α fails to be true is not a definition. As we saw in Chapter 4, it is a substantial theory about the way that negation works. It is denied in modern logic by logicians who subscribe to truth-value gaps or gluts. How negation behaves is, in fact, a highly contentious view historically. At least until the later Middle Ages, a common view of negation, with which Aristotle had some sympathies, was that ¬α simply cancels out α. (See section 4.1.) So contradictions do not entail everything: they entail nothing.
What other relevant arguments are there? We could spend much time talking about this, but I will not pursue the issue here. None that I know work. 8 It is hard enough to produce arguments for the LNC itself, but what we need in this context is not just an argument for the LNC, but one of a very strong kind. Consider, as an analogy, the arguments given by the early heliocentric astronomers for the centrality of the sun. These had force, but there were countervailing arguments; for example, the motion of the earth seemed to fly in the face of the accepted dynamics. In such a context, it was not irrational for someone not to accept the geocentric view. It would have been if the arguments for heliocentrism had rationally mandated their conclusion, something that later arguments were to do. In the same way, and returning to the issue of inconsistency, what we need in this case is not simply an argument for the LNC, but one that makes its conclusion rationally mandatory. This sets the bar very high, and I know of no argument, or raft of arguments, that even comes close to clearing it.
It might well be pointed out, quite correctly, that there is a possibility that I have been ignoring until now. There are certain things that one can believe rationally, even though one can give no reason for them. This is the case with perceptual reports, for example. I see a hand before my face. It is rational for me to believe this claim to be true, but I can give no reasons. My experience in some sense justifies my belief, but experience cannot be a premise in an argument. The Law of Non-Contradiction is obviously not a perceptual report, but could there be other sorts of examples where one may have a rational belief but no reason? Some have certainly thought so. For example, Wright (2004) suggests that we may have a rational entitlement to certain things, such as fundamental logical principles like modus ponens, even though no reason can be provided in any meaningful sense. Of course, certain conditions have to be satisfied—or we could take ourselves to be rationally entitled to the most absurd of views. How to formulate these conditions precisely we might argue about,
end p.122
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
but crucial are that (a) justifications for the principle employ nothing better than that very principle, and (b) there are no grounds for doubting the principle. 9 Note that clause (b) is essential. Consider the principle modus morons: α → β, β α. One may justify this as follows: Suppose that <β> and <α → β> are true. Then by the truth conditions for →, if <α> is true, so is <β>. Hence, <α> is true (by modus morons). 10 Now, whatever one thinks about the notion of ungrounded entitlement in general, the Law of Non-Contradiction does not satisfy clause (b). As I observed in the last section, various historical philosophers provided grounds for doubting the “Law”; and modern philosophers have added others—we will note one of these in section 7.5. 11
In the end, I doubt that the rationality of individual logical principles is to be sought in this kind of entitlement. Individual principles are always part of whole logical theories. And a theory—and so the parts that comprise it—is the kind of thing for which it is possible to give meaningful reasons, as we are about to see.
7.4 RATIONALITY AND CONSISTENCY
Those who take consistency to be a sine qua non of rationality may well feel discomforted by all this. If the inconsistency of a view does not show it to be irrational, what does?! In fact, even if consistency is a constraint on rationality, it is a relatively weak one. It is possible to work many rationally bizarre views into consistent ones by a little massage. For example, the view that the earth is flat can be held quite consistently, by invoking suitable auxiliary hypotheses about the behaviour of light, the mendacity of the world's media, etc. It is irrational for all that. So how does rationality work? Anyone who expects a simple algorithm to determine whether a belief is rational or not is bound to be disappointed. This is a lesson of post-positivist philosophy of science, if, indeed, it was not already to be learned from Aristotle's account of phronesis. But this does not mean that one can say nothing.
Given a theory (in any area of human cognition—science, philosophy, logic, or whatever), there are many cognitive virtues, and, correlatively, vices, that it may have. Perhaps the most important is the adequacy of the theory to the data which it was proposed to handle. Does it really account for these? Any other criteria may be contentions, but familiar candidates include, for example:
• Simplicity. Is the theory clean and elegant, or it is complex and contrived?
• Unity. Does the theory have to invoke numerous ad hoc hypotheses, coming in from left field?
• Explanatory Power. Can the theory be used to explain other things in the same domain, or is it of limited applicability?
• Parsimony. Does the theory multiply entities beyond necessity?
end p.123
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
It is clear that all these criteria may come by degrees. And as even a quick perusal of some intellectual history demonstrates, they may pull in different directions. The early Copernican theory was simpler than the Ptolemaic theory, but it could deal with the dynamic problems of the motion of the earth only in ad hoc ways, at least until the invention of a new dynamics. By contrast, the Ptolemaic theory, though more complex, was not ad hoc in this way.
When is one theory rationally preferable to another? When it is sufficiently better than its rivals on sufficiently many of these criteria. That is, of course, vague. It can be tightened up in various ways. I will give one way in the next chapter. But in the end, I think, it is essentially so. Indeed, it is precisely this vagueness that allows for rational people to disagree. For legitimate disagreement is precisely a feature of the borderland of application of any vague predicate. People may legitimately disagree, for example, over whether the application of ‘child’ to a 14-year old is correct. This does not mean that there are no determinate facts concerning rationality and similar vague notions. Someone who calls a normal 30-year old a child is clearly mistaken. And one theory can be manifestly superior to its rivals, all things considered.
Is consistency a cognitive virtue? The natural answer is ‘yes’, but things are not as obvious as on might have supposed. For a start, certain sorts of consistency may be required by other virtues. For example, if the theory is an empirical one, adequacy to the empirical data is certainly a virtue. But at least for the most part, such data is consistent: as I argued in sections 3.4 and 3.5, contradictions are rarely perceived in the empirical world, and where they are, they are illusions. Hence, adequacy to the data requires consistency of observational content. The question, then, is whether consistency is a virtue when it is not required by other virtues.
One reason to suppose that it is, is that the very notion of a paradox seems to presuppose it. Paradoxes can usually be formulated as arguments that end in a contradiction of some kind. Maybe the view that contradictions are not acceptable at any price is wrong. But the very fact that the conclusion of a paradox is a contradiction gives us some reason to suppose that it is unacceptable, or it would not be a paradox. But again, things are not so clear. In many well-known paradoxes, the contradiction involved is an empirical one. In Zeno's paradoxes, the conclusions are things like ‘Achilles never overtakes the tortoise’, when we can see that he does; or in the sorites paradox, conclusions may be of the form ‘This patch is red’, when we can see that it is not. As I have just noted, there are independent reasons for rejecting contradictions of this kind. There are, of course, paradoxes which are not like this: the Liar is an obvious one such. And we do, in this case, take the contradiction to indicate that something has gone wrong. This, therefore, provides some reason to suppose that consistency, per se, is a cognitive virtue. 12
Another reason can be found by considering another virtue, simplicity. Why simplicity—whatever, in fact, that comes to in the end—is a virtue is a tough question
end p.124
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
(to which I know no completely satisfactory answer). But an index of the fact that it is a virtue is that we tend to assign simpler hypotheses higher prior probabilities—for example, when curve-fitting. Something similar would appear to be the case with consistency. As I argued in In Contradiction, section 8.4, given an arbitrary situation about which we have little information, we would seem to be justified in taking its consistency to be a better bet than its inconsistency. This, therefore, indicates that consistency is a virtue: ‘contradictions should not be multiplied beyond necessity’. 13
Whether or not consistency is a virtue is, however, not centrally important for the present matter. If it is, it is simply one virtue amongst many. It is clear, then, how an inconsistent theory may be rationally acceptable; conversely, it is clear how an inconsistent theory may be rationally rejectable. It may simply be trumped by another theory.
Since this latter point is something of a stumbling block in discussions of paraconsistency, let me just illustrate it. Let us suppose that you, an atheist, argue against the existence of (a Christian) god on the basis of the existence of suffering. God is omniscient, omnipotent, perfectly good, etc. Take some event which we know to have occurred, and which caused much gratuitous suffering, for example the torture and murder of a young child. The properties of God entail that had such a being existed, this event would not have occurred. 14 But it did; hence there is no God. Now consider the person who is prepared to accept a contradiction in this context: God exists and prevented the event; hence it did not occur; but it occurred too. This move is certainly out there in logical space. Is it a rational one? No: that the event both did and did not occur is an observable contradiction, and as we saw in section 3.5, there are good reasons to reject a contradiction of this kind. Once could take on this claim as well, of course, but good luck to someone who does! I see no way of defending the view that the event both did and did not occur without invoking countless ad hoc hypotheses, and turning the situation into one of the flat-earth kind.
7.5 THE RATIONALITY OF INCONSISTENCY
So far, I have argued that the mere fact that someone's beliefs are inconsistent is not a definitive ground for rational criticism—and more, that to hold that it is, is itself irrational. I want to finish the chapter by arguing something stronger: that it is irrational to be consistent. I do not mean that it is irrational to be consistent about everything: just that there are some topics about which it is irrational to be consistent. Nor do I mean that it always has been and always will be irrational to be consistent about such topics: simply that in the present state of our knowledge, the rational belief is an inconsistent one.
The topic I have in mind here is that of truth. It seems to me that anyone weighing up the state of play concerning this notion, ought rationally to be inconsistent.
end p.125
A detailed argument for this would require its own book-length treatment, requiring as it would a systematic survey of all the theories of the semantic paradoxes with a detailed evaluation of their properties. So let me make the basic case and consider a few objections. I think it clear how one would go about filling in the details. 15
A dialetheic account of truth accepts the T-schema: <α> is true iff α, for every proposition, α. Assuming some basic principles of logic and self-reference, this entails contradictions in the shape, for example, of the Liar paradox. But because the underlying logic of the theory is paraconsistent, these contradictions are quarantined into singularities. Let us call this, or anything that endorses at least this much, the naive account of truth.
The most rational account at any time is, as I argued in the last section, the account which fares best according to the various epistemic virtues. So what other accounts are currently on the table? As most people who have even a passing acquaintance with the area will hardly need to be told, there is a plethora. Twentieth-century logic provided us with accounts by Tarski, Kripke, Gupta and Herzberger, Barwise and Etchemendy, and McGee, to name some of the major ones. So how do these fare in comparison to the naive account? It would be wrong to suppose that exactly the same considerations apply to all the theories; and in the end there is no substitute for examining the particular details of each one. In the present context, some broad brush strokes will have to suffice. 16
Let us start with consistency. A dialetheic account is inconsistent; the other accounts are consistent—or at least, they appear to be; I will return to this in a moment. Here, then, and assuming consistency to be a virtue, the other accounts would appear to come out better. But what of all the other criteria?
Adequacy to the data is a singularly important virtue. And concerning this, the situation is reversed. Consistent accounts do not, generally speaking, endorse the T-schema in unrestricted form. 17 The schema is an overwhelmingly natural principle concerning truth. It is as ancient as Aristotle, and as modern as deflationist accounts of truth. The most celebrated truth-theorist of the twentieth century, Alfred Tarski, even called it a criterion of adequacy on any account of truth. Of course, one can have theoretical grounds for rejecting it—indeed, any solution to the paradoxes that rejects it must try to develop such an account. But the intuitive pull of the T-schema can hardly be denied. Even more telling, an important piece of data is that we find the paradoxical arguments intuitively compelling. If we did not, they would not be paradoxical. A dialetheic account explains this in the most natural way: the arguments are sound, and obviously so. Consistent accounts do not respect the data in this way.
So what of the other virtues? Take simplicity. The dialetheic account is simplicity itself. None of the other theories is in the same league. All involve hierarchical constructions, of various degrees of complexity, going into the transfinite.
end p.126
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
How about explanatory power? The paradoxes of self-reference appear to form a single family. They would seem to be manifestations of a single underlying phenomenon, in which case they ought to have a common solution: same kind of paradox, same kind of solution (the Principle of Uniform Solution). A naive account of truth obviously does not apply to all of these paradoxes: it can be readily extended to the other semantic paradoxes, but it has nothing to say about the set-theoretic paradoxes. However, it is a special case of a more general theory that applies to all the paradoxes of self-reference: one based on the Inclosure Schema. 18 This is decidedly not true of the other accounts of truth. At best, according to them, the set-theoretic and semantic paradoxes are a completely different phenomenon. At worst, what they have to say about truth is in tension with the usual treatment of the set-theoretic paradoxes. 19
Of the other possible criteria, there is one more that is singularly relevant. What motivates all the consistent theories is an attempt to provide a consistent account of the semantic paradoxes of self-reference. This is the problem that they are meant to solve. If they do not solve this problem, this is a prime methodological failing. Do they? Not really; for all of them, the machinery deployed allows one to construct Liar-type arguments ending in contradiction (extended paradoxes). This is the reason that I said that the theories only appear to fare better on the virtue of consistency. In all cases, consistency can be enforced by quarantining the new machinery in a metalanguage, so making it illegitimate to deploy it in the paradox-producing way. But in all cases, the move (apart from being ad hoc) succeeds only in showing that the problem of consistency has not been solved, merely displaced. We may have a consistent account of truth, but a consistent account of the other semantic notions involved is still wanting. 20
Now for the objections.
Objection 1: The comparison between the naive account and consistent accounts is unfair. For one needs different underlying logics for the two accounts, a paraconsistent logic for the dialetheic account and classical logic for the consistent account. One needs to evaluate the package deal (truth plus logic) in each case. And this changes the evaluation, since classical logic is much simpler than any paraconsistent logic, making the consistent package over-all simpler.
Reply. The point about the package deal is quite correct. It is also true that classical logic is simpler than all paraconsistent logics—but it is not that much simpler in many cases. For classical logic must be taken to include modal logic. We reason with modal sentences all the time, and we certainly need a theory of truth that applies to these. Now the simplest relevant logic is not much more complex than modal logic. 21 Its semantics diverges from that of S5 in two ways. First, the classical assumption that truth and falsity (at a world) are exclusive and exhaustive is dropped. This is a very minor change. Next, the class of worlds is extended to include not just possible
end p.127
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
worlds, but impossible worlds too (and conditionals are given suitable truth conditions as these). But again, impossible worlds are things we need to countenance anyway: they are needed, for example, to provide a suitable semantics for counterfactuals with logically false antecedents. Or, to put it another way, if modal logic is not extended in this way, a classical account of such conditionals must be given, and since this will give highly counter-intuitive results, extra complexities will have to be invoked to handle these. It is not clear, then, that the move to a relevant logic does increase complexity, all things considered. If it does, it is not of a kind that changes the over-all simplicity assessment.
Objection 2: Even if a certain amount of inconsistency is acceptable, too much is not. Thus, if inconsistencies spread into the empirical consequences of the theory of truth, the theory is not rationally acceptable. The consequences of a naive theory do spread into this area, due to Curry paradoxes. Specifically, given the inference of Contraction (α → (α → β) α → β), everything follows using the T-schema.
Reply. The point about the unacceptability of widespread contradiction is correct. However, with the appropriate logic, contradictions are appropriately restricted. Contraction is not valid in the simplest relevant logics. (Nor is its failure ad hoc. It is a simple consequence of the nature of impossible worlds.) Indeed, it can be shown that the inconsistencies in a naive theory of truth do not spread into the empirical realm, but are quarantined within those sentences that are not grounded (in the sense of Kripke). 22
Objection 3: The employment of a non-classical logic means that the paraconsistent position concerning truth suffers from an important cognitive vice. For such a logic is weaker than classical logic. Hence, much classical reasoning must be accepted as invalid. This means that many of the important applications of classical logic must be given up, producing a significant loss of overall explanatory power.
Reply. It is true that many inferences that are classically acceptable must be acknowledged as invalid. This does not occasion an explanatory loss, however. For the only situations about which it makes sense to reason classically are consistent ones; and even paraconsistent logicians may employ classical logic in consistent situations (just as intuitionists may employ classical logic when reasoning about finite situations): it is a special case. The whole idea can be made formally rigorous and precise by the construction of an appropriate non-monotonic logic. 23
Objection 4: A naive account of truth is just as susceptible to extended paradoxes as consistent accounts. For we may define an operator, $, by the conditions of classical negation (whether or not one calls this negation), and then use the T-schema to infer something of the form A $A, giving rise to triviality. In the end, then, the naive account is no more acceptable than a consistent account.
Reply: No logical theory—of any kind—can allow unbridled licence in postulating connectives satisfying arbitrary conditions. The point was forcibly brought home by Arthur Prior in his discussion of tonk. From the point of view of the naive theory, any attempt to define a connective obeying the laws of classical negation will either
end p.128
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
produce a connective that does not satisfy these laws or will not succeed in specifying a meaningful connective at all. All this we saw in Chapter 5.
7.6 CONCLUSION
Whether or not one is persuaded by the details of the example of the last section, they illustrate at least the possibility that rationality may itself require inconsistency. It is a truism to point out that one of the greatest opponents of rationality has always been superstition, often of a religious nature. It is not a truism to note that one of the greatest superstitions in the history of Western thought has been that concerning consistency—as Wittgenstein put it, the ‘superstitious dread and veneration in face of contradiction’. 24 Consistency has been taken to be the very cornerstone of rationality. But this view has itself no rational ground: it would seem to be simply the legacy of Aristotle. Indeed an inconsistent view may be the very embodiment of rationality.
So much for belief itself. What about change of belief? Does this provide more trouble for dialetheism? Let us see.
end p.129
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
8 Belief Revision
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
8.1 INTRODUCTION
The rational person proportions their beliefs according to the evidence. This is, no doubt, a fundamental feature of rationality. But there is more to rationality concerning belief than that. The rational person is also one who changes their beliefs in the light of new information in an appropriate way. Sometimes, this will merely require the addition of the new information to the belief-set. At other times, this may require more drastic change, since old beliefs may have to be dropped. Though the topic of how one ought, rationally, to change one's beliefs is a venerable one in the history of Western philosophy, the application of formal methods to it is relatively recent. Yet these applications suffice to throw into relief various fundamental problems concerning belief-revision. It is one of these, concerning the role of consistency in the process, which I will address in this chapter. I will explain the problem in a moment. Let me first set the scene by saying something about one of the most common formal representations of belief-revision, the AGM account.
8.2 BACKGROUND: THE AGM CONDITIONS
In the AGM account, a set of beliefs is represented by a set of sentences, K, of some fixed language, . K is closed under deducibility. Hence, it is perhaps better to look on it as representing the set of commitments of an agent, rather than as their set of beliefs, since a real agent's explicit beliefs are not, as a matter of fact, closed under deduction. Given some information, α, expressed in , the AGM theory concerns three operations that may be effected on K in virtue of this.
The first is adding α to K. The result, written as K + α (and usually called ‘expansion’), is easily defined. We simply put α together with K, and then form their logical closure (since belief-sets are closed under deducibility). Thus, K + α = {β; K {α} CL β} (where CL denotes classical logical consequence).
The second operation is the result of taking α away from K, written as K − α (and usually called ‘contraction’). Of course, α may not be in K at all, in which case, K − α is just K. But if it is, we cannot simply delete α from K. Since the result must be logically closed, we may have to delete other things as well. Depending on K, there may be no unique way of doing this. (Example: if K is the logical closure of {p, p → q}, then if we remove q from K, one of p and p → q will have to be
end p.130
removed. But removing either of them will do the trick.) Possibly, other considerations will determine which deletions ought to be made, but it is not at all clear that simply being rational does define the result uniquely: prima facie, at least, rational people may well disagree about what ought to be given up. At any rate, the AGM account does not give an explicit definition of contraction; instead, it gives a set of axioms that K − α must satisfy. The conditions are well known, and I will not reproduce them here. 1
A third operation on K, the one we are really interested in, revision, is the general result of revising K, given new information, α. This may be written as K * α. Since revision may involve rejecting things, the same sort of considerations that apply to contraction apply to revision. Thus, AGM gives a set of conditions that K * α must satisfy. These are as follows:
K*1 K * α is logically closed
K* 2 α K * α
K*3 K * α K + α
K*4 ¬α K K + α K * α
K*5 K * α is inconsistent α is a logical contradiction
K*6 α and β are logically equivalent K * α = K * β
K*7 K * (α β) (K * α) + β
K*8 ¬β K * α (K * α) + β K * (α β)
K*2 is usually called the success condition: revising by α produces a belief-set that contains α. K * 5, which will be of particular importance in what follows, may be called the consistency condition. (Its converse is given by the success condition.)
A natural thought at this point is that revision may be defined in terms of the other two operations: K * α = (K − ¬α) + α. This is often called the Levi identity. We will come back to it later, but at any rate, if revision is defined in this way, it can be shown that the AGM conditions for + and − entail those for *.
Note that one feature of the AGM account of revision is that K * α is a set of sentences of the same language as that with which we started. Thus, it provides no account of revision that involves conceptual innovation. This is a highly important form of revision, not perhaps a common form, but arguably the most profound. At any rate, AGM has nothing to tell us about this.
The AGM axioms themselves are rather abstract. There are many concrete models of them; these characterize revision (uniquely) in terms of other things. One of the nicest models is in terms of “spheres”. A simple way of thinking of this is as follows. A set of beliefs, K, can be identified with the set of its models (in the model-theoretic sense). Let us write K + for the set of all interpretations that make K true (its models). I will write α+ for {α}+. K + α is then simply the theory whose models are K + ∩ α+. To define the other two operations, we suppose that K comes furnished with a set of “spheres”: K + = S 0 S 1 … S n = I (the set of all interpretations of ). To keep things simple, we suppose that there is a finite number of spheres, though this is inessential. Each sphere may be thought of as a “fallback” theory. Thus, if any
end p.131
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
S i is shown to be untenable, the agent's next choice is S i+1 . For any non-empty set, X I, there must be a smallest sphere, S X , that intersects it (and if X is empty, let S X = I). K * α may be defined as the theory whose models are . That is, it is the largest portion of the most preferred fallback theory that entails α. The definition of contraction need not concern here. 2 Let us just note that it can be shown that the operations, defined in terms of spheres, satisfy the AGM conditions.
8.3 INCONSISTENT BELIEF
Now to the problem posed by inconsistency. In the standard AGM account, as we saw, it is assumed that the logical consequence relation employed is classical. In particular, then, an inconsistent belief-set is trivial. Thus, adding to a belief-set something that is inconsistent with it produces triviality. Moreover, revising a belief-set with something inconsistent also gives triviality, because of the success condition.
Now, here is the problem. People often have inconsistent commitments. The persons whose beliefs are consistent is, in fact, a rarity. Yet it is absurd to suppose that a person who has inconsistent commitments is thereby committed to everything. If, by oversight, I believe both that I will give a talk on campus at noon, but also that I will be in town at noon, this hardly commits me to believing that the Battle of Hastings was in 1939.
It might be suggested that the theory of belief revision is one of an ideally rational agent, and that such an agent never has inconsistent beliefs. But this is a confusion. The theory of belief revision is a theory of how an ideally rational agent changes their beliefs. It is quite possible that such an agent should find themself with inconsistent beliefs in the first place (maybe through their education). Indeed, one thing we should expect of a theory of belief revision is an account of what it is rational to do if we do find ourselves in this situation.
Worse, it is not at all clear that an ideally rational agent must have consistent beliefs. Sometimes there is overwhelming evidence for inconsistent beliefs. The evidence points mercilessly to the fact that either a committed the crime or that b did. But I have known both a and b for years. Both are of impeccable moral character; neither is the kind of person who would do such a thing. This sort of situation seems to be a not uncommon one in science. For example, in the late nineteenth century, the evidence that evolution had occurred, and that the age of the earth was hundreds of millions of years, was overwhelming. But the thermodynamic evidence concerning the age of the sun showed that the earth could not possibly be that old. Nor is this simply a matter of history: it is well known that presently the general theory of relativity and quantum theory are mutually inconsistent. Indeed, as many philosophers of science have noted, most accepted scientific theories face contradictions and anomalies. 3 For
end p.132
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
this reason, we cannot simply suspend belief in theories facing inconsistency. If we did, we would have no science left.
Further, as we saw in section 6.2, there are, in fact, good grounds for supposing that an ideally rational agent must have inconsistent beliefs. Such an agent would not believe something unless the evidence supported its truth. Hence, every one of their beliefs, α 1 , …, α n , is rationally grounded. But the rational agent also knows that no one is perfect, and that the evidence is overwhelming that everyone has false beliefs (rational agents included: rationality does not entail infallibility). Hence, they believe ¬ (α 1 … α n ). So their beliefs are inconsistent. 4 And even if, for some reason, this argument fails, one would hardly want to rule out a priori the possibility that rational belief is fallible in this way.
Of course, a rational agent can hardly believe everything. Hence, whether one takes the agent of the theory to be simply a rational reviser, or to be a rational believer as well, a theory of belief revision must allow for the possibility of agents having inconsistent but non-trivial beliefs.
8.4 PARACONSISTENCY
Possibly, there are a number of ways that one might try to solve this problem, but the most obvious is simply to employ a relation of logical consequence that is paraconsistent, that is, in which inconsistencies do not entail everything. An agent's commitments may then be inconsistent without being trivial. 5 There are many paraconsistent logics, and we do not need to go into details here. 6 Using such a logic, pretty much the whole of the AGM theory goes over intact. Just replace CL with the appropriate paraconsistent notion of consequence. The sphere modelling, too, works the same way. We just replace classical models with the models of the paraconsistent logic employed. There is only one major and inevitable casualty amongst the AGM postulates: the consistency postulate. 7 This must fail: K * α may be inconsistent, even if α is quite consistent, and for reasons that have nothing to do with α. But this is entirely what one should expect in the current context. 8
end p.133
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
But now we face another problem. If we are allowing for the possibility of inconsistent beliefs, why should revising our beliefs with new information ever cause us to reject anything from our belief-set at all? Why not simply add the belief to our belief-set, and leave it at that? 9
The problem comes out sharply in the sphere modelling of the AGM postulates. In many standard paraconsistent logics, there is a trivial interpretation, ∞, one that makes everything true. A fortiori, it is a model of every theory. Given such an interpretation, for any α and K, ∞ K + ∩ α+. Thus, is just K + itself, and so . Revision and expansion are exactly the same thing! 10 ∞ is just a technical way of saying that when you can believe any contradiction, revision requires nothing but addition. Note, however, that in a paraconsistent context there is no reason to suppose the Levi identity holds (unless, of course, one uses it to define revision). For example, let K be the set of logical consequences of ¬p. Then ¬p is not in (K − ¬p) + p, but it is in K + p.
8.5 MULTIPLE CRITERIA
But this is too fast. Suppose that we use a paraconsistent logic. If we revise our beliefs in the light of new information, α, there is nothing now in logic that will force us to delete ¬α (and some of the things that entail it) from our beliefs. But just because this is a logical possibility, it does not follow that it is a rational possibility. There is a lot more to rationality than consistency. Many quite consistent beliefs are irrational; for example, the belief that I am a fried egg. I may even hold this belief consistently with the rest of my beliefs, if I make suitable adjustments elsewhere (by jettisoning, e.g. the belief that I was born, and not laid). This makes it no more rational.
As I discussed in section 7.4, and as epistemologists have often noted, there are many features of a set of beliefs that may speak against its rational acceptability. Inconsistency
end p.134
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
may perhaps do this, but so, for example, does a high level of ad-hocness—which is presumably what goes wrong in the case of the person who gerrymanders their beliefs into a consistent set containing one to the effect that they are a fried egg. Conversely, there are many features of a set of beliefs that speak in favour of its rational acceptability. Simplicity is a traditional such virtue; so are: a low degree of ad-hocness, fruitfulness, explanatory power, unifying power. 11 What, exactly, all these criteria—except consistency—amount to, is a thorny issue: they are all notoriously difficult to spell out. It may even be the case that some of them, when properly understood, come to the same thing, but certainly not all of them. Moreover, how one is to justify these epistemic virtues is a very difficult question. 12 But it is also a different one: as with moral virtues, people can agree that something, say kindness, is a virtue, whilst disagreeing in their theoretical accounts of why this is so. Indeed, that kindness is a virtue is a datum much firmer than any moral theory will ever be.
We need not go into any of these questions further here, though. For, again as noted in section 7.4, it is simply the multiplicity of the criteria that is presently relevant. As long as there is a multiplicity, there is the possibility of conflict between them. 13 One set of beliefs may be the simplest, have the highest explanatory power, but be inconsistent. Another may be consistent and fruitful, but highly ad hoc. What is it rational to believe in such circumstances? There may be no determinate answer to this question. But, as argued in section 7.4, it would seem clear that (vaguely) if one set of beliefs scores sufficiently better on a sufficient number of these criteria than another, it is rationally preferable. This is how an inconsistent set of beliefs can be rationally acceptable: it scores highly on many of the other criteria. Conversely, and to return to the problem at hand, this is why an inconsistent set of beliefs may not be rationally acceptable. Its inconsistency may speak against it; and so may many other criteria. In particular, if we always revise simply by adding on the new information, we are likely to lose in simplicity, economy, unity. So the result may be quite irrational.
Looking at things in this way, consistency is no longer a necessary condition for rational belief; at best, it is merely one of a list of (potentially conflicting) desiderata. But we conceded this once we conceded the possibility that it might be rational to have inconsistent beliefs anyway. The important point is that simply tacking on new information to our beliefs, rendering the whole belief-set inconsistent, though it may be a logical option, may not be the rational course of action. 14
8.6 FORMAL MODELS
Let me now outline a formal model of belief revision incorporating this insight. Suppose, as before, that our belief-set is K, and that new information, α, arrives. What is the new rational set of beliefs? There are a number of possibilities. One is
end p.135
simply the addition, K + α. Another is something obtained by the Levi operation: (K − ¬α) + α, where – is some suitable notion of contraction. 15 Another is a theory obtained by reversing the operations involved: (K + α) − ¬α. (There is no reason to suppose that, in general, this gives the same result. For example, it may not satisfy the AGM success condition.) Recall that the AGM conditions do not specify contraction uniquely; there may therefore even be many theories of the forms specified by the Levi identity and its reversal. And there may well be other possibilities as well. For example, one rational course of action may simply be to reject the new information, and write it off on the grounds of experimental error or whatever—though in this case one will presumably add to the belief-set something like ‘it seemed that α’, or ‘our instruments reported that α’—in which case, again, the success condition will clearly fail. 16 More profoundly, the new information may occasion conceptual innovation, and thus there may be sets of beliefs in a new language. 17 Let us call the collection of all options, whatever they are, K α. 18
We know that there is a set of criteria, C, which can be used to evaluate sets of belief. These criteria are not all-or-nothing matters. (Given a paraconsistent logic, even inconsistency comes by degrees.) So let us suppose that for each c C = {c 1 , …, c n } there is a scale of how well a belief-set fares according to that criterion. Specifically, for each belief-set, k K α, there is a real number, μ c (k), measuring how good that set is. (The higher the number, the better the set.) One might prefer some other scale, perhaps some subset of the real numbers, such as [0, 1]. But since the matter is largely conventional, this is not a crucial issue.
Next, we need a way of amalgamating the various criteria. A simple way of doing this is by taking their weighted average. 19 Let ρ(k), the “rationality index” of k, be defined thus:
•
The weights, w i , reflect the relative importance of each of the criteria. I will have a little more to say about these later. K * α can now be defined as the member of K α
end p.136
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
with the largest rationality index. 20 If there is more than one, K * α is one of these, non-deterministically. (Believers have a free choice.)
The model I have just described is simple, but obviously unrealistic in a number of ways. One of these is that, given a criterion, c, the assignment of a unique real number, μ c (k), to a theory, k, is not really to be expected. About the best we can hope for is to assign k a range of numbers. (And probably a vague range, at that. But how to handle vagueness poses a whole new set of problems that would take us far away from the present problem.)
A second feature of the model is that, according to it, all theories are comparable as to the rationality of their acceptability. The ordering is a linear one. 21 This seems quite unrealistic. Rational people may well disagree about the best thing to believe when different criteria rank theories radically differently. 22 Thus, suppose that there are two criteria, c 1 and c 2 ; c 1 ranks k 1 high, but k 2 low, and c 2 vice versa. Then there may just be no comparing the two theories. The best we can hope for is a determinate answer when most of the criteria give similar results (and that's vague too). Thus, the ordering ought to be a partial one.
By a small modification of the construction, we can solve both of these problems at once. We now take each μ c to assign each k, not a single value, but a non-empty range of values, . (The first figure is the lower bound; the second figure is the upper bound.) When we amalgamate these, the result is also going to be a range of values. The highest value a belief-set can hope for is clearly realized when it obtains the highest value under each component, and similarly for the lowest. So we can take ρ(k) to be the range [ρ−(k), ρ+(k)], where:
•
An overall ranking, ⊏, may now be defined on belief-sets. One belief-set, k 1 , is rationally preferable to another, k 2 , if it is clearly better, that is, if any value that k 1 can have is better than any value k 2 can have:
•
It is easy to check that this is a partial ordering. It is not, in general, a total ordering. For example, if ρ(k 1 )=[3, 7] and ρ(k 2 ) = [2, 4], then neither k 1 ⊏ k 2 nor k 2 ⊏ k 1 .
end p.137
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Thus, it may be that the rationality ordering, ⊏, provides a clear judgment concerning two theories sometimes, but not others. Rational disagreement is possible.
We can now define K * α to be any belief-set in K α maximal in the ordering. If there is more than one, this is non-deterministic. For it is easy to see that there may be more than one distinct maximum. For example, suppose that ρ(k 1 )=[3, 7] and ρ(k 2 )=[4, 6], but for every other k K α, ρ(k)=[1, 2]. Then k 1 and k 2 are rationally preferable to all the other ks, except each other, and neither is preferable to the other.
Let me illustrate this definition of revision with a toy example. Suppose that K is the set of logical consequences of {p, q} (p and q being logically independent of each other) and that α is ¬p. We suppose that the only members of K α are k 1 =K + α, the set of logical consequences of {p, q, ¬p}, and the Levi revision k 2 =(K−p)+¬p, the set of logical consequences of {q, ¬p}. Suppose, now, that there are two criteria, consistency, c 1 , and explanatory power, c 2 , and that the values of μ are (for the sake of illustration) as follows:
•
There is no room for disagreement about which theory is the more consistent! Finally, suppose that w 1 = 1, and w 2 = 2. Thus, we think that explanatory power is more important than consistency. Computing, we determine that:
•
Thus, k 1 ⊏ k 2 , and K * α = k 2 . In particular, note, revision has occasioned the dropping of a belief.
This more complex model is clearly more realistic, but it is still a very idealized model. For a start, is there any reason to suppose that the criterion-weights, w i , are uniquely and precisely determined? None that I can see. Moreover, it might well be the case that there could be rational disagreement about what their values ought to be. 23 A solution here is to treat weights as we treated the single-criterion values. That is, instead of taking them to be single values, we take them to be a range of values, broad enough to encompass differences of opinion. When we amalgamate, ρ+(k) is computed using the maxima of the weight-values; ρ−(k) is computed using the minima.
The account, thus modified, is still just a model. If it were ever to be applied in practice, the question of how to determine the values of the ws and the μ -values of each theory would become an important one. Of course, the absolute values are unimportant: the scales are almost completely arbitrary. It is the relative values (within scales and across scales) that are important. It can be hoped that a community of investigators could reach consensus on appropriate figures, particularly since it is only a range of values that has to be agreed upon. We may be able to set limits that keep everybody happy.
end p.138
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Note, also, that there might well be grounds for revising the weights themselves in the light of philosophical and scientific developments. Clearly, weight-revision cannot be done arbitrarily. Someone cannot declare, for example, that empirical adequacy is to be weighted 0. The result—on pretty much most other assumptions—would be that the trivial theory would come out as the most rational. However, there is a rational procedure for changing weights. The theory of rationality that I have laid out in this chapter, including the weights of the different criteria, is just that: a theory. It can therefore be revised in the way that I have described. In particular, the sentence to the effect that the weight of such and such a criterion is so and so might well be revised in the process. Maybe we come to believe, for example, that the world is just not the sort of place that is likely to be simple—in which case, the weight for simplicity would be decreased. (Indeed, the project of this book might be seen as part of an argument for lowering the weight traditionally assigned to consistency.) This raises the spectre that the revised theory might not be rationally acceptable given the revised weights, and so would need to be revised again. There is nothing, per se, wrong with this possibility. Revision, after all, should be expected to be an ongoing process. 24 There clearly might be a problem if the process went round in some kind of vicious loop. However, this fact itself, once realized, would be new information which would, presumably, provide the impetus for a revision that takes things out of the loop.
8.7 THE AGM CONDITIONS REVISITED
Since we started the chapter with a discussion of the AGM axioms, let me finish by briefly discussing to what extent K * α, as defined here, satisfies them. Take them one by one.
K* 1: K * α is logically closed.
K * α is a theory, and so logically closed; but under which logic? As argued, we may expect it to be a paraconsistent logic. But note that the construction does not require the logic to be any particular logic. Indeed, the different members of K α may well be closed under logics that are different from each other, and from that of K too. For, as I will discuss in Chapter 10, one thing that we may wish to revise under the influence of recalcitrant information is exactly our logical theory. Thus, e.g. under the weight of counter-examples to the material conditional, it might be (indeed it is!) rational to move to a relevant logic. Quite unlike standard AGM constructions,
end p.139
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
then, this account shows us how, and in what way, our logic may be revised too. The rational choice, K * α, simply has a different logic from K. 25
K* 2: α K * α.
The success postulate need not be satisfied by K * α, as has already been discussed.
K* 3: K * α K + α.
This postulate may not be satisfied either. For α may occasion conceptual revision. And if it does, there may be beliefs in K*α that cannot even be expressed in the language of K, and so are not in K+ α at all. For this reason, the revision of phlogiston chemistry under the recalcitrant results obtained by weighing, occasioned beliefs employing quite new concepts, like oxygen.
K* 4: ¬α K K+α K * α.
For similar reasons, this postulate fails. Old beliefs, such as that there is a substance called phlogiston, may get ditched in the process of revision.
K* 5: K * α is inconsistent α is a logical contradiction.
The failure of this postulate needs no further discussion.
K* 6: α and β are logically equivalent K * α = K * β.
There is no reason to suppose that this postulate holds either. Nor should it. For α and β may be logically equivalent under the logic of K, but they may occasion different revisions of logic. For example, the existence of a counter-example to the law of excluded middle is equivalent to the existence of one to distribution (of over V) in classical logic, since these are both tautologies. Yet the first may occasion a rational move to a three-valued logic (in which distribution, but not the law of excluded middle, holds), whilst the second might occasion a rational move to a quantum logic (in which the law of excluded middle, but not distribution, holds).
K* 7: K * (α β) (K * α) + β.
This postulate fails too. Suppose that, α K, and that, given K, α is incredible. So if α arrives as new (prima facie) information, the best thing to do is write it off to experimental error. Then α K * α. Now suppose that β is something that does not entail α (even in conjunction with K * α), so α (K * α)+β, but that in the light of β, α is acceptable. In particular, given α and β together, α does become acceptable. That is, α K * (α β).
K* 8: ¬β K * α (K * α) + β K*(α β).
This postulate fails for similar reasons. Choose some β such that neither β nor ¬β is in K. Suppose that, given K, α, β and—a fortiori—α β, are incredible. Their
end p.140
arrival as prima facie new information then have no substantial effect on K. Then ¬β K = K * α, β K = K * (α β), but β (K * α) + β.
In summary, then, given the construction of this section, all the AGM postulates may fail. They do so because the model shows how to operate in a much more general class of revisions than the simple ones envisaged by AGM. This, indeed, is one of its strengths. 26
8.8 CONCLUSION
The model of belief revision give here is, as we have seen, a much more general and flexible one than AGM. I do not think that, in real life, disputes over what belief-set (theory) it is rational to adopt proceed explicitly in the way prescribed by the model. Rational belief change is a much less cleanly articulated business than this. But I do think that the model captures, at least roughly, the qualitative features of what goes on implicitly in such contexts. And it suffices to give the lie to an objection often raised against those who think that it may be rational to believe a contradiction. 27 This is to the effect that there can be no rational debate with a person who will accept contradictions, since they may accept anything. They may do nothing of the kind.
end p.141
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
9 Consistency and the Empirical Sciences
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
9.1 INTRODUCTION
What role does, or should, inconsistency play in the empirical sciences? This is the question that is to be addressed in this chapter. The question is hardly a new one, but the development of modern formal paraconsistent logics has a profound impact on the subject. Paraconsistent logicians have realized that their subject has important implications for the empirical sciences and the philosophy thereof, 1 but discussions of the applications of paraconsistent logic have focused largely on non-empirical areas, such as semantics and metaphysics. It therefore seems appropriate to address the question directly.
I will first address the issue of the specificity of the empirical sciences: observation. Next, we will look at the role that inconsistency has played in science (henceforth in this chapter, I will often take the qualifier ‘empirical’ for granted), and the relation of this to paraconsistent logic. This will raise the question of how inconsistent information ought to be treated in science: what criteria, for example, may lead us to accepting an inconsistent theory? And should such an acceptance ever be more than provisional? These topics will be addressed in the next sections. An outcome of this discussion will be that, in the light of developments in paraconsistent logic, we may well have to change our attitude to inconsistencies of certain kinds; such a change would open whole new possibilities in science itself.
9.2 INCONSISTENCY AND OBSERVATION
Before we can address the issue of the role of inconsistency in empirical sciences, it will be important to discuss how such sciences differ from similar inquiries (which are often intricately connected with them), such as mathematics and metaphysics. The standard answer to this question is that in the empirical sciences, but not the others, observation plays a role. In the more developed sciences such observations are obtained through active experimentation. The observations serve to provide the ultimate explananda for science, as well as providing important inputs into the evaluation of scientific theories. This standard answer is not entirely unproblematic: the role of empirical data in science has often been overrated, and its role in areas such as philosophy
end p.142
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
underrated—for example, by positivists and empiricists. However, that it plays a much more central role in science than in other inquiries can hardly be denied.
The distinction between what is observable and what is not, is not, itself, entirely unproblematic. Some states of affairs are certainly observable (such as the colour of an insect) and some are certainly not (such as the colour of a quark). But it is impossible to draw a sharp boundary between what is, and what is not. What is observable may depend on what aids to perception (such as a microscope) are used; or on how we interpret our sensory input in the light of accepted theories (such as theories that tell us that when we witness the tracks in a bubble chamber, we are seeing the tracks of charged particles). However, as with all vague or context-dependant distinctions, the fact that it is impossible to draw a neat line between its two sides does not mean that there is not an important distinction to be drawn. (Compare: being a child and being an adult.)
Let us now revisit the discussion of observation in section 3.5 with this new context in mind. What can one see? 2 For a start, seeing, in the sense relevant to explanation and confirmation, is propositional. We see that something is the case. We see that the stars appear to be in a certain position and ask for an explanation, or use this against a theory according to which they should appear somewhere else. As we noted in section 3.5, some have thought that one cannot see that something is not the case. We always see that something is the case, and then infer that something else is not the case. For example, we see that something is red and infer that it is not black. Whilst one may certainly do this, one need not: one can see directly that something is not the case. Try a thought experiment. I show you an ordinary apple and ask: is this black? You compare its colour with a mental paradigm of blackness, and it does not match; you say no. 3 Or again, you enter a room; the whole room is visible from where you stand; there is no one there. You can see that Pierre is not in the room. You do not have to say: the things in the room are a chair, a table …; Pierre is not a chair, Pierre is not a table …; therefore, etc. 4 The very distinction between seeing what is the case and what is not the case is a false one. Some seeings are both. When talking of actual physical objects, to be transparent is not to be opaque and vice versa. But you can see that something is transparent and you can see that something is opaque.
It should be remembered that seeing is not simply a matter of light rays hitting the retina. Certainly, the eyes are involved in seeing; but one needs more than eyes to see. If what one sees is to play a role in one's cognitive functioning, one must also understand one's visual input. Hence, the categories of the understanding play a role. To see is to interpret ones visual stimuli, by applying these categories, either consciously or preconsciously; and there is no reason why truth functions such as negation or disjunction should not enter into the process of interpretation directly. For example, I can see that this is a photograph of either Ned or Ted, identical twins such that I cannot tell the difference, without seeing that it is a photograph of Ned, and inferring the disjunction.
end p.143
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Now, suppose that α and β are states of affairs observable at the same time and place, from the same perspective, etc., as I discussed in section 3.5; then, as argued there, so is their conjunction. We can see that something is a unicorn; we can see that it is green. Hence, we can see that it is a green unicorn. Thus, if α and ¬α are observable states of affairs, so is α ¬α. Of course, this is not to say that the conjunction is observed; merely that it is observable—that is, it is of a kind such that if it were to be the case, it could be seen.
In reply to the suggestion that it might be impossible to see inconsistent states of affairs because our perceptual apparatus imposes some “consistency filter”, it suffices to note, as we saw in section 3.5, that we perceive impossible situations in various visual illusions. Thus, there are many well-known impossible figures (of the kind, for example, employed by Escher in his drawings); there are perceptual sets where people report seeing things as simultaneously red and green; there are situations where things appear to be moving and not moving. Let me just remind the reader of one of these, the last. 5 This is commonly known as the waterfall illusion. After conditioning the visual field by showing it constant motion of a certain kind, say a spinning spiral, one then looks at a stationary scene. The scene appears to move in the opposite direction, but nothing in the scene changes its position; for example, an object at the top of the visual field does not move round to the bottom. What we see appears to be both moving and stationary; this is the natural way of describing one's visual sensations. Of course, perception in the cases I have described is not veridical; 6 these are illusions, and things are not really thus; but that is how they would appear if things were thus. There is therefore nothing about our visual system that requires perception to be consistent.
9.3 TYPES OF INCONSISTENCY
Let us now turn to the issue of inconsistency in science. The inconsistencies in question here are inconsistencies in what are accepted scientific beliefs. Many historians and philosophers of science have observed that there are such inconsistencies—indeed that they are common—even in contemporary science. 7 If we distinguish between observation and theory (what cannot be observed), then three different types of contradiction are particularly noteworthy for our purposes: between theory and observation, between theory and theory, and internal to a theory itself. 8 Let us look at these three in more detail.
Inconsistency between theory and observation is the most obvious example. We have a well-received theory, T, with a certain observable consequence, α. We then run
end p.144
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
an experiment and observe that ¬α. Simple-minded falsificationists would suggest that this shows that T is wrong and is to be ditched. But many philosophers of science have pointed out that this does not necessarily happen. 9 The contradiction may be treated as the site of an anomaly: both T and ¬α may be accepted pro tem. T will not be jettisoned until we have a better alternative; ¬α will not be jettisoned until we have an explanation of why our observation was incorrect. Examples of this kind in the history of science are many. The precession of the perihelion of Mercury, at odds with received facts about the heavenly bodies and Newtonian dynamics, was known for a long time before the rejection of Newtonian dynamics in favour of special relativity. Prout's hypothesis was widely accepted by many chemists even though it was known to be at odds with empirical data (much of which was subsequently rejected). And so on. 10
The second kind of inconsistency, between theory and theory, is less frequently noted, but certainly occurs. This is when we have two well-accepted theories, T 1 and T 2 , which have inconsistent consequences. Again, though this may be noted as the site of a problem, both T 1 and T 2 are retained until a suitable replacement theory for one or both is found. An example of this concerns the age of the earth in late nineteenth-century science. According to evolutionary theory (which was, by that time, well accepted), the age of the earth had to be hundreds of millions of years; but according to received thermodynamics, the sun—and so the earth—could not possibly be that old. (The issue was resolved only in the twentieth century with the discovery of radioactivity, a hitherto unsuspected form of heat-generation.) Another, contemporary, example: the theories of relativity and quantum theory are known to be mutually inconsistent. 11
The third example of an inconsistency is when a theory is self-inconsistent. This could arise because a theory has inconsistent observational consequences, though I know of no interesting cases of this in the history of science. What there certainly have been are inconsistent theories where the inconsistencies are located internally, away from observational consequences. For example, for over a hundred years Newtonian dynamics was based on the old calculus, in which infinitesimals had inconsistent properties (being both non-zero, at one point in computations, and zero at others). Another, particularly striking, example is Bohr's theory of the atom, which included both classical electrodynamic principles and quantum principles that were quite inconsistent with them. Though the theory was certainly considered as problematic, its empirical
end p.145
predications were so much better than those of any other theory at the time that it had no real competitor. 12
9.4 HANDLING INCONSISTENCY
As we have seen, the corpus of scientific beliefs may, or is even likely to be, inconsistent at any time. But from things accepted, scientists infer other things to be accepted, and they do not infer arbitrary conclusions. It follows that the inference procedure employed here must be a paraconsistent one (where an arbitrary α and ¬α do not entail an arbitrary β). What paraconsistent inference procedure is employed in inconsistent cases is another question; there is no a priori reason to suppose that it must be one of the standard monotonic paraconsistent logics, or even that it must be monotonic at all. Nor should one suppose that it must be the same in every case. What is guaranteed is that there must be some systematic procedure for drawing conclusions, and which does not permit drawing an arbitrary conclusion from a contradiction. What procedure is or was employed in any given case is a matter for detailed investigation.
I do not intend to discuss detailed examples here; but for what follows, it will be important for us to distinguish between two different kinds of paraconsistent logic: adjunctive and non-adjunctive. In adjunctive paraconsistent logics, such as standard relevant logics and da Costa's C systems, the rule of adjunction, α, β α β, is valid. In non-adjunctive systems, such as discussive logics, it is not. Non-adjunctive paraconsistent logics often employ some chunking procedure. Because of the failure of adjunction, one cannot simply put arbitrary premises together and draw conclusions. If the inference procedure is not to be impoverished, it is usually necessary to be able to put some premises together (into a chunk). A simple procedure is to put together any bunch of premises that are mutually consistent; but there are more sophisticated ones. 13
One thing that is clear from a fairly cursory consideration of the scientific handling of inconsistent information is that the inference procedure employed is often of a non-adjunctive, chunking, variety. For example, given the dispute about the age of the earth at the end of the nineteenth century, no one conjoined the views that the earth was hundreds of millions of years old, and that it was not, to infer that the earth
end p.146
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
really had a contradictory age. 14 Similarly, in the Bohr theory of the atom, the drawing of conclusions was restricted to well-defined consistent chunks, possibly in accordance with some pragmatically determined—but still determinate—considerations. 15 A conclusion drawn may then have been fed into another consistent chunk which contained information inconsistent with that in the first chunk. 16 Whether adjunctive paraconsistent logics have, historically, ever been used in handling inconsistencies is a different matter, and is rather doubtful. At any rate, I know of no examples where this is clearly the case.
9.5 ACCEPTING INCONSISTENT INFORMATION
We have seen that inconsistent information has sometimes been accepted in the history of science; we have also seen, at least in outline, why this does not lead to disaster. None of this shows that the inconsistent information ought to have been accepted (even provisionally). But the situation seems to have arisen so frequently that it is implausible to level at the scientific community charges of blatant and frequent irrationality. This, therefore, raises the question of the conditions under which it is reasonable to accept an inconsistent theory or other body of information.
The question of what makes it reasonable to accept any theory is a familiar one in contemporary philosophy of science, and a very difficult one. Neither do I intend to advance a detailed answer to that question here. But let us recall the account of rational belief developed in the previous two chapters.
The doxastic goodness of a theory may be evaluated under a number of orthogonal criteria. A major criterion is empirical adequacy. For any observational consequence, α, that a theory entails, if α is observed, this is a good mark. If it is not, this is a black mark. Empirical adequacy is, perhaps, the most important criterion in science. It is, at least, the one that mainly distinguishes the empirical sciences from similar investigations. But it is certainly not the only one; nor can it be. It is at least theoretically possible to have different theories that are both empirically adequate; more commonly, it happens that no theory in the field is entirely empirically adequate. Hence other criteria have also to be employed. There is a multitude of these; 17 philosophers may disagree both about what they are, and about how, exactly, to understand them, but the following have certainly been suggested, and are very plausible. Good-making features include: simplicity, ontological leanness (Occam's razor), explanatory power, a low degree of ad hocness, unity, fruitfulness. The converse features are bad: complexity,
end p.147
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
ontological extravagance, explanatory poverty, much ad hocness, fragmentation, barrenness. 18
The exact number of, and details concerning, criteria of these kinds, though a highly important and interesting question, need not detain us here. The important points are (a) that there is a multitude, and (b) that the criteria do not necessarily hang together. One theory, say Bohr's theory of the atom, may have a high degree of empirical adequacy, be very fruitful, but inconsistent. Another may be consistent, have a lesser degree of empirical adequacy, and be rather ad hoc. As I have argued in the previous two chapters, one theory is rationally preferable to another if it is sufficiently better on a sufficient number of criteria.
This account, rough as it may be, is sufficient to demonstrate a number of things. It shows why theory choice is a messy business: there is no simple algorithm. It shows why, within certain limits, there may be room to disagree over which theory is better: if no theory is overall best, people may reasonably disagree. All this is familiar from standard philosophy of science. As we noted in previous chapters, it shows two further things. First, it shows how and when it may be rational to accept an inconsistent theory: when, despite its inconsistency, it is markedly better than its rivals on sufficiently many other criteria. Secondly, it shows when it may be right to reject an inconsistent theory, even when inconsistency may be rationally tolerable: when a rival theory scores higher on sufficiently many criteria. That is, it shows how theories may be “falsified”, even if inconsistencies are sometimes tolerable.
9.6 INCONSISTENCY AND TRUTH
But should an inconsistent corpus of belief be accepted only provisionally, until a better one can be found; or can it be accepted as a candidate for the final truth? Several comments are pertinent here. First, there is no such thing as certainty about anything in science. Any theory or set of theories, whether consistent or inconsistent, should be endorsed fallibly. All theories go beyond the data—which is itself, in any case, “theory laden”. In this sense, the acceptance of anything is only ever provisional.
But is there something special about inconsistency in this regard? Here, it seems to me, the nature of the inconsistency is relevant. Note, first, that if a theory is empirically inadequate, however acceptable it is, the received information is not a candidate for the truth, at least as things stand. If a theory entails an observable consequence α, and α is not perceived, something is wrong, either with our theory or with our perceptions; something needs to be fixed. In particular, then, if a theory entails β ¬β, where β is some observation statement, then if such a contradiction is not observed, something is wrong. As I have already argued, β ¬β is a perfectly observable state of affairs. 19 Moreover, if the inconsistency in the scientific corpus is between a theory and an
end p.148
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
observation, something needs to be revised. For if theory T entails α, but ¬α is observed, not α, we again have an empirical inadequacy. It may be retorted that if inconsistencies are acceptable, maybe α ¬α is true after all. But again, since α ¬α is an observable state of affairs, and one that is not observed in the situation described, we have an empirical inadequacy: if α ¬α were true, so would α be, and this is precisely what is not observed.
What if a contradiction is one between theory and theory, or internal to a theory, not spilling over into observation? Here, the situation is more complicated. Suppose, first, that one is an instrumentalist; then all one cares about is the empirical adequacy of a theory; if a contradiction is located deep in the heart of theory, this is of no moment. But if, as I think correct—though I shall not argue it here—one should, in general, be a realist about scientific theories, the matter is different, and depends crucially on how this inconsistency is handled. If it is handled by a chunking strategy, then the theory is not a candidate for the truth. If α is true and ¬α is true, then so is their conjunction. If a theory refuses to allow this move then the theory cannot be correct, and we know this. 20
If, on the other hand, the inconsistency is handled with an adjunctive paraconsistent logic, there is no reason, as far as I can see, why we should not suppose the theory or theories in question to be correct. In particular, then, a theoretical inconsistency that is handled adjunctively is not, in itself, a reason to suppose that the theory cannot, in the long run, be allowed to stand. Any argument to the effect that such inconsistencies are ultimately unacceptable must be a quite general and a priori defence of the Law of Non-Contradiction in the appropriate form: any contradiction is known, in advance, to rule a theory out as a candidate for the truth. I know of no such argument that works. All fail, usually by simply begging the question in some way. 21 Thus, if we are realists, we will let our best theory, provided that it is not ruled out as a candidate for truth on other grounds, inform us as to what reality is like; and if our theory is inconsistent, there is no reason to suppose that the theory does not get it right: reality itself is inconsistent. In other words, inconsistencies of this kind in science do not mandate that the acceptance of the theory or theories in question be provisional in any special way. 22
9.7 INCONSISTENT MATHEMATICS
It is here that the impact of paraconsistent logic is revisionary—indeed, revolutionary. The Law of Non-Contradiction has been well entrenched in Western thought—and
end p.149
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
so science—since the canonization of Aristotle, whose defence of the law has rarely been challenged. Hence, scientists and philosophers have not been prepared to brook the thought that an inconsistent theory of any kind might be true. But subscribing to the Law is not rationally mandatory, as I argued in Chapter 7, and as the development of paraconsistent logics has played a large role in showing. Once this fact is digested, scientists may well—justifiably—take a different attitude to inconsistent theories of the appropriate kind. Indeed, they may even develop inconsistent theories, if these have the right empirical consequences, just as paraconsistent logicians have articulated inconsistent theories of semantics to handle the paradoxes of self-reference.
In modern science, the inferentially sophisticated part is nearly always mathematical. An appropriate mathematical theory is found, and its theorems are applied. Hence, a likely way for an inconsistent theory to arise now in science is via the application of an inconsistent mathematical theory. Though the construction of inconsistent mathematical theories (based on adjunctive paraconsistent logics) is relatively new, there are already a number: inconsistent number theories, linear algebras, category theories; and it is clear that there is much more scope in this area. 23 These theories have not been developed with an eye to their applicability in science—just as classical group theory was not. But once the paraconsistent revolution has been digested, it is by no means implausible to suppose that these theories, or ones like them, may find physical application—just as group theory did. For example, we might determine that certain physical magnitudes appear to be governed by the laws of some inconsistent arithmetic, where, for example, if n and m are magnitudes no smaller than some constant k, n + m = k (as well as its being the case that n + m ≠ k). 24 There are, after all, plenty of episodes in the history of science in which we came to accept that certain physical magnitudes have somewhat surprising mathematical properties (being imaginary, non-commuting, etc.). Why not inconsistency? 25
9.8 CONCLUSION
I believe that the development of modern formal paraconsistent logics is one of the most significant intellectual developments of the twentieth century. In challenging entrenched Western attitudes to inconsistency that are over 2,000 years old, it has the potential to ricochet through all of our intellectual life—and empirical science wears no bullet-proof vest. As we have seen, empirical scientists have always tolerated, and operated within, inconsistency in certain ways. One of the liberating effects of paraconsistency should be to allow us to understand better exactly how this proceeded.
end p.150
Such an understanding is bound to reflect into our understanding of the rationality of theory-choice, in the ways that I have indicated. Perhaps most importantly of all, paraconsistency may open the gate to important new kinds of theory within science itself. Where this will all lead, one cannot even begin to speculate.
One branch of science—though perhaps not empirical science—as traditionally conceived, is logic. The nature of this brings us to the fourth part of the book. And about the nature of logic, one can do a lot more than speculate.
end p.151
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
end p.152
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
PART IV LOGIC
end p.153
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
end p.154
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
10 Logic and Revisability
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
10.1 INTRODUCTION
In the first three parts of this book, we have looked at the relevance of dialetheism to three core notions of philosophy—and vice versa—truth, negation, and rationality. In this last part of the book we take on a fourth: logic itself.
If dialetheism is correct, then—unless one is a trivialist—one must subscribe to some form of paraconsistent logic. The orthodox logic of our time is not a paraconsistent logic, and so dialetheism requires us to revise our beliefs about which logic is correct. This is often described as revising our logic; but this is a very misleading way of putting the matter. The process is one of belief-revision of a kind discussed in Chapter 8. I indicated in that chapter how belief-revision in general operates; and this account applies to our beliefs and theories about logic, as much as anything else.
Still, it might be thought that there is something special about logic which makes it different. In this chapter I will argue that there is not. Logic is revisable in just the same way that any other theory is. For comparison, I will also discuss the situation concerning geometry and arithmetic. The chapter also discusses some ramifications and implications of the revisability of logic. In particular, we will look at what the rivalry of logical theories amounts to, and at Quine's remarks concerning rivalry and the possibility of “deviant logics”.
Paraconsistent logic is not, of course, the only “deviant” logic. Many formal non-classical logics were developed in the twentieth century, including intuitionist logic, various many-valued logics, quantum logics, etc. All of these provide potential rivals for classical logic, and many of the considerations posed by paraconsistent logics are quite general. I will therefore discuss the question of logical revision in the more general context of non-classical logics. The central concern, however, will remain paraconsistent logic.
10.2 LOGICS, GEOMETRIES, AND ARITHMETICS
The earliest formal non-classical logics to be developed in the twentieth century was many-valued logic. Principia Mathematica was hardly off the press before Łukasiewicz
end p.155
was formulating various logics of this kind. 1 In producing these, Łukasiewicz had the model of non-Euclidean geometries very much in mind. For example, he wrote: 2
It would perhaps not be right to call the many-valued systems of propositional logic established by me ‘non-Aristotelian’ logic, as Aristotle himself was the first to have thought that the law of bivalence could not be true for certain propositions. Our new-found logic might rather be termed ‘non-Chrysippean’, since Chrysippus appears to have been the first logician to consciously set up and stubbornly defend the theorem that every proposition is either true or false. The Chrysippean theorem has to the present day formed the most basic foundation of our entire logic. It is not easy to foresee what influence the discovery of non-Chrysippean systems of logic will exercise on philosophical speculation. However, it seems to me that the philosophical significance of the systems of logic treated here might be at least as great as the significance of non-Euclidean geometry.
The analogy between non-standard logics and non-Euclidean geometries is an important and interesting one, though several people have denied it. For example, the Kneales (1962), 575, say:
Even from a purely formal point of view the ordinary two-valued system has a unique status among deductive systems which can plausibly be called logic, since it contains all the others as fragments of itself. In short, they are not alternatives to classical logic in the sense in which Lobachevski's geometry is an alternative to Euclid's.
Now this is just factually wrong. There are, for example, connexivist systems of logic that contain logical truths that are not classical logical truths, such as (α → β) → ¬ (α → ¬β). 3 But even if it were not, the conclusion would hardly follow. From the fact that a logic is maximal in some sense, it does not follow that another cannot be an alternative. The maximal logic might entail too much.
Some have insisted that classical logic has some kind of priority over other systems, since metatheoretic reasoning must be classical. 4 Even if this were to entail some disanalogy, the claim is, again, just false. Intuitionist metatheory for intuitionist logic makes perfectly good sense. 5 Even Rescher (1969), 219, who has a good deal more sympathy for the claim than many others, denies the analogy, on the ground that the articulation of a logical system requires the employment of a metalogic (normally an informal one, and not necessarily classical), whereas the articulation of a geometric system does not require the employment of a metageometry. Rescher's observation seems correct. But again, it is difficult to see it as having significant import for the question. The formulation of a grammar for a language requires the employment of a metalanguage, and so a metagrammar. But this hardly entails that there cannot be rival grammars for a language, or that the question of which is correct is not a fallible and revisable matter. The same could be true of logic.
end p.156
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
In fact, the analogy between non-Euclidean geometries and non-classical logics is a close one, as we will see. To facilitate the comparison of geometry and logic, we will also look at an important half-way house, arithmetic. Logic, arithmetic, and geometry are the three great a priori sciences of Kant's Critique of Pure Reason. According to Kant, the mind has certain cognitive structures which, when imposed on our “raw sensations” produce our experiences. The first two, space and time, are dealt with in the Transcendental Aesthetic. The third, the categories, is dealt with in the Transcendental Analytic. In the case of all three, a certain body of truths holds good in virtue of these a priori structures; and these constitute the three corresponding sciences; geometry in the case of space, arithmetic in the case of time, and logic in the case of the categories. As the difference in location in the Critique indicates, the sciences are not entirely on a par: geometry and arithmetic are synthetic; logic is analytic. Nonetheless, each, as a science, is certain and, essentially, complete. This gives us Euclidean geometry, (standard) arithmetic, and Aristotelian logic.
10.3 NON-EUCLIDEAN GEOMETRY
There are few now who would agree with the Kantian picture of these three sciences—at least in its entirety. It has disintegrated, not just under the pressure of philosophical criticism, but under the pressure of developments in science itself. This is clearest in the case of geometry.
Until the nineteenth century, ‘geometry’ just meant Euclidean geometry; but in the first part of that century some different geometries were developed. Initially, these were obtained by Lobachevski and Bolyi simply by negating one of the postulates of Euclidean geometry in order to try to find a reductio proof of it. But under Riemann, the subject developed into one of great generality and sophistication. In particular, he developed a highly elegant theory concerning the curvature of spaces in various geometries. Whether non-Euclidean geometries were to be called geometries in stricto sensu might have been a moot point; after all, they did not describe the structure of physical space. But they were at least theories about objects called ‘points’, ‘lines’, etc., whose behaviour bore important analogies to that of the corresponding objects in Euclidean geometries. Moreover, Riemann realized that it might well be an empirical question as to which geometry should be applied in physics. 6
Within another fifty years, and even more shocking to Kantian sensibilities, Riemann had been vindicated. The General Theory of Relativity postulated a connection between mass and the curvature of space (or space-time) which implied that space may have non-zero curvature, and so be non-Euclidean. Predictions of this theory were borne out by subsequent experimentation, and the Theory is now generally accepted.
These developments have forced us to draw a crucial distinction. We must distinguish between geometry as a pure mathematical theory and geometry as an applied
end p.157
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
theory. As pure mathematical theories, there are many geometries. Each is perfectly well-defined by some axiomatic or model-theoretic structure. What holds in it may be a priori. By contrast, which to apply to the cosmos as a physical geometry is neither a priori nor incorrigible. Each pure geometry, when applied, provides, in effect, a theory about spatial (or spatio-temporal) relations; and which is correct is to be determined by the usual fallible scientific criteria.
This by no means resolves all pertinent philosophical issues. In particular, how to understand the status of applied geometry is still contentious. The simplest interpretation is a realist one. 7 Physical geometry is a theory about how certain things in physical space, i.e. points, lines, etc., behave; and a non-Euclidean geometry gets it right. The alternative to realism is non-realism, of which there are many kinds. One is reductionism: talk of geometric points and lines is to be translated without loss into talk of relationships between physical objects. This is the view traditionally associated with Leibniz, but has found few modern adherents. Another kind of non-realism is instrumentalism. 8 Geometry has no descriptive content, literal or reductive. It is merely auxiliary machinery for the rest of physics. As such, we may choose whatever geometric machinery makes life easiest elsewhere; and a non-Euclidean geometry does just that. One notable version of instrumentalism is that according to which, once we have chosen a geometry, its claims become true by convention. Such conventionalism is often associated with the name of Poincaré. 9
10.4 NON-STANDARD ARITHMETICS
Is the situation concerning different logics the same as that concerning different geometries, at least in principle? Prima facie, it might well be thought not. A crucial difference would appear to be that logic, unlike geometry, is analytic, not synthetic. Thus, logic is not about the world in any interesting sense; consequently, the question of changing logic to provide a better description of the world, as geometry was changed, does not arise.
Before we consider this issue, it will be illuminating to consider the case of arithmetic, which is a halfway house. Kant, it is true, considered both geometry and arithmetic to be synthetic. The received view in the twentieth century was more influenced by Frege, however. 10 Geometry may be synthetic, but arithmetic is analytic.
Let us start by asking if there could be alternative pure arithmetics in the sense that there are alternative pure geometries. Normal arithmetic is the set of sentences of the usual first-order language that are true in the standard model, the natural numbers, 0, 1, 2, …, as subject to the usual arithmetic operations. We may take an alternative arithmetic simply to be one that is inconsistent with this. In other words,
end p.158
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
we form an alternative arithmetic by throwing in something false in the standard model. Naturally, if consistency is to be preserved, other things must be thrown out. There are two possibilities here.
The first is that we retain all the axioms of Peano Arithmetic, but add the negation of something independent of Peano Arithmetic but true in the standard model. We then have a theory that has a classical non-standard model. 11 As a rival arithmetic, such theories are a little disappointing, however. For as is well known, any model of such a theory must have an initial section that is isomorphic to the standard model. In a sense, then, such theories are not rivals to standard arithmetic, but extensions.
The second, and more radical, way of obtaining a non-standard arithmetic is to add something inconsistent with the Peano axioms, and jettison some of these. (This is the analogue of how non-Euclidean geometries were initially produced.) In principle, this could produce many different systems, but I know of only one to be found in the literature. This jettisons the axiom which says that numbers always have a successor, and adds its negation, producing a finite arithmetic. 12 Although there are such systems, then, there is no well worked-out theory of their general structure.
10.5 MODELS OF THE INCONSISTENT
A more radical way still of producing a non-standard arithmetic, for which there is now a general theory, is to drop the consistency requirement. We may then add the negation of something true in the standard model and jettison nothing. Let us approach the issue via a little history.
A driving force behind the development of mathematics can be seen as the extension of the number system in such a way as to provide solutions to equations that had no solution. Thus, for example, the equation x + 3 = 2 has no solution in the natural numbers. Negative numbers began to be used for this purpose around the fifteenth century. Or consider the equation x 2 = −1, which has no solution in the domain of real numbers. This occasioned the introduction of complex numbers a little later. In each case, the old number system was embedded in a new number system in which hitherto insoluble equations found roots.
Now consider Boolean equations. A Boolean expression is a term constructed from some of an infinite number of variables, p, q, r, … by means of the functors , V and - (complementation). A Boolean equation is simply an equation between two Boolean expressions. The simplest interpretation for this language is the two-element
end p.159
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Boolean algebra, 2 , whose Hasse diagram is:
•
( is interpreted as meet, V as join, and - as order inversion). Within this interpretation many Boolean equations have solutions. For example, the equation p V −p = q is solved by q = ⊺ and p = ⊺ (or ). But many equations have no solutions, e.g. p = −p. It is natural, then, to extend the algebra to one in which all equations have solutions. The simplest such one is that whose Hasse diagram is as follows:
•
(where operations are interpreted in the same way; in particular, μ is a fixed-point for −). In this structure the equation p = −p is solved by p = μ. More generally, it is not difficult to check that if every variable is assigned μ, any Boolean expression evaluates to μ. Hence every Boolean equation has a solution.
The construction here is a special case of a more general one. The three-element algebra just given, 𝒟 3 , is a De Morgan lattice with a fixed point for negation. By the same argument as before, every Boolean equation has a solution in an algebra of this kind. The general result is that every De Morgan algebra (and a fortiori, every Boolean algebra) can be embedded in such an algebra. 13 The result is analogous to one of the fundamental theorems of algebra, that every field can be extended to one over which all equations have solutions (i.e. all non-constant polynomials have roots), an algebraic closure. (The algebraic closure of the reals is, of course, the field of complex numbers.)
Algebras have many applications. The most obvious application of 𝒟 3 in a philosophical context is to provide a structure of truth-values for a propositional language. In this context, the fact that every Boolean expression can be evaluated to μ means that, provided we take the designated values of our logic to be those ≥ μ, every
end p.160
sentence is satisfiable; as, more generally, is every set of sentences. (The interpretation where everything takes the value μ is, of course, a rather uninteresting one. But there will be many others. The point is simply to demonstrate the existence of models.)
In terms of logics, the above algebraic structure is exactly the logic LP, 14 and the value μ is normally thought of as both true and false. The truth-value μ might seem a rather odd one. One might be tempted to call it an imaginary truth-value, for the same reason that √ − 1 was called imaginary. But there is nothing really imaginary about imaginary numbers. Mathematically speaking they are just as bona fide as real numbers. Indeed, they even have applications in physics. Leave quantum mechanics out of this; even in classical physics, magnitudes such as impedance are given by complex numbers. Similarly, mathematically speaking, there is nothing imaginary about μ. And like imaginary numbers, μ may even have important applications, for example, as the truth-value of paradoxical self-referential sentences, α (for which we often appear to have α ↔ ¬α).
Anyway, as we have now seen, every set of sentences, even inconsistent ones, has a model in the logic LP (at least the trivial one, where everything takes the value μ, but in general many non-trivial models too). 15 In particular, and to return to the main issue, so do inconsistent arithmetics.
10.6 INCONSISTENT ARITHMETICS
The fact that there are logical models of inconsistent arithmetics does not tell us anything about their arithmetical structure. In fact, they have an interesting common structure. What is this? I will restrict myself here to describing the situation for the simplest finite models. The general case (even the general finite case) is more complex. 16
In every such model there is a tail, T, of numbers, 0, …, n − 1, that behave consistently. The numbers from n onwards form a cycle with some period, p (> 0), so that for any i ≥ n, i + p = i. (But since this is a model of arithmetic i + p ≠ i also.) These models may therefore be characterized by two parameters: the tail, T, and the period, p. The trivial model is the special case in which T = φ and p = 1.
Every structure of the kind described is a non-standard arithmetic, and the numbers in it are analogous to the points and lines in a non-standard geometry. Since each of the non-standard arithmetics extends ordinary arithmetic, every equation that has a solution in standard number theory has a solution in it. But, as might be expected, there are equations that have no standard solution but which have solutions in non-standard arithmetics. Let a numerical expression be any term formed from variables,
end p.161
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
x, y, …, and the constant 0, using the functors +, × and ′ (successor). 17 (I will use m for the numeral of m.) An equation is any identity of the form s = t, where s and t are arithmetic expressions. It is not difficult to establish that in the trivial model, every arithmetic equation has a solution (viz. x = y = … = 0.) But many families of equations have general solutions in non-trivial models. For example, any equation of the form m = n has a solution in a model where m, n T and m = n (mod p). And any equation of the form s(x) = t(x), where x occurs in s and t, has a solution in every model where p = 1 (namely the least inconsistent number). 18
10.7 EMPIRICAL APPLICATIONS
We have seen that there are alternative pure arithmetics, in the same sense that there are alternative pure geometries: abstract mathematical structures dealing with objects (numbers or points) that behave in a way recognizably similar to the corresponding objects of the standard theories. But, it might be suggested, this is as far as the similarity goes. There is no possibility of applying a non-standard arithmetic to an empirical situation in the same way. There could be no such thing as applied non-standard arithmetic in the same way that there is applied non-Euclidean geometry. Is this so? I take the answer to be ‘no’. There are theoretical arguments for this; we will come to these when I turn to logic. For the present I will demonstrate the point by telling a story to show how we might come to replace a standard application of ordinary arithmetic with a different one. There are a few such stories already in the literature, notably, those of Gasking (1940). But I will give one which might motivate the adoption of one of the inconsistent arithmetics that we met in the last section. 19 I do not want to claim that the situation described is a possibility in any real sense: it is merely a thought experiment. But if it succeeds it will show the required conceptual possibility. 20
Let us suppose that we come to predict a collision between an enormous star and a huge planet. Using a standard technique, we compute their masses as x 1 and y 1 , respectively. Since masses of this kind are, to within experimental error, the sum of the masses of the baryons (protons and neutrons) in them, it will be convenient to take a unit of measurement according to which a baryon has mass 1. In effect, therefore, these figures measure the numbers of baryons in the masses. After the collision, we
end p.162
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
measure the mass of the resulting (fused) body, and obtain the figure z, where z is much less than x 1 + y 1 . Naturally, our results are subject to experimental error. But the difference is so large that it cannot possibly be explained by this. We check our instruments, suspecting a fault, but cannot find one; we check our computations for an error, but cannot find one. We have a puzzle. Some days later, we have the chance to record another collision. We record the masses before the collision. This time they are x 2 and y 2 . Again, after the collision, the mass appears to be z (the same as before), less than x 2 + y 2 . The first result was no aberration. We have an anomaly.
We investigate various ways of solving the anomaly. We might revise the theories on which our measuring devices depend, but there is no obvious way of doing this. We could say that some baryons disappeared in the collision; alternatively, we could suppose that under certain conditions the mass of a baryon decreases. But either of these options seems to amount to a rejection of the law of conservation of mass(-energy), which would seem to be a rather unattractive course of action.
Then someone, call them Einquine, fixes on the fact that the resultant masses of the two collisions were the same in both cases, z. This is odd. If mass has gone missing, why should this produce the same result in both cases? An idea occurs to Einquine. Maybe our arithmetic for counting baryons is wrong. 21 Maybe the appropriate arithmetic is one where z is the least inconsistent number, and p (the period of the cycle) = 1. For in such an arithmetic x 1 + y 1 = x 2 + y 2 = z, and our observations are explained without having to assume that the mass of baryons has changed, or that any are lost in the collisions! Einquine hypothesizes that z is a fundamental constant of the universe, just like the speed of light, or Planck's constant. 22
While she is thus hypothesizing, reports of the collisions start to come in from other parts of the galaxy. (The human race had colonized other planets some centuries before.) These reports all give the masses of the two new objects as the same, but all are different from each other. Some even measure them as greater than the sum of their parts. Einquine is about to give up her hypothesis, when she realizes that this is quite compatible with it. Even if the observer measures the mass as z′, provided only that z′ > z then z = z′, and their results are the same!
But this does leave a problem. Why do observers consistently record result that differ from each other? Analysing the data, Einquine sees that values of z (hers included), are related to the distance of the observer from the collision, d, by the (classical) equation z = z 0 + kd (where z 0 and k are constants). In virtue of this, she revises her estimate of the fundamental constant to z 0 , and hypothesizes that the effect of an inconsistent mass of baryons on a measuring device is a function of its distance from the mass. Further observational reports bear this hypothesis out; and Einquine starts to consider the mechanism involved in the distance effect.
We could continue the story indefinitely, but it has gone far enough. For familiar reasons, there are likely to be theories other than Einquine's that could be offered for the
end p.163
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
data. Some of them might preserve orthodox arithmetic by jettisoning conservation laws, or by keeping these but varying some physical auxiliary hypotheses. Others might modify arithmetic in some other, but consistent, way. 23 And each of these theories might become more or less plausible in the light of further experimentation, etc. But the point is made: it is quite possible that we might vary our arithmetic for empirical reasons. There can be alternative applied arithmetics, just as there are alternative applied geometries. 24
An applied arithmetic is, in effect, a theory concerning certain relations of magnitude. As with geometry, there are various ways in which one might understand such a theory. One can tell a realist story: collections of physical objects have a certain physical property, namely size; and sizes, together with the operations on them, have the same objective structure as the numbers and corresponding operations in the non-standard arithmetic. Hence, facts about the mathematical structure transfer directly to the physical structure, and this is why it works. Or one could tell instrumentalist stories of various obvious kinds. For example, Gasking (1940), runs a conventionalist line on arithmetic that is similar to Poincaré's on geometry. We need not pursue these matters further here, though.
10.8 REVISING LOGIC: THE CASE OF SYLLOGISTIC
We have seen that there are numerous pure geometries and arithmetics. We have also seen that, when applied, a geometry or arithmetic is a corrigible theory of some kind. Let us now turn to the question of whether the same is true of logic. I will argue that it is. Indeed, it may fairly be pointed out that I have already assumed this, since the argument concerning arithmetic was partly based on arithmetics whose underlying logic is non-classical.
First, there are numerous pure logics. This point I take to be relatively uncontentious. There are the many-valued logics that Łukasiewicz invented, not to mention others such as intuitionism, quantum logic, and paraconsistent logic (one of which, LP, we met in the preceding sections). Possibly, a purist might say that they are not logics since they are not the real logic. But that would be like saying that non-Euclidean geometries are not geometries since they are not the real geometry. In both
end p.164
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
cases we have a family of structures (logics or geometries) that are perfectly well-defined mathematical structures; and, as far as that goes, all on a par.
The more contentious question is whether in logic, as in geometry and arithmetic, which one we apply may be a corrigible matter. Of course, a pure logic can have many applications (as may pure geometries and arithmetics). For example, standard propositional logic may be used to test inferences or simplify the design of electrical circuits; and for some of these applications (e.g. the latter) the question of which logic is correct may well be a theory-laden and corrigible matter. But when talking of application here, what we are talking about is what one might call the canonical application. The canonical application of geometry is in physical geometry; the canonical application of arithmetic is to counting and measuring; the canonical application of logic is to reasoning.
Now, there is a well known and very famous argument to the effect that all applied theories are revisable in principle, and so corrigible. I refer, of course, to that contained in Quine's celebrated paper ‘Two Dogmas of Empiricism’ (1951). The argument is to the effect that there is no principled way of drawing a line between the revisable and the unrevisable theories; and hence the sciences traditionally thought of as a priori, such as geometry and arithmetic are just as revisable as psychology and physics. 25 Quine also mentions logic explicitly in this context, but does not pursue the matter at great length. 26 Different arguments for similar conclusions were put forward by Putnam in (1962). Moreover, Putnam went on to develop the conclusion for logic. In (1969), drawing on the example of non-Euclidean geometry explicitly, Putnam argued that the situation in quantum physics might well occasion the replacement of classical logic by quantum logic—at least in reasoning about the micro-world.
We need not work through the details of Putnam's example here. To show that the received logic may change it is not necessary to argue for the possibility of this. One can point to a place where it has actually happened. 27 When Kant defended the incorrigibility of logic in the Critique of Pure Reason, 28 he defended Aristotelian logic—by which I mean, here, not Aristotle's logic, as found in the Organon, but the somewhat pathetic remnants of medieval logic, based on the theory of the syllogism, which was orthodox in the eighteenth and nineteenth centuries. 29 Within 150 years, this had been replaced by the entirely inappropriately called ‘classical logic’ of Frege and Russell.
That Aristotelian and classical logic are distinct will hardly be denied. But it might well be suggested that the adoption of classical logic did not revise Aristotelian logic in any interesting sense: Aristotelian logic was perfectly correct as far as it went; it was
end p.165
just incomplete. Classical logic simply extended it to a more complete theory. Such a suggestion would be false. It is a well-known fact, often ignored by philosophers (though not, perhaps, historians of philosophy) that Aristotelian logic is incompatible with classical logic in just the same way that non-Euclidean geometries are incompatible with Euclidean geometry. A central part of Aristotelian logic is syllogistic, and the most natural translation of the syllogistic forms into classical logic is as follows:
AaB All As are Bs x(Ax Bx)
AeB No As are Bs ¬ x(Ax Bx)
AiB Some As are Bs x(Ax Bx)
AoB Some As are not Bs x(Ax ¬ Bx)
Given this translation, Aristotelian syllogistic gives verdicts concerning the validity of some syllogisms that are inconsistent with classical logic. Consider the inferences called by the medievals Darapti and Camestros, which are, respectively:
All Bs are Cs
All Bs are As
Hence some As are Cs
All Cs are Bs
No As are Bs
Hence some As are not Cs
Both of these are valid syllogisms. Both are invalid in classical logic.
The problem is, of course, one of existential import. Some syllogisms seem to presuppose that various categories are instantiated. 30 It is sometimes suggested that the problem can be repaired by adding the import to the translations explicitly. Specifically, we add the clause xAx to each of the a and e forms. (It would be redundant in the other two.) This is, indeed, sufficient to render all the syllogistic forms classically valid, but the problem with this is that it invalidates other central parts of Aristotelian logic, notably, the square of opposition. The square is: edskip
AaB AeB
AiB AoB
where the claims on the top line are contraries; on the bottom line are sub-contraries; and on both diagonals are contradictories. Now it is clear that, once the a form is augmented with existential import, a and o are not contradictories: both are false if there are no As. For the same reason, neither are e and i.
Another suggested repair is to add existential import to the a form (but not the e), and take the o form to be its negation ( x(Ax ¬Bx) V ¬ xAx). This validates all the syllogisms and the square of opposition. The oddity of taking ‘some As are not Bs’ to
end p.166
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
be true if there are no As is clear enough. But more importantly, this repair invalidates another part of the traditional logic: the inferences of obversion. Specifically, obversion permits the inference from ‘no As are Bs’ to ‘all As are non-Bs’; which fails if the e form is not existentially loaded. Obversion is not in Aristotle, but it is a perfectly standard part of traditional logic.
It is sometimes suggested that, rather than adding existential import to the translations explicitly, we should take the instantiation of all the categories involved to be a global presupposition. This is a move of desperation. If it is correct, then we cannot use syllogistic to reason, e.g. in mathematics, where we certainly do not make such presuppositions. I don't think that the traditional logicians who endorsed syllogistic believed this. Moreover, if we were to allow validity to have contingent presuppositions, pretty much anything could be made to be valid. 31
More importantly, the suggestion really will not save syllogistic. 32 All winged horses are horses, and all winged horses have wings. Applying Darapti, we may infer that there are some winged horses. The argument clearly generalizes. All As are As. 33 A fortiori, all ABs are As; and symmetrically, all ABs are Bs. By Darapti it follows that there are some ABs. Thus syllogistic allows us to prove that any two categories intersect. And if it be replied that this is just one of the global presuppositions, take B to be Ā, the complement of A (non-A). It can hardly be maintained that Aristotelian logic globally presupposes contradictions. This argument requires the use of compound terms. Again, these are not in Aristotle, but are an established part of traditional logic. 34
What we have seen is that, however one interprets traditional logic in classical logic, something has to be given up. Moreover, this is quite essential. For as the last argument shows, traditional logic is, in fact, inconsistent. At any rate, classical logic is not (just) a more generous framework subsuming traditional logic. Prevarication aside, modern logic has given the thumbs-down to Darapti and its ilk.
10.9 QUINE AND MEANING-VARIANCE
As Quine insisted, then, there can be rival logics for canonical application, just as there can be rival geometries and arithmetics for their canonical applications. Some years after writing ‘Two Dogmas’, Quine developed his views on radical translation, with some corollaries for logic. And many commentators have viewed these as a partial reneging on his earlier radical views concerning logic. 35 The views are spelled out first
end p.167
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
in Word and Object (1960), section 13; but they are put most pithily in Philosophy of Logic, thus:
To turn to a popular extravaganza, what if someone were to reject the law of non-contradiction and so accept an occasional sentence and its negation as both true? An answer one hears is that this would vitiate all science. Any conjunction of the form ‘p. p’ logically implies every sentence whatever; therefore acceptance of one sentence and its negation as true would commit us to accepting every sentence as true, and thus as forfeiting all distinction between true and false.
In answer to this answer, one hears that such full-width trivialisation could perhaps be staved off by making compensatory adjustments to block this indiscriminate deducibility of all sentences from an inconsistency. Perhaps, it is suggested, we can so rig our new logic that it will isolate its contradictions and contain them.
My view of the dialogue is that neither party knows what he is talking about. They think that they are talking about negation, ‘ ’, ‘not’; but surely the notion ceased to be recognisable as negation when they took to regarding some conjunctions of the form ‘p. p’ as true, and stopped regarding such sentences as implying all others. Here, evidently, is the deviant logician's predicament: when he tries to deny the doctrine he only changes the subject. 36
Quine has paraconsistent logic in his sights here, but the context makes it plain that the point is meant to be a general one; and the point would seem to be that a change of logic occasions a change of meaning of the connectives concerned (though Quine might not put it this way because of his scepticism about meaning).
The claim has been denied by some, 37 and I think that it would be wrong to suppose that the connectives of different logics must have different meanings. However, I think that it must be agreed that the connectives of different pure logics often do have different meanings. Consider, for example, negation in intuitionist and classical logics. Given a possible-world interpretation for a propositional language, the truth conditions of classical and intuitionist negation are, respectively:
¬α is true at world w iff α is not true at world w
¬α is true at world w iff for all w′ such that w ≤ w′, α is not true at w′
It is clear that the truth conditions are quite distinct. Since identity of meaning would certainly seem to entail identity of truth conditions, it follows that the two negations mean something different. An intuitionist would say that the first truth conditions do not succeed in defining a meaningful connective at all. I do not think that this is right; but even if it were, it would still follow that the two connectives do not mean the same: one is meaningful; the other is not.
It would be wrong, however, to suppose that Quine takes the meaning variance of connectives across different logics to be an argument against his earlier view concerning the revisability of logic. A few pages after the passage just quoted, he says (p. 83):
By the reasoning of a couple of pages back, whoever denies the law of excluded middle changes the subject. This is not to say that he is wrong in so doing. In repudiating ‘p or p’ he is indeed giving up classical negation, or perhaps alternation, or both; and he may have his reasons.
end p.168
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
And summing up some of the lessons learned in the book he reiterates explicitly the revisability of logic (p. 100):
… I am committed to urge the empirical nature of logic and mathematics no more than the unempirical character of theoretical physics; it is rather their kinship I am urging, and a doctrine of gradualism … A case in point was seen … in the proposal to change logic to help quantum mechanics. The merits of the proposal may be dubious, but what is relevant now is that such proposals have been made. Logic is in principle no less open to revision than quantum mechanics or the theory of relativity.
Moreover, Quine is absolutely correct to insist that the views concerning meaning-change do not render rivalry and revision impossible. One way to see this is to recall that a number of influential writers in the philosophy of science, such as Kuhn (1962) and Feyerabend (1975), argued for a version of meaning variation for scientific theories. According to them, the meanings of terms in scientific theories are defined by the scientific principles in the theory, and thus, e.g. ‘mass’ in Newtonian mechanics means something different from ‘mass’ in relativistic mechanics. They concluded from this that the theories are incommensurable, i.e. that no direct comparison of content is possible between them. But they did not infer that the theories are not rivals. They obviously are: they give different and incompatible accounts of, e.g. motion, to the point of making inconsistent predictions. 38
So, if Quine is not arguing against the possibility of rivalry in logic, what point is he making? Let us return to that in a moment. It will pay us, first, to look more closely at how rivalry in logic manifests itself.
10.10 RIVALRY
As pure logics, no logic is a rival of any other. They are all perfectly good abstract theories. It is only when we apply them that a question of rivalry occurs. To give a theory canonical application, it is crucial that we forge some link between it and the practices that give the application life. 39 In the case of geometry, we need to relate the objects of the theory to our practices of measuring lines and angles; in the case of arithmetic, we need to relate the objects of the theory to our practices of counting. And in the case of logic, we need to relate the theory to our practices of inferring.
Inferring is a practice carried out in the vernacular; maybe the vernacular augmented with a technical vocabulary (such as that of chess, physics or whatever); maybe the
end p.169
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
vernacular augmented with mathematical apparatus; but the vernacular nonetheless. 40 Hence, to apply a logic, we must have some way of identifying structures in the formal language with claims in the vernacular. (Whether these are sentences, propositions, thoughts or whatnot, we need not go into here.) Notoriously, in the case of logic, the mode of identification is largely tacit. It is a skill that good logicians acquire, but no one has ever spelled out the details in general. It is simple-minded, for example, to suppose that every sentence with a ‘not’ in is adequately represented in the language of a logical theory by whatever represents the sentence with ‘not’ deleted, and prefixed with ‘¬’. For a start, the negation of ‘All cows are black’ is not ‘All cows are not black’. And as we have had a couple of occasions to observe earlier in the book, English uses ‘not’ for functions other than negation (as in ‘I'm not British; I'm Scottish’). Similar points can be made concerning ‘ ’ and ‘and’, and ‘V’ and ‘or’. 41
Rivalry in geometry, arithmetic or logic, occurs when different theories, each relative to the way it is applied, deliver different, incompatible, verdicts. In the case of logic, this happens when incompatible verdicts are given concerning the validity of some inference. For example, consider the inference ‘If there is not a greatest prime number then there is a greatest prime number. Hence there is a greatest prime number.’ Given the usual identifications, classical logic says that it is correct to draw this conclusion; intuitionist logic that it is not. Or take the inference ‘107 both is and is not the greatest prime number. Hence Fermat's Last Theorem is true.’ Classical logic says that this is a correct conclusion to draw; paraconsistent logic that it is not.
10.11 QUINE IN DEFENCE OF CLASSICAL LOGIC
Let us now return to Quine's observations and ask, assuming them to be correct, what their point is supposed to be. The deviant's logician's ‘predicament’ is that if they change the logic they change the subject; but if this does not rule out rivalry in logic, why, exactly, is it a predicament? I take the answer to be this. The deviant logician may want to change logics and meanings—and may be justified in doing so—but what they cannot do is say that the principles of (the received) logic are false. If someone denies one of the principles, then either their words mean something different, or what they say is (logically) false.
The observation is clearly made in defence of classical logic against its rivals. It is, after all, supposed to be a predicament for the deviant logician, not the classical one. 42 But the first thing to note about the above point is that it is entirely symmetrical. The
end p.170
classical logician is in exactly the same situation with respect to the rival logic: when they deny the principles of that, they, too, mean something different, or what they say is (logically) false.
So why does Quine think that the observation tells against non-classical logic? The answer is that he assumes that the logical constructions of the vernacular are those of classical logic. This is demonstrated by the passage in the original quotation where he lists negation, ‘ ’, and ‘not’ as the same thing, without further comment. Even granting that negation and ‘not’ are the same (which they are not, as I have already observed), ‘ ’ is a sign of a formal language with a certain semantics (classical for Quine), whereas negation is a notion from vernacular reasoning. 43
But Quine's assumption is exactly what a partisan of a logic other than classical is likely to take issue with. Someone who rejects classical logic, say a paraconsistent logician, need not deny that the (classical) meaning of ‘ ’ is sufficient to guarantee the validity of inference p. p q in classical logic (the pure abstract theory); 44 what they will certainly deny is that this is the meaning of negation, as it occurs in vernacular reasoning, about, say, the claim ‘This sentence is not true’. According to them, the semantics of their pure logic is the correct semantics for vernacular negation. Seen in this way, a dispute between rival logics is, then, exactly a dispute over meanings. 45 This may surface, for example, as a dispute over the truth conditions of vernacular negation. It is therefore entirely open to someone who holds that our theory of logic is revisable to hold that the correct logical principles are analytic, that is, true solely in virtue of the meanings of the logical connectives employed; 46 it is just that which principles are analytic is a corrigible and theory-laden issue.
At any rate, as we see, Quine's defence of classical logic completely begs the question, as does any logician who claims that classical logic is right by definition. 47 The question is: which definition? And this is to be settled by investigations of the kind standardly engaged in in trans-logical debates.
10.12 LOGIC AND TRANSLATION
As I observed earlier, what underlies Quine's views about the meaning-variance of connectives in different logics are certain of his views concerning radical translation. These are brought to bear on the issue explicitly in the section in Philosophy of Logic immediately following the one I have just been discussing. Here, Quine appeals to aspects of radical translation to try to establish that one cannot attribute to a speaker any view incompatible with classical logic.
end p.171
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
The argument is to the effect that when we translate the utterances of another, it is a constraint on correct translation that we cannot take them to be denying things that we take to be obvious (since they must be equally obvious to them). But logic is obvious. Hence we cannot attribute to them beliefs we take to be logically false. As he sums up the argument ((1970), 83):
… logical truth is guaranteed under translation. The canon ‘Save the obvious’ bans any manual of translation that would represent the foreigners as contradicting our logic (apart perhaps from corrigible confusions in complex sentences).
In the present context, several comments on this argument are pertinent. First, even if the argument is right, the situation concerning classical and non-classical logicians is entirely symmetrical. If a classical logician (“we”) cannot impute the denial of a classical logical truth to an intuitionist or paraconsistent logician (a “foreigner”), then they cannot impute a denial of what they take to be a logical truth to a classical logician. The argument cannot, therefore, be used as an argument for classical logic. 48
Secondly, the argument is, in any case, badly deficient. There are two reasons for this. The first is that the claim that logic is obvious is mind-numbingly false. That certain principles hold in various pure logics may well be obvious. That they are so in the vernacular is not at all true—or what is the debate between rival logics all about? From a modern perspective, it may seem obvious that Darapti is invalid. It was not obvious to logicians for the previous 2,000 years that this was so—or if it was, they kept very quiet about it. History has taught us that what seems obvious may well not be so. Indeed, it may be false. Quine can assume that logic is obvious only because he has tacitly identified logic with the received theory 49 —an error I have already commented on.
The second failure of the argument is due to the fact that the rule that the obvious should be built into translation is only a defeasible rule, and may be defeated by other aspects of the context. Someone standing in the pouring rain may yet deny that it is raining, since they may believe that they are in a virtual-reality machine. Less fancifully, people in the fifteenth century took it to be obvious that the earth is stationary (apart from occasional quakes and tremors). When Galileo said otherwise, they did not translate his beliefs away: they showed him the thumbscrews. Similarly, and closer to home, when writers such as Hegel and Engels explicitly endorse contradictions, we do not automatically say that they cannot mean, or believe, what they say. In the light of the rest of what they say, we may well conclude that this is exactly what they believe. 50
end p.172
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
For all these reasons, appeal to radical translation will not help, one iota, in defending classical logic.
10.13 REALISM VS. INSTRUMENTALISM IN LOGIC
We are now finished with Quine. Let us return to the analogy between logic, geometry, and arithmetic. An applied geometry is, in effect, a theory of spatial relations; an applied arithmetic is a theory of relations of magnitude; and an applied logic is a theory of logical relations. In particular, then, when we change our logical theory, it does not follow that what principles of logic are valid also changes, any more than when we change our theory of geometry, the geometry of the cosmos changes. The fact that we may use the word ‘logic’ to apply both to our theory and the concerns of that theory may encourage this confusion. 51 The word ‘geometry’ is similarly ambiguous; but in a post-Kantian age only the naive would suffer from the corresponding confusion.
What, though, makes a theory of logic correct? Just as with geometry and arithmetic, one may tell both realist and non-realist stories here. In the next chapter I will argue for a realist account. But for the present, I will just chart some of the positions in conceptual space.
According to a simple instrumentalist story, logic is a tool that we use for inferring; and what makes a logical theory right is that, combined with our other theories, it produces the right results. ‘Right’ here is to be understood as meaning that the things that we deduce which are subject to relatively direct testing, test positive. Thus, for example, if I deduce the existence of something satisfying a certain condition (an atomic particle, or a number), and investigation fails to find such a thing, then the prediction would appear to be dubious. Of course, as Quine (of ‘Two Dogmas’) would have pointed out, we can decide to hold on to any predictions our theory makes if we make suitable changes elsewhere—which we may well determine to do. What settles the most appropriate theory, then, is simply that which is overall simplest, most adequate to the data, least ad hoc, and so on. 52
As in geometry and arithmetic, this simple instrumentalism may be augmented by some kind of conventionalism. Even though the correct logic is a matter of choice, one may still regard claims of logic as true or false. What makes them so is the convention of our choice. This is conventionalism in logic, of the positivist variety.
The realist story is different. What makes a theory the right theory is that it correctly describes an objective, theory-independent, reality. In the case of logic, these are logical relationships, notably the relationship of validity, that hold between propositions (sentences, statements, or whatever one takes truth-bearers to be). But what are these logical relationships? Several answers are possible here. Perhaps the simplest is one according to which logical truths are analytic, that is, true solely in virtue of the meanings of the connectives, where these meanings are Fregean and objective. Logical relationships are therefore platonic relationships of a certain kind.
end p.173
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
A different, and more common story, which avoids reification of meanings, is Tarski's. 53 Interpretations are set-theoretic structures of a certain kind; truth (or maybe satisfaction) in an interpretation is given a recursive definition; and validity is defined as truth-preservation in all interpretations. The facts of validity are then defined by general facts about what interpretations there are. Depending on one's views, this may or may not be a partly empirical question—and if it is, logic turns out to be an a posteriori subject (in the traditional sense), as well as corrigible.
If a realist account of the nature of logical relations is given, then, whatever that is, the question arises as to the criteria one should use to determine which theory is correct. The answer to this, as explained in the previous part of the book, is that one decides on the basis of which theory is overall simplest, most adequate to the data, least ad hoc, and so on. These are, of course, exactly the same criteria as are used by the instrumentalist. Thus, although there are profound ontological differences between instrumentalists and realists, when it comes to the decision as to which logic is correct, all parties may appeal to the similar considerations. This is why the debates between various logical theories may well be, and often are, conducted largely independently of the ontological issue.
10.14 LOCALISM VS. GLOBALISM
Let me finish by commenting on another issue, that of whether the correct logic is global or local. Is the same logical theory to be applied in all domains, or do different domains require different logics? This question is orthogonal to the realist/instrumentalist issue. Since instrumentalists are guided purely by pragmatic concerns, localism sits well with it; but it is quite possible for an instrumentalist to insist that logic is, by its very nature, domain-neutral, and so be a globalist. For example, Ryle (1954) argues that topic-neutrality is a necessary condition for a construction to be the concern of formal logic. Globalism is probably the more natural position for a realist, but it is quite possible for a realist to insist that the nature of the logical relationships between statements about one domain is different from that between statements concerning another. Thus, it is quite possible for a realist to be a localist.
In the end, I doubt that there is a serious issue here. Even if modes of legitimate inference do vary from domain to domain, there must be a common core determined by the syntactic intersection of all these. 54 In virtue of the tradition of logic as being domain-neutral, this has good reason to be called the correct logic. But if this claim is rejected, even the localist must recognise the significance of this core. Despite the fact that there are relatively independent domains about which we reason, given any two domains, it is always possible that we may be required to reason across domains, for
end p.174
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
example about the relationship between the macro-domain and the micro-domain, or between mathematical objects and physical objects. Now, if α is a statement about one domain, and β about another, the only logical relationships that we can count on, e.g. α β or α → β satisfying are those that are common to the two domains.
But conversely, even a globalist may admit that when applying logic to certain domains, the generally valid inferences may legitimately be augmented by others. An intuitionist, for example, may agree that it is permissible to use all of classical logic to reason about the finite (or at least, the decidable); and a paraconsistent logician may hold that it is permissible to use all of classical logic when reasoning about the consistent. Thus, there would seem to be no significant disagreement between localists and globalists about the facts; only about how to describe them.
10.15 CONCLUSION
We started our discussion of geometry, arithmetic, and logic with Kant. For him, these were the three great a priori sciences. The appearance of non-Euclidean geometries, and, in particular, the adoption of one of them as a physical geometry early in the twentieth century made it impossible to subscribe to the Kantian picture. There are many pure geometries; each, when applied, provides a theory of corrigible status. We have seen that exactly the same is true of arithmetic and logic. There are many pure arithmetics and logics; any one of these may be applied to provide a theory of some appropriate kind; and no theory can claim incorrigibility in this matter. Łukasiewicz' analogy between non-Euclidean geometries and non-standard logics is therefore a highly apt one.
Logics, then—that is, our theories of logic—are fallible theories. As we noted in 10.13, this raises the question of how such theories should be interpreted. Should one understand them instrumentally, or realistically, or perhaps in some other way? And if realistically what is the reality in question? This is the topic to which we turn in the next chapter.
end p.175
11 Validity
show chapter abstract and keywords
hide chapter abstract and keywords
Graham Priest
11.1 INTRODUCTION
What is logic? Uncontroversially, logic is the study of reasoning. Not all the things that might fall under that rubric are logic, however. For example, the way that people actually reason may, in some profound sense, be part of the ultimate answer to the question of the nature of logic (think of Wittgenstein in the Investigations), but logic is not about the way that people actually think. The reason for this is simple: as a rich literature—if not common sense—now attests, people frequently reason illogically. 1 Logic does not tell us how people do reason, but how they ought to reason. We should cede the question of how people actually reason to psychology (and I will return to the question of normativity later).
The study of reasoning, in the sense in which logic is interested, concerns the issue of what follows from what. Less cryptically, some things—call them premises—provide reasons for others—call them conclusions. Thus, people may provide others with certain premises when they wish to persuade them of certain conclusions; or they may draw certain conclusions from premises that they themselves already believe. The relationship between premise and conclusion in each case is, colloquially, an argument, implication, or inference. Logic is the investigation of that relationship. A good inference may be called a valid one. Hence, logic is, in a nutshell, the study of validity.
But what is validity? Beyond a few platitudes, it is not at all clear how one should go about answering this question. It is not even clear what notions may be invoked in an answer: truth, meaning, possibility, something else? In this chapter I will provide an answer. In a nutshell, I will argue that validity is the relationship of truth-preservation-in-all-situations. Exactly how that relationship behaves is a question that can be answered only by the articulation of a logical theory of the kind discussed in the last chapter. As I argued there, each pure logic, when given its canonical interpretation, can be thought of as a theory concerning the behaviour of certain notions; specifically, those notions that are standardly deployed in logic. Validity is undoubtedly the most important of these—to which all the others must relate in the end. (We have looked at another of these, negation, in Chapter 4.) Though the account leaves many details to be filled in, it is at least sufficient to answer the question of what logical theories are theories of—truth preservation of the appropriate kind—and so, as we will see, to show that they sustain a realistic interpretation.
end p.176
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
11.2 VALIDITY: A FIRST PASS
What, then, is validity? Doubtlessly, a valid inference is one where the premises provide some genuine ground for the conclusion. But what does that mean? Traditionally, logic has distinguished between two notions of validity: deductive and non-deductive (inductive). A valid deductive argument is one where, in some sense, the conclusion cannot but be true, given the premises; a valid inductive argument is one where there is some lesser degree of support. Standard examples illustrate the distinction clearly enough. One might well ask why the notion of validity falls apart in this way, and what the relationship is between the two parts. I will come back to the whole issue of inductive inference later in the chapter, and give a uniform account of validity, both deductive and inductive. For the present, let us simply accept the distinction between the two notions of validity as a given, and focus on deductive validity.
11.3 PROOF-THEORETIC CHARACTERIZATION
What, then, is a deductively valid inference? Modern logic standardly gives two, very different, sorts of answer to this question: proof-theoretic and model-theoretic (semantic). In the proof-theoretic answer, one specifies some basic rules of inference syntactically. A valid inference is then one that can be obtained by chaining together, in some syntactically characterizable fashion, any of the basic rules. The whole process might take the form of a Gentzen system, a system of natural deduction, or even (God help us) an axiom system.
Such a characterization may undoubtedly be very useful. But as an answer to the main question it is of limited use, for several reasons. The first is that there seem to be languages for which the notion of deductive validity is provably uncharacterizable in this way. Second-order logic is the obvious example. This has no complete proof-theoretic characterization. Given certain assumptions, the same is true of intuitionistic logic too. 2
Possibly in these cases—especially the intuitionistic one—one might simply reject the semantic notion of validity with respect to which the proof-theory is incomplete. But even if one does this, there is a more profound reason why a proof-theoretic characterization is unsatisfactory as an ultimate characterization of validity. We can clearly give any number of systems of rules. Some may have nothing to do with logic at all; and those that do may give different answers to the question of which inferences are valid. The crucial question is: which system of rules is the right one? The natural answer at this point is to say that the rules are those which hold in virtue of the meanings of certain notions that occur in the premises and conclusions. This appears to take us into the second characterization of validity, the semantic one. Some independent account of those meanings is given, and the appropriate proof-theory must answer to
end p.177
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
the semantics by way of a suitable soundness proof (and perhaps, also, completeness proof). And indeed, I think this way of proceeding is correct.
One may, however, resist this move for a while. One may suggest that it is a mistake to understand meaning in some independent way, but that the rules themselves specify the meanings of certain crucial notions involved. For example, one might say that the introduction and elimination rules for a connective in a system of natural deduction, specify its meaning. The problem with this was pointed out by Prior (1960). One cannot claim that an arbitrary set of rules specifies the meaning of a connective. Suppose, for example, that one could characterize a connective, * (tonk), by the rules: α α * β and α * β β. Then everything would follow from everything—hardly a satisfactory outcome.
Some constraints must therefore be put on what rules are acceptable. One might attempt some purely syntactic constraint. For example, as we noted in section 5.3, it has been suggested 3 that the rules in question must give a conservative extension. This, however, will not solve the problem. Conservativeness is always relative to some underlying proof theory. For example, as we also noted Chapter 5, adding the rules for Boolean negation to a complete proof theory for positive “classical” logic is conservative; adding them to one for positive intuitionist logic is not. One needs, therefore, to justify the underlying proof theory. Possibly, one might try to justify the rules for one of the connectives of this by considerations of conservativeness. We are obviously then in a regress situation. Given that there is a finite number of connectives, this will ground out. If the ground itself has rules of inference, nothing has been gained. Perhaps, though, the ground will contain no rules of inference. Maybe the ground is just constituted by a vocabulary of atoms, with no rules of proof.
Even in this case, however, the problem of justification does not vanish. This is for two reasons. The first is that, as we noted in section 5.4, conservativeness is not invariant under the order in which notions are added. Each of classical negation and truth is conservative with respect to an appropriate positive relevant logic (and self-reference), but once one is added, the addition of the other is no longer conservative. This makes the order in which notions are added crucial; but there would seem to be no way of justifying a non-arbitrary order.
Less obviously, perhaps, justifying the individual rules is not enough. A proof-theory allows us to put together rules in a certain way, e.g. by chaining together. There are various possibilities here, which themselves need justification. The point comes out most clearly when the proof-theory is formulated as a sequent-calculus. I will return to the issue in a moment.
A different attempt to state justificatory constraints on a proof-theory (which rules out tonk-like connectives) takes us back to the discussion of section 5.5. Suppose that we are working in a natural deduction system, and suppose that we take the introduction rule for a connective to provide a direct account of its meaning. This needs no justification: any introduction rule may serve to do this. The corresponding elimination rule is then justified by the fact that it is conservative with respect to the introduction rule. In the words of Dummett, one of the people to whom this idea is
end p.178
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
due, the introduction and elimination rules are in harmony. 4 The idea can be cashed out in terms of a suitable normal-form theorem: whenever we have an introduction rule followed by the corresponding elimination rule, both can be eliminated.
Even if this strategy can be made to work, it can provide only a partial justification for the proof-theory. For the introduction and elimination rules are superimposed on structural inferential rules; for example, the transitivity of deducibility (deductions may be chained together to make longer deductions). Such structural rules are not inevitable, 5 and the question therefore arises as to how these rules are to be justified. This becomes patently obvious if the proof-theory is formulated as a Gentzen system, where the structural rules are quite explicit, and for which there is now a well-advanced study of logics with different structural rules: sub-structural logics. 6 One needs to justify which structural rules one accepts (and which one does not), and there is no evident purely proof-theoretic way of doing this.
If, as the foregoing discussion suggests, one cannot justify every feature of a proof-theory syntactically, the only other possibility would seem to be some semantic constraint to which the rules must answer. 7 We are thrown back to the other kind of characterization of validity, the model-theoretic one. So let us turn to this.
11.4 MODEL-THEORETIC CHARACTERIZATION
A deductively valid inference is, we said, one where the premises cannot be true without the conclusion also being true. A crucial question here is how to understand the ‘cannot’. Modern logic has produced a very particular but very general way of understanding this. When we reason, we reason about many different situations; some are actual—what things are like at the centre of the sun; some are merely possible—what things would have been like had the German Axis won the Second World War; and maybe even some that are impossible—what things would be like if, per impossibile, someone squared the circle. We also have a notion of what it is to hold in, or be true in, a situation. In talking of validity, necessity is to be explicated in terms of holding in all situations. Let us use lower-case Greek letters for premises and conclusions, 8 upper-case Greeks for sets thereof, and write to indicate valid inference. Then we may define ∏ κ as:
for every situation in which all the members of ∏ hold, κ, holds
There is also a corresponding notion of logical truth: κ is logically true iff it holds in all situations.
end p.179
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
So far so good. But what is a situation, and what is it to be true in it? One could, I suppose, take it that these notions are indefinable, but this is not likely to get us very far; nor is it the characteristic way of modern logic. Using mathematical techniques, both notions are normally defined. A situation is taken to be a mathematical structure of a certain kind, and holding in it is defined as a relationship between truth-bearers and structures. Both structures and relation are normally defined set-theoretically.
No one would suppose that situations are mathematical entities, such as ordered n-tuples—at the very least, in the case of actual situations. Strictly speaking, then, set-theoretic structures represent situations. They do this, presumably, because the situations have a structure (or, at least, a pertinent structure) that is isomorphic to that of the mathematical structure. How to understand this is a question in the philosophy of mathematics. Indeed, how to understand this sort of question is the central question of the nature of applied mathematics. I shall not pursue the matter here. 9
In the case of the standard account of validity for (classical) first order logic (without free variables), a structure is a pair:
• (Q)
•
where D is the non-empty domain of quantification, and I is the denotation function. Truth in is defined in the usual recursive fashion. Or in a Kripke semantics for modal logics, a structure is a 4-tuple:
• (K)
•
where W is a set of worlds, @ is a distinguished member of W (the “base world”), R is a binary relation on W satisfying certain properties (to be employed in stating the truth conditions of □), and I is the denotation function, which assigns each atomic formula a subset of W (namely, those worlds at which it is true). Truth at a world w W is again defined in a recursive fashion. And truth in is defined as truth at @.
Once the notions of a structure and holding-in are made precise, the definition of validity can be spelled out exactly. Let us write α to mean that α holds in structure . We may also say that is a model of α. is a model of a set of truth bearers if it is a model of every member of the set. Then α is a logical truth iff every structure is a model of α. And:
• (DV)
•
I take this to be the best answer to the question of when an inference is deductive validity presently on offer.
More needs to be said about this, however. Given any proof-theory of a certain kind, there is a well-known way that a semantics of the kind just described can be obtained. We simply construct a sound and complete many-valued semantics from the Lindenbaum algebra of the logic. A situation is an assignment of values
end p.180
in the algebra to sentences, and a sentence holds in a situation just if its value is a designated one. This semantics is clearly too cheap. More generally, it reminds us of the fact that we need to distinguish those semantics that are of a kind to be genuinely informative about the meanings of the notions involved, and those that are not. Truth-functional semantics for first-order logic are of this kind, as are Kripke semantics for modal logics. The semantics obtained from the Lindenbaum algebra are not; neither are these the only ones: many logics have topological, algebraic, or many-valued semantics which, though they are useful in establishing various things about the proof-theory—especially various independence results—are of a purely “technical” nature.
The distinction between “informative” semantics and “technical” semantics is obvious enough in principle, though it is not an easy one to draw cleanly. However, for an informative semantics, we need something like this: the notions employed in the semantics must have some intuitive meaning; and it should be clear why these notions, with the properties that the semantics attributes to them, may naturally be involved with a specification of the meanings of the logical notions that occur in the proof-theory. Thus, the notion of a possible world—though contentious enough—clearly has intuitive purchase, and it is quite clear why such a notion might be used to state the truth conditions of modal operators. By contrast, given, say, a many-valued interpretation of the kind used to prove the independence of some axiom in an axiom system, the values involved may have no intuitive meaning whatever; nor, therefore, can it be clear why giving recursive conditions for a connective in terms of these values can tell us anything about the meaning of the connective.
However one finally cashes out the details, it is essential that the semantics involved in the project of giving an account of validity be of the informative kind. If they are not, then they are vulnerable to exactly the same charge of arbitrariness that I levelled against a simple proof-theoretic characterization of validity. If they are, however, they can be taken to provide an account of the meanings of the logical notions involved, of the kind required. Given an appropriate soundness proof, the semantics will provide a justification of the proof-theory—in a way that the proof-theory does not provide a justification of the semantics.
As Dummett puts it:
When soundness … fails, the remedy … must of course be sought in a modification of either the syntactic or the semantic notion of logical consequence, and, on occasion, it might be the semantic notion rather than the syntactic one that is altered. Nevertheless, the semantic notion always has a certain priority: the definition of the syntactic notion is required to be responsible to the semantic notion, rather than the other way about. The syntactic relation is defined by devising a set of primitive rules of inference, and a corresponding notion of formal deduction. If a semantic notion can be defined with respect to which a soundness proof can be given, we then have a reason for regarding the primitive rules of inference as valid: until then, we have only an intuitive impression of their validity. 10
end p.181
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
11.5 FILLING IN THE DETAILS
( DV ) is the form of a definition of validity. It leaves many details to be filled in. These depend, for a start, on the language employed in formulating the premises and conclusions. More importantly, as I indicated in section 11.1, the details cannot be filled in without constructing an adequate logical theory for that language, and hence without resolving philosophical issues of a very substantial kind. This is not the place to go into these, but let me just point out some of the things at issue in the process.
A major question to be answered is: how, exactly, are situations structured? This question cannot be divorced from that of how to define the relation of being true in a structure. 11 For example, should the truth conditions of the conditional employ an ordering relation on worlds, as occurs in Kripke semantics for intuitionist logic, a ternary relation, as occurs in many relevant logics, or none of these, as in classical logic? If either of the first two of these is correct, then the relation will have to be a part of the structure. Another example: assuming that the truth/falsity conditions for a predicate are to be given in terms of its extension (those things it is true of) and anti-extension (those things it is false of), are we to suppose that these are exclusive and exhaustive of the relevant domains (as classical logic assumes) or not (as more liberal logics may allow)? Issues of the above kind pose deep metaphysical/semantical issues of a highly contentious nature.
Many of the relevant considerations here are familiar from the literature debating intuitionist, paraconsistent, and other non-classical logics. Theoretical issues concerning meaning, truth, and many other notions, certainly enter the debate. There is also a question of adequacy to the data. We have intuitions about the validity of particular inferences. (We may well have intuitions about the validity of various forms of inference as well, though because of the universality implicit in these, they are much less reliable.) These act like the data in an empirical science: if the theory gives the wrong results about them, this is a black mark against it. But as with all theorization, the fact that a theory has desirable theoretical properties (e.g. simplicity, non-ad-hocness), may well cast doubt on any data that goes against it—especially if we can explain how we come to be mistaken about the data. (We will see an example of this concerning
end p.182
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
enthymemes a little later.) The dialectical juggling of theory against data is always a matter of good judgment (which is not to say that all judgments are equally good), and always fallible. 12
The situation is, in fact, even more prone to dispute than I have so far said. This is because ( DV ) itself is couched in terms whose behaviour is theory-laden, such as the logical constant ‘every’; and if one parses the restricted quantifier ‘every A is B’ in the usual way as ‘everything is such that if it is an A it is a B’, then the conditional is getting in on the act too. The meanings of such notions, especially ‘if’, are philosophically contentious. 13
Another way of looking at the matter is this: it is not just the definition ( DV ) itself that is at issue, but—as I observed with respect to truth conditions in section 5.9—what does and what does not follow from it. This depends on the behaviour of the “logical constants”; that is, the valid principles that such constants satisfy; which is part of what is at issue in an account of validity; which is what ( DV ) gives. The issue is therefore a circular one. This does not mean that it is impossible to come up with a solution to the whole set of matters. It just means that there is no privileged point of entry: we are going to have to proceed by bootstrapping. Certainly, one can do this with “classical logic”; one can equally well do it with intuitionist logic or a paraconsistent logic. In the end, we want a theory that, as a total package, comes out best under “reflective equilibrium”. There is no short way with this.
11.6 THE TARSKIAN ACCOUNT
Before we leave deductive validity and turn to inductive validity, it will be illuminating to compare the account of validity I have given with the celebrated account given by Tarski in his essay (1956). 14 In a nutshell, this is as follows. Certain words of the language are designated logical constants. Given a sentence, its form is the result of replacing each non-logical-constant with a parameter (variable). An interpretation is
end p.183
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
a function that assigns each parameter a denotation of the appropriate type (objects for names, extensions for predicates, etc.). A relationship of satisfaction is defined between interpretations and forms, standardly by recursion. An inference is valid iff every interpretation that satisfies the form of each premise satisfies the form of the conclusion. (Correspondingly, a sentence is logically true if its form is satisfied by all interpretations.)
If we identify interpretations with what I have been calling structures, and write the satisfaction relationship as odels, then the form of the Tarskian definition of validity is exactly ( DV ). It is therefore tempting to think of the two accounts as the same. And indeed they may, in some cases, amount to the same thing; but as accounts, they are quite distinct. For a start, an interpretation, as employed in the definition, is normally only part of a structure, e.g. the I of ( Q ) or ( K ). In all but the simplest cases, structures carry more information than that, e.g. the domain, D, in ( Q ), and the binary relation, R, in ( K ). This will, in general, make an important difference as to what inferences are valid. For example, consider the sentence x yx ≠ y. At least as standardly understood, this contains only logical constants. According to the Tarskian account it is therefore either logically true or logically false. In fact, it is the former, since it is true (simpliciter). But given standard-model theory, it holds in some structures and not others, hence it is not logically true. 15
In (1990) Etchemendy provides an important critique of the Tarskian account. It will be further illuminating to see to what extent the account offered here is subject to the same problems. Etchemendy provides two sorts of counter-examples to the Tarskian account. These concern under-generation and over-generation.
According to the Tarskian account, any valid inference is, by definition, formally valid; that is, any inference of the same form is valid. This seems to render certain intuitively valid inferences invalid. For example, given the usual understanding of logical form:
This is red;
hence this is coloured
is invalid, since its form is:
x is P;
hence x is Q
which has invalid instances.
This argument is not conclusive. One may simply agree that the original inference is invalid, but explain the counter-intuitiveness of this by pointing out that the inference is an enthymeme. It is an instance of the valid form:
x is P;
end p.184
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
everything that is P is Q;
hence x is Q
and the instance of the suppressed premise here is ‘everything that is red is coloured’, which is obviously true. 16
I shall not discuss the adequacy of this move here. I wish only to use the example to contrast the account of validity given here with the Tarskian one. For, modulo an appropriate account of logical form, the present account may, but need not, make validity a formal matter. This just depends on what structures there are. In the case at point, for example, there may be no structures where there is something in the extension of ‘red’ that is not in the extension of ‘coloured’. We might, for example, eliminate such structures from a more general class with “meaning postulates”, in the fashion of Carnap and Montague. 17 If this is the case, the inference in question is valid, though not formally so. Of course, this path is not pursued in standard-model theory, where the more general notion of structure is employed; the result of this is that the notion of validity produced is a formal one. If one takes this line, then the model-theoretic account of validity also has to employ the enthymematic strategy.
Etchemendy's second sort of counter-example to the Tarskian approach concerns over-generation. In such examples, Tarski's account gives as valid, inferences that are not so; or if perchance this does not happen, this is so only by “luck”. Consider the sentence: there are at most two cats: x x z((Cx Cy Cz) (x = y V y = z V z = x)). Provided that there are at least three objects in toto, this is not a logical truth; but if not, it is. In this case, presumably, the account gets the answer right. But if the universe is finite, with, say, objects, the account is going to give the wrong answer for: there are at most cats. Moreover, whether or not something is a logical truth ought not to depend on such accidental things as the size of the universe.
One may meet Etchemendy's criticism by pointing out that the totality of all objects is not restricted to the totality of all physical (actual) objects, but comprises all objects, including all mathematical objects, all possible, and maybe all impossible, objects too. Not only is this so large as to make every statement of the form ‘there are at most i cats’ (where i is any size, finite or infinite) not a logical truth; but this result does not arise because of some “lucky contingency”. There is nothing contingent about the totality of all objects. 18
An objection similar to Etchemendy's might be made against the account of validity offered here. An inference is valid if it is truth-preserving in all situations. Couldn't this give the wrong answer if there are “not enough situations to go around”; and even if there are, should such a contingency determine validity? The answer to this is the
end p.185
same, and even more evident. The totality of situations is the totality of all situations: actual, possible, and maybe impossible too. It doesn't make sense to suppose that there might not be enough. Nor is the result contingent in any way. The totality of all situations is no contingent totality.
But the point may be pressed. The official definition is given, not in terms of situations, but in terms of the mathematical structures that represent them. Might there not be enough structures to “go around”. One might doubt this. Structures need not be “pure sets”. Any situation is made up of components; and it is natural to suppose that these can be employed to construct an isomorphic set-theoretic structure. Yet the worry is a real one. One of the situations we reason about is the situation concerning sets. Yet in Zermelo Fraenkel set theory, there is no structure with the totality of sets as domain. Hence this set theory cannot represent that situation. This does not show that ( DV ) is wrong, however. It merely shows that ZF is an inadequate vehicle for representation. A set theory with at least a universal set is required. There are may other reasons for being dissatisfied with ZF as a most general account of set. 19
Before we turn to the matter of inductive validity, let me end this section by returning to the question of logical realism. The account of validity offered here is a realistic one. Its realism operates on two levels. First, if one takes a logic to be specified by an appropriate proof-theory (axiom system, natural deduction system, or whatever), one can take this as an attempt to characterize the appropriate notion of truth-preservation. That is what the theory is about. A soundness proof establishes partial success in the enterprise; a completeness proof the other half. But at a deeper level, the model-theoretic account of validity can itself be thought of as an applied mathematical theory which attempts to characterize a certain relationship between situations themselves. That is what it is about. The ontological status of these situations, we might argue about. Are they physical objects of a Lewisian kind, abstract objects such as sets of propositions, or non-existent objects of a Meinongian kind? We need not go into this issue here. 20 Whatever kind of thing they are, the situations themselves are what the model theory is supposed to represent.
11.7 INDUCTIVE VALIDITY
We have seen that the model-theoretic account of validity that I have offered is different from the Tarskian account. But, so far, we have not seen any definitive reason to prefer it to that account. The model-theoretic account is more powerful and more flexible, for sure, but as long as the enthymematic move is acceptable, we have seen no cases where this extra power and flexibility must be used. However, an argument for this may be found by looking at the question of inductive validity. In due course, I will also give a model-theoretic definition of inductive validity. But let us approach these issues via a different question.
Theories of deductive validity took off with Aristotle, and are now highly articulated. Theorization about inductive validity is, by comparison, completely underdeveloped.
end p.186
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Why? A standard answer, with a certain plausibility, is as follows. Deductive validity is a purely formal matter. Hence it is relatively easy to apply syntactic methods to the issue. By contrast, if an inference is inductively valid, this is not due to its form, but to the contents of the claims involved, a matter which is susceptible to no such simple method.
The issue is, however, not that straightforward. For a start, as we have seen, it may not be the case that deductive validity is a matter of form. That just depends on how other details of the account pan out. Moreover, it is not immediately clear that inductive validity is not a matter of form either. The inference:
x is P;
most Ps are Qs;
hence x is Q
is a pretty good candidate for a valid inductive form. (It is certainly not deductively valid.) It is true that an inference such as:
Abdul lives in Kuwait;
hence Abdul is a Moslem
is frequently cited as a valid inductive inference, and one that is not formally valid on the usual understanding of what the logical constants are. But it is not at all clear that this is so. Just as in the deductive case, we may take it to be an enthymeme of the above form with suppressed premise: most people who live in Kuwait are Moslems.
Despite this, inductive validity is not, in fact, always a formal matter. This, I take it, is one of the lessons of Goodman's new “riddle of induction”. 21 Consider an inference of the following form (plausibly, one for enumerative induction):
•
If E is ‘is an emerald’ and G is ‘is green’, this inference seems quite valid. If, on the other hand, G is ‘is grue’ (that is, a predicate which, before some (future) time, t, is truly applicable to green things, and truly applicable to blue things thereafter), it is not. It is well known that there is no syntactic way of distinguishing between the two inferences. For though ‘grue’ is, intuitively, a defined predicate, ‘green’ can be defined in terms of grue and bleen (a predicate which, before t, is truly applicable to blue things, and truly applicable to green things thereafter). Hence any syntactic construction may be dualized. What breaks the symmetry is that ‘emerald’ and ‘green’ are natural properties (are projectible, in Goodman's terminology), whereas ‘grue’ is not. But there is no syntactic characterization of this. Hence, inductive validity is not, in general, a formal matter. It follows, then, that no version of the Tarskian account of validity is going to be applicable to inductive validity. For as we have seen, Tarskian validity is formal validity.
end p.187
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
11.8 PROBABILITY
How, then, are we to get a grip on the notion of inductive validity? A natural suggestion is that we should appeal to a suitable notion of probability. (Probability assignments are not, except in a very few cases, a formal matter.) Now there certainly seem to be intimate links between inductive validity and probability, but it is not clear that one can use the notion to formulate a satisfactory theory of inductive validity.
Let us restrict ourselves, for simplicity, to the one-premise case, and let us write the conditional probability of α given β in the usual way, as p(α/β). Then a first suggestion for defining inductive validity is as follows: an inference from premise β to conclusion α is valid if β raises the probability of α, i.e. if p(α/β) > p(α). This will not do, however. Just consider a case where β raises the probability of α though this is still small, as in, e.g.:
John used to be a boxer;
so John has had a broken nose
This seems quite invalid. (Most boxers have never had broken noses.)
A more plausible suggestion is that the inference is valid if p(α/β) is “sufficiently high”—where this is to be cashed out in some suitable fashion. But this, too, has highly counter-intuitive results. Consider the case, for example, where β decreases the probability of α, even though the conditional probability is still high, as in:
John used be a boxer;
so John has not had his nose broken
This inference is of dubious validity. 22
Could an appeal to Bayesianism solve the problem at this point? According to Bayesians, the relevance of a premise, β, is simply that if we learn—or were to learn—that β, we (would) revise our evaluation of the probability of any statement, α, to p(α/β), where p is our current probability function. I do not wish to deny that we sometimes revise in this way, but this cannot provide a satisfactory account of inductive inference for two reasons. First, if there is nothing more to the story than this, it is tantamount to giving up on the notion of inductive inference altogether. Inference is concerned with the question of when, given certain premises, it is reasonable to accept certain conclusions. In other words, we want to be able to detach the conclusion of the argument, given the premises. Conditionalization does not, on its own, give an answer to the question of when this is possible. 23 More conclusively, even Bayesians concede
end p.188
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
that there is information upon which one cannot conditionalize. 24 For example, if α is anything such that p(α) = 1, and if the probability function that will result from our next revision is q, then we cannot conditionalize coherently on, e.g. q(α) ≤ 0.5. For the result of conditionalization is q(α) = 1 (still). Yet information of this form can certainly be the premise of an inductive argument; for example, one whose conclusion is that q(α) < 0.5. Hence inference outstrips conditionalization. Even if there were some way of analysing inductive validity satisfactorily in terms of probability, a more fundamental problem awaits us. Deductive validity and inductive validity would seem to have something to do with each other. They are both species of validity. If deductive validity is to be analysed in terms of preservation of truth in a structure, and inductive validity is to be analysed as something to do with probability, they would seem to be as different as chalk from cheese. Maybe there is some deeper connection here, but it is not at all obvious what this could be. One might try defining a deductively valid inference from β to α, as one where p(α/β) = 1. This has nothing to do with model theory, but one might hope that, by making suitable connections (for example, by considering a probability measure on the space of models), the definition could be shown to be equivalent to ( DV ). One might even jettison the model-theoretic definition of deductive validity entirely, and attempt a uniform account of validity in terms of probability. Such moves face further problems, however. For example, consider: John chose a natural number at random; hence John did not choose 173 On the usual understanding of probability theory, this satisfies the probabilistic account of deductive validity, but it is hardly deductively valid: the conclusion might turn out to be false. There are ways that one might try to get around this problem too, but is would certainly seem much more satisfactory if an account of inductive validity could be found which made the connection with deductive validity obvious, and which did not did not depend upon probabilistic jiggery-pokery. 11.9 NON-MONOTONIC LOGIC Such an account is now available, thanks to recent developments in non-monotonic logic. An inference relation, , is monotonic if ∏ κ entails ∏ σ κ. Deductively valid inferences are monotonic. For suppose that ∏ κ. If all the premises in ∏ σ hold in a structure, then certainly all those in ∏ do. In which case, so does κ. On the other hand, inferences traditionally accounted inductively valid are well known not to be monotonic. Consider only: Abdul lives in Kuwait; Abdul went to mass last Sunday; hence Abdul is a Moslem end p.189 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved The study of non-monotonic inferences, quite independently of probability theory, is one that has seen rapid and exciting developments in the last 25 years, mainly from amongst logicians in computer-science departments. There are many distinctive approaches to non-monotonic inference, of various degrees of mathematical sophistication. But it is now becoming clear that at the core of theories of non-monotonicity there is a canonical construction. 25 Let me illustrate. Consider Abdul again. What makes it plausible to infer that he is a Muslim, given only that he lives in Kuwait, is that, if he were not, he would be rather abnormal. (Qua inhabitant of Kuwait, and not, of course, in any evaluative sense.) Although the inference may not be truth-preserving, it is certainly truth-preserving in all “normal situations”. We can formulate this more precisely as follows. Let us suppose that we can compare situations (or the structures that represent them) with respect to their normality. Normality, of course, comes by degrees. Let us write the comparison as ≻ ( is more normal than ). ≻ is certainly a partial order (transitive and anti-symmetric); but it is not a linear order. There is no guarantee that we can compare any two structures with respect to their normality. Let us say that is a most normal model of a set of sentences, ∏, n ∏, iff: ∏ and for all such that ≻ , ∏ Then if we write inductive validity as i , we can define: • (IV) • This captures the idea that an inference is inductively valid if the conclusion holds in all models of ∏ that are as normal as ∏ will let them be. Notice how this gives rise to non-monotonicity in a very natural way. κ may well be true in all situations most normal with respect to ∏, but throw in Σ, and the most normal situations may be more abnormal, and not necessarily ones where κ holds. 26 11.10 CONSEQUENCES OF THIS ACCOUNT The definition ( IV ) is the one I wish to offer. It is exactly the same as ( DV ), except that where ( DV ) has ‘model’ ( IV ) has ‘most normal model’. The connection between the two notions is therefore patent. Where deductive validity requires truth preservation over all structures, inductive validity requires truth preservation over all normal structures (with respect to the premise set). Deductive validity is the same as inductive validity except that we don't care about normality. This is equivalent to taking ≻ to be the minimal ordering that relates nothing to anything. We therefore end p.190 have a generic account of validity. Validity is truth-preservation in all structures of a certain kind. Deductive validity is the limit case where we are talking about all structures, period. Otherwise put: validity is about truth preservation in all normal structures. Deductive validity is the limit case where every situation is to be considered normal. 27 This account of inductive validity extends the model-theoretic account of deductive validity. Hence, all the comments that I made about filling in its details carry over to it, also. In this respect they are the same. The major technical difference between the two notions, is the fact that inductive validity makes use of the ordering ≻, whilst deductive validity does not. This, therefore, focuses the major conceptual difference between the two. Although the question of what structures there are is open to theoretical debate, whatever they are, they are invariant across all contexts in which we reason. The definition ( IV ) also gives us a core of universally inductively valid inferences, namely those that hold for all orderings. 28 But when we employ the notion of inductive validity in practice, we do not argue relative to all orderings, but with respect to one particular ordering. The ordering is not a priori, but is fixed by external factors, such as the world and the context. For example, the fact that to be green is more normal (natural) than to be grue is determined by nature. Or what is to count as normal may depend on our interests. In the context of discussing a religious question, a resident of Kuwait who goes to mass is not normal. In the context of discussing biological appendages, such as noses, they are (normally) quite normal. 29 This is, I think, the heart of the difference that people have sensed between inductive and deductive validity. It is not really a matter of the difference between form and content—or if it is, what this comes down to is exactly the dependence of inductive validity on an a posteriori feature, the ordering ≻. 11.11 NORMATIVITY AND INFORMATION PRESERVATION Let me conclude by tying up a couple of loose ends; and let us start by returning to the question of the normativity of logic. Validly, I said, is how people ought to reason. Why? The answer is simple. We reason about all kinds of situations. We want to know what sorts of things hold in them, given that we know other things; or what sorts of things don't hold, given that we know other things that don't. If we reason validly then, by the definition of validity, we can be assured that reasoning forward end p.191 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved preserves the first property, and that reasoning backwards preserves the second. Validly is how one ought to reason if one wants to achieve these goals. The obligation is, then, hypothetical rather than categorical. In less enlightened days, I argued against a model-theoretic account of validity of the kind given here. 30 One argument was to the effect that a model-theoretic account validates the inference ex contradictione quodlibet: α, ¬α β, which is not a norm of reasoning. This argument is just fallacious. A model-theoretic account may validate this argument; but equally, it may not. It depends, crucially, on what situations there are. If there are non-trivial but inconsistent situations, as there are in many logics, this will not be the case. 31 A second argument was to the effect that a model-theoretic account cannot be right since, if it were, to know that an inference is valid we would have to know, in advance of using it, that it preserves truth in all situations; and hence it would be impossible for us to learn something new about a situation by applying a valid inference. This argument is equally, though less obviously, invalid. It confuses matters definitional with matters epistemological. We may well, and in fact do, have ways of knowing that a particular inference is valid, other than that it simply satisfies the definition. Particular cases are usually more certain than general truths. Compare: The Church/Turing Thesis provides, in effect, a definition of algorithmaticity. But we can often know that a process is algorithmic without having to write a program for a Turing machine. If we could not, the Thesis would not be refutable, which it certainly is. 32 Finally, and further to the subject of truth-preservation: it is sometimes objected that a definition of validity on the basis of truth-preservation is too weak. What we often need of a notion of validity is that it preserve not truth, but something else, like information. Consider an inference engine for a computational database, for example. We want one which extracts the information that is implicit in the data. Truth has nothing to do with it. This argument is also far too swift. Valid inferences preserve not just truth, but truth in a structure. Given the right way of setting things up, the situation as described in the database may well be (represented by) an appropriate structure. Structures do not have to be “large” like classical possible worlds, but may be “small”, as in situation semantics. (In particular, the set of things true in a situation may be both inconsistent and complete.) A valid inference is, then, by definition, just what we want to extract the juice from the information provided by the database. Provided we choose our situations carefully, a logic of truth-preservation is also a logic of information preservation. 33 end p.192 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved 11.12 CONCLUSION This last claim is, of course, just a promissory note, and needs to be redeemed by a lot of hard work, specifying all the details that have only been hinted at above. The aim of this chapter has not been to present all the details of an account of validity. That is probably the work of several lifetimes! The aim has just been to provide the general form of an answer to the question of what validity is. To this extent, the slack in the definition is to its advantage. The details are to be filled in, in the most appropriate way. But despite the slack, the form of the answer is far from vacuous. There are certainly other possibilities (some of which we have briefly traversed); and the fact that this answer is possible at all is a tribute to the developments in modern model theory—of both classical and non-classical logics—of which this account can be thought of as the distilled essence. end p.193 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved 12 Logical Pluralism show chapter abstract and keywords hide chapter abstract and keywords Graham Priest 12.1 INTRODUCTION The previous chapters in this part of the book have given us an account of the epistemology and ontology of logic. But this still leaves an important question. I said ‘logic’; maybe I should have said ‘logics’. Is there one logic or are there many? Traditionally, there was only one—the logic of Aristotle and the Stoics, as melded together in the Middle Ages—and the question never arose. This century, we have seen a plethora of logics: Frege/Russell (classical) logic, intuitionism, paraconsistent logic, quantum logic. Usually, the advocates of these logics were still logical monists, in the sense that they took it that other logics were wrong. But the weight of the plethora has been too much for some people, who have decided that there is no one true logic: there are lots of things which may, with equal justification, be called logic. In logic, as in a multicultural society, pluralism must be endorsed. Is this right? 1 The answer, as one might expect, is that it depends on what one means. There are certain senses in which the answer is clearly ‘yes’. But in the case of central importance—I will explain what I take that to be in due course—I think that the answer is ‘no’; and in this chapter I will defend a monist position. I think that much of the initial plausibility of pluralism in this central case arises from a confusion between the various senses of the question. The first topic on the agenda is, therefore, an appropriate disambiguation. One of the important notions that requires clarification is that of rivalry between logics. For pluralism arises in a serious form when there is such a rivalry, and we are called upon to adjudicate. Can we, in such circumstances, decide for more than one candidate? The notion of rivalry will therefore be of central concern, and I will revisit the discussion of the notion in section 10.10. A consideration of these issues in the first main part of the chapter will clear the ground for discussing serious objections to monism, which we will do in the second. This will leave a few words to be said about inductive inference in the final brief section. end p.194 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved 12.2 PURE AND APPLIED LOGIC We should start by distinguishing, as we did in section 10.8, between pure and applied logic. As I noted there, there is an important analogy between logic and geometry. 2 It is now an uncontentious fact that there are many pure geometries: Euclidean, Riemannian, spherical, etc. Each is a perfectly good mathematical structure, and can be formulated as an axiom system, with standard models, etc. There is no question of rivalry between geometries at this level. The question of rivalry occurs when one applies geometries for some purpose, say to provide an account of the physical geometry of the cosmos. Then the question of which geometry is right must be faced. In the same way, there are many pure logics. I enumerated several of these in the previous section. Each is a well-defined mathematical structure with a proof-theory, model theory, etc. There is no question of rivalry between them at this level. This can occur only when one requires a logic for application to some end. Then the question of which logic is right arises. If one is asking about pure logics, then, pluralism is uncontentiously correct. Plurality is an issue of substance only if one is asking about applied logics. In this case, there is the potential for rivalry, and whether one should be a monist or a pluralist about this rivalry is a question that must be faced. 12.3 THEORETICAL PLURALISM Let us turn, then, to applied logics. The first thing to note here is that pure logics can be applied for many purposes, such as simplifying electronic circuits, or analysing certain grammatical structures. And again, it is clear and uncontentious that different pure logics may be appropriate for each application. In the two examples given, for example, the appropriate logics are Boolean logic and the Lambek calculus. 3 We will have further examples, closer to home, later. Plurality is, then, an interesting issue only when we have one particular application in mind. Fix, then, on some one application. A pure logic is applied by interpreting it in some way or other. As I argued in Chapter 10, it then becomes a theory of how the domain in which it is interpreted behaves; just as a pure geometry, when interpreted as a physical geometry, is a theory about space. In such a situation there may well be disputes about which theory is correct. This has certainly happened in geometry. It also happens in logic, as we shall have several occasions to observe below. Such disputes are to be resolved in logic, as in geometry and elsewhere, by the usual criteria of theoretical evaluation, such as adequacy to the data, and theoretical virtues, such as simplicity, unity, no ad hocness, etc. We saw how in Chapter 8. The end p.195 present relevance of all this is that we have the source of another kind of pluralism. Given a fixed application to some domain, there may be many different applied logics which constitute theories about the behaviour of that domain—and correspondingly, disputes about which theory is right. I will call this, for want of a better term, theoretical pluralism. 12.4 THE CANONICAL APPLICATION OF LOGIC Let us now turn to the most important and traditional application of a pure logic. This is what in section 10.10 I called the canonical application: the application of a logic in the analysis of reasoning—which was also traditionally called ‘logic’, of course (just as ‘geometry’ was used ambiguously before Euclidean geometry and its canonical application were distinguished, to describe both). The central purpose of an analysis of reasoning is to determine what follows from what—what premises support what conclusions—and why. An argument where this is, in fact, the case is valid. As I discussed in Chapter 11, it is traditional to distinguish between two notions of validity, deductive and inductive. So immediately it would appear that we have a pluralism here. As to whether this is indeed the case, I will return at the end of this chapter. For most of this chapter I will focus just on deductive validity. Before we can discuss the issue of plurality for this application, we need to recall how pure logics are applied in an analysis of reasoning. As explained in section 10.10, we reason, in the first place in the vernacular. We establish by observation that a planet moves in a certain way, and wish to know if the description of its trajectory follows from our theory of motion; or we infer what would be the case if the Butler did it, to see whether these consequences do, in fact, obtain. Premises and conclusions are formulated in a natural language. By all means, this may be a natural language augmented with technical vocabulary, such as that of mathematics. The language employed, though, is still the language of use and communication. Pure logics, at least as standardly conceived, do not concern the vernacular at all, but are couched in terms of formal languages. A definition of validity is provided for inferences expressed in one of these. Applying this to provide an account of validity for the vernacular, as in the application of any pure mathematical structure, requires an interpretation. And in the case of logic, the interpretation is quite literally so: a translation procedure between the formal language in question and the vernacular. Such a procedure is rarely articulated at great length in logic texts. Rather, students are given hints about treating ‘or’ as V (most of the time), etc., and then set loose on a bunch of exercises, through which they develop procedural knowledge. But given such a translation procedure, a pure logic provides an account of validity for vernacular inferences. A vernacular inference is valid iff its translation into the formal language is valid in the pure logic. The result, moreover, often produces disagreement. According to some logics, ‘there is an infinitude of primes’ follows from ‘it is not the case that there is not an infinitude of primes’; according to others, it does not. According to some logics, ‘the sky is green’ end p.196 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved follows from ‘the liar sentence is both true and false’; according to others, it does not. Such disagreements mean that we clearly have a case of theoretical pluralism. Is there a more serious kind of pluralism here? To address this question, let us turn to accounts of validity for formal languages. 12.5 VALIDITY There is a plurality of formal languages employed in logic. I am not talking here of the fact that different symbols are used in different languages. The fact that ‘¬’ is used for negation in one language, whilst ‘ ’ is used in another, is neither here nor there: as long as each plays the role of vernacular negation (in the sense of Chapter 4) according to the appropriate translation manual, the two are, in effect, the same symbol. Rather, what is important is that one language may lack a symbol, in this sense, that another has. Thus, a modal language contains modal operators, which an extensional language lacks. Strictly speaking, then, the accounts of validity for two such languages must be different, and we have a pluralism. But this, again, is not a very interesting pluralism. For as far as the common set of symbols goes, the two accounts may be in perfect agreement; it is simply that one (or each) extends the other in some direction. Given, then, a formal language, and a translation procedure between it and the vernacular, what should a formal theory of validity be like? I take the answer to be as explained in Chapter 11. When we reason, we reason about various situations or states of affairs. These may be actual or hypothetical. We reason to establish what holds in these situations given what we know, or assume, about them. This is truth-preservation (forward), 4 though it is not actually truth that is in question unless the situation we are reasoning about is itself actual. The point of deduction, then, is to give us a set of canons that preserve truth in this sense. A valid inference is therefore one such that in all the situations where the premises hold, the conclusion holds. As noted in Chapter 11, this answer is, of course, nothing more than a gesture in the direction of an account of validity. Much remains to be done by way of spelling out what situations there are, and what it is to hold in one. And there are certainly different possibilities here: according to paraconsistent semantics, but not classical or intuitionist semantics, there are inconsistent situations. According to intuitionist semantics (and some paraconsistent semantics), but not classical semantics (and some other paraconsistent semantics), the definition of truth in a situation requires the truth conditions of negation to make reference to more than one possible world. There may therefore be different views on how to fill in the details required. But the familiar disputes about this, such as the intuitionist critique of classical logic, or the dialetheist critique of explosive logics, would seem to be cases of theoretical pluralism. Is there a more serious kind of pluralism here? So far, we have no reason to believe so. end p.197 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved 12.6 DOMAIN VARIATION So let us turn to some possible reasons. These have been advocated by a number of people, and we will look at six. The first is provided by da Costa (1997). 5 Da Costa's major argument for pluralism is to the effect that reasoning about different kinds of things may require different logics. The kinds in question are kinds like macro-objects, quantum objects, platonic objects, mental constructions, etc. What to say about this depends on how, exactly, it is envisaged that logic will vary. One thing that da Costa envisages is that different kinds of objects have different properties. Thus, for example, he moots the possibility that quantum objects do not have the property of self-identity, which macro-objects have. Hence, micro-objects require a non-classical logic of identity, one where x x = x is absent. In response to this, the obvious monist reply is that since validity is truth-preservation in all situations, if there are situations in which objects may not be self-identical, then self-identity is not a logical law at all. This does not mean that when reasoning about, e.g. macro-objects one may not use the law, though. It is just a “contingent” property of certain domains, and may thus be invoked when reasoning about them. In a similar way, the intuitionist may invoke the law of excluded middle when reasoning about finite (or at least decidable) domains: this, plus intuitionist logic, gives classical logic. But the intuitionist has not changed logical allegiances. It is simply that classical validity can be recovered enthymematically, given the extra domain-specific premises. Da Costa's pluralism is more radical than I have so far indicated, though. He envisages not only that objects of different kinds may have different logical properties, but that different logical operators may also need to be used in reasoning about different kinds of objects. Thus, for example, classical negation is appropriate for dealing with platonic objects, and intuitionist negation is appropriate for dealing with mental constructions. But, this cannot be right. Classical and intuitionist connectives have different meanings. (Indeed, some intuitionists, such as Dummett, even claim that some classical connectives have no meaning at all). Thus, take negation as an example. Classical and intuitionist negations have different truth conditions. 6 But difference in truth conditions entails difference in meaning. Hence, the two connectives have different meanings. 7 Now, either vernacular negation is ambiguous or it is not. If it is not, then, since the different theories attribute it different meanings, they cannot both be right. We have a simple theoretical pluralism. The other possibility is that vernacular negation is ambiguous. Thus, it may be argued that vernacular negation sometimes end p.198 PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com) © Copyright Oxford University Press, 2006. All Rights Reserved means classical negation and sometimes means intuitionist negation. But if this were right, we would have two legitimate meanings of negation, and the correct way to treat this formally would be to have two corresponding negation signs in the formal language, the translation manual telling us how to disambiguate when translating into formalese. In exactly this way, it is often argued that the English conditional is ambiguous, between the subjunctive and indicative. This does not cause us to change logics, we simply have a formal language with two conditional symbols, say and >, and use both. This is pluralism in a sense, but the sense is just one of ambiguity.
In fact, I see no cogent reason to suppose that negation is ambiguous in this sense. If the account of negation of Chapter 4 is correct, negation is unique. Indeed, negation cannot have different meanings when reasoning about different kinds of things. This is because we can reason about different kinds simultaneously. We can, for example, reason from the claim that it is not the case that a and b have some particular property in common, where a and b are of different kinds. Moreover, there are other reasons as to why vernacular negation cannot be ambiguous between intuitionist and classical negation. If it were, as I have argued, we could have two formal negations. But it is well known that in the presence of classical negation, many other important intuitionist distinctions collapse. For example, the intuitionist conditional collapses into the classical conditional. 8
Of course, as I have already noted, the intuitionist logician can, in effect, use the full force of classical argument when reasoning about finite domains. But this is not because negation has changed its meaning. It is because they are entitled to various extra premises, such as the law of excluded middle.
12.7 CONTEXT DEPENDENCE
The next notable advocate of pluralism that we look at is Batens (1985), (1990). According to him, different logics are required, not for different domains, but in different contexts, where a context is, in a nutshell, a problem-solving situation. Two of Batens' arguments for pluralism are particularly notable.
The first, given mainly in (1990), takes it for granted that a paraconsistent logic is correct in some contexts, but argues that it cannot be correct in others (e.g. a metatheoretic one), for which classical logic is required. Hence, pluralism. Now, just as an intuitionist may use what amounts to classical logic when reasoning about finite situations, so a paraconsistent logician may use what amounts to classical logic given appropriate information about the domain. For example, sufficient information is that for all α, (α ¬α) → , where → is an appropriate detachable conditional,
end p.199
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
and is a logical constant that entails everything. This extra information, together with the base paraconsistent logic, generates classical logic. 9
This reply might not satisfy Batens, for the following reason. As I have stressed, the meaning of a logical operator, such as negation, does not change simply in virtue of the extra information. But, Batens argues, a paraconsistent negation just cannot express certain things that need to be expressed in talking about such situations. In particular, ¬α cannot express the fact that α is rejected. What to say about this argument depends on how one interprets ‘rejection’, which may, in fact, mean several different things. I discussed this matter in section 6.3, so do not need to go into details here. The major sense of rejection that Batens has in mind is that to reject α is to commit oneself to something, which, in conjunction with α, produces triviality (p. 222). But a paraconsistent logician can do just that, by endorsing (α ¬α) → .
The second of Batens' arguments (given mainly in (1985)) is that, notwithstanding the above, different contexts result in different meanings for logical operators, and hence deliver different logics (p. 338 f.). The argument for meaning variance is a familiar one, of a kind employed by Kuhn and Feyerabend in the philosophy of science. The first premise is that what is accepted may vary from context to context. The second is that what is accepted (partially) determines the meanings of the words involved. It follows from these that meanings change from context to context.
Now one might say a good deal about this argument, but its central failing is that a change of what is accepted does not (necessarily) result in a change of meaning. When a Christian loses their certitude, and comes to believe that God does not exist, the word ‘God’ has not changed its meaning. What they come to believe is the very opposite of what they believed before. Similarly, the classical logician believes that it is not the case that it is not the case that α entails α; the intuitionist believes otherwise. It does not follow that ‘it is not the case that’ means something different for the two of them. If it did, they would not be in disagreement, which they most certainly are. Intuitionist and classical negation do mean different things; 10 as I have argued, the point of disagreement between the two logicians is precisely which of those two negations it is that correctly characterises vernacular negation.
12.8 LOGICAL CONSTANTS AND VARIABLES
The next three arguments we will look at are all to be found in Beall and Restall (2000), who give what is, perhaps, the most sustained defence of pluralism. 11 B&R
end p.200
endorse an account of validity similar to that which I gave in Chapter 11; 12 but they argue for pluralism, nonetheless. The three arguments are all to the effect that the details of the account given there may be filled in in different, equally legitimate, ways.
B&R's first argument proceeds as follows. There is a strong tradition in logic of treating certain words in premises and conclusions as parameters. This is done in the semantics of first-order logic, e.g. where one treats predicates and constants as semantically variable. B&R observe that the account of validity can be parameterized in this way to give the usual model-theoretic (Tarskian) account of validity. 13 What are the consequences of this? A simple example illustrates. Consider the inference: a is red; hence a is coloured. This is valid on the unparameterized account. Any situation in which a is red, it is coloured (which is not to say that there might not be inconsistent situations, in which it is not coloured too). But the inference is not valid according to a parameterized definition of validity—at least if the parameters include predicates—for it is of the form: Pa Qa. What is at issue here is the question of what situations are to be admitted into our semantics. Are we to include, in our sweep, situations in which things can be in the extension of ‘red’ but not of ‘coloured’, etc., or are we not?
Now, it is true that the model-theoretic construction can be done in either way, parameterized or unparameterized, so giving different notions of validity. But as far as I can see, this is just another case of theoretical pluralism—or at least, B&R provide no reasons for thinking otherwise. They say that they do not think it ‘fruitful to debate which of …[the two accounts] is logic’ (p. 480). I am really not sure what this means. It certainly cannot mean that there are no considerations that might persuade one to adjudicate for one theory over the other. 14 Many of the moves in this theoretical dispute are, in fact, well known.
The inference ‘a is red; hence a is coloured’ is certainly prima facie valid. Hence, if the parametric account is to survive, this validity must be explained away. The standard move is to claim that the inference is, in fact, invalid, but that it appears to be valid because we confuse it with a valid enthymeme, with suppressed premise ‘All red things are coloured’, taken for granted. But this move is not very plausible when scrutinized. For if something about ‘All red things are coloured’ gave it the power to make an enthymeme, whose suppressed premise it was, appear valid, then the inference: ‘Snow is white; hence snow is white and all red things are coloured’ should equally appear valid—which it does not.
There are other reasons for being dissatisfied with the parametric account. For example, the parameterized definition of validity in effect allows the meanings of some words, but not others, to vary. But when we ask what follows from something's
end p.201
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
being red, we are not asking about what might be the case if ‘red’ meant something different. That's just changing the subject. It is essential, then, that meaning change is not allowed—even of so-called logical variables. 15
This is no place to pursue the ramifications of the dispute further. The main point for present purposes is just that parameterized validity is not real validity at all—just a false theory thereof. 16 This is not to say that the parameterized notion of validity is a useless notion. It is still true that every parametrically valid inference is valid. Hence, a conclusion that follows according to such a notion, still follows, and this may well allow us, in reasoning, to show what we need. But there is more to validity than formal validity.
12.9 CLASSES OF SITUATION
B&R's next argument is to the effect that we can obtain equally legitimate notions of logical consequence by defining validity as truth-preservation over different classes of situations. Thus, if we restrict ourselves to consistent and complete domains, classical logical consequence will result; if we take a broader class, it will not. If we include situations with empty domains we will have a free logic; otherwise not. We therefore have a plurality of validities: one for each appropriate class of situations—and all equally legitimate notions of validity.
The obvious reply to this argument is that it is only truth-preservation over all situations that is, strictly speaking, validity. One of the points about deductive logic is that it will work come what may: we do not have to worry about anything except the premises. As I have already observed, this is not to say that in practice one may not reason as if one were using a different, stronger, notion of validity, one appropriate to a more limited class of situations. But this is not because one has changed logical allegiances; it is because one is allowed to invoke contingent properties of the domain in question.
A possible reply here 17 is that the class of situations is so large and variegated that there are no inferences that are truth-preserving over all of them, except, possibly, the trivial α α. Hence monism is vacuous. I have never seen any persuasive argument as to why one should suppose this to be the case, however. It may well be that the intersection of the valid principles in all pure logics is empty, but this is beside the
end p.202
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
point. We are not talking about pure logics here; we are talking about applied logics, and indeed, logics applied for one particular purpose.
Moreover, I think it just false that all principles of inference fail in some situation. For example, any situation in which a conjunction holds, the conjuncts hold, simply in virtue of the meaning of . Naturally, if one were allowed to vary the meanings of words as the class of situations is broadened, then this would not be the case. But when we ask about what is the case in situations where the sky is blue and the sun is shining, we are interested in just that: allowing ‘and’ to take on some other meaning is beside the point. It is certainly open to someone to claim that what I have just said gets the meaning of ‘and’ wrong, and that the standard truth conditions are not, in fact, delivered by its meaning. That is a substantial issue, and certainly gives rise to theoretical pluralism, but it is not germane here. As long as meanings are fixed, one can't vary them to dispose of valid inferences. 18
There is another possible move that might be made at this point (of a kind that came up in another context in section 10.14). What, exactly, is the disagreement between the monist and the pluralist about here? The monist accepts that there is a core of universally correct inferences; but this may be augmented if we are reasoning about certain kinds of situations. The pluralist holds that different sorts of situations require different validities, but may accept—will accept, if the above argument is correct—that there is a core of inferences that are acceptable in all kinds of situations. What is the difference? None, it might be thought. The facts are agreed upon. What is at issue is only how to describe them.
One might argue against this duck/rabbit pluralism as follows. 19 Suppose that one is a pluralist of the kind in question. Let s be some situation about which we are reasoning; suppose that s is in different classes of situations, say, K 1 and K 2 . Should one use the notion of validity appropriate for K 1 or for K 2 ? We cannot give the answer ‘both’ here. Take some inference that is valid in K 1 but not K 2 , α β, and suppose that we know (or assume) α holds in s; are we, or are we not entitled to accept that β does? Either we are or we are not: there can be no pluralism about this. In fact, the answer is that we are. Since s is in K 1 , and the inference is truth-preserving in all situations in K 1 . In other words, if we know that a situation about which we are reasoning is in class K, we are justified in reasoning with validity defined over the restricted class of situations K. 20
end p.203
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
Thus, differences in validity due to variation of the class of situations in question can be viewed from a pluralist perspective; but as I have observed, one can view it equally from a monist perspective.
12.10 INSTRUMENTALISM
B&R's final argument is that we may generate different logics by giving the truth conditions of the connectives in different ways. Thus, for example, we may give either intuitionist truth conditions or classical truth conditions. If we do the former, the result is a notion of validity that is constructive, that is, tighter than classical validity, but which it is perfectly legitimate to use for certain ends.
There are two aspects of this position that need concern us. The first was covered essentially in section 12.6. If we give different truth conditions for the connectives, we are giving the formal connectives different meanings. When we apply the logics to vernacular reasoning we are, therefore, giving different theories of the meanings of the vernacular connectives. We have a case of theoretical pluralism; and the theories cannot both be right—or if they are, we simply have a case of ambiguity, as we have already seen.
One might dispute this argument (as Restall did in discussion). Classical and intuitionist connectives do not have different meanings. Truth conditions are general and uniform. Apparently different truth conditions are just special cases. Thus, for example, the truth conditions for the connectives in a Kripke interpretation for intuitionist logic collapse into classical conditions at “classical” worlds, namely, those which access no worlds other than themselves. We do not, therefore, have connectives with different meanings. It is just that the intuitionist countenances more situations than their classical cousin.
If this move is right, then this third example of pluralism reduces to the previous one, which I have already dealt with. But I do not think it is right. For a dispute about meanings can be cashed out just as much in terms of what situations there are, as in terms of truth conditions. For example, to determine whether it is part of the meaning of ‘to see’ that the eyes be employed, we might consider whether there are any (maybe hypothetical) situations where someone could be described as seeing, even though they had no (functioning) eyes. One who holds that seeing involves having eyes will deny the possibility of such situations.
Exactly the same is true of the case at hand. For someone who takes classical truth conditions to govern an account of the connectives, say negation, for example, what it means to say that ¬α holds in a situation is just that α fails to hold there. It follows that there can be no situations where neither α nor ¬α holds. Suppose, for example, 21 that one thinks of a situation as a part of a possible world. Moreover, suppose that we have a situation of which JC is not a part. It follows that ‘JC is reading’ is not true of that situation. For classical negation, it will follow that ‘JC is not reading’ is true of that situation. It is not an option to say that neither of these two is true. The situation
end p.204
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
cannot be “incomplete”. As B&R themselves put it (p. 482), ‘the classical account of negation fails for situations’. 22
The second aspect of B&R's position here that we need to look at is the claim that, even if one is, say, a classical logician, one may legitimately accept the policy of restricting one's reasoning to the use of intuitionistically valid inferences, and so reason constructively, for certain ends. 23 Now this policy is certainly not intuitionism. For intuitionism justifies many mathematical principles that are classically inconsistent, and so intuitionist mathematics is not a proper part of classical mathematics. Neither is it a policy of reasoning only in accordance with principles that are constructive from a classical perspective. For this extends intuitionist logic. For example, it legitimates Markov's Principle in arithmetic: ( x(A V ¬A) ¬ x¬A) → xA (where the quantifiers are constructive). This is not intuitionistically valid. 24
Nonetheless, it is certainly the case that one could decide to operate with just intuitionist logic, or, more generally, with a constructive part of one's preferred logic (assuming that that logic is not itself constructive). One could do this simply as an exercise; but more importantly, there might be a perfectly sensible point to doing so. A point might be provided by the fact that conclusions obtained constructively contain more information, or more computational content, than conclusions proved non-constructively. Reasoning in this way may therefore be a useful instrumental technique. I noted that a pure logic may be applied for many purposes, and instrumental purposes are purposes. Hence this is simply a case of applying a logic for a different end, and we have already seen that different applications may require different logics. Note, though, that it does not follow from this that conclusions obtained using non-constructive principles of inference are themselves defective in any way. The things so proved are guaranteed, in fact, to hold in the situation about which we are reasoning, by the definition of validity.
Whilst we are on the instrumental use of logics, let me give one more example of this. Inference procedures are used in science. And sometimes in science, we have a theory that we know to be incorrect, but which we use, nonetheless, for instrumental purposes. If we are doing this, then we are clearly also at liberty to use whatever inferential processes we like if they satisfy those purposes.
An example of this is as follows. The Bohr theory of the atom was a very useful theory, which made strikingly accurate empirical predictions. Its inference procedures were highly non-standard, though, involving the chunking of principles, with limited flow of information between the chunks. This can be understood in terms of a certain non-adjunctive paraconsistent reasoning strategy. 25 Now, no one ever thought that
end p.205
Bohr's theory was correct. It is commonly suggested that this is because it was inconsistent. I do not think that this is a good reason. What is a good reason, as explained in section 9.6, is that if a truth-preserving inference mechanism had been used, the theory would have had all sorts of empirical consequences that are not observed. The theory, then, cannot be true. Despite this, it was an important stepping stone en route to a more adequate theory. It was necessary to obtain insight into various principles and their interaction, so that more adequate principles could be formulated. The use of a logic that is not one of simple truth-preservation was perfectly legitimate in this context.
A number of writers, working in the field of non-adjunctive paraconsistent logic in particular, have argued that it may be important to have an inference engine that preserves something other than (or as well as) truth, for certain purposes. 26 As I have already stressed, different applications may well require different logics. State your purpose and how it is to be achieved; then use a logic the application of which does this.
12.11 UNDER-DETERMINATION BY THE DATA
Let us finish with one more argument for pluralism. In effect, an applied logic is, as I have said, a theory about how its domain of application behaves. Hence, a pure logic, when given the canonical application, provides a theory about the norms of correct (deductive) reasoning. It may be thought that there is insufficient information to determine which logic is the correct applied logic here; in particular, that our intuitions about the goodness and badness of particular inferences —which provide the data in this case—are insufficient to determine which theory is correct. Hence, there may be a plurality of equally correct answers.
Pluralism of this kind may be argued on very general grounds, familiar from the philosophy of science. The correct theory, it may be said, is always under-determined by the data, which is, in any case, soft. In virtue of this, the argument sometimes continues, the correct theory is a purely conventional matter. And we can make whichever conventions are most convenient.
What is to be said about this? The first point is that indeterminism of the kind I have just described is, in general, quite overrated. Adequacy to the data is just one of the criteria to be employed in assessing theories. Others, such as simplicity, unity, a low degree of ad hocness, etc., are familiar from the literature. Whilst it is clearly a possibility of some sort that different theories may yet tie—and not just pro tem, but in the long term—it is hard to come up with concrete examples of this in the history of intellectual inquiry. At any rate, I know of no argument to suppose that this is a real possibility in logic.
And even if it were, this is not an end of the matter. For this plurality is merely epistemic: we just cannot tell which of the theories is the correct one. Given competing
end p.206
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
theories, as long as one is a realist about the subject matter, at most one of them can be correct, even if we cannot tell which. There is therefore no alethic pluralism.
But should we be a realist about logic? The answer, as I argued in Chapter 11, is ‘yes’. Validity is determined by the class of situations involved in truth-preservation, quite independently of our theory of the matter. This answer has a certain ontological sting, of course. For, as I observed, the situations about which we reason are not all actual: many are purely hypothetical. And one must be a realist about these too. There are numerous different sorts of realism that one might endorse here, many of which are familiar from debates about the nature of possible worlds. One may take hypothetical situations to be concrete non-actual situations; abstract objects, like sets of propositions or combinations of actual components; real but non-existent objects. 27 I will not address the question of which of these account is correct here. 28 Any of them will do, as long as they provide for an independent realm of situations; and hence a determinate answer to the question of which theory is correct (even if our theories do tie, epistemically).
12.12 INDUCTIVE INFERENCE
We have now dealt with a number of important arguments for pluralism about deductive validity. Let me conclude the chapter by returning to the issue of inductive validity, to see what it adds to the picture. The situation here is complicated by the fact that there is no theoretical agreement about the nature of inductive inference, even of the limited kind that exists for deductive inference. But in section 11.9 I gave an account of what inductive validity amounts to, and I assume here that something like that is essentially right.
According to this account, situations are structured with an ordering, taken to represent normality. A most normal situation where the premises are true is then a situation where the premises are true, such that there is no situation higher in the ordering where this is so. The important point to note here is that when we reason inductively, what counts as normal is context-dependent, as pointed out in section 11.10. If we are reasoning about biological appendages, a dog with three legs is not normal; if we are reasoning about creatures' genotypes, the dog may be perfectly normal, provided that its lost leg is caused by a road accident, and not a genetic abnormality.
Now, how does this affect the issue of pluralism? First, the fact that we have two canons of inference, inductive and deductive, is not a case of pluralism. This is because, according to the above definition, all deductively valid inferences are inductively valid. (If truth is preserved in all interpretations, it is preserved in all most normal ones.) Hence, we do not have two canons of inference: we have one set of valid inferences, inside which there is a distinguished subset.
end p.207
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
But the fact that normality is context-relative does give rise to a pluralism. What is inductively valid will depend on the normality ordering, which may vary from context to context. One might be tempted to reply here as I replied to the pluralist in section 12.9. It is still the case that there is a core of inductive inferences that are valid in all contexts, namely those that hold whatever the normality ordering. Why not call those the valid inductive inferences, the others being reducible to those enthymematically? A main answer to this is that if one conceptualizes the valid inductive inferences in this way, then there is no difference between inductive and deductive validity. The valid inductive inferences turn out to be indistinguishable from the valid deductive ones. 29 One can maintain a monism about inductive inference, but if one does, it is an entirely degenerate one.
12.13 CONCLUSION
We have now reviewed several reasons for logical pluralism, and several things that pluralism might mean. The issue of monism vs. pluralism has turned out to be a delicate one. As we have seen, there are many senses in which pluralism is uncontentiously correct. No one, for example, is going to argue about pluralism in pure logic, or theoretical pluralism in applied logic. The important, and hard, question concerns the situation with respect to the correct logic for canonical application; and I have argued that one can quite coherently view this situation from a monist perspective.
12.14 TO BE AND NOT TO BE—THAT IS THE ANSWER
The last part of this book has been concerned with the nature of logic, its status as a corrigible theory with a certain subject matter, and its claims to uniqueness. With this, our review of the natures of four central philosophical notions—truth, negation, rationality, and logic—is completed.
Many of the lessons learned are quite general; they have nothing specifically to do with dialetheism, and stand quite independently of the truth of this view. From this perspective, the upshot of the investigations is an interesting one. Consistency has often been taken to be so central to the notions in question that it has been felt that they could not operate without it. As we have seen, they can. But perhaps more importantly, the accounts of these core notions that have emerged in the discussion are not so different from the traditional accounts. Any of the traditional theories of truth may be allowed to stand, negation is a standard contradictory-forming operator, the view of rationality is familiar from contemporary philosophy of science, and the nature of logic has not collapsed into some faddish relativism. Perhaps the most surprising
end p.208
PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com)
© Copyright Oxford University Press, 2006. All Rights Reserved
thing, then, is how easily considerations of consistency can be detached from these notions, and so how non-integral they are to them. This makes the traditional view of the centrality of consistency to these notions even more surprising. The dead hand of Aristotle has, it would seem, weighed on the topics, preventing philosophers from applying to them the critical spirit which is their due. Even were it to be the case, then, that dialetheism turns out to be a view that, for some reason, cannot be sustained, this, at least, has come to an end.
Clearly, though, the investigations here have not been polemically innocent: the defence of dialetheism has always been running, if not in the foreground, then at least in the background.
I chose the title of this book, not simply because of its natural application to the book's contents, but because of the irony of the passage from which it comes:
Doubt that the sun doth move;
Doubt thou the stars are fire;
Doubt that the sun doth move;
Doubt truth to be a liar;
But never doubt I love.
(Hamlet, ii . ii . 115)
The irony is twofold. That something true cannot be false is one of the things that Hamlet is holding up as certain. Unfortunately, the other things that are cited as of the same kind have turned out to be, not just uncertain, but actually false: the sun does not move—not, in the Ptolemaic sense in which Hamlet intends it; and the stars are not fire, since fire is rapid oxidation, not nuclear fusion. This is an external irony of the text. There is also an internal irony. Hamlet is saying that his love for Ophelia is even more certain than any of these certitudes. Yet, as becomes clear as the plot unfolds, his claim to love, too, is false.
Even though the quotation from which the title of this book comes appears to endorse the consistency of truth, the double irony completely reverses this. If I cannot claim Aristotle on my side, at least I can claim Shakespeare.
END
No comments:
Post a Comment