First-order logic, semantically speaking

To save space and time, we will immediately formulate our notions for sets of formulas. As noted above, this is not a problem since single formulas are just special sets of formulas. Namely, for any formula XX, there is a singleton set {X}\{X\}.

Satisfiability

As before, we will start with the notion of satisfiability for sets of formulas (including singleton sets), and define validity using the notion of satisfiability. Thus, we have:

Definition 8.1 (Satisfiability). A set SS of formulas of β„’1\mathcal{L}_1 is first-order satisfiable or semantically consistent iff there is a structure 𝐒\mathbf{S} and an assignment 𝐚\mathbf{a} such that π’βŠ¨X[𝐚]\mathbf{S} \models X[\mathbf{a}] for each X∈SX \in S. We say SS is first-order unsatisfiable or semantically inconsistent provided it is not semantically consistent.

Exercise 8.1. By the above, it follows that a set of formulas is semantically inconsistent provided there is no structure 𝐒\mathbf{S} and assignment 𝐚\mathbf{a} under which every formula in the set is simultaneously satisfiable. Explain why this is the case in your own words using the definition above.

Checking whether a set of formulas is first-order satisfiable is checking whether there is a structure and an assignment that assigns a member of the domain of that structure to the variables of the language such that every formula in the set comes out satisfied. This sounds very complicated, but as usual, the underlying idea is very simple.

Take, for example, the following formula (considered as a singleton set): P(x)βˆ§βˆ€yP(y)P(x) \wedge \forall y P(y)

Is this formula satisfiable? To answer this in the positive, we need to find a structure and a variable assignment in which it is satisfied. In fact, there are many structures and assignments that satisfy this formula, so it is satisfiable. Here is one.

Let 𝐒\mathbf{S} be a structure such that 𝐃={𝖺}\mathbf{D}=\{\mathsf{a}\}, and 𝐈(P)={𝖺}\mathbf{I}(P)=\{\mathsf{a}\}. For any variable xx, let 𝐚(x)=𝖺\mathbf{a}(x)=\mathsf{a}. So there is a single object aa in the structure, and that single object is in the set of things that are PP. Moreover, every variable is assigned 𝖺\mathsf{a} under the variable assignment 𝐚\mathbf{a}.

Based on this, we have that π’βŠ¨P(x)[𝐚]\mathbf{S} \models P(x)[\mathbf{a}], since 𝐚(x)∈𝐈(P)\mathbf{a}(x) \in \mathbf{I}(P). Moreover, we also have that π’βŠ¨βˆ€yP(y)[𝐚]\mathbf{S} \models \forall y P(y)[\mathbf{a}], since under every yy-variant assignment to 𝐚\mathbf{a}, P(y)P(y) is satisfied. In other words (and some symbols), π’βŠ¨P(y)[πšβ€²]\mathbf{S} \models P(y)[\mathbf{a}'] for every yy-variant assignment πšβ€²\mathbf{a}', simply because every assignment points to aa (as there is nothing else in the domain). So since both are satisfied under 𝐚\mathbf{a} in 𝐒\mathbf{S}, their conjunction P(x)βˆ§βˆ€yP(y)P(x) \wedge \forall y P(y) is also satisfied in the same structure under the same assignment. So the formula P(x)βˆ§βˆ€yP(y)P(x) \wedge \forall y P(y) is satisfiable.

There are many other formulas that are satisfiable because they are satisfied in the above structure 𝐒\mathbf{S} and assignment 𝐚\mathbf{a}.

Exercise 8.2. Show that each of the following formulas are first-order satisfiable by showing that they are satisfied in 𝐒\mathbf{S} and 𝐚\mathbf{a}, as defined above.

  1. βˆƒx1P(x)βˆ§Β¬βˆ€yΒ¬P(y)\exists x_1 P(x) \wedge \neg \forall y \neg P(y)

  2. βˆ€x(P(x)∨¬P(x))\forall x(P(x) \vee \neg P(x))

  3. βˆ€xP(x)∨¬P(x)\forall x P(x) \vee \neg P(x)

  4. Β¬P(y)β†’Β¬P(y)\neg P(y) \rightarrow \neg P(y)

  5. βˆ€x(P(x)β†’βˆƒyP(y))\forall x (P(x) \rightarrow \exists y P(y))

Remark 8.1. Note that this entails that the set of formulas {βˆƒx1P(x)βˆ§Β¬βˆ€yΒ¬P(y),βˆ€x(P(x)∨¬P(x)),βˆ€xP(x)∨¬P(x),Β¬P(y)β†’Β¬P(y),βˆ€x(P(x)β†’βˆƒyP(y))}\{\exists x_1 P(x) \wedge \neg \forall y \neg P(y), \forall x(P(x) \vee \neg P(x)), \forall x P(x) \vee \neg P(x), \neg P(y) \rightarrow \neg P(y), \forall x (P(x) \rightarrow \exists y P(y))\} is satisfiable, since you just showed that they are each satisfied in the same structure under the same assignment.

Exercise 8.3. Write five formulas using only the predicate PP and no constants such that they are each satisfied in 𝐒\mathbf{S} under 𝐚\mathbf{a}, and thus satisfiable in first-order logic.

Of course, some sets of formulas require more elaborate structures and assignments to be satisfied, and hence be satisfiable. And in general, just because a set of formulas is not satisfied in a given structure under a given assignment does not entail that it is not satisfiable. For the latter, what you would have to show is that there is no structure and assignment whatsoever under which the set of formulas are satisfied.

Take, for example, the formula P(x)∧¬P(y)P(x) \wedge \neg P(y). This is not satisfied in 𝐒\mathbf{S} under any variable assignment 𝐚*\mathbf{a}^*, simply because it requires an object to not be PP. So π’βŠ¨ΜΈP(x)∧¬P(y)[𝐚*]\mathbf{S} \not\models P(x) \wedge \neg P(y)[\mathbf{a}^*] for any 𝐚*\mathbf{a}^*. On the other hand, this does not mean that P(x)∧¬P(y)P(x) \wedge \neg P(y) is unsatisfiable. Indeed, it is easy to think of a structure and an assignment that satisfies it.

Let 𝐖\mathbf{W} be the structure such that 𝐃={𝖺,𝖻}\mathbf{D}=\{\textsf{a}, \textsf{b}\}, 𝐈(P)={𝖺}\mathbf{I}(P)=\{\textsf{a}\}. Moreover, let 𝖻\mathbf{\textsf{b}} be 𝐛(x)=𝖺\mathbf{b}(x)=\textsf{a} and 𝐛(y)=𝖻\mathbf{b}(y)=\textsf{b}. Now π–βŠ¨P(x)∧¬P(y)[𝐛]\mathbf{W} \models P(x) \wedge \neg P(y)[\mathbf{b}]. Why? It is because π–βŠ¨P(x)[𝐛]\mathbf{W} \models P(x)[\mathbf{b}], for 𝐛(x)∈𝐈(P)\mathbf{b}(x) \in \mathbf{I}(P), and π–βŠ¨Β¬P(y)[𝐛]\mathbf{W} \models \neg P(y)[\mathbf{b}], since 𝐛(y)βˆ‰πˆ(P)\mathbf{b}(y) \notin \mathbf{I}(P). So P(x)∧¬P(y)P(x) \wedge \neg P(y) is satisfiable, since there is a structure and an assignment that satisfy it, as just demonstrated.

Note that when it comes to open formulas, assignments matter! On the other hand, when it comes to sentences, we have an easier time. You may remember that a sentence is either satisfied under every assignment in a structure or none, and therefore, is true or false in that structure. So the only things you need to care about in these cases is whether the sentence is true in the structure or not. This is because if the sentence is true in a structure, it is satisfied under every assignment in that structure by definition, so it is satisfiable, as per our definition of satisfiability. Not coincidentally, this is exactly how zeroth-order satisfiability was defined, as zeroth-order languages do not have open formulas.

Here is a simple example. Suppose you have a first-order sentence P(b)∨¬P(a)P(b) \vee \neg P(a). Extend the interpretation 𝐈\mathbf{I} of 𝐒\mathbf{S} so that 𝐈(b)=𝐈(a)=𝖺\mathbf{I}(b)=\mathbf{I}(a)=\mathsf{a} (for there is nothing else in the domain of 𝐒\mathbf{S}). Now since π–Ίβˆˆπˆ(P)\mathsf{a} \in \mathbf{I}(P), π’βŠ¨P(b)\mathbf{S} \models P(b), so π’βŠ¨P(b)∨¬P(a)\mathbf{S} \models P(b) \vee \neg P(a), and the sentence is true in 𝐒\mathbf{S}. So the sentence is satisfiable in 𝐒\mathbf{S}.

Remember: for a set of formulas to be satisfiable, it is enough to find a single structure and variable assignment under which each formula is satisfied. Moreover, if a sentence (closed formula) is true in a structure, by definition, it is satisfied under every assignment in that structure, and thus it is satisfied in a structure under an assignment, and thus it is satisfiable full stop. So if a set of sentences are all true in a structure, they are satisfied in that structure, so they are first-order satisfiable.

So some sets of formulas are satisfiable, and for each set of satisfiable formulas, there needs to be only one structure and assignment under which each formula in the set is satisfied (in the same structure, under the same assignment). Now let’s inspect some examples of sets of formulas that are not satisfiable. In other words, sets of formulas for which a structure and assignment cannot be found under which they are all satisfied. As noted, these formulas we will call (first-order) unsatisfiable or semantically inconsistent.

As a simple example, take the sentence P(a)∧¬P(a)P(a) \wedge \neg P(a). You may already see that this is a sentence of form X∧¬XX \wedge \neg X, which is the most basic form of a contradiction. As we have demonstrated previously, there is no structure 𝐒\mathbf{S} in which it could be true, simply by the way ∧\wedge and Β¬\neg is defined. But this also means that no matter what structure 𝐒*\mathbf{S}^* and assignment 𝐚*\mathbf{a}^* we choose, 𝐒*⊭P(a)∧¬P(a)[𝐚*]\mathbf{S}^* \not\models P(a) \wedge \neg P(a)[\mathbf{a}^*], so indeed, this formula is first-order unsatisfiable.

Note that the same goes for first-order sentences that include quantifiers, like βˆƒxP(x)βˆ§Β¬βˆƒxP(x)\exists x P(x) \wedge \neg \exists x P(x). This sentence is unsatisfiable precisely for the same reason as P(a)∧¬P(a)P(a) \wedge \neg P(a) is unsatisfiable. Simply because if βˆƒxP(x)βˆ§Β¬βˆƒxP(x)\exists x P(x) \wedge \neg \exists x P(x) were satisfied in a structure under an assignment, then βˆƒxP(x)\exists x P(x) would have to be satisfied, and Β¬βˆƒxP(x)\neg \exists x P(x) would also have to be satisfied. But by definition, if βˆƒxP(x)\exists x P(x) is satisfied, Β¬βˆƒxP(x)\neg \exists x P(x) is not, and if Β¬βˆƒxP(x)\neg \exists x P(x) is satisfied, βˆƒxP(x)\exists x P(x) is not. So there is no way to satisfy this formula. In general, any formula of form X∧¬XX \wedge \neg X, where XX is any formula, is first-order unsatisfiable unsatisfiable for this reason.

Note: just as any set of sentences is first-order satisfiable if it is true in a structure, any set of sentences is first-order unsatisfiable if it is false in every structure. So in both cases, variable assignments may be disregarded, as there are no free variables.

It is important to note that some first-order unsatisfiable sets of formulas are unsatisfiable not solely because of how the connectives are defined (which coincides with zeroth-order unsatisfiability), but essentially because of how the variables and quantifiers function. For example, take P(x)βˆ§βˆ€yΒ¬P(y)P(x) \wedge \forall y \neg P(y). This formula is not of the form X∧¬XX \wedge \neg X, or any zeroth-order contradiction for that matter, yet it is first-order unsatisfiable. It is not hard to see why this is the case.

Suppose P(x)βˆ§βˆ€yΒ¬P(y)P(x) \wedge \forall y \neg P(y) were first-order satisfiable. Then, there would be a structure 𝐒\mathbf{S} and assignment 𝐚\mathbf{a} such that π’βŠ¨P(x)βˆ§βˆ€yΒ¬P(y)[𝐚]\mathbf{S} \models P(x) \wedge \forall y \neg P(y)[\mathbf{a}]. In turn, this would mean that π’βŠ¨P(x)[𝐚]\mathbf{S} \models P(x)[\mathbf{a}], that is, that 𝐚(x)∈𝐈(P)\mathbf{a}(x) \in \mathbf{I}(P). On the other hand, we would also have that π’βŠ¨βˆ€yΒ¬P(y)[𝐚]\mathbf{S} \models \forall y \neg P(y)[\mathbf{a}], which would mean that for every yy-variant πšβ€²\mathbf{a}', π’βŠ¨Β¬P(y)[πšβ€²]\mathbf{S} \models \neg P(y)[\mathbf{a}'], so π’βŠ¨ΜΈP(y)[πšβ€²]\mathbf{S} \not\models P(y)[\mathbf{a}'] for every yy-variant πšβ€²\mathbf{a}', and so πšβ€²(y)βˆ‰πˆ(P)\mathbf{a}'(y) \notin \mathbf{I}(P) for every yy-variant πšβ€²\mathbf{a}'. But this is only true if 𝐈(P)\mathbf{I}(P) is empty, otherwise we would be able to find a yy-variant assignment πšβ€²\mathbf{a}' such that πšβ€²(y)∈𝐈(P)\mathbf{a}'(y) \in \mathbf{I}(P). On the other hand, we also showed that 𝐚(x)∈𝐈(P)\mathbf{a}(x) \in \mathbf{I}(P), so it should not be empty! In other words, these two conjuncts cannot be satisfied at the same time, since one requires 𝐈(P)\mathbf{I}(P) to have at least one member, and the other requires it to be empty. So P(x)βˆ§βˆ€yΒ¬P(y)P(x) \wedge \forall y \neg P(y) cannot be satisfied, and so is not first-order satisfiable.

Exercise 8.4. Show that the first-order sentence βˆƒxP(x)βˆ§βˆ€yΒ¬P(y)\exists x P(x) \wedge \forall y \neg P(y) is unsatisfiable. Hint: it is unsatisfiable for essentially the same reasons as P(x)βˆ§βˆ€yΒ¬P(y)P(x) \wedge \forall y \neg P(y), with a slight twist.

Validity

Now that we have a grasp on first-order satisfiability, we are in the position to formulate what it means for an argument to be first-order valid, and in connection, what it means for a formula to be a first-order validity. As we did with zeroth-order validity, we will define first-order validity using the notion of first-order satisfiability. In particular, we shall say:

Definition 8.2 (Validity, first-order). Let A={X1,X2,...}A=\{X_1, X_2, …\} be any set of formulas and let YY be any formula of β„’1\mathcal{L}_1. Then, the argument from premises AA to the conclusion YY is first-order valid iff the set {X1,X2,...,Β¬Y}\{X_1, X_2, …, \neg Y\} is not first-order satisfiable. If an argument from AA to YY is first-order valid, we write A⊨YA \models Y, or say that YY is a (first-order) semantic consequence of, or (first-order) semantically entailed by AA. If the argument is not valid, we say it is invalid, and write A⊭YA \not\models Y. If A=βˆ…A=\emptyset and A⊨YA \models Y, we say YY is a first-order validity, and simply write ⊨Y\models Y.

Similar to our zeroth-order definition, the reasoning behind this is as follows. An argument is valid if whenever its premises are satisfied, its conclusion is also satisfied. This is equivalent to saying that it is impossible for an argument’s premises to be satisfied but its conclusion to be unsatisfied. This latter means that if an argument is valid, we should not be able to find a structure and a variable assignment which satisfies all its premises but does not satisfy its conclusion. Put another way, we should not be able to find a structure and variable assignment under which the premises and the negation of the conclusion is satisfied. So put another way, we should not find that the set of the premises and the negation of the conclusion together is satisfiable. Which is exactly what the definition says.

As far as first-order sentences are concerned, the definition of validity essentially reduces to that of zeroth-order validity. In particular, a set of sentences A={X1,X2,...}A=\{X_1, X_2, …\} entails a sentence YY provided in every structure in which AA is true, YY is also true, which is once again equivalent to {X1,X2,...,Β¬Y}\{X_1, X_2, …, \neg Y\} not being true in any structure. Keep in mind, however, that these sentences may include quantifiers, as long as every variable is bound. The only additional complication when including open formulas is that we need to include into our calculations the assignments. But again, this just means that if an argument is valid, then whenever its premises are satisfied in a structure under an assignment, its conclusion will also always be satisfied in that structure under that assignment.

Note that by definition, an argument is invalid if it not valid. And validity for an argument with premise set {X1,X2,...}\{X_1, X_2, …\} and conclusion YY is defined by the set {X1,X2,...,Β¬Y}\{X_1, X_2, …, \neg Y\} being unsatisfiable. From this, it readily follows that an argument is not valid (or simply, invalid) if {X1,X2,...,Β¬Y}\{X_1, X_2, …, \neg Y\} is satisfiable. In other words, if you want to show that an argument is invalid, you need to provide a structure and an assignment under which the premises are satisfied, but the conclusion is not (or equivalently, its negation is).

Exercise 8.5. Show that the following facts hold in first-order logic:

  1. βˆƒxΒ¬P(x)βŠ¨ΜΈβˆ€yΒ¬P(y)\exists x \neg P(x) \not\models \forall y \neg P(y)

  2. Β¬βˆ€xΒ¬P(x)βŠ¨ΜΈΒ¬βˆƒxΒ¬P(x)\neg \forall x \neg P(x) \not\models \neg \exists x\neg P(x)

  3. Β¬βˆ€xP(x)βŠ¨ΜΈΒ¬βˆƒxP(x)\neg \forall x P(x) \not\models \neg \exists x P(x)

  4. βˆ€xP(x)β†’βˆ€xQ(x)βŠ¨ΜΈβˆ€x(P(x)β†’Q(x))\forall x P(x) \rightarrow \forall x Q(x) \not\models \forall x (P(x) \rightarrow Q(x))

  5. βˆƒx(P(x)∧Q(x))βŠ¨ΜΈβˆƒxP(x)βˆ§βˆƒxQ(x)\exists x (P(x) \wedge Q(x)) \not\models \exists x P(x) \wedge \exists x Q(x)

  6. βˆ€x(P(x)∨Q(x))βŠ¨ΜΈβˆ€xP(x)βˆ¨βˆ€xQ(x)\forall x(P(x) \vee Q(x)) \not\models \forall x P(x) \vee \forall x Q(x)

  7. βˆƒx(P(x)β†’Q(a))βŠ¨ΜΈβˆƒxP(x)β†’Q(a)\exists x (P(x) \rightarrow Q(a)) \not \models \exists x P(x) \rightarrow Q(a)

  8. {βˆƒx(P(x)β†’Q(x)),βˆƒxP(x)}βŠ¨ΜΈβˆƒxQ(x)\{\exists x (P(x) \rightarrow Q(x)), \exists x P(x)\} \not \models \exists x Q(x)

In general, every zeroth-order valid argument is first-order valid. In fact, zeroth-order satisfiability, unsatisfiability and validity entail first-order satisfiability, unsatisfiability and validity, respectively (for zeroth-order arguments). But once again, there are first-order valid arguments whose validity essentially depends on the particular use of variables and quantifiers. Most importantly, there are two fundamental properties of quantifiers, connected to negation:

Proposition 8.1 (Negation push-through; βˆƒ\exists). Β¬βˆƒxYβŠ¨βˆ€xΒ¬Y\neg \exists x Y \models \forall x \neg Y for any formula YY of β„’1\mathcal{L}_1.

Proof. Suppose π’βŠ¨Β¬βˆƒxY[𝐚]\mathbf{S} \models \neg \exists x Y[\mathbf{a}]. By definition, π’βŠ¨ΜΈβˆƒxY[𝐚]\mathbf{S} \not \models \exists x Y[\mathbf{a}]. Since π’βŠ¨βˆƒxY[𝐚]\mathbf{S} \models \exists x Y[\mathbf{a}] means that for some xx-variant πšβ€²\mathbf{a}', π’βŠ¨Y[πšβ€²]\mathbf{S} \models Y[\mathbf{a}'], its negation means that there is no xx-variant πšβ€²\mathbf{a}' for which we have π’βŠ¨Y[πšβ€²]\mathbf{S} \models Y[\mathbf{a}']. That is, for every xx-variant πšβ€²\mathbf{a}', π’βŠ¨ΜΈY[πšβ€²]\mathbf{S} \not\models Y[\mathbf{a}']. That is, for every xx-variant πšβ€²\mathbf{a}', π’βŠ¨Β¬Y[πšβ€²]\mathbf{S} \models \neg Y [\mathbf{a}']. But this just means that π’βŠ¨βˆ€xΒ¬Y[𝐚]\mathbf{S} \models \forall x \neg Y[\mathbf{a}].Β β—»

It is easy to see intuitively why this holds. Simply, if there is no xx such that YY, then obviously, every xx must not be YY, for otherwise, there would be an xx that is YY. For example, if among a group of students, there is no student who received a C on their exam, then for every student, it is true that they did not receive a C on their exam. That is, if Β¬βˆƒxC(x)\neg \exists x C(x), then βˆ€xΒ¬C(x)\forall x \neg C(x).

Note also that YY may be any formula above, not just an atomic one. So it is also the case, for example, that if there is no person who has a car and and has a sister, then for every person, it is not the case that they have a car and a sister. So if Β¬βˆƒx(H(x)∧S(x))\neg \exists x (H(x) \wedge S(x)), then βˆ€xΒ¬(H(x)∧S(x))\forall x \neg (H(x) \wedge S(x)). It is important here where the parentheses are. For no one having a car and a sister does not mean that no one has a car or a sister. It only means that they cannot have both. And indeed, this follows, for by DeMorgan’s law, βˆ€xΒ¬(H(x)∧S(x))\forall x \neg (H(x) \wedge S(x)) entails βˆ€x(Β¬H(x)∨¬S(x))\forall x (\neg H(x) \vee \neg S(x)). That is, it is true of everyone that they either do not have a car, or they do not have a sister, or they neither have a car nor a sister. The only possible configuration excluded by this sentence is that they have both have a car and a sister.

Next, we also have the converse of the above. Namely, that if it is not true for all xx that YY, then there must be at least one xx for which YY is the case.

Proposition 8.2 (Negation push-through; βˆ€\forall). Β¬βˆ€xYβŠ¨βˆƒxΒ¬Y\neg \forall x Y \models \exists x \neg Y for any formula YY of β„’1\mathcal{L}_1.

Proof. Suppose π’βŠ¨Β¬βˆ€xY[𝐚]\mathbf{S} \models \neg \forall x Y[\mathbf{a}]. By definition, π’βŠ¨Β¬βˆ€xY[𝐚]\mathbf{S} \models \neg \forall x Y[\mathbf{a}] iff π’βŠ¨ΜΈβˆ€xY[𝐚]\mathbf{S} \not \models \forall x Y[\mathbf{a}]. Now π’βŠ¨βˆ€xY[𝐚]\mathbf{S}\models \forall x Y[\mathbf{a}] provided for every xx-variant assignment πšβ€²\mathbf{a}', we have π’βŠ¨Y[πšβ€²]\mathbf{S}\models Y[\mathbf{a}']. So on the contrary, π’βŠ¨ΜΈβˆ€xY[𝐚]\mathbf{S} \not \models \forall x Y[\mathbf{a}] provided there is some xx-variant assignment πšβ€²\mathbf{a}' for which we have π’βŠ¨ΜΈY[πšβ€²]\mathbf{S} \not \models Y[\mathbf{a}']. So there is some xx-variant assignment πšβ€²\mathbf{a}' such that π’βŠ¨Β¬Y[πšβ€²]\mathbf{S} \models \neg Y[\mathbf{a}']. But then by definition, π’βŠ¨βˆƒxΒ¬Y[𝐚]\mathbf{S} \models \exists x \neg Y[\mathbf{a}].Β β—»

Now again, the intuitive content of this is pretty straightforward. For example, if not everyone got an A on their exam, then there is someone who did not get an A on their exam. That is, if Β¬βˆ€xA(x)\neg \forall x A(x), then βˆƒxΒ¬A(x)\exists x \neg A(x). And of course, once again, this holds for any formula whatsoever. So for example, if not everyone has a car and a sister, then there is someone who does not have a car and a sister. That is, if Β¬βˆ€x(H(x)∧S(x))\neg \forall x (H(x) \wedge S(x)), then βˆƒxΒ¬(H(x)∧S(x))\exists x \neg (H(x) \wedge S(x)). This then entails, by DeMorgan’s law, βˆƒxΒ¬H(x)∨¬S(x))\exists x \neg H(x) \vee \neg S(x)). That is, if not everyone has both a car and a sister, then there must be someone who either does not have a car, or does not have a sister, or neither has a car nor a sister.

The above two propositions also work backwards. In particular:

Proposition 8.3 (Negation pull-through; βˆƒ\exists). βˆ€xΒ¬YβŠ¨Β¬βˆƒxY\forall x \neg Y \models \neg \exists x Y for any formula YY of β„’1\mathcal{L}_1.

Proposition 8.4 (Negation pull-through; βˆ€\forall). βˆƒxΒ¬YβŠ¨Β¬βˆ€xY\exists x \neg Y \models \neg \forall x Y for any formula YY of β„’1\mathcal{L}_1.

Exercise 8.6. Prove the two propositions above using the proofs for their push-through variants as blueprint. Hint: think backwards.

Note that the use of DeMorgan’s law above was not accidental. The connectives are defined in first-order logic exactly the same way as they are in zeroth-order logic. Accordingly, the valid argument forms of zeroth-order logic carry over to first-order logic. For example, here is a clever utilization of the above propositions and some zeroth-order logic.

Proposition 8.5. βˆƒxYβŠ¨Β¬βˆ€xΒ¬Y\exists x Y \models \neg \forall x \neg Y and Β¬βˆ€xΒ¬YβŠ¨βˆƒxY\neg \forall x \neg Y \models \exists x Y

Proof. For the first, suppose π’βŠ¨βˆƒxY[𝐚]\mathbf{S} \models \exists x Y[\mathbf{a}]. Then by zeroth-order reasoning, π’βŠ¨Β¬Β¬βˆƒxY[𝐚]\mathbf{S} \models \neg \neg \exists x Y[\mathbf{a}]. Then, by pushing the negation through βˆƒ\exists, we have π’βŠ¨Β¬βˆ€Β¬xY[𝐚]\mathbf{S} \models \neg \forall \neg x Y[\mathbf{a}]. So βˆƒxYβŠ¨Β¬βˆ€xΒ¬Y\exists x Y \models \neg \forall x \neg Y.

For the second, suppose π’βŠ¨Β¬βˆ€xΒ¬Y[𝐚]\mathbf{S} \models \neg \forall x \neg Y[\mathbf{a}]. Then, pulling the negation through βˆ€\forall, we get π’βŠ¨Β¬Β¬βˆƒxY[𝐚]\mathbf{S} \models \neg\neg \exists x Y[\mathbf{a}]. Then, by zeroth-order reasoning, we get π’βŠ¨βˆƒxY[𝐚]\mathbf{S} \models \exists x Y[\mathbf{a}]. So Β¬βˆ€xΒ¬YβŠ¨βˆƒxY\neg \forall x \neg Y \models \exists x Y.Β β—»

Proposition 8.6. βˆ€xYβŠ¨Β¬βˆƒxΒ¬Y\forall x Y \models \neg \exists x \neg Y and Β¬βˆƒxΒ¬YβŠ¨βˆ€xY\neg \exists x \neg Y \models \forall x Y

Exercise 8.7. Prove the above proposition. Hint: take the proof of its inverted version above as blueprint.

Here is another example of how one can reason about validity in first-order logic:

Proposition 8.7. {βˆ€x(P(x)β†’Q(x)),βˆƒyP(y)}βŠ¨βˆƒyQ(y)\{\forall x (P (x) \rightarrow Q(x)), \exists y P(y)\} \models \exists y Q(y)

Proof. Suppose π’βŠ¨βˆ€x(P(x)β†’Q(x))[𝐚]\mathbf{S} \models \forall x (P (x) \rightarrow Q(x))[\mathbf{a}], and π’βŠ¨βˆƒyQ(y)[𝐚]\mathbf{S} \models \exists y Q(y)[\mathbf{a}]. By the latter, we know that there is a yy-variant assignment πšβ€²\mathbf{a}' such that π’βŠ¨Q(y)[πšβ€²]\mathbf{S} \models Q(y)[\mathbf{a}']. In other words, πšβ€²(y)∈𝐈(Q)\mathbf{a}'(y) \in \mathbf{I}(Q). Let’s denote the object πšβ€²(y)\mathbf{a}'(y) by 𝖼\mathsf{c}, so that we know π–Όβˆˆπˆ(Q)\mathsf{c} \in \mathbf{I}(Q). Now because π’βŠ¨βˆ€x(P(x)β†’Q(x))[𝐚]\mathbf{S} \models \forall x (P (x) \rightarrow Q(x))[\mathbf{a}], we know that for every xx-variant assignment πšβ€³\mathbf{a}'', π’βŠ¨P(x)β†’Q(x)[πšβ€³]\mathbf{S} \models P (x) \rightarrow Q(x)[\mathbf{a}'']. So we also have that π’βŠ¨P(x)β†’Q(x)[πšπ–Όx]\mathbf{S} \models P (x) \rightarrow Q(x)[\mathbf{a}^x_\mathsf{c}] (the xx-variant assignment where xx is sent to 𝖼\mathsf{c}). Now we already know that π’βŠ¨P(x)[πšπ–Όx]\mathbf{S} \models P(x)[\mathbf{a}^x_\mathsf{c}], since we established that π–Όβˆˆπˆ(P)\mathsf{c} \in \mathbf{I}(P). But then π’βŠ¨Q(x)[πšπ–Όx]\mathbf{S} \models Q(x)[\mathbf{a}^x_\mathsf{c}] as well, by the conditional. So π–Όβˆˆπˆ(Q)\mathsf{c} \in \mathbf{I}(Q). This, in turn, means that π’βŠ¨Q(y)[πšπ–Όy]\mathbf{S} \models Q(y)[\mathbf{a}^y_\mathsf{c}]. So by definition, π’βŠ¨βˆƒyQ(y)[𝐚]\mathbf{S} \models \exists y Q(y)[\mathbf{a}].Β β—»

Now as you can see, reasoning about validity is rather painful when done semantically. So it is time to extend our tableau system with new rules so that it can handle arguments in first-order logic.

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Learning Logic Backwards Copyright © by Peter Susanszky is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book