Chapter 4  Calculus of Inductive Constructions

The underlying formal language of Coq is a Calculus of Inductive Constructions (Cic) whose inference rules are presented in this chapter. The history of this formalism as well as pointers to related work are provided in a separate chapter; see Credits.

4.1  The terms

The expressions of the Cic are terms and all terms have a type. There are types for functions (or programs), there are atomic types (especially datatypes)... but also types for proofs and types for the types themselves. Especially, any object handled in the formalism must belong to a type. For instance, universal quantification is relative to a type and takes the form “for all x of type T, P”. The expression “x of type T” is written “x:T”. Informally, “x:T” can be thought as “x belongs to T”.

The types of types are sorts. Types and sorts are themselves terms so that terms, types and sorts are all components of a common syntactic language of terms which is described in Section 4.1.2 but, first, we describe sorts.

4.1.1  Sorts

All sorts have a type and there is an infinite well-founded typing hierarchy of sorts whose base sorts are Prop and Set.

The sort Prop intends to be the type of logical propositions. If M is a logical proposition then it denotes the class of terms representing proofs of M. An object m belonging to M witnesses the fact that M is provable. An object of type Prop is called a proposition.

The sort Set intends to be the type of small sets. This includes data types such as booleans and naturals, but also products, subsets, and function types over these data types.

Prop and Set themselves can be manipulated as ordinary terms. Consequently they also have a type. Because assuming simply that Set has type Set leads to an inconsistent theory [25], the language of Cic has infinitely many sorts. There are, in addition to Set and Prop a hierarchy of universes Type(i) for any integer i.

Like Set, all of the sorts Type(i) contain small sets such as booleans, natural numbers, as well as products, subsets and function types over small sets. But, unlike Set, they also contain large sets, namely the sorts Set and Type(j) for j<i, and all products, subsets and function types over these sorts.

Formally, we call S the set of sorts which is defined by:

S ≡ {Prop,Set,Type(i) |  i ∈ ℕ} 

Their properties, such as: Prop:Type(1), Set:Type(1), and Type(i):Type(i+1), are defined in Section 4.4.

The user does not have to mention explicitly the index i when referring to the universe Type(i). One only writes Type. The system itself generates for each instance of Type a new index for the universe and checks that the constraints between these indexes can be solved. From the user point of view we consequently have Type:Type. We shall make precise in the typing rules the constraints between the indexes.

Implementation issues

In practice, the Type hierarchy is implemented using algebraic universes. An algebraic universe u is either a variable (a qualified identifier with a number) or a successor of an algebraic universe (an expression u+1), or an upper bound of algebraic universes (an expression max(u1,...,un)), or the base universe (the expression 0) which corresponds, in the arity of sort-polymorphic inductive types (see Section 4.5.2), to the predicative sort Set. A graph of constraints between the universe variables is maintained globally. To ensure the existence of a mapping of the universes to the positive integers, the graph of constraints must remain acyclic. Typing expressions that violate the acyclicity of the graph of constraints results in a Universe inconsistency error (see also Section 2.10).

4.1.2  Terms

Terms are built from sorts, variables, constants, abstractions, applications, local definitions, and products. From a syntactic point of view, types cannot be distinguished from terms, except that they cannot start by an abstraction or a constructor. More precisely the language of the Calculus of Inductive Constructions is built from the following rules.

  1. the sorts Set, Prop, Type(i) are terms.
  2. variables, hereafter ranged over by letters x, y, etc., are terms
  3. constants, hereafter ranged over by letters c, d, etc., are terms.
  4. if x is a variable and T, U are terms then ∀ x:T,U (forall x:TU in Coq concrete syntax) is a term. If x occurs in U, ∀ x:T,U reads as “for all x of type T, U”. As U depends on x, one says that ∀ x:T,U is a dependent product. If x does not occur in U then ∀ x:T,U reads as “if T then U”. A non dependent product can be written: TU.
  5. if x is a variable and T, u are terms then λ x:Tu (fun x:T  =>  u in Coq concrete syntax) is a term. This is a notation for the λ-abstraction of λ-calculus [8]. The term λ x:Tu is a function which maps elements of T to the expression u.
  6. if t and u are terms then (t u) is a term (t u in Coq concrete syntax). The term (t u) reads as “t applied to u”.
  7. if x is a variable, and t, T and u are terms then let x:=t:T in u is a term which denotes the term u where the variable x is locally bound to t of type T. This stands for the common “let-in” construction of functional programs such as ML or Scheme.
Free variables.

The notion of free variables is defined as usual. In the expressions λ x:TU and ∀ x:T, U the occurrences of x in U are bound.

Substitution.

The notion of substituting a term t to free occurrences of a variable x in a term u is defined as usual. The resulting term is written u{x/t}.

The logical vs programming readings.

The constructions of the Cic can be used to express both logical and programming notions, accordingly to the Curry-Howard correspondence between proofs and programs, and between propositions and types [38, 81, 39].

For instance, let us assume that nat is the type of natural numbers with zero element written 0 and that True is the always true proposition. Then → is used both to denote natnat which is the type of functions from nat to nat, to denote TrueTrue which is an implicative proposition, to denote natProp which is the type of unary predicates over the natural numbers, etc.

Let us assume that mult is a function of type natnatnat and eqnat a predicate of type natnatProp. The λ-abstraction can serve to build “ordinary” functions as in λ x:nat.(mult x x) (i.e. fun x:nat  =>  mult  x x in Coq notation) but may build also predicates over the natural numbers. For instance λ x:nat.(eqnat  x 0) (i.e. fun x:nat  =>  eqnat  x 0 in Coq notation) will represent the predicate of one variable x which asserts the equality of x with 0. This predicate has type natProp and it can be applied to any expression of type nat, say t, to give an object P t of type Prop, namely a proposition.

Furthermore forall x:natP x will represent the type of functions which associate to each natural number n an object of type (P n) and consequently represent the type of proofs of the formula “∀ xP(x)”.

4.2  Typing rules

As objects of type theory, terms are subjected to type discipline. The well typing of a term depends on a global environment and a local context.

Local context.

A local context is an ordered list of local declarations of names which we call variables. The declaration of some variable x is either a local assumption, written x:T (T is a type) or a local definition, written x:=t:T. We use brackets to write local contexts. A typical example is [x:T;y:=u:U;z:V]. Notice that the variables declared in a local context must be distinct. If Γ declares some x, we write x ∈ Γ. By writing (x:T) ∈ Γ we mean that either x:T is an assumption in Γ or that there exists some t such that x:=t:T is a definition in Γ. If Γ defines some x:=t:T, we also write (x:=t:T) ∈ Γ. For the rest of the chapter, the Γ::(y:T) denotes the local context Γ enriched with the local assumption y:T. Similarly, Γ::(y:=t:T) denotes the local context Γ enriched with the local definition (y:=t:T). The notation [] denotes the empty local context. By Γ1; Γ2 we mean concatenation of the local context Γ1 and the local context Γ2.

Global environment.

A global environment is an ordered list of global declarations. Global declarations are either global assumptions or global definitions, but also declarations of inductive objects. Inductive objects themselves declare both inductive or coinductive types and constructors (see Section 4.5).

A global assumption will be represented in the global environment as (c:T) which assumes the name c to be of some type T. A global definition will be represented in the global environment as c:=t:T which defines the name c to have value t and type T. We shall call such names constants. For the rest of the chapter, the E;c:T denotes the global environment E enriched with the global assumption c:T. Similarly, E;c:=t:T denotes the global environment E enriched with the global definition (c:=t:T).

The rules for inductive definitions (see Section 4.5) have to be considered as assumption rules to which the following definitions apply: if the name c is declared in E, we write cE and if c:T or c:=t:T is declared in E, we write (c : T) ∈ E.

Typing rules.

In the following, we define simultaneously two judgments. The first one E[Γ] ⊢ t : T means the term t is well-typed and has type T in the global environment E and local context Γ. The second judgment WF(E)[Γ] means that the global environment E is well-formed and the local context Γ is a valid local context in this global environment.

A term t is well typed in a global environment E iff there exists a local context Γ and a term T such that the judgment E[Γ] ⊢ t : T can be derived from the following rules.

W-Empty
 WF([])[]
W-Local-Assum
E[Γ] ⊢ T : s    s ∈  S    x ∉Γ 
 WF(E)[Γ::(x:T)]
W-Local-Def
E[Γ] ⊢ t : T    x ∉Γ 
 WF(E)[Γ::(x:=t:T)]
W-Global-Assum
E[] ⊢ T : s    s ∈  S    c ∉ E
 WF(E;c:T)[]
W-Global-Def
E[] ⊢ t : T   c ∉ E
 WF(E;c:=t:T)[]
Ax-Prop
 WF(E)[Γ]
E[Γ] ⊢ Prop : Type(1)
Ax-Set
 WF(E)[Γ]
E[Γ] ⊢ Set : Type(1)
Ax-Type
 WF(E)[Γ]
E[Γ] ⊢ Type(i) : Type(i+1)
Var
  WF(E)[Γ]     (x:T) ∈ Γ  or  (x:=t:T) ∈ Γ for some t
E[Γ] ⊢ x : T
Const
 WF(E)[Γ]    (c:T) ∈ E  or  (c:=t:T) ∈ E for some t 
E[Γ] ⊢ c : T
Prod-Prop
E[Γ] ⊢ T : s    s ∈  S    E[Γ::(x:T)] ⊢ U : Prop
 E[Γ] ⊢ ∀ x:T,U : Prop
Prod-Set
E[Γ] ⊢ T : s    s ∈{PropSet}       E[Γ::(x:T)] ⊢ U : Set
 E[Γ] ⊢ ∀ x:T,U : Set
Prod-Type
E[Γ] ⊢ T : Type(i)     E[Γ::(x:T)] ⊢ U : Type(i)
E[Γ] ⊢ ∀ x:T,U : Type(i)
Lam
E[Γ] ⊢ ∀ x:T,U : s     E[Γ::(x:T)] ⊢ t : U
E[Γ] ⊢ λ x:T.  t : ∀ x:TU
App
E[Γ] ⊢ t : ∀ x:U,T    E[Γ] ⊢ u : U
E[Γ] ⊢ (t u) : T{x/u}
Let
E[Γ] ⊢ t : T     E[Γ::(x:=t:T)] ⊢ u : U
E[Γ] ⊢ let x:=t:T in u : U{x/t}


Remark: Prod1 and Prod2 typing-rules make sense if we consider the semantic difference between Prop and Set:


Remark: We may have let x:=t:T in u well-typed without having ((λ x:Tut) well-typed (where T is a type of t). This is because the value t associated to x may be used in a conversion rule (see Section 4.3).

4.3  Conversion rules

In Cic, there is an internal reduction mechanism. In particular, it can decide if two programs are intentionally equal (one says convertible). Convertibility is described in this section.

β-reduction.

We want to be able to identify some terms as we can identify the application of a function to a given argument with its result. For instance the identity function over a given type T can be written λ x:Tx. In any global environment E and local context Γ, we want to identify any object a (of type T) with the application ((λ x:Txa). We define for this a reduction (or a conversion) rule we call β:

E[Γ] ⊢ ((λ x:Ttu) ▷β t{x/u

We say that t{x/u} is the β-contraction of ((λ x:Ttu) and, conversely, that ((λ x:Ttu) is the β-expansion of t{x/u}.

According to β-reduction, terms of the Calculus of Inductive Constructions enjoy some fundamental properties such as confluence, strong normalization, subject reduction. These results are theoretically of great importance but we will not detail them here and refer the interested reader to [24].

ι-reduction.

A specific conversion rule is associated to the inductive objects in the global environment. We shall give later on (see Section 4.5.3) the precise rules but it just says that a destructor applied to an object built from a constructor behaves as expected. This reduction is called ι-reduction and is more precisely studied in [126, 145].

δ-reduction.

We may have variables defined in local contexts or constants defined in the global environment. It is legal to identify such a reference with its value, that is to expand (or unfold) it into its value. This reduction is called δ-reduction and shows as follows.

E[Γ] ⊢ xδ t     if (x:=t:T) ∈ Γ         E[Γ] ⊢ cδ t     if (c:=t:T) ∈ E
ζ-reduction.

Coq allows also to remove local definitions occurring in terms by replacing the defined variable by its value. The declaration being destroyed, this reduction differs from δ-reduction. It is called ζ-reduction and shows as follows.

E[Γ] ⊢ let x:=u in tζ t{x/u}
η-expansion.

Another important concept is η-expansion. It is legal to identify any term t of functional type ∀ x:T, U with its so-called η-expansion λ x:T.  (t x) for x an arbitrary variable name fresh in t.


Remark: We deliberately do not define η-reduction:

λ x:T.  (t x) ⋫η t

This is because, in general, the type of t need not to be convertible to the type of λ x:T.  (t x). E.g., if we take f such that:

f : ∀ x:Type(2),Type(1)

then

λ x:Type(1),(f  x) : ∀ x:Type(1),Type(1)

We could not allow

λ x:Type(1),(f x) ⋫η f

because the type of the reduced term ∀ x:Type(2),Type(1) would not be convertible to the type of the original term ∀ x:Type(1),Type(1).

Convertibility.

Let us write E[Γ] ⊢ tu for the contextual closure of the relation t reduces to u in the global environment E and local context Γ with one of the previous reduction β, ι, δ or ζ.

We say that two terms t1 and t2 are βιδζη-convertible, or simply convertible, or equivalent, in the global environment E and local context Γ iff there exist terms u1 and u2 such that E[Γ] ⊢ t1 ▷ … ▷ u1 and E[Γ] ⊢ t2 ▷ … ▷ u2 and either u1 and u2 are identical, or they are convertible up to η-expansion, i.e. u1 is λ x:Tu1 and u2 x is recursively convertible to u1, or, symmetrically, u2 is λ x:Tu2 and u1 x is recursively convertible to u2. We then write E[Γ] ⊢ t1 =βδιζη t2.

The convertibility relation allows introducing a new typing rule which says that two convertible well-formed types have the same inhabitants.

4.4  Subtyping rules

At the moment, we did not take into account one rule between universes which says that any term in a universe of index i is also a term in the universe of index i+1 (this is the cumulativity rule of Cic). This property extends the equivalence relation of convertibility into a subtyping relation inductively defined by:

  1. if E[Γ] ⊢ t =βδιζη u then E[Γ] ⊢ tβδιζη u,
  2. if ij then E[Γ] ⊢ Type(i) ≤βδιζη Type(j),
  3. for any i, E[Γ] ⊢ Setβδιζη Type(i),
  4. E[Γ] ⊢ Propβδιζη Set, hence, by transitivity, E[Γ] ⊢ Propβδιζη Type(i), for any i
  5. if E[Γ] ⊢ T =βδιζη U and E[Γ::(x:T)] ⊢ T′ ≤βδιζη U′ then E[Γ] ⊢ ∀ x:T, T′ ≤βδιζη ∀ x:U, U′.

The conversion rule up to subtyping is now exactly:

Conv
E[Γ] ⊢ U : s    E[Γ] ⊢ t : T    E[Γ] ⊢ Tβδιζη U
E[Γ] ⊢ t : U
Normal form.

A term which cannot be any more reduced is said to be in normal form. There are several ways (or strategies) to apply the reduction rules. Among them, we have to mention the head reduction which will play an important role (see Chapter 8). Any term can be written as λ x1:T1.  … λ xk:Tk .  (t0 t1tn) where t0 is not an application. We say then that t0 is the head of t. If we assume that t0 is λ x:Tu0 then one step of β-head reduction of t is:

λ x1:T1.  … λ xk:Tk.  (λ x:T.  u0 t1… tn)  ▷   λ (x1:T1)…(xk:Tk).  (u0{x/t1t2 … tn)

Iterating the process of head reduction until the head of the reduced term is no more an abstraction leads to the β-head normal form of t:

t ▷ … ▷ λ x1:T1.  …λ xk:Tk.  (v u1 … um)

where v is not an abstraction (nor an application). Note that the head normal form must not be confused with the normal form since some ui can be reducible. Similar notions of head-normal forms involving δ, ι and ζ reductions or any combination of those can also be defined.

4.5  Inductive Definitions

Formally, we can represent any inductive definition as Ind[p](ΓI := ΓC) where:

These inductive definitions, together with global assumptions and global definitions, then form the global environment. Additionally, for any p there always exists ΓP=[a1:A1;…;ap:Ap] such that each (t:T)∈ΓI∪ΓC can be written as: ∀ΓP, T where ΓP is called the context of parameters.

Examples

The declaration for parameterized lists is:

Ind [1]
[ list : Set → Set ] :=
nil := ∀A : Set, list A
cons := ∀A : Set, A → list A → list A


which corresponds to the result of the Coq declaration:

Coq < Inductive list (A:Set) : Set :=
        | nil : list A
        | cons : A -> list A -> list A.

The declaration for a mutual inductive definition of tree and forest is:

Ind [1]


tree : Set
forest : Set

:=

node : forest → tree
emptyf : forest
consf : tree → forest → forest




which corresponds to the result of the Coq declaration:

Coq < Inductive tree : Set :=
          node : forest -> tree
      with forest : Set :=
        | emptyf : forest
        | consf : tree -> forest -> forest.

The declaration for a mutual inductive definition of even and odd is:

Ind [1]


even : nat → Prop
odd : nat → Prop

:=

even_O : even O
even_S : ∀n : nat, odd n → even (S n)
odd_S : ∀n : nat, even n → odd (S n)




which corresponds to the result of the Coq declaration:

Coq < Inductive even : nat -> Prop :=
        | even_O : even 0
        | even_S : forall n, odd n -> even (S n)
      with odd : nat -> Prop :=
        | odd_S : forall n, even n -> odd (S n).

4.5.1  Types of inductive objects

We have to give the type of constants in a global environment E which contains an inductive declaration.

Ind
 WF(E)[Γ]        Ind[p](ΓI := ΓC) ∈ E        (a:A)∈ΓI
E[Γ] ⊢ a : A
Constr
 WF(E)[Γ]        Ind[p](ΓI := ΓC) ∈ E        (c:C)∈ΓC
E[Γ] ⊢ c : C

4.5.2  Well-formed inductive definitions

We cannot accept any inductive declaration because some of them lead to inconsistent systems. We restrict ourselves to definitions which satisfy a syntactic criterion of positivity. Before giving the formal rules, we need a few definitions:

Definition

A type T is an arity of sort s if it converts to the sort s or to a product ∀ x:T,U with U an arity of sort s.

Examples

ASet is an arity of sort Set. ∀ A:Prop,AProp is an arity of sort Prop.

Definition

A type T is an arity if there is a s S such that T is an arity of sort s.

Examples

ASet and ∀ A:Prop,AProp are arities.

Definition

We say that T is a type of constructor of I in one of the following two cases:

Examples

nat and natnat are types of constructors of nat.
A:Type,list A and ∀ A:Type,Alist Alist A are constructors of list.

Definition

The type of constructor T will be said to satisfy the positivity condition for a constant X in the following cases:

The constant X occurs strictly positively in T in the following cases:

The type of constructor T of I satisfies the nested positivity condition for a constant X in the following cases:

For instance, if one considers the type

Inductive tree (A:Type) : Type :=
 | leaf : list A
 | node : A -> (nat -> tree A) -> tree A

Then every instantiated constructor of list A satisfies the nested positivity condition for list
  │
  ├─ concerning type list A of constructor nil:Type list A of constructor nil satisfies the positivity condition for listbecause list does not appear in any (real) arguments of the type of that constructor(primarily because list does not have any (real) arguments) ... (bullet 1)
  │
  ╰─ concerning type ∀ A → list A → list A of constructor cons:
       Type ∀ A : Type, A → list A → list A of constructor cons
       satisfies the positivity condition for list because:
       │
       ├─ list occurs only strictly positively in Type ... (bullet 3)
       │
       ├─ list occurs only strictly positively in A ... (bullet 3)
       │
       ├─ list occurs only strictly positively in list A ... (bullet 4)
       │
       ╰─ list satisfies the positivity condition for list A ... (bullet 1)

Correctness rules.

We shall now describe the rules allowing the introduction of a new inductive definition.

W-Ind
Let E be a global environment and ΓPIC are contexts such that ΓI is [I1:∀ ΓP,A1;…;Ik:∀ ΓP,Ak] and ΓC is [c1:∀ ΓP,C1;…;cn:∀ ΓP,Cn].
(EP] ⊢ Aj : sj)j=1…  k          (EIP] ⊢ Ci : sqi)i=1…  n
 WF(E;Ind[p](ΓI := ΓC))[Γ]
provided that the following side conditions hold:
  • k>0 and all of Ij and ci are distinct names for j=1… k and i=1… n,
  • p is the number of parameters of IndI := ΓC) and ΓP is the context of parameters,
  • for j=1… k we have that Aj is an arity of sort sj and IjE,
  • for i=1… n we have that Ci is a type of constructor of Iqi which satisfies the positivity condition for I1Ik and ci ∉ Γ ∪ E.

One can remark that there is a constraint between the sort of the arity of the inductive type and the sort of the type of its constructors which will always be satisfied for the impredicative sort Prop but may fail to define inductive definition on sort Set and generate constraints between universes for inductive definitions in the Type hierarchy.

Examples.

It is well known that existential quantifier can be encoded as an inductive definition. The following declaration introduces the second-order existential quantifier ∃ X.P(X).

Coq < Inductive exProp (P:Prop->Prop) : Prop :=
        exP_intro : forall X:Prop, P X -> exProp P.

The same definition on Set is not allowed and fails:

Coq < Fail Inductive exSet (P:Set->Prop) : Set :=
        exS_intro : forall X:Set, P X -> exSet P.
The command has indeed failed with message:
Large non-propositional inductive types must be in Type.

It is possible to declare the same inductive definition in the universe Type. The exType inductive definition has type (TypeiProp)→ Typej with the constraint that the parameter X of exT_intro has type Typek with k<j and ki.

Coq < Inductive exType (P:Type->Prop) : Type :=
        exT_intro : forall X:Type, P X -> exType P.
Sort-polymorphism of inductive types.

Inductive types declared in Type are polymorphic over their arguments in Type. If A is an arity of some sort and s is a sort, we write A/s for the arity obtained from A by replacing its sort with s. Especially, if A is well-typed in some global environment and local context, then A/s is typable by typability of all products in the Calculus of Inductive Constructions. The following typing rule is added to the theory.

Ind-Family
Let Ind[p](ΓI := ΓC) be an inductive definition. Let ΓP = [p1:P1;…;pp:Pp] be its context of parameters, ΓI = [I1:∀ ΓP,A1;…;Ik:∀ ΓP,Ak] its context of definitions and ΓC = [c1:∀ ΓP,C1;…;cn:∀ ΓP,Cn] its context of constructors, with ci a constructor of Iqi.

Let mp be the length of the longest prefix of parameters such that the m first arguments of all occurrences of all Ij in all Ck (even the occurrences in the hypotheses of Ck) are exactly applied to p1 … pm (m is the number of recursively uniform parameters and the pm remaining parameters are the recursively non-uniform parameters). Let q1, …, qr, with 0≤ rm, be a (possibly) partial instantiation of the recursively uniform parameters of ΓP. We have:





Ind[p](ΓI := ΓC) ∈ E
(E[] ⊢ ql : Pl)l=1… r
(E[] ⊢ Plβδιζη Pl{pu/qu}u=1… l−1)l=1… r
1 ≤ j ≤ k
E[] ⊢ Ij q1 … qr:∀ [pr+1:Pr+1;…;pp:Pp], (Aj)/sj

provided that the following side conditions hold:

  • ΓP is the context obtained from ΓP by replacing each Pl that is an arity with Pl for 1≤ lr (notice that Pl arity implies Pl arity since E[] ⊢ Plβδιζη Pl{pu/qu}u=1… l−1);
  • there are sorts si, for 1 ≤ ik such that, for ΓI = [I1:∀ ΓP,(A1)/s1;…;Ik:∀ ΓP,(Ak)/sk] we have (EIP] ⊢ Ci : sqi)i=1… n;
  • the sorts si are such that all eliminations, to Prop, Set and Type(j), are allowed (see Section 4.5.3).

Notice that if Ij q1 … qr is typable using the rules Ind-Const and App, then it is typable using the rule Ind-Family. Conversely, the extended theory is not stronger than the theory without Ind-Family. We get an equiconsistency result by mapping each Ind[p](ΓI := ΓC) occurring into a given derivation into as many different inductive types and constructors as the number of different (partial) replacements of sorts, needed for this derivation, in the parameters that are arities (this is possible because Ind[p](ΓI := ΓC) well-formed implies that Ind[p](ΓI := ΓC) is well-formed and has the same allowed eliminations, where ΓI is defined as above and ΓC = [c1:∀ ΓP,C1;…;cn:∀ ΓP,Cn]). That is, the changes in the types of each partial instance q1 … qr can be characterized by the ordered sets of arity sorts among the types of parameters, and to each signature is associated a new inductive definition with fresh names. Conversion is preserved as any (partial) instance Ij q1 … qr or Ci q1 … qr is mapped to the names chosen in the specific instance of Ind[p](ΓI := ΓC).

In practice, the rule Ind-Family is used by Coq only when all the inductive types of the inductive definition are declared with an arity whose sort is in the Type hierarchy. Then, the polymorphism is over the parameters whose type is an arity of sort in the Type hierarchy. The sort sj are chosen canonically so that each sj is minimal with respect to the hierarchy PropSetpType where Setp is predicative Set. More precisely, an empty or small singleton inductive definition (i.e. an inductive definition of which all inductive types are singleton – see paragraph 4.5.3) is set in Prop, a small non-singleton inductive type is set in Set (even in case Set is impredicative – see Section 4.8), and otherwise in the Type hierarchy.

Note that the side-condition about allowed elimination sorts in the rule Ind-Family is just to avoid to recompute the allowed elimination sorts at each instance of a pattern-matching (see section 4.5.3). As an example, let us consider the following definition:

Coq < Inductive option (A:Type) : Type := 
      | None : option A 
      | Some : A -> option A.

As the definition is set in the Type hierarchy, it is used polymorphically over its parameters whose types are arities of a sort in the Type hierarchy. Here, the parameter A has this property, hence, if option is applied to a type in Set, the result is in Set. Note that if option is applied to a type in Prop, then, the result is not set in Prop but in Set still. This is because option is not a singleton type (see section 4.5.3) and it would lose the elimination to Set and Type if set in Prop.

Coq < Check (fun A:Set => option A).
fun A : Set => option A
     : Set -> Set

Coq < Check (fun A:Prop => option A).
fun A : Prop => option A
     : Prop -> Set

Here is another example.

Coq < Inductive prod (A B:Type) : Type := pair : A -> B -> prod A B.

As prod is a singleton type, it will be in Prop if applied twice to propositions, in Set if applied twice to at least one type in Set and none in Type, and in Type otherwise. In all cases, the three kind of eliminations schemes are allowed.

Coq < Check (fun A:Set => prod A).
fun A : Set => prod A
     : Set -> Type -> Type

Coq < Check (fun A:Prop => prod A A).
fun A : Prop => prod A A
     : Prop -> Prop

Coq < Check (fun (A:Prop) (B:Set) => prod A B).
fun (A : Prop) (B : Set) => prod A B
     : Prop -> Set -> Set

Coq < Check (fun (A:Type) (B:Prop) => prod A B).
fun (A : Type) (B : Prop) => prod A B
     : Type -> Prop -> Type

4.5.3  Destructors

The specification of inductive definitions with arities and constructors is quite natural. But we still have to say how to use an object in an inductive type.

This problem is rather delicate. There are actually several different ways to do that. Some of them are logically equivalent but not always equivalent from the computational point of view or from the user point of view.

From the computational point of view, we want to be able to define a function whose domain is an inductively defined type by using a combination of case analysis over the possible constructors of the object and recursion.

Because we need to keep a consistent theory and also we prefer to keep a strongly normalizing reduction, we cannot accept any sort of recursion (even terminating). So the basic idea is to restrict ourselves to primitive recursive functions and functionals.

For instance, assuming a parameter A:Set exists in the local context, we want to build a function length of type list Anat which computes the length of the list, so such that (length (nil A)) = O and (length (cons A a l)) = (S (length l)). We want these equalities to be recognized implicitly and taken into account in the conversion rule.

From the logical point of view, we have built a type family by giving a set of constructors. We want to capture the fact that we do not have any other way to build an object in this type. So when trying to prove a property about an object m in an inductive definition it is enough to enumerate all the cases where m starts with a different constructor.

In case the inductive definition is effectively a recursive one, we want to capture the extra property that we have built the smallest fixed point of this recursive equation. This says that we are only manipulating finite objects. This analysis provides induction principles. For instance, in order to prove ∀ l:list A,(has_length A l (length l)) it is enough to prove:

which given the conversion equalities satisfied by length is the same as proving:

One conceptually simple way to do that, following the basic scheme proposed by Martin-Löf in his Intuitionistic Type Theory, is to introduce for each inductive definition an elimination operator. At the logical level it is a proof of the usual induction principle and at the computational level it implements a generic operator for doing primitive recursion over the structure.

But this operator is rather tedious to implement and use. We choose in this version of Coq to factorize the operator for primitive recursion into two more primitive operations as was first suggested by Th. Coquand in [28]. One is the definition by pattern-matching. The second one is a definition by guarded fixpoints.

The match…with …end construction.

The basic idea of this operator is that we have an object m in an inductive type I and we want to prove a property which possibly depends on m. For this, it is enough to prove the property for m = (ci u1upi) for each constructor of I. The Coq term for this proof will be written:

match m with  (c1 x11 ... x1p1) ⇒ f1  | … |  (cn xn1 ... xnpn) ⇒ fn  end

In this expression, if m eventually happens to evaluate to (ci u1upi) then the expression will behave as specified in its i-th branch and it will reduce to fi where the xi1xipi are replaced by the u1upi according to the ι-reduction.

Actually, for type-checking a match…with…end expression we also need to know the predicate P to be proved by case analysis. In the general case where I is an inductively defined n-ary relation, P is a predicate over n+1 arguments: the n first ones correspond to the arguments of I (parameters excluded), and the last one corresponds to object m. Coq can sometimes infer this predicate but sometimes not. The concrete syntax for describing this predicate uses the as…in…return construction. For instance, let us assume that I is an unary predicate with one parameter and one argument. The predicate is made explicit using the syntax:

match m as  x  in  I _ a  return  P  with  (c1 x11 ... x1p1) ⇒ f1  | … |  (cn xn1 ... xnpn) ⇒ fn end

The as part can be omitted if either the result type does not depend on m (non-dependent elimination) or m is a variable (in this case, m can occur in P where it is considered a bound variable). The in part can be omitted if the result type does not depend on the arguments of I. Note that the arguments of I corresponding to parameters must be _, because the result type is not generalized to all possible values of the parameters. The other arguments of I (sometimes called indices in the literature) have to be variables (a above) and these variables can occur in P. The expression after in must be seen as an inductive type pattern. Notice that expansion of implicit arguments and notations apply to this pattern. For the purpose of presenting the inference rules, we use a more compact notation:

case(m,(λ a xP), λ x11 ... x1p1f1  | … |  λ xn1...xnpnfn)
Allowed elimination sorts.

An important question for building the typing rule for match is what can be the type of λ a xP with respect to the type of m. If m:I and I:A and λ a xP : B then by [I:A|B] we mean that one can use λ a xP with m in the above match-construct.

Notations.

The [I:A|B] is defined as the smallest relation satisfying the following rules: We write [I|B] for [I:A|B] where A is the type of I.

The case of inductive definitions in sorts Set or Type is simple. There is no restriction on the sort of the predicate to be eliminated.

Prod
[(I x):A′|B′]
[I:∀ x:A, A′|∀ x:A, B′]
Set & Type
s1 ∈ {Set,Type(j)}        s2 ∈  S
[I:s1|Is2]

The case of Inductive definitions of sort Prop is a bit more complicated, because of our interpretation of this sort. The only harmless allowed elimination, is the one when predicate P is also of sort Prop.

Prop
[I:Prop|IProp]

Prop is the type of logical propositions, the proofs of properties P in Prop could not be used for computation and are consequently ignored by the extraction mechanism. Assume A and B are two propositions, and the logical disjunction AB is defined inductively by:

Coq < Inductive or (A B:Prop) : Prop :=
        or_introl : A -> or A B | or_intror : B -> or A B.

The following definition which computes a boolean value by case over the proof of or A B is not accepted:

Coq < Fail Definition choice (A B: Prop) (x:or A B) :=
        match x with or_introl _ _ a => true | or_intror _ _ b => false end.
The command has indeed failed with message:
Incorrect elimination of "x" in the inductive type "or":
the return type has sort "Set" while it should be "Prop".
Elimination of an inductive object of sort Prop
is not allowed on a predicate in sort Set
because proofs can be eliminated only to build proofs.

From the computational point of view, the structure of the proof of (or A B) in this term is needed for computing the boolean value.

In general, if I has type Prop then P cannot have type ISet, because it will mean to build an informative proof of type (P m) doing a case analysis over a non-computational object that will disappear in the extracted program. But the other way is safe with respect to our interpretation we can have I a computational object and P a non-computational one, it just corresponds to proving a logical property of a computational object.

In the same spirit, elimination on P of type IType cannot be allowed because it trivially implies the elimination on P of type ISet by cumulativity. It also implies that there are two proofs of the same property which are provably different, contradicting the proof-irrelevance property which is sometimes a useful axiom:

Coq < Axiom proof_irrelevance : forall (P : Prop) (x y : P), x=y.
proof_irrelevance is assumed

The elimination of an inductive definition of type Prop on a predicate P of type IType leads to a paradox when applied to impredicative inductive definition like the second-order existential quantifier exProp defined above, because it give access to the two projections on this type.

Empty and singleton elimination

There are special inductive definitions in Prop for which more eliminations are allowed.

Prop-extended
I  is an empty or singleton definition   s ∈  S
[I:Prop|Is]

A singleton definition has only one constructor and all the arguments of this constructor have type Prop. In that case, there is a canonical way to interpret the informative extraction on an object in that type, such that the elimination on any sort s is legal. Typical examples are the conjunction of non-informative propositions and the equality. If there is an hypothesis h:a=b in the local context, it can be used for rewriting not only in logical propositions but also in any type.

Coq < Print eq_rec.
eq_rec = 
fun (A : Type) (x : A) (P : A -> Set) => eq_rect x P
     : forall (A : Type) (x : A) (P : A -> Set),
       P x -> forall y : A, x = y -> P y
Argument A is implicit
Argument scopes are [type_scope _ _ _ _ _]

Coq < Extraction eq_rec.
(** val eq_rec : 'a1 -> 'a2 -> 'a1 -> 'a2 **)
let eq_rec _ f _ =
  f

An empty definition has no constructors, in that case also, elimination on any sort is allowed.

Type of branches.

Let c be a term of type C, we assume C is a type of constructor for an inductive type I. Let P be a term that represents the property to be proved. We assume r is the number of parameters and p is the number of arguments.

We define a new type {c:C}P which represents the type of the branch corresponding to the c:C constructor.

{c:(I p1pr t1tp)}P≡ (P t1…  tp c
{c:∀ x:T,C}P≡ ∀ x:T,{(c x):C}P 

We write {c}P for {c:C}P with C the type of c.

Example.

The following term in concrete syntax:

match t as l return P' with
| nil _ => t1
| cons _ hd tl => t2
end

can be represented in abstract syntax as

case(t,P,f1 | f2)

where

  P=λ l . P
  f1=t1
  f2=λ (hd:nat) . λ (tl:list nat) . t2

According to the definition:

{(nil nat)}P ≡ {(nil nat) : (list nat)}P ≡ (P (nil nat))

{(cons nat)}P ≡{(cons nat) : (natlist natlist nat)}P
≡∀ n:nat, {(cons nat n) : list natlist nat)}P
≡∀ n:nat, ∀ l:list nat, {(cons nat n l) : list nat)}P
≡∀ n:nat, ∀ l:list nat,(P (cons nat n l)).

Given some P, then {(nil nat)}P represents the expected type of f1, and {(cons nat)}P represents the expected type of f2.

Typing rule.

Our very general destructor for inductive definition enjoys the following typing rule

match
E[Γ] ⊢ c : (I q1…  qr t1…  ts)   E[Γ] ⊢ P : B  [(I q1qr)|B]    (E[Γ] ⊢ fi : {(cpi q1qr)}P)i=1…  l
E[Γ] ⊢ case(c,P,f1|… |fl) : (P t1…  ts c)

provided I is an inductive type in a definition Ind[r](ΓI := ΓC) with ΓC = [c1:C1;…;cn:Cn] and cp1cpl are the only constructors of I.

Example.

Below is a typing rule for the term shown in the previous example:

E[Γ] ⊢ t : (list nat)    E[Γ] ⊢ P : B    [(list nat)|B]    E[Γ] ⊢ f1 : {(nil nat)}P    E[Γ] ⊢ f2 : {(cons nat)}P
E[Γ] ⊢ case(t,P,f1|f2) : (P t)
Definition of ι-reduction.

We still have to define the ι-reduction in the general case.

A ι-redex is a term of the following form:

case((cpi q1qr a1am),P,f1|… | fl)

with cpi the i-th constructor of the inductive type I with r parameters.

The ι-contraction of this term is (fi a1am) leading to the general reduction rule:

case((cpi q1qr a1am),P,f1|… | fn) ▷ι (fi a1… am

4.5.4  Fixpoint definitions

The second operator for elimination is fixpoint definition. This fixpoint may involve several mutually recursive definitions. The basic concrete syntax for a recursive set of mutually recursive declarations is (with Γi contexts):

fix f1 (Γ1) :A1:=t1 with … with  fnn) :An:=tn

The terms are obtained by projections from this set of declarations and are written

fix f1 (Γ1) :A1:=t1 with … with  fnn) :An:=tn for fi

In the inference rules, we represent such a term by

Fix fi{f1:A1′:=t1′ … fn:An′:=tn′}

with ti′ (resp. Ai′) representing the term ti abstracted (resp. generalized) with respect to the bindings in the context Γi, namely ti′=λ Γiti and Ai′=∀ Γi, Ai.

Typing rule

The typing rule is the expected one for a fixpoint.

Fix
(E[Γ] ⊢ Ai : si)i=1… n     (E[Γ,f1:A1,…,fn:An] ⊢ ti : Ai)i=1… n
E[Γ] ⊢ Fix fi{f1:A1:=t1 … fn:An:=tn} : Ai

Any fixpoint definition cannot be accepted because non-normalizing terms allow proofs of absurdity. The basic scheme of recursion that should be allowed is the one needed for defining primitive recursive functionals. In that case the fixpoint enjoys a special syntactic restriction, namely one of the arguments belongs to an inductive type, the function starts with a case analysis and recursive calls are done on variables coming from patterns and representing subterms. For instance in the case of natural numbers, a proof of the induction principle of type

∀ P:natProp, (P O)→(∀ n:nat, (P n)→(P (S n)))→ ∀ n:nat, (P n)

can be represented by the term:

λ P:natProp. λ f:(P O).  λ g:(∀ n:nat, (P n)→(P (S n))) . 
Fix h{h:∀ n:nat, (P n):=λ n:nat.  case(n,P,f | λ p:nat.  (g p (h p)))}

Before accepting a fixpoint definition as being correctly typed, we check that the definition is “guarded”. A precise analysis of this notion can be found in [67]. The first stage is to precise on which argument the fixpoint will be decreasing. The type of this argument should be an inductive definition. For doing this, the syntax of fixpoints is extended and becomes

Fix fi{f1/k1:A1:=t1 … fn/kn:An:=tn}

where ki are positive integers. Each ki represents the index of pararameter of fi, on which fi is decreasing. Each Ai should be a type (reducible to a term) starting with at least ki products ∀ y1:B1,… ∀ yki:Bki, Ai and Bki an is unductive type.

Now in the definition ti, if fj occurs then it should be applied to at least kj arguments and the kj-th argument should be syntactically recognized as structurally smaller than yki

The definition of being structurally smaller is a bit technical. One needs first to define the notion of recursive arguments of a constructor. For an inductive definition Ind[r](ΓI := ΓC), if the type of a constructor c has the form ∀ p1:P1,… ∀ pr:Pr, ∀ x1:T1, … ∀ xr:Tr, (Ij p1pr t1ts), then the recursive arguments will correspond to Ti in which one of the Il occurs.

The main rules for being structurally smaller are the following:
Given a variable y of type an inductive definition in a declaration Ind[r](ΓI := ΓC) where ΓI is [I1:A1;…;Ik:Ak], and ΓC is [c1:C1;…;cn:Cn]. The terms structurally smaller than y are:

The following definitions are correct, we enter them using the Fixpoint command as described in Section 1.3.4 and show the internal representation.

Coq < Fixpoint plus (n m:nat) {struct n} : nat :=
        match n with
        | O => m
        | S p => S (plus p m)
        end.
plus is defined
plus is recursively defined (decreasing on 1st argument)

Coq < Print plus.
plus = 
fix plus (n m : nat) {struct n} : nat :=
  match n with
  | 0 => m
  | S p => S (plus p m)
  end
     : nat -> nat -> nat
Argument scopes are [nat_scope nat_scope]

Coq < Fixpoint lgth (A:Set) (l:list A) {struct l} : nat :=
        match l with
        | nil _ => O
        | cons _ a l' => S (lgth A l')
        end.
lgth is defined
lgth is recursively defined (decreasing on 2nd argument)

Coq < Print lgth.
lgth = 
fix lgth (A : Set) (l : list A) {struct l} : nat :=
  match l with
  | nil _ => 0
  | cons _ _ l' => S (lgth A l')
  end
     : forall A : Set, list A -> nat
Argument scopes are [type_scope _]

Coq < Fixpoint sizet (t:tree) : nat := let (f) := t in S (sizef f)
       with sizef (f:forest) : nat :=
        match f with
        | emptyf => O
        | consf t f => plus (sizet t) (sizef f)
        end.
sizet is defined
sizef is defined
sizet, sizef are recursively defined (decreasing respectively on 1st,
1st arguments)

Coq < Print sizet.
sizet = 
fix sizet (t : tree) : nat := let (f) := t in S (sizef f)
with sizef (f : forest) : nat :=
  match f with
  | emptyf => 0
  | consf t f0 => plus (sizet t) (sizef f0)
  end
for sizet
     : tree -> nat

Reduction rule

Let F be the set of declarations: f1/k1:A1:=t1fn/kn:An:=tn. The reduction for fixpoints is:

(Fix fi{Fa1aki) ▷ι ti{(fk/Fix fk{F})k=1… n}  a1… aki

when aki starts with a constructor. This last restriction is needed in order to keep strong normalization and corresponds to the reduction for primitive recursive operators. The following reductions are now possible:

  plus (S (S O)) (S O)ιS (plus (S O) (S O))
 ιS (S (plus O (S O)))
 ιS (S (S O))

Mutual induction

The principles of mutual induction can be automatically generated using the Scheme command described in Section 13.1.

4.6  Admissible rules for global environments

From the original rules of the type system, one can show the admissibility of rules which change the local context of definition of objects in the global environment. We show here the admissible rules that are used used in the discharge mechanism at the end of a section.

Abstraction.

One can modify a global declaration by generalizing it over a previously assumed constant c. For doing that, we need to modify the reference to the global declaration in the subsequent global environment and local context by explicitly applying this constant to the constant c′.

Below, if Γ is a context of the form [y1:A1;…;yn:An], we write ∀ x:U,Γ{c/x} to mean [y1:∀ x:U,A1{c/x};…;yn:∀ x:U,An{c/x}] and E{|Γ|/|Γ|c}. to mean the parallel substitution E{y1/(y1 c)}…{yn/(yn c)}.

First abstracting property:
 WF(E;c:U;E′;c′:=t:T;E″)[Γ]
 WF(E;c:U;E′;c′:=λ x:U.  t{c/x}:∀ x:U,T{c/x}; E″{c′/(c′ c)})[Γ{c/(c c′)}]
 WF(E;c:U;E′;c′:T;E″)[Γ]
 WF(E;c:U;E′;c′:∀ x:U,T{c/x}; E″{c′/(c′ c)})[Γ{c/(c c′)}]
 WF(E;c:U;E′;Ind[p](ΓI := ΓC);E″)[Γ]
 WF
(E;c:U;E′;Ind[p+1](∀ x:UI{c/x} := ∀ x:UC{c/x});E″{|ΓIC|/|ΓICc})
[Γ{|ΓIC|/|ΓICc}]

One can similarly modify a global declaration by generalizing it over a previously defined constant c′. Below, if Γ is a context of the form [y1:A1;…;yn:An], we write Γ{c/u} to mean [y1:A1{c/u};…;yn:An{c/u}].

Second abstracting property:
 WF(E;c:=u:U;E′;c′:=t:T;E″)[Γ]
 WF(E;c:=u:U;E′;c′:=(let x:=u:U in t{c/x}):T{c/u};E″)[Γ]
 WF(E;c:=u:U;E′;c′:T;E″)[Γ]
 WF(E;c:=u:U;E′;c′:T{c/u};E″)[Γ]
 WF(E;c:=u:U;E′;Ind[p](ΓI := ΓC);E″)[Γ]
 WF(E;c:=u:U;E′;Ind[p](ΓI{c/u} := ΓC{c/u});E″)[Γ]
Pruning the local context.

If one abstracts or substitutes constants with the above rules then it may happen that some declared or defined constant does not occur any more in the subsequent global environment and in the local context. One can consequently derive the following property.

First pruning property:
 WF(E;c:U;E′)[Γ]      c  does not occur in E′ and Γ
 WF(E;E′)[Γ]
Second pruning property:
 WF(E;c:=u:U;E′)[Γ]      c  does not occur in E′ and Γ
 WF(E;E′)[Γ]

4.7  Co-inductive types

The implementation contains also co-inductive definitions, which are types inhabited by infinite objects. More information on co-inductive definitions can be found in [68, 70, 71].

4.8  The Calculus of Inductive Construction with impredicative Set

Coq can be used as a type-checker for the Calculus of Inductive Constructions with an impredicative sort Set by using the compiler option -impredicative-set. For example, using the ordinary coqtop command, the following is rejected.

Coq < Fail Definition id: Set := forall X:Set,X->X.
The command has indeed failed with message:
The term "forall X : Set, X -> X" has type "Type"
while it is expected to have type "Set".

while it will type-check, if one uses instead the coqtop -impredicative-set command.

The major change in the theory concerns the rule for product formation in the sort Set, which is extended to a domain in any sort:

Prod
E[Γ] ⊢ T : s    s ∈  S       E[Γ::(x:T)] ⊢ U : Set
 E[Γ] ⊢ ∀ x:T,U : Set

This extension has consequences on the inductive definitions which are allowed. In the impredicative system, one can build so-called large inductive definitions like the example of second-order existential quantifier (exSet).

There should be restrictions on the eliminations which can be performed on such definitions. The eliminations rules in the impredicative system for sort Set become:

Set
s ∈ {PropSet}
[I:Set|Is]
    
I  is a small inductive definition    s ∈ {Type(i)}
[I:Set|Is]