A compiler for arithmetic expressions #

THIS FILE IS SYNCHRONIZED WITH MATHLIB4. Any changes to this file require a corresponding PR to mathlib4.

A formalization of the correctness of a compiler from arithmetic expressions to machine language described by McCarthy and Painter, which is considered the first proof of compiler correctness.

Main definitions #

expr : the syntax of the source language.
value : the semantics of the source language.
instruction: the syntax of the target language.
step : the semantics of the target language.
compile : the compiler.

Main results #

compiler_correctness: the compiler correctness theorem.

Notation #

≃[t]/ac: partial equality of two machine states excluding registers x ≥ t and the accumulator.
≃[t] : partial equality of two machine states excluding registers x ≥ t.

References #

John McCarthy and James Painter. Correctness of a compiler for arithmetic expressions. In Mathematical Aspects of Computer Science, volume 19 of Proceedings of Symposia in Applied Mathematics. American Mathematical Society, 1967. http://jmc.stanford.edu/articles/mcpain/mcpain.pdf

Tags #

compiler

Types #

source

@[reducible]

def arithcc.word :

Type

Value type shared by both source and target languages.

Equations

arithcc.word = ℕ

source

@[reducible]

def arithcc.identifier :

Type

Variable identifier type in the source language.

Equations

arithcc.identifier = string

source

@[reducible]

def arithcc.register :

Type

Equations

arithcc.register = ℕ

source

theorem arithcc.register.lt_succ_self (r : arithcc.register) :

r < r + 1

source

theorem arithcc.register.le_of_lt_succ {r₁ r₂ : arithcc.register} :

r₁ < r₂ + 1 → r₁ ≤ r₂

Source language #

source

@[protected, instance]

def arithcc.expr.inhabited :

inhabited arithcc.expr

source

inductive arithcc.expr :

Type

const : arithcc.word → arithcc.expr
var : arithcc.identifier → arithcc.expr
sum : arithcc.expr → arithcc.expr → arithcc.expr

An expression in the source language is formed by constants, variables, and sums.

Instances for arithcc.expr

arithcc.expr.has_sizeof_inst
arithcc.expr.inhabited

source

@[simp]

def arithcc.value :

arithcc.expr → (arithcc.identifier → arithcc.word) → arithcc.word

The semantics of the source language (2.1).

Equations

arithcc.value (s₁.sum s₂) ξ = arithcc.value s₁ ξ + arithcc.value s₂ ξ
arithcc.value (arithcc.expr.var x) ξ = ξ x
arithcc.value (arithcc.expr.const v) _x = v

Target language #

source

@[protected, instance]

def arithcc.instruction.inhabited :

inhabited arithcc.instruction

source

inductive arithcc.instruction :

Type

li : arithcc.word → arithcc.instruction
load : arithcc.register → arithcc.instruction
sto : arithcc.register → arithcc.instruction
add : arithcc.register → arithcc.instruction

Instructions of the target machine language (3.1--3.7).

Instances for arithcc.instruction

arithcc.instruction.has_sizeof_inst
arithcc.instruction.inhabited

source

structure arithcc.state :

Type

ac : arithcc.word
rs : arithcc.register → arithcc.word

Machine state consists of the accumulator and a vector of registers.

The paper uses two functions c and a for accessing both the accumulator and registers. For clarity, we make accessing the accumulator explicit and use read/write for registers.

Instances for arithcc.state