Core Language (modulo changes, new features and bugs)
The provided core language (see Recalc.Language
)
implements a simple, dependently typed functional programming language
based on LambdaPi1, which is extended by
- cell references (sized, nested multi-dimensional arrays), and
- implicit arguments.
Syntax
Cell references and ranges are similar to Excel:
Where and must be printable, and
characters '[]\
need to be escaped with \
. The corresponding
prefixed versions may only contain alphanumeric characters and ._~
.
When is not (or partially) provided, it is implicitly defined as the and/or of the location where it is being read.
Terms and types share the syntax of a standard lambda calculus with dependent functions:
where includes signed decimals, int
, bool
, false
,
and true
. Identifiers are case-insensitive.
Strings may include variables which will be implicitly converted to string
, a
must escape "
, \
, a
, \b
, \f
, \n
, \r
,
\t
, and \v
. It must also escape $
if it is followed by a {
, otherwise
${var}
will be treated as a term show(val)
.
When a function type ignores its argument it can be written as and similarly for implicit arguments.
Typechecking and Inference
The typing judgements and evaluation rules follow directly the LambdaPi
implementation (a simple bi-directional typechecker). The only notable
difference is that we introduce another context which gives
the type and value of a cell.
The original rules are trivially extended, passing the additional around. For example, the rule would be extended as follows:
Cell References
Rules concerning cell references and cell ranges make use of the new type constructor for tensors (not part of the surface language) and are quite simple:
The first one just states that a simple cell reference has the same type as the type of the term at that location.
The second one is similar. Using the top left corner of a selection it makes sure that all cells in the referred range share that type . The type of a cell range is a tensor with elements .
(where is an identity for all types except tensor types for which it is defined as )
The dimensions of the resulting tensor are given by
That is, referring to an range of cells gives a tensor which potentially can become higher order when the elements are tensors by themselves.
Implicit Arguments
Implicit arguments are covered by
which follows the regular application rule, and
which uses unification to solve for implicict arguments (and leaves uninferrable untouched out). That is, is the set of variables which are left unsolved by , and is a substitution from implicit variables to the solved terms that unifies and .
The procedure works similar to in the sense that it returns unsolved implicits and a substitution with resolved terms.
Light-weight Type Classes
Using the procedure the language provides a way to overload operators
and then function show
:
For example, show
has type {t} -> {{show: t -> string}} -> t -> string
which
will unify t
with the type of its argument and then seek for a known implementation
{show: t -> string}
and insert it if it can find one.
Instances that will be d are:
string conversion:
{show: bool -> string}
{show: int -> string}
{show: string -> string}
plus (plus
, for which +
is syntactic sugar):
{plus: int -> int -> int}
{plus: string -> string -> string}
minus (minus
, for which -
is syntactic sugar):
{minus: int -> int -> int}
mult (mult
, for which *
is syntactic sugar):
{mult: int -> int -> int}
(arithmetic overloads for tensors TBD)
Evaluation
The evaluation rules for the language extended to spreadsheets are equally simple. They abstract the recalculation using the context introduced earlier and extends existing rules accordingly.
The new rules become:
(where flattens a tensor value and acts as an identity otherwise)
Thus references to cell ranges behave as nested arrays with dependently typed sizes:
-
Andres LΓΆh, Conor McBride, Wouter Swierstra. A Tutorial Implementation of a Dependently Typed Lambda Calculus. ↩