Context Navigation

← Previous Change
Next Change →

Changeset 7ff30d07 for doc

Timestamp:

Jun 14, 2016, 1:23:18 PM (9 years ago)

Author:

Thierry Delisle <tdelisle@…>

Branches:

ADT, aaron-thesis, arm-eh, ast-experimental, cleanup-dtors, ctor, deferred_resn, demangler, enum, forall-pointer-decay, gc_noraii, jacob/cs343-translation, jenkins-sandbox, master, memory, new-ast, new-ast-unique-expr, new-env, no_list, persistent-indexer, pthread-emulation, qualifiedEnum, resolv-new, with_gc

Children:

545ef59, c738ca4, ee51534

Parents:

6cbc25a (diff), c8c03683 (diff)
Note: this is a merge changeset, the changes displayed below correspond to the merge itself.
Use the (diff) links above to see all the changes relative to each parent.

Message:

Merge branch 'master' of plg.uwaterloo.ca:software/cfa/cfa-cc

Conflicts:

Makefile.in
aclocal.m4
configure
src/Makefile.in
src/Tests/Context.c
src/driver/Makefile.in
src/examples/Makefile.in
src/examples/ctxts.c
src/examples/esskaykay.c
src/libcfa/Makefile.in

Location:

doc

Files:

: 4 edited

LaTeXmacros/common.tex (modified) (6 diffs)
user/Cdecl.fig (modified) (3 diffs)
user/user.tex (modified) (40 diffs)
working/resolver_design.md (modified) (2 diffs)

Legend:

: Unmodified
: Added
: Removed

doc/LaTeXmacros/common.tex

-              r6cbc25a
+              r7ff30d07
 %% Created On       : Sat Apr  9 10:06:17 2016
 %% Last Modified By : Peter A. Buhr
 %% Last Modified On : Fri Jun  3 09:32:19 2016
 %% Update Count     : 62
+%% Last Modified On : Fri Jun 10 16:35:25 2016
+%% Update Count     : 101
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 …
 %   \belowdisplayskip \abovedisplayskip
 %}
+\usepackage{pslatex}                                                                    % reduce size of san serif font
 \usepackage{relsize}                                    % must be after change to small or selects old size
 …
     \vskip 50\p@
   }}
 \renewcommand\section{\@startsection{section}{1}{\z@}{-3.5ex \@plus -1ex \@minus -.2ex}{2.3ex \@plus .2ex}{\normalfont\large\bfseries}}
+\renewcommand\section{\@startsection{section}{1}{\z@}{-3.5ex \@plus -1ex \@minus -.2ex}{1.75ex \@plus .2ex}{\normalfont\large\bfseries}}
 \renewcommand\subsection{\@startsection{subsection}{2}{\z@}{-3.25ex \@plus -1ex \@minus -.2ex}{1.5ex \@plus .2ex}{\normalfont\normalsize\bfseries}}
 \renewcommand\subsubsection{\@startsection{subsubsection}{3}{\z@}{-2.5ex \@plus -1ex \@minus -.2ex}{1.0ex \@plus .2ex}{\normalfont\normalsize\bfseries}}
 …
 \newenvironment{quote2}{%
         \list{}{\lstset{resetmargins=true}\leftmargin=\parindent\rightmargin\leftmargin}%
+        \list{}{\lstset{resetmargins=true,aboveskip=0pt,belowskip=0pt}\topsep=4pt\parsep=0pt\leftmargin=\parindent\rightmargin\leftmargin}%
         \item\relax
 }{%
 …
 language=CFA,
 columns=flexible,
 basicstyle=\sf,
+basicstyle=\linespread{0.9}\sf,
 stringstyle=\tt,
 tabsize=4,
 …
 showlines=true,
 aboveskip=4pt,
 belowskip=2pt,
+belowskip=3pt,
 moredelim=**[is][\color{red}]{®}{®}, % red highlighting
+% moredelim=**[is][\color{blue}]{¢}{¢}, % blue highlighting
+moredelim=**[is][\color{blue}]{ß}{ß}, % blue highlighting
+moredelim=**[is][\color{OliveGreen}]{¢}{¢}, % green highlighting
 moredelim=[is][\lstset{keywords={}}]{¶}{¶}, % temporarily turn off keywords
+% literate={\\`}{\raisebox{0.3ex}{\ttfamily\upshape \hspace*{-2pt}`}}1, % escape \`, otherwise used for red highlighting
+literate={...}{{$\dots$}}1 {<-}{{$\leftarrow$}}1 {=>}{{$\Rightarrow$}}1,
+% replace/adjust listing characters that look bad in sanserif
+literate={-}{\raisebox{-0.15ex}{\texttt{-}}}1 {^}{\raisebox{0.6ex}{$\scriptscriptstyle\land\,$}}1
+        {~}{\raisebox{0.3ex}{$\scriptstyle\sim\,$}}1 {_}{\makebox[1.2ex][c]{\rule{1ex}{0.1ex}}}1 {`}{\ttfamily\upshape\hspace*{-0.1ex}`}1
+        {<-}{$\leftarrow$}2 {=>}{$\Rightarrow$}2 {...}{$\dots$}2,
 }%
 \lstMakeShortInline©    % single-character for \lstinline
-\makeatletter
-% replace/adjust listings characters that look bad in sanserif
-\lst@CCPutMacro
-\lst@ProcessOther{"22}{\lst@ttfamily{"}{\raisebox{0.3ex}{\ttfamily\upshape "}}} % replace double quote
-\lst@ProcessOther{"27}{\lst@ttfamily{'}{\raisebox{0.3ex}{\ttfamily\upshape '\hspace*{-2pt}}}} % replace single quote
-\lst@ProcessOther{"2D}{\lst@ttfamily{-}{\textbf{\texttt{-}}}} % replace minus
-\lst@ProcessOther{"3C}{\lst@ttfamily{<}{\textbf{\texttt{<}}}} % replace less than
-\lst@ProcessOther{"3E}{\lst@ttfamily{>}{\textbf{\texttt{>}}}} % replace greater than
-\lst@ProcessOther{"5E}{\raisebox{0.4ex}{$\scriptstyle\land\,$}} % replace circumflex
-\lst@ProcessOther{"5F}{\lst@ttfamily{\char95}{{\makebox[1.2ex][c]{\rule{1ex}{0.1ex}}}}} % replace underscore
-\lst@ProcessOther{"60}{\lst@ttfamily{`}{\raisebox{0.3ex}{\ttfamily\upshape \hspace*{-2pt}`}}} % replace backquote
-\lst@ProcessOther{"7E}{\raisebox{0.3ex}{$\scriptstyle\sim\,$}} % replace tilde
-%\lst@ProcessOther{"7E}{\raisebox{-.4ex}[1ex][0pt]{\textasciitilde}} % lower tilde
-\@empty\z@\@empty % NECESSARY DO NOT REMOVE
-\makeatother
 % Local Variables: %

doc/user/Cdecl.fig

-              r6cbc25a
+              r7ff30d07
 1 1.00 45.00 90.00
 1275 1950 1500
 1 0 50 -1 4 9 0.0000 2 105 90 1350 1650 0\001
 1 0 50 -1 4 9 0.0000 2 105 90 1500 1650 1\001
 1 0 50 -1 4 9 0.0000 2 105 90 1650 1650 2\001
 1 0 50 -1 4 9 0.0000 2 105 90 1800 1650 3\001
 1 0 50 -1 4 9 0.0000 2 105 90 1950 1650 4\001
 1 0 50 -1 4 9 0.0000 2 75 75 1200 1325 x\001
+1 0 50 -1 4 10 0.0000 2 105 90 1350 1650 0\001
+1 0 50 -1 4 10 0.0000 2 105 90 1500 1650 1\001
+1 0 50 -1 4 10 0.0000 2 105 90 1650 1650 2\001
+1 0 50 -1 4 10 0.0000 2 105 90 1800 1650 3\001
+1 0 50 -1 4 10 0.0000 2 105 90 1950 1650 4\001
+1 0 50 -1 4 10 0.0000 2 75 75 1200 1325 x\001
 -6
 2325 1200 3600 1350
 …
 2 0 1 0 7 50 -1 -1 0.000 0 0 -1 0 0 5
 1200 3600 1200 3600 1350 2850 1350 2850 1200
 1 0 50 -1 4 9 0.0000 2 105 90 2925 1325 0\001
 1 0 50 -1 4 9 0.0000 2 105 90 3075 1325 1\001
 1 0 50 -1 4 9 0.0000 2 105 90 3225 1325 2\001
 1 0 50 -1 4 9 0.0000 2 105 90 3375 1325 3\001
 1 0 50 -1 4 9 0.0000 2 105 90 3525 1325 4\001
+1 0 50 -1 4 10 0.0000 2 105 90 2925 1325 0\001
+1 0 50 -1 4 10 0.0000 2 105 90 3075 1325 1\001
+1 0 50 -1 4 10 0.0000 2 105 90 3225 1325 2\001
+1 0 50 -1 4 10 0.0000 2 105 90 3375 1325 3\001
+1 0 50 -1 4 10 0.0000 2 105 90 3525 1325 4\001
 -6
 2 0 1 0 7 50 -1 -1 0.000 0 0 -1 0 0 5
 …
 1275 2850 1275
 -6
 1 0 50 -1 4 9 0.0000 2 75 75 2400 1325 x\001
+1 0 50 -1 4 10 0.0000 2 75 75 2400 1325 x\001
 -6

doc/user/user.tex

-              r6cbc25a
+              r7ff30d07
 %% Created On       : Wed Apr  6 14:53:29 2016
 %% Last Modified By : Peter A. Buhr
 %% Last Modified On : Fri Jun  3 09:49:31 2016
 %% Update Count     : 281
+%% Last Modified On : Fri Jun 10 16:38:22 2016
+%% Update Count     : 394
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 …
 % inline code ©...© (copyright symbol) emacs: C-q M-)
+% red highlighting ®...® (registered trademark sumbol) emacs: C-q M-.
+% latex escape §...§ (section symbol) emacs: C-q M-'
+% red highlighting ®...® (registered trademark symbol) emacs: C-q M-.
+% blue highlighting ß...ß (sharp s symbol) emacs: C-q M-_
+% green highlighting ¢...¢ (cent symbol) emacs: C-q M-"
+% Latex escape §...§ (section symbol) emacs: C-q M-'
 % keyword escape ¶...¶ (pilcrow symbol) emacs: C-q M-^
 % math escape $...$ (dollar symbol)
 \documentclass[openright,twoside]{article}
+\documentclass[twoside,11pt]{article}
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 …
 \usepackage{fullpage,times,comment}
 \usepackage{epic,eepic}
 \usepackage{upquote}                                                                    % switch curled `' to straight `'
+\usepackage{upquote}                                                                    % switch curled `'" to straight `'"
 \usepackage{xspace}
 \usepackage{varioref}                                                                   % extended references
 \usepackage{listings}                                                                   % format program code
 \usepackage{footmisc}                                                                   % support label/reference in footnote
+\usepackage[flushmargin]{footmisc}                                              % support label/reference in footnote
 \usepackage{latexsym}                                   % \Box glyph
 \usepackage{mathptmx}                                   % better math font with "times"
+\usepackage[usenames]{color}
 \usepackage[pagewise]{lineno}
 \renewcommand{\linenumberfont}{\scriptsize\sffamily}
+\input{common}                                          % bespoke macros used in the document
 \usepackage[dvips,plainpages=false,pdfpagelabels,pdfpagemode=UseNone,colorlinks=true,pagebackref=true,linkcolor=blue,citecolor=blue,urlcolor=blue,pagebackref=true,breaklinks=true]{hyperref}
 \usepackage{breakurl}
 \renewcommand{\UrlFont}{\small\sf}
-%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
-% Bespoke macros used in the document.
-\input{common}
-%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 % Names used in the document.
 \newcommand{\Version}{1.0.0}
 \newcommand{\CS}{C\raisebox{-0.9ex}{\large$^\sharp$}\xspace}
+\newcommand{\Textbf}[2][red]{{\color{#1}{\textbf{#2}}}}
+\newcommand{\Emph}[2][red]{{\color{#1}\textbf{\emph{#2}}}}
+\newcommand{\R}[1]{\Textbf{#1}}
+\newcommand{\B}[1]{{\Textbf[blue]{#1}}}
+\newcommand{\G}[1]{{\Textbf[OliveGreen]{#1}}}
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 …
 \CFA\footnote{Pronounced ``C-for-all'', and written \CFA, CFA, or \CFL.} is a modern general-purpose programming-language, designed an as evolutionary step forward from the C programming language.
 The syntax of the \CFA language builds from C, and should look immediately familiar to C programmers.
+The syntax of the \CFA language builds from C, and should look immediately familiar to C/\CC programmers.
 % Any language feature that is not described here can be assumed to be using the standard C11 syntax.
 \CFA adds many modern programming-language features that directly leads to increased \emph{safety} and \emph{productivity}, while maintaining interoperability with existing C programs and achieving C performance.
+\CFA adds many modern programming-language features that directly lead to increased \emph{safety} and \emph{productivity}, while maintaining interoperability with existing C programs and achieving C performance.
 Like C, \CFA is a statically typed, procedural language with a low-overhead runtime, meaning there is no global garbage-collection.
 The primary new features include parametric-polymorphism routines and types, exceptions, concurrency, and modules.
 …
 New programs can be written in \CFA using a combination of C and \CFA features.
 \CC had a similar goal 30 years ago, but has struggled over the intervening time to incorporate modern programming-language features because of early design choices.
 \CFA has 30 years of hindsight and clean starting point.
+\CFA has 30 years of hindsight and a clean starting point.
 Like \CC, there may be both an old and new ways to achieve the same effect.
 For example, the following programs compare the \CFA and C I/O mechanisms.
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 #include <fstream>
 int main( void ) {
         int x = 0, y = 1, z = 2;
         sout | x | y | z | endl;
+        ®sout | x | y | z | endl;®
+}
 \end{lstlisting}
 …
 int main( void ) {
         int x = 0, y = 1, z = 2;
         printf( "%d %d %d\n", x, y, z );
+        ®printf( "%d %d %d\n", x, y, z );®
+}
 \end{lstlisting}
 …
 \end{quote2}
 Both programs output the same result.
 While the \CFA I/O looks similar to the \CC output style, there are several important differences, such as automatic spacing between variables as in Python (see also~\VRef{s:IOLibrary}).
 This document is a reference manual for the \CFA programming language, targeted at \CFA programmers.
+While the \CFA I/O looks similar to the \CC output style, there are important differences, such as automatic spacing between variables as in Python (see also~\VRef{s:IOLibrary}).
+This document is a user manual for the \CFA programming language, targeted at \CFA programmers.
 Implementers may refer to the \CFA Programming Language Specification for details about the language syntax and semantics.
 In its current state, this document covers the intended core features of the language.
 …
 \section{History}
 The \CFA project started with K-W C~\cite{Till89,Buhr94a}, which extended C with new declaration syntax, multiple return values from routines, and extended assignment capabilities using the notion of tuples.
+The \CFA project started with K-W C~\cite{Buhr94a,Till89}, which extended C with new declaration syntax, multiple return values from routines, and extended assignment capabilities using the notion of tuples.
 (See~\cite{Werther96} for some similar work, but for \CC.)
 The original \CFA project~\cite{Ditchfield92} extended the C type system with parametric polymorphism and overloading, as opposed to the \CC approach of object-oriented extensions to the C type-system.
 A first implementation of the core Cforall language was created~\cite{Bilson03,Esteves04}, but at the time there was little interesting in extending C, so work did not continue.
 As the saying goes, ``What goes around, comes around'', and there is now renewed interest in the C programming language because of legacy code-bases, so the \CFA project has been restarted.
+As the saying goes, ``What goes around, comes around.'', and there is now renewed interest in the C programming language because of legacy code-bases, so the \CFA project has been restarted.
 …
 Even with all its problems, C is a very popular programming language because it allows writing software at virtually any level in a computer system without restriction.
 For systems programming, where direct access to hardware and dealing with real-time issues is a requirement, C is usually the language of choice.
+For system programming, where direct access to hardware and dealing with real-time issues is a requirement, C is usually the language of choice.
 As well, there are millions of lines of C legacy code, forming the base for many software development projects (especially on UNIX systems).
 The TIOBE index (\url{http://www.tiobe.com/tiobe_index}) for March 2016 shows programming-language popularity, with Java 20.5\%, C 14.5\%, \CC 6.7\%, \CS 4.3\%, Python 4.3\%, and all other programming languages below 3\%.
+As well, for 30 years, C has been the number 1 and 2 most popular programming language:
+\begin{center}
+\setlength{\tabcolsep}{1.5ex}
+\begin{tabular}{@{}r|c|c|c|c|c|c|c@{}}
+Ranking & 2016  & 2011  & 2006  & 2001  & 1996  & 1991  & 1986          \\
+\hline
+Java    & 1             & 1             & 1             & 3             & 29    & -             & -                     \\
+\hline
+\R{C}   & \R{2} & \R{2} & \R{2} & \R{1} & \R{1} & \R{1} & \R{1}         \\
+\hline
+\CC             & 3             & 3             & 3             & 2             & 2             & 2             & 7                     \\
+\end{tabular}
+\end{center}
 Hence, C is still an extremely important programming language, with double the usage of \CC, where \CC itself is largely C code.
 Finally, love it or hate it, C has been an important and influential part of computer science for 40 years and it appears it will continue to be for many more years.
 …
 The goal of this project is to engineer modern language features into C in an evolutionary rather than revolutionary way.
 \CC~\cite{c++,ANSI14:C++} is an example of a similar project;
 however, it largely extended the language, and did not address existing problems.\footnote{%
+however, it largely extended the language, and did not address many existing problems.\footnote{%
 Two important existing problems addressed were changing the type of character literals from ©int© to ©char© and enumerator from ©int© to the type of its enumerators.}
 Fortran~\cite{Fortran08}, Ada~\cite{Ada12}, and Cobol~\cite{Cobol14} are examples of programming languages that took an evolutionary approach, where modern language features are added and problems fixed within the framework of the existing language.
+Fortran~\cite{Fortran08}, Ada~\cite{Ada12}, and Cobol~\cite{Cobol14} are examples of programming languages that took an evolutionary approach, where modern language features (e.g., objects, concurrency) are added and problems fixed within the framework of the existing language.
 Java~\cite{Java8}, Go~\cite{Go}, Rust~\cite{Rust} and D~\cite{D} are examples of the revolutionary approach for modernizing C/\CC, resulting in a new language rather than an extension of the descendent.
 These languages have different syntax and semantics from C, and do not interoperate directly with C, largely because of garbage collection.
 As a result, there is a significant learning curve to move to these languages, and C legacy-code must be rewritten.
 These costs can be prohibitive for many companies with a large software base in C/\CC, and many programmers that require retraining to a new programming language.
 The result of this project is a language that is largely backwards compatible with C11~\cite{C11}, but containing many modern language features and fixing some of the well known C problems.
 Without significant extension to the C programming language, C will be unable to cope with the needs of modern programming problems and programmers;
+These costs can be prohibitive for many companies with a large software base in C/\CC, and a significant number of programmers requiring retraining to a new programming language.
+The result of this project is a language that is largely backwards compatible with C11~\cite{C11}, but fixing some of the well known C problems and containing many modern language features.
+Without significant extension to the C programming language, it is becoming unable to cope with the needs of modern programming problems and programmers;
 as a result, it will fade into disuse.
 Considering the large body of existing C code and programmers, there is significant impetus to ensure C is transformed into a modern programming language.
 …
 This feature allows users of \CFA to take advantage of the existing panoply of C libraries from inside their \CFA code.
 In fact, one of the biggest issues for any new programming language is establishing a minimum level of library code to support a large body of activities.
 Programming-language developers often state that adequate library support takes more work than designing and implementing the language itself.
+Language developers often state that adequate library support takes more work than designing and implementing the language itself.
 Like \CC, \CFA starts with immediate access to all exiting C libraries, and in many cases, can easily wrap library routines with simpler and safer interfaces, at very low cost.
+Hence, \CFA begins by leveraging the large repository of C libraries with little cost.
 However, it is necessary to differentiate between C and \CFA code because of name overloading, as for \CC.
 For example, the C math-library provides the following routines for computing the absolute value of the basic type: ©abs©, ©labs©, ©llabs©, ©fabs©, ©fabsf©, ©fabsl©, ©cabsf©, ©cabs©, and ©cabsl©.
 Whereas, \CFA wraps each of these routines into one with the common name ©abs©.
+For example, the C math-library provides the following routines for computing the absolute value of the basic types: ©abs©, ©labs©, ©llabs©, ©fabs©, ©fabsf©, ©fabsl©, ©cabsf©, ©cabs©, and ©cabsl©.
+Whereas, \CFA wraps each of these routines into ones with the common name ©abs©:
 \begin{lstlisting}
 char abs( char );
 extern "C" {
 int abs( int );                         // use default C routine for int
+int abs( int );                                 // use default C routine for int
 } // extern "C"
 long int abs( long int );
 …
 \section[Compiling CFA Program]{Compiling \CFA Program}
+The command ©cfa© is used to compile \CFA program(s).
+This command works like the GNU ©gcc©\index{gcc} command, e.g.:
+The command ©cfa© is used to compile \CFA program(s), and is based on the GNU ©gcc©\index{gcc} command, e.g.:
 \begin{lstlisting}
 cfa§\indexc{cfa}\index{compilation!cfa@©cfa©}§ [ gcc-options ] C/§\CFA§-files [ assembler/loader-files ]
 …
 By default, \CFA programs having the following ©gcc© flags turned on:
 \begin{description}
 \item\hspace*{-4pt}\Indexc{-std=gnu99}\index{compilation option!-std=gnu99@{©-std=gnu99©}}
+\item\hspace*{-0.6ex}\Indexc{-std=gnu99}\index{compilation option!-std=gnu99@{©-std=gnu99©}}
 The 1999 C standard plus GNU extensions.
 \item\hspace*{-4pt}\Indexc{-fgnu89-¶inline¶}\index{compilation option!-fgnu89-inline@{©-fgnu89-¶inline¶©}}
+\item\hspace*{-0.6ex}\Indexc{-fgnu89-¶inline¶}\index{compilation option!-fgnu89-inline@{©-fgnu89-¶inline¶©}}
 Use the traditional GNU semantics for inline routines in C99 mode, which allows inline routines in header files.
 \end{description}
 The following new \CFA option is available:
 \begin{description}
 \item\hspace*{-4pt}\Indexc{-CFA}\index{compilation option!-CFA@{©-CFA©}}
+\item\hspace*{-0.6ex}\Indexc{-CFA}\index{compilation option!-CFA@{©-CFA©}}
 Only the C preprocessor and the \CFA translator steps are performed and the transformed program is written to standard output, which makes it possible to examine the code generated by the \CFA translator.
 \end{description}
 …
 The following preprocessor variables are available:
 \begin{description}
 \item\hspace*{-4pt}\Indexc{__CFA__}\index{preprocessor variables!__CFA__@{©__CFA__©}}
+\item\hspace*{-0.6ex}\Indexc{__CFA__}\index{preprocessor variables!__CFA__@{©__CFA__©}}
 is always available during preprocessing and its value is the current major \Index{version number} of \CFA.\footnote{
 The C preprocessor allows only integer values in a preprocessor variable so a value like ``\Version'' is not allowed.
 Hence, the need to have three variables for the major, minor and patch version number.}
 \item\hspace*{-4pt}\Indexc{__CFA_MINOR__}\index{preprocessor variables!__CFA_MINOR__@{©__CFA_MINOR__©}}
+\item\hspace*{-0.6ex}\Indexc{__CFA_MINOR__}\index{preprocessor variables!__CFA_MINOR__@{©__CFA_MINOR__©}}
 is always available during preprocessing and its value is the current minor \Index{version number} of \CFA.
 \item\hspace*{-4pt}\Indexc{__CFA_PATCH__}\index{preprocessor variables!__CFA_PATCH__@©__CFA_PATCH__©}
+\item\hspace*{-0.6ex}\Indexc{__CFA_PATCH__}\index{preprocessor variables!__CFA_PATCH__@©__CFA_PATCH__©}
 is always available during preprocessing and its value is the current patch \Index{version number} of \CFA.
 \item\hspace*{-4pt}\Indexc{__CFORALL__}\index{preprocessor variables!__CFORALL__@©__CFORALL__©}
 is always available during preprocessing and it has no value.
+\item\hspace*{-0.6ex}\Indexc{__CFA__}\index{preprocessor variables!__CFA__@©__CFA__©} and \Indexc{__CFORALL__}\index{preprocessor variables!__CFORALL__@©__CFORALL__©}
+are always available during preprocessing and have no value.
 \end{description}
 …
 \begin{lstlisting}
 #ifndef __CFORALL__
 #include <stdio.h>                      // C header file
+#include <stdio.h>                              // C header file
 #else
 #include <fstream>                      // §\CFA{}§ header file
+#include <fstream>                              // §\CFA{}§ header file
 #endif
 \end{lstlisting}
 …
 Numeric constants are extended to allow \Index{underscore}s within constants\index{constant!underscore}, e.g.:
 \begin{lstlisting}
 ®_®147®_®483®_®648;                            // decimal constant
 _ul;                                          // decimal unsigned long constant
 _377;                                          // octal constant
 x_ff_ff;                                       // hexadecimal constant
 x_ef3d_aa5c;                           // hexadecimal constant
 .141_592_654;                          // floating point constant
 _e_+1_00;                                     // floating point constant
 x_ff_ff_p_3;                           // hexadecimal floating point
 x_1.ffff_ffff_p_128_l;         // hexadecimal floating point long constant
 L_"\x_ff_ee";                           // wide character constant
+®_®147®_®483®_®648;                                    // decimal constant
+_ul;                                                  // decimal unsigned long constant
+_377;                                                  // octal constant
+x_ff_ff;                                               // hexadecimal constant
+x_ef3d_aa5c;                                   // hexadecimal constant
+.141_592_654;                                  // floating point constant
+_e_+1_00;                                             // floating point constant
+x_ff_ff_p_3;                                   // hexadecimal floating point
+x_1.ffff_ffff_p_128_l;                 // hexadecimal floating point long constant
+L_"\x_ff_ee";                                   // wide character constant
 \end{lstlisting}
 The rules for placement of underscores is as follows:
 …
 \end{enumerate}
 It is significantly easier to read and enter long constants when they are broken up into smaller groupings (most cultures use comma or period among digits for the same purpose).
 This extension is backwards compatible, matches with the use of underscore in variable names, and appears in Ada and Java.
+This extension is backwards compatible, matches with the use of underscore in variable names, and appears in Ada and Java 8.
 …
 \begin{quote2}
 \begin{tabular}{@{}ll@{}}
 \begin{lstlisting}[aboveskip=0pt,belowskip=0pt]
 int *x[ 5 ]
+\begin{lstlisting}
+int *x[5]
 \end{lstlisting}
+&
 …
 For example, a routine returning a pointer to an array of integers is defined and used in the following way:
 \begin{lstlisting}
 int (*f())[ 5 ] {...};  // definition mimics usage
 ... (*f())[ 3 ] += 1;
+int (*f())[5] {...};                    // definition mimics usage
+... (*f())[3] += 1;
 \end{lstlisting}
 Essentially, the return type is wrapped around the routine name in successive layers (like an onion).
 While attempting to make the two contexts consistent was a laudable goal, it has not worked out in practice.
+\CFA provides its own type, variable and routine declarations, using a slightly different syntax.
+The new declarations place modifiers to the left of the base type, while C declarations place modifiers to the right of the base type.
+\CFA provides its own type, variable and routine declarations, using a different syntax.
+The new declarations place qualifiers to the left of the base type, while C declarations place qualifiers to the right of the base type.
+In the following example, \R{red} is for the base type and \B{blue} is for the qualifiers.
+The \CFA declarations move the qualifiers to the left of the base type, i.e., blue to the left of the red, while the qualifiers have the same meaning but are ordered left to left to right to specify the variable's type.
+\begin{quote2}
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{lstlisting}
+ß[5] *ß ®int® x1;
+ß* [5]ß ®int® x2;
+ß[* [5] int]ß f®( int p )®;
+\end{lstlisting}
+&
+\begin{lstlisting}
+®int® ß*ß x1 ß[5]ß;
+®int® ß(*ßx2ß)[5]ß;
+ßint (*ßf®( int p )®ß)[5]ß;
+\end{lstlisting}
+\end{tabular}
+\end{quote2}
 The only exception is bit field specification, which always appear to the right of the base type.
+C and the new \CFA declarations may appear together in the same program block, but cannot be mixed within a specific declaration.
+In \CFA declarations, the same tokens are used as in C: the character ©*© is used to indicate a pointer, square brackets ©[©\,©]© are used to represent an array, and parentheses ©()© are used to indicate a routine parameter.
+However, unlike C, \CFA type declaration tokens are specified from left to right and the entire type specification is distributed across all variables in the declaration list.
+% Specifically, the character ©*© is used to indicate a pointer, square brackets ©[©\,©]© are used to represent an array or function return value, and parentheses ©()© are used to indicate a routine parameter.
+However, unlike C, \CFA type declaration tokens are distributed across all variables in the declaration list.
 For instance, variables ©x© and ©y© of type pointer to integer are defined in \CFA as follows:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 ®* int x, y;®
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{lstlisting}
+®*® int x, y;
 \end{lstlisting}
+&
 \begin{lstlisting}
 int *x, *y;
+int ®*®x, ®*®y;
 \end{lstlisting}
 \end{tabular}
 …
 Other examples are:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{\hspace{20pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c@{\hspace{20pt}}}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{\hspace{2em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c@{\hspace{2em}}}{\textbf{C}} \\
 \begin{lstlisting}
 [ 5 ] int z;
 …
 \end{quote2}
 All type qualifiers, i.e., ©const© and ©volatile©, are used in the normal way with the new declarations but appear left to right, e.g.:
+All type qualifiers, e.g., ©const©, ©volatile©, etc., are used in the normal way with the new declarations and also appear left to right, e.g.:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{\hspace{20pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c@{\hspace{20pt}}}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{1em}}l@{\hspace{1em}}l@{}}
+\multicolumn{1}{c@{\hspace{1em}}}{\textbf{\CFA}}        & \multicolumn{1}{c@{\hspace{1em}}}{\textbf{C}} \\
 \begin{lstlisting}
 const * const int x;
 …
 \end{tabular}
 \end{quote2}
 All declaration qualifiers, i.e., ©extern©, ©static©, etc., are used in the normal way with the new declarations but can only appear at the start of a \CFA routine declaration,\footnote{\label{StorageClassSpecifier}
+All declaration qualifiers, e.g., ©extern©, ©static©, etc., are used in the normal way with the new declarations but can only appear at the start of a \CFA routine declaration,\footnote{\label{StorageClassSpecifier}
 The placement of a storage-class specifier other than at the beginning of the declaration specifiers in a declaration is an obsolescent feature.~\cite[\S~6.11.5(1)]{C11}} e.g.:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{\hspace{20pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c@{\hspace{20pt}}}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{\hspace{2em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c@{\hspace{2em}}}{\textbf{C}} \\
 \begin{lstlisting}
 extern [ 5 ] int x;
 …
 e.g.:
 \begin{lstlisting}
 x;                                              // int x
 *y;                                             // int *y
 f( p1, p2 );                    // int f( int p1, int p2 );
 f( p1, p2 ) {}                  // int f( int p1, int p2 ) {}
 \end{lstlisting}
 As stated above, the two styles of declaration may appear together in the same block.
+x;                                                              // int x
+*y;                                                             // int *y
+f( p1, p2 );                                    // int f( int p1, int p2 );
+f( p1, p2 ) {}                                  // int f( int p1, int p2 ) {}
+\end{lstlisting}
+Finally, new \CFA declarations may appear together with C declarations in the same program block, but cannot be mixed within a specific declaration.
 Therefore, a programmer has the option of either continuing to use traditional C declarations or take advantage of the new style.
 Clearly, both styles need to be supported for some time due to existing C-style header-files, particularly for UNIX systems.
 …
 The new declaration syntax can be used in other contexts where types are required, e.g., casts and the pseudo-routine ©sizeof©:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 y = (* int)x;
 i = sizeof([ 5 ] * int);
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{lstlisting}
+y = (®* int®)x;
+i = sizeof(®[ 5 ] * int®);
 \end{lstlisting}
+&
 \begin{lstlisting}
 y = (int *)x;
 i = sizeof(int *[ 5 ]);
+y = (®int *®)x;
+i = sizeof(®int *[ 5 ]®);
 \end{lstlisting}
 \end{tabular}
 …
 \CFA also supports a new syntax for routine definition, as well as ISO C and K\&R routine syntax.
 The point of the new syntax is to allow returning multiple values from a routine~\cite{CLU,Galletly96}, e.g.:
+The point of the new syntax is to allow returning multiple values from a routine~\cite{Galletly96,CLU}, e.g.:
 \begin{lstlisting}
 ®[ int o1, int o2, char o3 ]® f( int i1, char i2, char i3 ) {
 …
 Declaration qualifiers can only appear at the start of a routine definition, e.g.:
 \begin{lstlisting}
 extern [ int x ] g( int y ) {§\,§}
+®extern® [ int x ] g( int y ) {§\,§}
 \end{lstlisting}
 Lastly, if there are no output parameters or input parameters, the brackets and/or parentheses must still be specified;
 in both cases the type is assumed to be void as opposed to old style C defaults of int return type and unknown parameter types, respectively, as in:
 \begin{lstlisting}
 [§\,§] g();                                             // no input or output parameters
 [ void ] g( void );                     // no input or output parameters
+[§\,§] g();                                                     // no input or output parameters
+[ void ] g( void );                             // no input or output parameters
 \end{lstlisting}
 …
 [ i, j, ch ] = f( 3, 'a', ch );
 \end{lstlisting}
 The list of return values from f and the grouping on the left-hand side of the assignment is called a tuple and discussed in Section 12.
+The list of return values from f and the grouping on the left-hand side of the assignment is called a \newterm{return list} and discussed in Section 12.
 \CFA style declarations cannot be used to declare parameters for K\&R style routine definitions because of the following ambiguity:
 …
 \begin{lstlisting}
 typedef int foo;
 int f( int (* foo) );           // foo is redefined as a parameter name
+int f( int (* foo) );                   // foo is redefined as a parameter name
 \end{lstlisting}
 The string ``©int (* foo)©'' declares a C-style named-parameter of type pointer to an integer (the parenthesis are superfluous), while the same string declares a \CFA style unnamed parameter of type routine returning integer with unnamed parameter of type pointer to foo.
 The redefinition of a type name in a parameter list is the only context in C where the character ©*© can appear to the left of a type name, and \CFA relies on all type modifier characters appearing to the right of the type name.
+The redefinition of a type name in a parameter list is the only context in C where the character ©*© can appear to the left of a type name, and \CFA relies on all type qualifier characters appearing to the right of the type name.
 The inability to use \CFA declarations in these two contexts is probably a blessing because it precludes programmers from arbitrarily switching between declarations forms within a declaration contexts.
 C-style declarations can be used to declare parameters for \CFA style routine definitions, e.g.:
 \begin{lstlisting}
 [ int ] f( * int, int * );      // returns an integer, accepts 2 pointers to integers
 [ * int, int * ] f( int );      // returns 2 pointers to integers, accepts an integer
+[ int ] f( * int, int * );              // returns an integer, accepts 2 pointers to integers
+[ * int, int * ] f( int );              // returns 2 pointers to integers, accepts an integer
 \end{lstlisting}
 The reason for allowing both declaration styles in the new context is for backwards compatibility with existing preprocessor macros that generate C-style declaration-syntax, as in:
 \begin{lstlisting}
 #define ptoa( n, d ) int (*n)[ d ]
 int f( ptoa(p,5) ) ...          // expands to int f( int (*p)[ 5 ] )
 [ int ] f( ptoa(p,5) ) ...      // expands to [ int ] f( int (*p)[ 5 ] )
+int f( ptoa( p, 5 ) ) ...               // expands to int f( int (*p)[ 5 ] )
+[ int ] f( ptoa( p, 5 ) ) ...   // expands to [ int ] f( int (*p)[ 5 ] )
 \end{lstlisting}
 Again, programmers are highly encouraged to use one declaration form or the other, rather than mixing the forms.
 \subsection{Returning Values}
 Named return values handle the case where it is necessary to define a local variable whose value is then returned in a ©return© statement, as in:
+\subsection{Named Return Values}
+\Index{Named return values} handle the case where it is necessary to define a local variable whose value is then returned in a ©return© statement, as in:
 \begin{lstlisting}
 int f() {
 …
 \end{lstlisting}
 Because the value in the return variable is automatically returned when a \CFA routine terminates, the ©return© statement \emph{does not} contain an expression, as in:
+\begin{lstlisting}
+®[ int x ]® f() {
+        ... x = 0; ... x = y; ...
+        ®return;® // implicitly return x
+}
+\end{lstlisting}
+When the return is encountered, the current value of ©x© is returned to the calling routine.
+\newline
+\begin{minipage}{\linewidth}
+\begin{lstlisting}
+®[ int x, int y ]® f() {
+        int z;
+        ... x = 0; ... y = z; ...
+        ®return;® // implicitly return x, y
+}
+\end{lstlisting}
+\end{minipage}
+\newline
+When the return is encountered, the current values of ©x© and ©y© are returned to the calling routine.
 As well, ``falling off the end'' of a routine without a ©return© statement is permitted, as in:
 \begin{lstlisting}
 [ int x ] f() {
         ... x = 0; ... x = y; ...
 } // implicitly return x
 \end{lstlisting}
 In this case, the current value of ©x© is returned to the calling routine just as if a ©return© had been encountered.
+[ int x, int y ] f() {
+        ...
+} // implicitly return x, y
+\end{lstlisting}
+In this case, the current values of ©x© and ©y© are returned to the calling routine just as if a ©return© had been encountered.
 …
 as well, parameter names are optional, e.g.:
 \begin{lstlisting}
 [ int x ] f ();                         // returning int with no parameters
 [ * int ] g (int y);            // returning pointer to int with int parameter
 [ ] h (int,char);                       // returning no result with int and char parameters
 [ * int,int ] j (int);          // returning pointer to int and int, with int parameter
+[ int x ] f ();                                 // returning int with no parameters
+[ * int ] g (int y);                    // returning pointer to int with int parameter
+[ ] h (int,char);                               // returning no result with int and char parameters
+[ * int,int ] j (int);                  // returning pointer to int and int, with int parameter
 \end{lstlisting}
 This syntax allows a prototype declaration to be created by cutting and pasting source text from the routine definition header (or vice versa).
 It is possible to declare multiple routine-prototypes in a single declaration, but the entire type specification is distributed across \emph{all} routine names in the declaration list (see~\VRef{s:Declarations}), e.g.:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 [ int ] f(int), g;
 …
 for example, the following is incorrect:
 \begin{lstlisting}
 * [ int x ] f () fp;            // routine name "f" is not allowed
+* [ int x ] f () fp;                    // routine name "f" is not allowed
 \end{lstlisting}
 …
 While in theory default arguments can be simulated with overloading, as in:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{default arguments}}  & \multicolumn{1}{c}{\textbf{overloading}}      \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{default arguments}}   & \multicolumn{1}{c}{\textbf{overloading}}      \\
 \begin{lstlisting}
 void p( int x, int y = 2, int z = 3 ) {...}
 …
 \CFA allows \Index{type nesting}, and type qualification of the nested types, where as C hoists\index{type hoisting} (refactors) nested types into the enclosing scope and has no type qualification.
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l|l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{C Type Nesting}}     & \multicolumn{1}{c}{\textbf{C Implicit Hoisting}}      & \multicolumn{1}{|c}{\textbf{\CFA}}    \\
+\begin{tabular}{@{}l@{\hspace{3em}}l|l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{C Type Nesting}}      & \multicolumn{1}{c}{\textbf{C Implicit Hoisting}}      & \multicolumn{1}{|c}{\textbf{\CFA}}    \\
 \hline
 \begin{lstlisting}
 …
 \section{Tuples}
+\section{Lexical List}
 In C and \CFA, lists of elements appear in several contexts, such as the parameter list for a routine call.
 (More contexts are added shortly.)
 A list of such elements is called a tuple.
 The general syntax of a tuple is:
+A list of such elements is called a \newterm{lexical list}.
+The general syntax of a lexical list is:
 \begin{lstlisting}
 [ §\emph{exprlist}§ ]
 \end{lstlisting}
 where ©$\emph{exprlist}$© is a list of one or more expressions separated by commas.
 The brackets, ©[]©, allow differentiating between tuples and expressions containing the C comma operator.
 The following are examples of tuples:
+The brackets, ©[]©, allow differentiating between lexical lists and expressions containing the C comma operator.
+The following are examples of lexical lists:
 \begin{lstlisting}
 [ x, y, z ]
 …
 \begin{figure}
 \centering
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 ®L1:® for ( ... ) {
 …
 \vspace*{0.25in}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{C}}        \\
 \begin{lstlisting}
 ®L1®: for ( ... ) {
 …
 Therefore, the ©case© clause is extended with a list of values, as in:
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{\hspace{20pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c@{\hspace{20pt}}}{\textbf{C}}        \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{\hspace{2em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c@{\hspace{2em}}}{\textbf{C}} \\
 \begin{lstlisting}
 switch ( i ) {
 …
 In addition, two forms of subranges are allowed to specify case values: the GNU C form and a new \CFA form.
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{\hspace{20pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c@{\hspace{20pt}}}{\textbf{GNU C}}    \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{\hspace{2em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c@{\hspace{2em}}}{\textbf{GNU C}}     \\
 \begin{lstlisting}
 switch ( i ) {
 …
 Auto type-inferencing occurs in a declaration where a variable's type is inferred from its initialization expression type.
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}ll@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CC}}        & \multicolumn{1}{c}{©gcc©}\index{gcc} \\
+\begin{tabular}{@{}l@{\hspace{3em}}ll@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CC}} & \multicolumn{1}{c}{©gcc©}\index{gcc} \\
 \begin{lstlisting}
 …
 The general case is printing out a sequence of variables separated by whitespace.
 \begin{quote2}
 \begin{tabular}{@{}l@{\hspace{30pt}}l@{}}
 \multicolumn{1}{c@{\hspace{30pt}}}{\textbf{\CFA}}       & \multicolumn{1}{c}{\textbf{\CC}}      \\
+\begin{tabular}{@{}l@{\hspace{3em}}l@{}}
+\multicolumn{1}{c@{\hspace{3em}}}{\textbf{\CFA}}        & \multicolumn{1}{c}{\textbf{\CC}}      \\
 \begin{lstlisting}
 int x = 0, y = 1, z = 2;

doc/working/resolver_design.md

-              r6cbc25a
+              r7ff30d07
 ## Conversion Costs ##
+Each possible resolution of an expression has a _cost_ consisting of four
+integer components: _unsafe_ conversion cost, _polymorphic_ specialization
+cost, _safe_ conversion cost, and a _count_ of conversions.
+These components form a lexically-ordered tuple which can be summed
+element-wise; summation starts at `(0, 0, 0, 0)`.
+Each possible resolution of an expression has a _cost_ tuple consisting of
+the following components: _unsafe_ conversion cost, _polymorphic_
+specialization cost, _safe_ conversion cost, a count of _explicit_
+conversions, and _qualifier_ conversion cost.
+These components are lexically-ordered and can be summed element-wise;
+summation starts at `(0, 0, 0, 0, 0)`.
 ### Lvalue and Qualifier Conversions ###
 …
 programmers.
+## Resolver Architecture ##
+### Function Application Resolution ###
+Our resolution algorithm for function application expressions is based on
+Baker's[3] single-pass bottom-up algorithm, with Cormack's[4] single-pass
+top-down algorithm applied where appropriate as an optimization.
+Broadly speaking, the cost of this resolution per expression will be
+proportional to `i^d`, where `i` is the number of interpretations of each
+program symbol, and `d` is the maximum depth of the expression DAG.
+Since `d` is determined by the user programmer (in general, bounded by a small
+constant), opportunities for resolver optimization primarily revolve around
+minimizing `i`, the number of interpretations of each symbol that are
+considered.
+[3] Baker, Theodore P. A one-pass algorithm for overload resolution in Ada.
+ACM Transactions on Programming Languages and Systems (1982) 4:4 p.601-614
+[4] Cormack, Gordon V. An algorithm for the selection of overloaded functions
+in Ada. SIGPLAN Notices (1981) 16:2 p.48-52
+Unlike Baker, our system allows implicit type conversions for function
+arguments and return types; the problem then becomes to find the valid
+interpretation for an expression that has the unique minimal conversion cost,
+if such exists.
+Interpretations can be produced both by overloaded names and implicit
+conversions applied to existing interpretations; we have proposals to reduce
+the number of interpretations considered from both sources.
+To simplify the problem for this discussion, we will consider application
+resolution restricted to a domain of functions applied to variables, possibly
+in a nested manner (e.g. `f( g( x ), y )`, where `x` and `y` are variables and
+`f` and `g` are functions), and possibly in a typed context such as a variable
+initialization (e.g. `int i = f( x );`); the other aspects of Cforall type
+resolution should be able to be straightforwardly mapped into this model.
+The types of the symbol tables used for variable and function declarations
+look somewhat like the following:
+        variable_table = name_map( variable_name, variable_map )
+        function_table = name_map( function_name, function_map )
+        variable_map = multi_index( by_type( variable_type ),
+                                    variable_decl_set )
+        function_map = multi_index( by_int( n_params ),
+                                                                by_type( return_type ),
+                                                                function_decl_set )
+`variable_name` and `function_name` are likely simple strings, with `name_map`
+a hash table (or perhaps trie) mapping string keys to values.
+`variable_decl_set` and `function_decl_set` can be thought of for the moment
+as simple bags of typed declarations, where the declaration types are linked
+to the graph of available conversions for that type.
+In a typed context both the `variable_decl_set` and the `function_decl_set`
+should be able to be selected upon by type; this is accomplished by the
+`by_type` index of both `variable_map` and `function_map`.
+The `by_int` index of `function_map` also provides a way to select functions
+by their number of parameters; this index may be used to swiftly discard any
+function declaration which does not have the appropriate number of parameters
+for the argument interpretations being passed to it; given the likely small
+number of entries in this map, it is possible that a binary search of a sorted
+vector or even a linear search of an unsorted vector would be more efficient
+than the usual hash-based index.
+Given these data structures, the general outline of our algorithm follows
+Baker, with Cormack's algorithm used as a heuristic filter in typed contexts.
+In an untyped context, we use a variant of Baker's bottom-up algorithm.
+The leaves of the interpretation DAG are drawn from the variable symbol table,
+with entries in the table each producing zero-cost interpretations, and each
+implicit conversion available to be applied to the type of an existing entry
+producing a further interpretation with the same cost as the conversion.
+As in Baker, if two or more interpretations have the same type, only the
+minimum cost interpretation with that type is produced; if there is no unique
+minimum cost interpretation than resolution with that type is ambiguous, and
+not permitted.
+It should be relatively simple to produce the list of interpretations sorted
+by cost by producing the interpretations via a breadth-first search of the
+conversion graph from the initial interpretations provided in the variable
+symbol table.
+To match a function at one of the internal nodes of the DAG, we first look up
+the function's name in the function symbol table, the appropriate number of
+parameters for the arguments that are provided through the `by_int` index of
+the returned `function_map`, then go through the resulting `function_decl_set`
+searching for functions where the parameter types can unify with the provided
+argument lists; any such matching function produces an interpretation with a
+cost that is the sum of its argument costs.
+Though this is not included in our simplified model, this unification step may
+include binding of polymorphic variables, which introduces a cost for the
+function binding itself which must be added to the argument costs.
+Also, checking of function assertions would likely be done at this step as
+well, possibly eliminating some possible matching functions (if no suitable
+assertions can be satisfied), or adding further conversion costs for the
+assertion satisfaction.
+Once the set of valid function interpretations is produced, these may also be
+expanded by the graph of implicit conversions on their return types, as the
+variable interpretations were.
+This implicit conversion-based expansion of interpretations should be skipped
+for the top-level expression if used in an untyped (void) context, e.g. for
+`f` in `f( g ( x ) );` or `x` in `x;`.
+On the other hand, if the top-level expression specifies a type, e.g. in
+`int i = f( x );`, only top level expressions that return that type are
+relevant to the search, so the candidates for `f` can be filtered first by
+those that return `int` (or a type convertable to it); this can be
+accomplished by performing a top-down filter of the interpretations of `f` by
+the `by_type` index of the `function_map` in a manner similar to Cormack's[4]
+algorithm.
+In a typed context, such as an initialization expression
+`T x = f( g( y ), z );`, only interpretations of `f( g( y ), z )` which have
+type `T` are valid; since there are likely to be valid interpretations of
+`f( g( y ), z )` which cannot be used to initialize a variable of type `T`, we
+can use this information to reduce the number of interpretations considered.
+Drawing from Cormack[4], we first search for interpretations of `f` where the
+return type is `T`; by breadth-first-search of the conversion graph, it should
+be straightforward to order the interpretations of `f` by the cost to convert
+their return type to `T`.
+We can also filter out interpretations of `f` with less than two parameters,
+since both `g( y )` and `z` must produce at least one parameter; we may not,
+however, rule out interpretations of `f` with more than two parameters, as
+there may be a valid interpretation of `g( y )` as a function returning more
+than one parameter (if the expression was `f( y, z )` instead, we could use an
+exact parameter count, assuming that variables of tuple type don't exist).
+For each compatible interpretation of `f`, we can add the type of the first
+parameter of that interpretation of `f` to a set `S`, and recursively search
+for interpretations of `g( y )` that return some type `Si` in `S`, and
+similarly for interpretations of `z` that match the type of any of the second
+parameters of some `f`.
+Naturally, if no matching interpretation of `g( y )` can be found for the
+first parameter of some `f`, the type of the second parameter of that `f` will
+not be added to the set of valid types for `z`.
+Each node in this interpretation DAG is given a cost the same way it would be
+in the bottom-up approach, with the exception that when going top-down there
+must be a final bottom-up pass to sum the interpretation costs and sort them
+as appropriate.
+If a parameter type for some `f` is a polymorphic type variable that is left
+unbound by the return type (e.g. `forall(otype S) int f(S x, int y)`), the
+matching arguments should be found using the bottom-up algorithm above for
+untyped contexts, because the polymorphic type variable does not sufficiently
+constrain the available interpretations of the argument expression.
+Similarly, it would likely be an advantage to use top-down resolution for
+cast expressions (e.g. `(int)x`), even when those cast expressions are
+subexpressions of an otherwise untyped expression.
+It may also be fruitful to switch between the bottom-up and top-down
+algorithms if the number of valid interpretations for a subexpression or valid
+types for an argument exceeds some heuristic threshold, but finding such
+a threshold (if any exists) will require experimental data.
+This hybrid top-down/bottom-up search provides more opportunities for pruning
+interpretations than either a bottom-up or top-down approach alone, and thus
+may be more efficient than either.
+A top-down-only approach, however, devolves to linear search through every
+possible interpretation in the solution space in an untyped context, and is
+thus likely to be inferior to a strictly bottom-up approach, though this
+hypothesis needs to be empirically validated.
+Both Baker and Cormack explicitly generate all possible interpretations of a
+given expression; thinking of the set of interpretations of an expression as a
+list sorted by cost, this is an eager evaluation of the list.
+However, since we generally expect that user programmers will not often use
+high-cost implicit conversions, one potentially effective way to prune the
+search space would be to first find the minimal-cost interpretations of any
+given subexpression, then to save the resolution progress of the
+subexpressions and attempt to resolve the superexpression using only those
+subexpression interpretations.
+If no valid interpretation of the superexpression can be found, the resolver
+would then repeatedly find the next-most-minimal cost interpretations of the
+subexpressions and attempt to produce the minimal cost interpretation of the
+superexpression.
+This process would proceed until all possible subexpression interpretations
+have been found and considered.
+A middle ground between the eager and lazy approaches can be taken by
+considering the lexical order on the cost tuple; essentially, every
+interpretation in each of the classes below will be strictly cheaper than any
+interpretation in the class after it, so if a min-cost valid interpretation
+can be found while only generating interpretations in a given class, that
+interpretation is guaranteed to be the best possible one:
+. Interpretations without polymorphic functions or implicit conversions
+. Interpretations without polymorphic functions using only safe conversions
+. Interpretations using polymorphic functions without unsafe conversions
+. Interpretations using unsafe conversions
+In this lazy-eager approach, all the interpretations in one class would be
+eagerly generated, while the interpretations in the next class would only be
+considered if no match was found in the previous class.
 ## Appendix A: Partial and Total Orders ##
 The `<=` relation on integers is a commonly known _total order_, and

Note: See TracChangeset for help on using the changeset viewer.