Context Navigation

source: doc/theses/fangren_yu_COOP_S20/Report.tex @ ab1a9ea

ADTast-experimentalenumforall-pointer-decaypthread-emulationqualifiedEnum

Last change on this file since ab1a9ea was 79e261b7, checked in by Peter A. Buhr <pabuhr@…>, 4 years ago
remove repeated pagebackref=true option
Property mode set to `100644`
File size: 35.1 KB

Rev	Line
[ebec8ed]	1	\documentclass[twoside,11pt]{article}
[ae45007]	2
	3	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
	4
	5	% Latex packages used in the document.
	6	\usepackage{fullpage,times,comment}
	7	\usepackage{epic,eepic}
	8	\usepackage{upquote} % switch curled `'" to straight
	9	\usepackage{calc}
	10	\usepackage{varioref} % extended references
	11	\usepackage[labelformat=simple,aboveskip=0pt,farskip=0pt]{subfig}
	12	\renewcommand{\thesubfigure}{\alph{subfigure})}
[ebec8ed]	13	\usepackage[flushmargin]{footmisc} % support label/reference in footnote
[ae45007]	14	\usepackage{latexsym} % \Box glyph
	15	\usepackage{mathptmx} % better math font with "times"
[ebec8ed]	16	\usepackage[toc]{appendix} % article does not have appendix
[ae45007]	17	\usepackage[usenames]{color}
	18	\input{common} % common CFA document macros
[79e261b7]	19	\usepackage[dvips,plainpages=false,pdfpagelabels,pdfpagemode=UseNone,colorlinks=true,linkcolor=blue,citecolor=blue,urlcolor=blue,pagebackref=true,breaklinks=true]{hyperref}
[ae45007]	20	\usepackage{breakurl}
[1526e86]	21	\urlstyle{sf}
	22
	23	% reduce spacing
	24	\setlist[itemize]{topsep=5pt,parsep=0pt}% global
	25	\setlist[enumerate]{topsep=5pt,parsep=0pt}% global
[ae45007]	26
	27	\usepackage[pagewise]{lineno}
	28	\renewcommand{\linenumberfont}{\scriptsize\sffamily}
	29
	30	% Default underscore is too low and wide. Cannot use lstlisting "literate" as replacing underscore
	31	% removes it as a variable-name character so keywords in variables are highlighted. MUST APPEAR
	32	% AFTER HYPERREF.
	33	\renewcommand{\textunderscore}{\leavevmode\makebox[1.2ex][c]{\rule{1ex}{0.075ex}}}
[a75d464]	34	\newcommand{\NOTE}{\textbf{NOTE}}
[cebfcb8]	35	\newcommand{\TODO}[1]{{\color{Purple}#1}}
[ae45007]	36
	37	\setlength{\topmargin}{-0.45in} % move running title into header
	38	\setlength{\headsep}{0.25in}
	39
	40	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
	41
[2260d9e1]	42	\CFAStyle % CFA code-style for all languages
[a75d464]	43	\lstset{
[2260d9e1]	44	language=C++,moredelim=**[is][\color{red}]{@}{@} % make C++ the default language
[a75d464]	45	}% lstset
[ae45007]	46	\lstnewenvironment{C++}[1][] % use C++ style
[2260d9e1]	47	{\lstset{language=C++,moredelim=**[is][\color{red}]{@}{@}}\lstset{#1}}{}
[ae45007]	48
	49	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
	50
	51	\setcounter{secnumdepth}{3} % number subsubsections
	52	\setcounter{tocdepth}{3} % subsubsections in table of contents
	53	\makeindex
	54
	55	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
	56
	57	\title{\Huge
[101cc3a]	58	\lstinline\|cfa-cc\| Developer's Reference
[ae45007]	59	}% title
	60
	61	\author{\LARGE
	62	Fangren Yu
	63	}% author
	64
	65	\date{
	66	\today
	67	}% date
	68
	69	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
	70
	71	\begin{document}
	72	\pagestyle{headings}
	73	% changed after setting pagestyle
	74	\renewcommand{\sectionmark}[1]{\markboth{\thesection\quad #1}{\thesection\quad #1}}
	75	\renewcommand{\subsectionmark}[1]{\markboth{\thesubsection\quad #1}{\thesubsection\quad #1}}
	76	\pagenumbering{roman}
	77	\linenumbers % comment out to turn off line numbering
	78
	79	\maketitle
	80	\pdfbookmark[1]{Contents}{section}
	81	\tableofcontents
	82
	83	\clearpage
	84	\thispagestyle{plain}
	85	\pagenumbering{arabic}
	86
	87
	88	\section{Overview}
	89
[2260d9e1]	90	@cfa-cc@ is the reference compiler for the \CFA programming language, which is a non-object-oriented extension to C.
[cebfcb8]	91	\CFA attempts to introduce productive modern programming language features to C while maintaining as much backward-compatibility as possible, so that most existing C programs can seamlessly work with \CFA.
	92
	93	Since the \CFA project dates back to the early 2000s, and only restarted in the past few years, there is a significant amount of legacy code in the current compiler codebase with little documentation.
	94	The lack of documentation makes it difficult to develop new features from the current implementation and diagnose problems.
	95
	96	Currently, the \CFA team is also facing poor compiler performance.
	97	For the development of a new programming language, writing standard libraries is an important component.
	98	The slow compiler causes building of the library files to take tens of minutes, making iterative development and testing almost impossible.
	99	There is an ongoing effort to rewrite the core data-structure of the compiler to overcome the performance issue, but many bugs have appeared during this work, and lack of documentation is hampering debugging.
	100
	101	This developer's reference manual begins the documentation and should be continuously im\-proved until it eventually covers the entire compiler codebase.
	102	For now, the focus is mainly on the parts being rewritten, and also the primary performance bottleneck, namely the resolution algorithm.
	103	Its aimed is to provide new project developers with guidance in understanding the codebase, and clarify the purpose and behaviour of certain functions that are not mentioned in the previous \CFA research papers~\cite{Bilson03,Ditchfield92,Moss19}.
[ae45007]	104
	105
	106	\section{Compiler Framework}
	107
[cebfcb8]	108	\CFA source code is first transformed into an abstract syntax tree (AST) by the parser before analyzed by the compiler.
	109
	110
[ae45007]	111	\subsection{AST Representation}
	112
	113
[cebfcb8]	114	There are 4 major categories of AST nodes used by the compiler, along with some derived structures.
[ae45007]	115
[cebfcb8]	116	\subsubsection{Declaration Nodes}
[ae45007]	117
	118	A declaration node represents either of:
	119	\begin{itemize}
	120	\item
[1526e86]	121	type declaration: @struct@, @union@, @typedef@ or type parameter (see \VRef[Appendix]{s:KindsTypeParameters})
[ae45007]	122	\item
[cebfcb8]	123	variable declaration
[ae45007]	124	\item
[cebfcb8]	125	function declaration
[ae45007]	126	\end{itemize}
	127	Declarations are introduced by standard C declarations, with the usual scoping rules.
[cebfcb8]	128	In addition, declarations can also be qualified by the \lstinline[language=CFA]@forall@ clause (which is the origin of \CFA's name):
[ae45007]	129	\begin{cfa}
[cebfcb8]	130	forall ( <$\emph{TypeParameterList}$> \| <$\emph{AssertionList}$> )
[ae45007]	131	$\emph{declaration}$
	132	\end{cfa}
[cebfcb8]	133	Type parameters in \CFA are similar to \CC template type parameters.
	134	The \CFA declaration
[ae45007]	135	\begin{cfa}
	136	forall (dtype T) ...
	137	\end{cfa}
[cebfcb8]	138	behaves similarly to the \CC template declaration
[ae45007]	139	\begin{C++}
	140	template <typename T> ...
	141	\end{C++}
	142
[cebfcb8]	143	Assertions are a distinctive feature of \CFA, similar to \emph{interfaces} in D and Go, and \emph{traits} in Rust.
	144	Contrary to the \CC template where arbitrary functions and operators can be used in a template definition, in a \CFA parametric function, operations on parameterized types must be declared in assertions.
[ae45007]	145	Consider the following \CC template:
	146	\begin{C++}
[cebfcb8]	147	@template@ forall<typename T> T foo( T t ) {
	148	return t + t * t;
[ae45007]	149	}
	150	\end{C++}
[cebfcb8]	151	where there are no explicit requirements on the type @T@.
	152	Therefore, the \CC compiler must deduce what operators are required during textual (macro) expansion of the template at each usage.
	153	As a result, templates cannot be compiled.
	154	\CFA assertions specify restrictions on type parameters:
[ae45007]	155	\begin{cfa}
[cebfcb8]	156	forall( dtype T \| @{ T ?+?( T, T ); T ?*?( T, T ) }@ ) int foo ( T t ) {
	157	return t + t * t;
[ae45007]	158	}
	159	\end{cfa}
[cebfcb8]	160	Assertions are written using the usual \CFA function declaration syntax.
	161	Only types with operators ``@+@'' and ``@*@'' work with this function, and the function prototype is sufficient to allow separate compilation.
	162
	163	Type parameters and assertions are used in the following compiler data-structures.
	164
[ae45007]	165
[cebfcb8]	166	\subsubsection{Type Nodes}
[ae45007]	167
[cebfcb8]	168	Type nodes represent the type of an object or expression.
	169	Named types reference the corresponding type declarations.
	170	The type of a function is its function pointer type (same as standard C).
	171	With the addition of type parameters, named types may contain a list of parameter values (actual parameter types).
[ae45007]	172
	173
[cebfcb8]	174	\subsubsection{Statement Nodes}
	175
	176	Statement nodes represent the executable statements in the program, including basic expression statements, control flows and blocks.
[ae45007]	177	Local declarations (within a block statement) are represented as declaration statements.
	178
	179
[cebfcb8]	180	\subsubsection{Expression Nodes}
	181
	182	Some expressions are represented differently before and after the resolution stage:
[ae45007]	183	\begin{itemize}
	184	\item
[cebfcb8]	185	Name expressions: @NameExpr@ pre-resolution, @VariableExpr@ post-resolution
[ae45007]	186	\item
[cebfcb8]	187	Member expressions: @UntypedMemberExpr@ pre-resolution, @MemberExpr@ post-resolution
[ae45007]	188	\item
[cebfcb8]	189	\begin{sloppypar}
	190	Function call expressions (including overloadable operators): @UntypedExpr@ pre-resolution, @ApplicationExpr@ post-resolution
	191	\end{sloppypar}
[ae45007]	192	\end{itemize}
[cebfcb8]	193	The pre-resolution representation contains only the symbols.
	194	Post-resolution links them to the actual variable and function declarations.
[ae45007]	195
	196
	197	\subsection{Compilation Passes}
	198
[cebfcb8]	199	Compilation steps are implemented as passes, which follows a general structural recursion pattern on the syntax tree.
[ae45007]	200
[cebfcb8]	201	The basic workflow of compilation passes follows preorder and postorder traversal on the AST data-structure, implemented with visitor pattern, and can be loosely described with the following pseudocode:
[a75d464]	202	\begin{C++}
[cebfcb8]	203	Pass::visit( node_t node ) {
	204	previsit( node );
	205	if ( visit_children )
[ae45007]	206	for each child of node:
[cebfcb8]	207	child.accept( this );
	208	postvisit( node );
[ae45007]	209	}
[a75d464]	210	\end{C++}
[cebfcb8]	211	Operations in @previsit@ happen in preorder (top to bottom) and operations in @postvisit@ happen in postorder (bottom to top).
	212	The precise order of recursive operations on child nodes can be found in @Common/PassVisitor.impl.h@ (old) and @AST/Pass.impl.hpp@ (new).
	213
	214	Implementations of compilation passes follow certain conventions:
[ae45007]	215	\begin{itemize}
	216	\item
[cebfcb8]	217	Passes \textbf{should not} directly override the visit method (Non-virtual Interface principle);
	218	if a pass desires different recursion behaviour, it should set @visit_children@ to false and perform recursive calls manually within previsit or postvisit procedures.
	219	To enable this option, inherit from the @WithShortCircuiting@ mixin.
[ae45007]	220	\item
[cebfcb8]	221	previsit may mutate the node but \textbf{must not} change the node type or return @nullptr@.
[ae45007]	222	\item
[cebfcb8]	223	postvisit may mutate the node, reconstruct it to a different node type, or delete it by returning @nullptr@.
[ae45007]	224	\item
	225	If the previsit or postvisit method is not defined for a node type, the step is skipped.
[cebfcb8]	226	If the return type is declared as @void@, the original node is returned by default.
	227	These behaviours are controlled by template specialization rules;
	228	see @Common/PassVisitor.proto.h@ (old) and @AST/@ @Pass.proto.hpp@ (new) for details.
[ae45007]	229	\end{itemize}
	230	Other useful mixin classes for compilation passes include:
	231	\begin{itemize}
	232	\item
[cebfcb8]	233	@WithGuards@ allows saving and restoring variable values automatically upon entering/exiting the current node.
[ae45007]	234	\item
[cebfcb8]	235	@WithVisitorRef@ creates a wrapped entity for the current pass (the actual argument passed to recursive calls internally) for explicit recursion, usually used together with @WithShortCircuiting@.
[ae45007]	236	\item
[cebfcb8]	237	@WithSymbolTable@ gives a managed symbol table with built-in scoping-rule handling (\eg on entering and exiting a block statement)
[ae45007]	238	\end{itemize}
[cebfcb8]	239	\NOTE: If a pass extends the functionality of another existing pass, due to \CC overloading resolution rules, it \textbf{must} explicitly introduce the inherited previsit and postvisit procedures to its own scope, or otherwise they are not picked up by template resolution:
[a75d464]	240	\begin{C++}
[ae45007]	241	class Pass2: public Pass1 {
[cebfcb8]	242	@using Pass1::previsit;@
	243	@using Pass1::postvisit;@
[ae45007]	244	// new procedures
	245	}
[a75d464]	246	\end{C++}
[ae45007]	247
	248
[cebfcb8]	249	\subsection{Data Structure Change (new-ast)}
[ae45007]	250
[cebfcb8]	251	It has been observed that excessive copying of syntax tree structures accounts for a majority of computation cost and significantly slows down the compiler.
	252	In the previous implementation of the syntax tree, every internal node has a unique parent;
	253	therefore all copies are required to duplicate the entire subtree.
	254	A new, experimental re-implementation of the syntax tree (source under directory @AST/@ hereby referred to as ``new-ast'') attempts to overcome this issue with a functional approach that allows sharing of common sub-structures and only makes copies when necessary.
[ae45007]	255
[cebfcb8]	256	The core of new-ast is a customized implementation of smart pointers, similar to @std::shared_ptr@ and @std::weak_ptr@ in the \CC standard library.
	257	Reference counting is used to detect sharing and allowing certain optimizations.
	258	For a purely functional (immutable) data-structure, all mutations are modelled by shallow copies along the path of mutation.
[ae45007]	259	With reference counting optimization, unique nodes are allowed to be mutated in place.
[cebfcb8]	260	This however, may potentially introduce some complications and bugs;
	261	a few issues are discussed near the end of this section.
[ae45007]	262
	263
[cebfcb8]	264	\subsubsection{Source: \lstinline{AST/Node.hpp}}
[ae45007]	265
[cebfcb8]	266	Class @ast::Node@ is the base class of all new-ast node classes, which implements reference counting mechanism.
	267	Two different counters are recorded: ``strong'' reference count for number of nodes semantically owning it;
	268	``weak'' reference count for number of nodes holding a mere reference and only need to observe changes.
	269	Class @ast::ptr_base@ is the smart pointer implementation and also takes care of resource management.
[ae45007]	270
[cebfcb8]	271	Direct access through the smart pointer is read-only.
	272	A mutable access should be obtained by calling @shallowCopy@ or mutate as below.
[ae45007]	273
[cebfcb8]	274	Currently, the weak pointers are only used to reference declaration nodes from a named type, or a variable expression.
	275	Since declaration nodes are intended to denote unique entities in the program, weak pointers always point to unique (unshared) nodes.
	276	This property may change in the future, and weak references to shared nodes may introduce some problems;
	277	see mutate function below.
[ae45007]	278
[cebfcb8]	279	All node classes should always use smart pointers in structure definitions versus raw pointers.
	280	Function
[d1864da]	281	\begin{C++}
	282	void ast::Node::increment(ref_type ref)
	283	\end{C++}
[cebfcb8]	284	increments this node's strong or weak reference count.
	285	Function
[d1864da]	286	\begin{C++}
	287	void ast::Node::decrement(ref_type ref, bool do_delete = true)
	288	\end{C++}
[cebfcb8]	289	decrements this node's strong or weak reference count.
	290	If strong reference count reaches zero, the node is deleted.
	291	\NOTE: Setting @do_delete@ to false may result in a detached node.
	292	Subsequent code should manually delete the node or assign it to a strong pointer to prevent memory leak.
	293
[d1864da]	294	Reference counting functions are internally called by @ast::ptr_base@.
[cebfcb8]	295	Function
[d1864da]	296	\begin{C++}
	297	template<typename node_t>
	298	node_t * shallowCopy(const node_t * node)
	299	\end{C++}
[cebfcb8]	300	returns a mutable, shallow copy of node: all child pointers are pointing to the same child nodes.
	301	Function
[d1864da]	302	\begin{C++}
	303	template<typename node_t>
	304	node_t * mutate(const node_t * node)
	305	\end{C++}
[cebfcb8]	306	returns a mutable pointer to the same node, if the node is unique (strong reference count is 1);
	307	otherwise, it returns @shallowCopy(node)@.
	308	It is an error to mutate a shared node that is weak-referenced.
	309	Currently this does not happen.
	310	A problem may appear once weak pointers to shared nodes (\eg expression nodes) are used;
	311	special care is needed.
	312
	313	\NOTE: This naive uniqueness check may not be sufficient in some cases.
	314	A discussion of the issue is presented at the end of this section.
	315	Functions
[d1864da]	316	\begin{C++}
	317	template<typename node_t, typename parent_t, typename field_t, typename assn_t>
[cebfcb8]	318	const node_t * mutate_field(const node_t * node, field_t parent_t::* field, assn_t && val)
[d1864da]	319	\end{C++}
	320	\begin{C++}
	321	template<typename node_t, typename parent_t, typename coll_t, typename ind_t,
	322	typename field_t>
	323	const node_t * mutate_field_index(const node_t * node, coll_t parent_t::* field, ind_t i,
	324	field_t && val)
	325	\end{C++}
[cebfcb8]	326	are helpers for mutating a field on a node using pointer to a member function (creates shallow copy when necessary).
[d1864da]	327
	328
[cebfcb8]	329	\subsubsection{Issue: Undetected Sharing}
	330
	331	The @mutate@ behaviour described above has a problem: deeper shared nodes may be
[d1864da]	332	mistakenly considered as unique. \VRef[Figure]{f:DeepNodeSharing} shows how the problem could arise:
	333	\begin{figure}
	334	\centering
	335	\input{DeepNodeSharing}
	336	\caption{Deep sharing of nodes}
	337	\label{f:DeepNodeSharing}
	338	\end{figure}
[cebfcb8]	339	Given the tree rooted at P1, which is logically the chain P1-A-B, and P2 is irrelevant, assume @mutate(B)@ is called.
	340	The algorithm considers B as unique since it is only directly owned by A.
	341	However, the other tree P2-A-B indirectly shares the node B and is therefore wrongly mutated.
[d1864da]	342
[cebfcb8]	343	To partly address this problem, if the mutation is called higher up the tree, a chain mutation helper can be used.
[d1864da]	344
[cebfcb8]	345	\subsubsection{Source: \lstinline{AST/Chain.hpp}}
[d1864da]	346
[cebfcb8]	347	Function
[d1864da]	348	\begin{C++}
	349	template<typename node_t, Node::ref_type ref_t>
	350	auto chain_mutate(ptr_base<node_t, ref_t> & base)
	351	\end{C++}
[cebfcb8]	352	returns a chain mutator handle that takes pointer-to-member to go down the tree, while creating shallow copies as necessary;
	353	see @struct _chain_mutator@ in the source code for details.
[d1864da]	354
[cebfcb8]	355	For example, in the above diagram, if mutation of B is wanted while at P1, the call using @chain_mutate@ looks like the following:
[a75d464]	356	\begin{C++}
	357	chain_mutate(P1.a)(&A.b) = new_value_of_b;
	358	\end{C++}
[cebfcb8]	359	\NOTE: if some node in chain mutate is shared (therefore shallow copied), it implies that every node further down is also copied, thus correctly executing the functional mutation algorithm.
	360	This example code creates copies of both A and B and performs mutation on the new nodes, so that the other tree P2-A-B is untouched.
	361	However, if a pass traverses down to node B and performs mutation, for example, in @postvisit(B)@, information on sharing higher up is lost.
	362	Since the new-ast structure is only in experimental use with the resolver algorithm, which mostly rebuilds the tree bottom-up, this issue does not actually happen.
	363	It should be addressed in the future when other compilation passes are migrated to new-ast and many of them contain procedural mutations, where it might cause accidental mutations to other logically independent trees (\eg common sub-expression) and become a bug.
	364
	365
[a75d464]	366	\section{Compiler Algorithm Documentation}
	367
[cebfcb8]	368	This compiler algorithm documentation covers most of the resolver, data structures used in variable and expression resolution, and a few directly related passes.
	369	Later passes involving code generation are not included yet;
	370	documentation for those will be done latter.
	371
[a75d464]	372
	373	\subsection{Symbol Table}
	374
[cebfcb8]	375	\NOTE: For historical reasons, the symbol-table data-structure is called @indexer@ in the old implementation.
	376	Hereby, the name is changed to @SymbolTable@.
	377	The symbol table stores a mapping from names to declarations, implements a similar name-space separation rule, and provides the same scoping rules as standard C.\footnote{ISO/IEC 9899:1999, Sections 6.2.1 and 6.2.3.}
	378	The difference in name-space rule is that @typedef@ aliases are no longer considered ordinary identifiers.
	379	In addition to C tag-types (@struct@, @union@, @enum@), \CFA introduces another tag type, @trait@, which is a named collection of assertions.
	380
	381
	382	\subsubsection{Source: \lstinline{AST/SymbolTable.hpp}}
[1526e86]	383
[cebfcb8]	384	Function
[a75d464]	385	\begin{C++}
	386	SymbolTable::addId(const DeclWithType * decl)
	387	\end{C++}
[cebfcb8]	388	provides name mangling of identifiers, since \CFA allows overloading of variables and functions.
	389	The mangling scheme is closely based on the Itanium \CC ABI,\footnote{\url{https://itanium-cxx-abi.github.io/cxx-abi/abi.html}, Section 5.1} while making adaptations to \CFA specific features, mainly assertions and overloaded variables by type.
[a75d464]	390
[cebfcb8]	391	Naming conflicts are handled by mangled names;
	392	lookup by name returns a list of declarations with the same identifier name.
	393	Functions
[a75d464]	394	\begin{C++}
	395	SymbolTable::addStruct(const StructDecl * decl)
	396	SymbolTable::addUnion(const UnionDecl * decl)
	397	SymbolTable::addEnum(const EnumDecl * decl)
	398	SymbolTable::addTrait(const TraitDecl * decl)
	399	\end{C++}
[cebfcb8]	400	add a tag-type declaration to the symbol table.
	401	Function
[a75d464]	402	\begin{C++}
	403	SymbolTable::addType(const NamedTypeDecl * decl)
	404	\end{C++}
[cebfcb8]	405	adds a @typedef@ alias to the symbol table.
[a75d464]	406
[cebfcb8]	407	\textbf{C Incompatibility Note}: Since \CFA allows using @struct@, @union@ and @enum@ type-names without a prefix keyword, as in \CC, @typedef@ names and tag-type names cannot be disambiguated by syntax rules.
	408	Currently the compiler puts them together and disallows collision.
	409	The following program is valid C but invalid \CFA (and \CC):
[a75d464]	410	\begin{C++}
	411	struct A {};
[cebfcb8]	412	typedef int A; // gcc: ok, cfa: Cannot redefine typedef A
	413	struct A sa; // C disambiguates via struct prefix
	414	A ia;
	415	\end{C++}
	416	In practices, such usage is extremely rare, and hence, this change (as in \CC) has minimal impact on existing C programs.
	417	The declaration
	418	\begin{C++}
	419	struct A {};
	420	typedef struct A A; // A is an alias for struct A
	421	A a;
	422	struct A b;
[a75d464]	423	\end{C++}
[cebfcb8]	424	is not an error because the alias name is identical to the original.
	425	Finally, the following program is allowed in \CFA:
[a75d464]	426	\begin{C++}
	427	typedef int A;
[cebfcb8]	428	void A(); // name mangled
[a75d464]	429	// gcc: A redeclared as different kind of symbol, cfa: ok
	430	\end{C++}
[cebfcb8]	431	because the function name is mangled.
	432
[a75d464]	433
	434	\subsection{Type Environment and Unification}
	435
[cebfcb8]	436	The following core ideas underlie the parametric type-resolution algorithm.
	437	A type environment organizes type parameters into \textbf{equivalent classes} and maps them to actual types.
	438	Unification is the algorithm that takes two (possibly parametric) types and parameter mappings, and attempts to produce a common type by matching information in the type environments.
[a75d464]	439
	440	The unification algorithm is recursive in nature and runs in two different modes internally:
	441	\begin{itemize}
	442	\item
[cebfcb8]	443	Exact unification mode requires equivalent parameters to match perfectly.
[a75d464]	444	\item
[cebfcb8]	445	Inexact unification mode allows equivalent parameters to be converted to a common type.
[a75d464]	446	\end{itemize}
[cebfcb8]	447	For a pair of matching parameters (actually, their equivalent classes), if either side is open (not bound to a concrete type yet), they are combined.
[a75d464]	448
[cebfcb8]	449	Within the inexact mode, types are allowed to differ on their cv-qualifiers (\eg @const@, @volatile@, \etc);
	450	additionally, if a type never appear either in a parameter list or as the base type of a pointer, it may also be widened (\ie safely converted).
	451	As \CFA currently does not implement subclassing as in object-oriented languages, widening conversions are only on the primitive types, \eg conversion from @int@ to @long int@.
[a75d464]	452
[cebfcb8]	453	The need for two unification modes comes from the fact that parametric types are considered compatible only if all parameters are exactly the same (not just compatible).
	454	Pointer types also behaves similarly;
	455	in fact, they may be viewed as a primitive kind of parametric types.
	456	@int @ and @long @ are different types, just like @vector(int)@ and @vector(long)@ are, for the parametric type @*(T)@ / @vector(T)@, respectively.
[a75d464]	457
[cebfcb8]	458	The resolver uses the following @public@ functions:\footnote{
	459	Actual code also tracks assertions on type parameters; those extra arguments are omitted here for conciseness.}
[a75d464]	460
	461
[cebfcb8]	462	\subsubsection{Source: \lstinline{ResolvExpr/Unify.cc}}
[a75d464]	463
[cebfcb8]	464	Function
[a75d464]	465	\begin{C++}
[cebfcb8]	466	bool unify(const Type * type1, const Type * type2, TypeEnvironment & env,
	467	OpenVarSet & openVars, const SymbolTable & symtab, Type *& commonType)
[a75d464]	468	\end{C++}
[cebfcb8]	469	returns a boolean indicating if the unification succeeds or fails after attempting to unify @type1@ and @type2@ within current type environment.
	470	If the unify succeeds, @env@ is modified by combining the equivalence classes of matching parameters in @type1@ and @type2@, and their common type is written to @commonType@.
	471	If the unify fails, nothing changes.
	472	Functions
[a75d464]	473	\begin{C++}
[cebfcb8]	474	bool typesCompatible(const Type * type1, const Type * type2, const SymbolTable & symtab,
	475	const TypeEnvironment & env)
	476	bool typesCompatibleIgnoreQualifiers(const Type * type1, const Type * type2,
	477	const SymbolTable & symtab, const TypeEnvironment & env)
[a75d464]	478	\end{C++}
[cebfcb8]	479	return a boolean indicating if types @type1@ and @type2@ can possibly be the same type.
	480	The second version ignores the outermost cv-qualifiers if present.\footnote{
	481	In \lstinline@const int * const@, only the second \lstinline@const@ is ignored.}
	482	These function have no side effects.
[a75d464]	483
[cebfcb8]	484	\NOTE: No attempt is made to widen the types (exact unification is used), although the function names may suggest otherwise, \eg @typesCompatible(int, long)@ returns false.
[a75d464]	485
	486
	487	\subsection{Expression Resolution}
	488
[cebfcb8]	489	The design of the current version of expression resolver is outlined in the Ph.D.\ thesis by Aaron Moss~\cite{Moss19}.
[a75d464]	490	A summary of the resolver algorithm for each expression type is presented below.
	491
[cebfcb8]	492	All overloadable operators are modelled as function calls.
	493	For a function call, interpretations of the function and arguments are found recursively.
	494	Then the following steps produce a filtered list of valid interpretations:
[a75d464]	495	\begin{enumerate}
	496	\item
[cebfcb8]	497	From all possible combinations of interpretations of the function and arguments, those where argument types may be converted to function parameter types are considered valid.
[a75d464]	498	\item
	499	Valid interpretations with the minimum sum of argument costs are kept.
	500	\item
[cebfcb8]	501	\label{p:argcost}
	502	Argument costs are then discarded; the actual cost for the function call expression is the sum of conversion costs from the argument types to parameter types.
[a75d464]	503	\item
[cebfcb8]	504	\label{p:returntype}
	505	For each return type, the interpretations with satisfiable assertions are then sorted by actual cost computed in step~\ref{p:argcost}.
	506	If for a given type, the minimum cost interpretations are not unique, that return type is ambiguous.
	507	If the minimum cost interpretation is unique but contains an ambiguous argument, it is also ambiguous.
[a75d464]	508	\end{enumerate}
[cebfcb8]	509	Therefore, for each return type, the resolver produces:
[a75d464]	510	\begin{itemize}
	511	\item
[cebfcb8]	512	no alternatives
[a75d464]	513	\item
[cebfcb8]	514	a single valid alternative
[a75d464]	515	\item
[cebfcb8]	516	an ambiguous alternative
[a75d464]	517	\end{itemize}
[cebfcb8]	518	\NOTE: an ambiguous alternative may be discarded at the parent expressions because a different return type matches better for the parent expressions.
[a75d464]	519
[cebfcb8]	520	The \emph{non}-overloadable expressions in \CFA are: cast expressions, address-of (unary @&@) expressions, short-circuiting logical expressions (@&&@, @\|\|@) and ternary conditional expression (@?:@).
[a75d464]	521
[cebfcb8]	522	For a cast expression, the convertible argument types are kept.
	523	Then the result is selected by lowest argument cost, and further by lowest conversion cost to target type.
	524	If the lowest cost is still not unique or an ambiguous argument interpretation is selected, the cast expression is ambiguous.
	525	In an expression statement, the top level expression is implicitly cast to @void@.
[a75d464]	526
	527	For an address-of expression, only lvalue results are kept and the minimum cost is selected.
	528
[cebfcb8]	529	For logical expressions @&&@ and @\|\|@, arguments are implicitly cast to @bool@, and follow the rules fr cast expression above.
[a75d464]	530
[cebfcb8]	531	For the ternary conditional expression, the condition is implicitly cast to @bool@, and the branch expressions must have compatible types.
	532	Each pair of compatible branch expression types produce a possible interpretation, and the cost is defined as the sum of the expression costs plus the sum of conversion costs to the common type.
[a75d464]	533
[1526e86]	534
[2ff78aae]	535	\subsection{Conversion and Application Cost}
[1526e86]	536
	537	There were some unclear parts in the previous documentation in the cost system, as described in the Moss thesis~\cite{Moss19}, section 4.1.2.
	538	Some clarification are presented in this section.
[2ff78aae]	539
	540	\begin{enumerate}
	541	\item
[1526e86]	542	Conversion to a type denoted by parameter may incur additional cost if the match is not exact.
	543	For example, if a function is declared to accept @(T, T)@ and receives @(int, long)@, @T@ is deducted @long@ and an additional widening conversion cost is added for @int@ to @T@.
[2ff78aae]	544
	545	\item
[1526e86]	546	The specialization level of a function is the sum of the least depth of an appearance of a type parameter (counting pointers, references and parameterized types), plus the number of assertions.
	547	A higher specialization level is favoured if argument conversion costs are equal.
[2ff78aae]	548
	549	\item
[1526e86]	550	Coercion of pointer types is only allowed in explicit cast expressions;
	551	the only allowed implicit pointer casts are adding qualifiers to the base type and cast to @void*@, and these counts as safe conversions.
	552	Note that implicit cast from @void *@ to other pointer types is no longer valid, as opposed to standard C.
[2ff78aae]	553	\end{enumerate}
[a75d464]	554
	555
	556	\subsection{Assertion Satisfaction}
	557
[cebfcb8]	558	The resolver tries to satisfy assertions on expressions only when it is needed: either while selecting from multiple alternatives of a same result type for a function call (step \ref{p:returntype} of resolving function calls) or upon reaching the top level of an expression statement.
[a75d464]	559
[cebfcb8]	560	Unsatisfiable alternatives are discarded.
	561	Satisfiable alternatives receive \textbf{implicit parameters}: in \CFA, parametric functions may be separately compiled, as opposed to \CC templates which are only compiled at instantiation.
	562	Given the parametric function-definition:
[a75d464]	563	\begin{C++}
	564	forall (otype T \| {void foo(T);})
	565	void bar (T t) { foo(t); }
	566	\end{C++}
[cebfcb8]	567	the function @bar@ does not know which @foo@ to call when compiled without knowing the call site, so it requests a function pointer to be passed as an extra argument.
	568	At the call site, implicit parameters are automatically inserted by the compiler.
[a75d464]	569
[1526e86]	570	Implementation of implicit parameters is discussed in \VRef[Appendix]{s:ImplementationParametricFunctions}.
[a75d464]	571
	572	\section{Tests}
	573
	574	\subsection{Test Suites}
	575
[cebfcb8]	576	Automatic test suites are located under the @tests/@ directory.
	577	A test case consists of an input CFA source file (suffix @.cfa@), and an expected output file located in the @tests/.expect/@ directory, with the same file name ending with suffix @.txt@.
	578	For example, the test named @tests/tuple/tupleCast.cfa@ has the following files, for example:
[a75d464]	579	\begin{C++}
	580	tests/
[cebfcb8]	581	tuple/
	582	.expect/
	583	tupleCast.txt
	584	tupleCast.cfa
[a75d464]	585	\end{C++}
[cebfcb8]	586	If compilation fails, the error output is compared to the expect file.
	587	If the compilation succeeds but does not generate an executable, the compilation output is compared to the expect file.
	588	If the compilation succeeds and generates an executable, the executable is run and its output is compared to the expect file.
	589	To run the tests, execute the test script @test.py@ under the @tests/@ directory, with a list of test names to be run, or @--all@ (or @make all-tests@) to run all tests.
	590	The test script reports test cases fail/success, compilation time and program run time.
	591	To see all the options available for @test.py@ using the @--help@ option.
[a75d464]	592
	593
	594	\subsection{Performance Reports}
	595
[cebfcb8]	596	To turn on performance reports, pass the @-XCFA -S@ flag to the compiler.
	597	Three kinds of performance reports are available:
[a75d464]	598	\begin{enumerate}
	599	\item
	600	Time, reports time spent in each compilation step
	601	\item
	602	Heap, reports number of dynamic memory allocations, total bytes allocated, and
	603	maximum heap memory usage
	604	\item
	605	Counters, for certain predefined statistics; counters can be registered anywhere in
	606	the compiler as a static object, and the interface can be found at
	607	@Common/Stats/Counter.h@.
	608	\end{enumerate}
[cebfcb8]	609	It is suggested to run performance tests with optimization (@g++@ flag @-O3@).
[a75d464]	610
[1526e86]	611
[ebec8ed]	612	\appendix
[2ff78aae]	613	\section{Appendix}
	614
	615	\subsection{Kinds of Type Parameters}
[1526e86]	616	\label{s:KindsTypeParameters}
	617
[ebec8ed]	618	A type parameter in a @forall@ clause has 3 kinds:
[1526e86]	619	\begin{enumerate}[listparindent=0pt]
	620	\item
[ebec8ed]	621	@dtype@: any data type (built-in or user defined) that is not a concrete type.
[1526e86]	622
[ebec8ed]	623	A non-concrete type is an incomplete type such as an opaque type or pointer/reference with an implicit (pointer) size and implicitly generated reference and dereference operations.
[2ff78aae]	624	\item
[ebec8ed]	625	@otype@: any data type (built-in or user defined) that is concrete type.
[1526e86]	626
[ebec8ed]	627	A concrete type is a complete type, \ie types that can be used to create a variable, which also implicitly asserts the existence of default and copy constructors, assignment, and destructor\footnote{\CFA implements the same automatic resource management (RAII) semantics as \CC.}.
	628	% \item
	629	% @ftype@: any function type.
	630	%
	631	% @ftype@ provides two purposes:
	632	% \begin{itemize}
	633	% \item
	634	% Differentiate function pointer from data pointer because (in theory) some systems have different sizes for these pointers.
	635	% \item
	636	% Disallow a function pointer to match an overloaded data pointer, since variables and functions can have the same names.
	637	% \end{itemize}
[1526e86]	638
[2ff78aae]	639	\item
[1526e86]	640	@ttype@: tuple (variadic) type.
[2ff78aae]	641
[ebec8ed]	642	Restricted to the type for the last parameter in a function, it provides a type-safe way to implement variadic functions.
[1526e86]	643	Note however, that it has certain restrictions, as described in the implementation section below.
[2ff78aae]	644	\end{enumerate}
	645
[1526e86]	646
[2ff78aae]	647	\subsection{GNU C Nested Functions}
	648
	649	\CFA is designed to be mostly compatible with GNU C, an extension to ISO C99 and C11 standards. The \CFA compiler also implements some language features by GCC extensions, most notably nested functions.
	650
	651	In ISO C, function definitions are not allowed to be nested. GCC allows nested functions with full lexical scoping. The following example is taken from GCC documentation\footnote{\url{https://gcc.gnu.org/onlinedocs/gcc/Nested-Functions.html}}:
	652	\begin{C++}
[1526e86]	653	void bar( int * array, int offset, int size ) {
	654	int access( int * array, int index ) { return array[index + offset]; }
	655	int i;
	656	/* ... */
	657	for ( i = 0; i < size; i++ )
	658	/* ... / access (array, i) / ... */
[2ff78aae]	659	}
	660	\end{C++}
[1526e86]	661	GCC nested functions behave identically to \CC lambda functions with default by-reference capture (stack-allocated, lifetime ends upon exiting the declared block), while also possible to be passed as arguments with standard function pointer types.
[2ff78aae]	662
	663
	664	\subsection{Implementation of Parametric Functions}
[1526e86]	665	\label{s:ImplementationParametricFunctions}
	666
	667	\CFA implements parametric functions using the implicit parameter approach: required assertions are passed to the callee by function pointers;
	668	size of a parametric type must also be known if referenced directly (\ie not as a pointer).
[2ff78aae]	669
	670	The implementation is similar to the one from Scala\footnote{\url{https://www.scala-lang.org/files/archive/spec/2.13/07-implicits.html}}, with some notable differences in resolution:
	671	\begin{enumerate}
	672	\item
[ebec8ed]	673	All types, variables, and functions are candidates of implicit parameters
[2ff78aae]	674	\item
	675	The parameter (assertion) name must match the actual declarations.
	676	\end{enumerate}
	677
	678	For example, the \CFA function declaration
	679	\begin{cfa}
[1526e86]	680	forall( otype T \| { int foo( T, int ); } )
[2ff78aae]	681	int bar(T);
	682	\end{cfa}
	683	after implicit parameter expansion, has the actual signature\footnote{\textbf{otype} also requires the type to have constructor and destructor, which are the first two function pointers preceding the one for \textbf{foo}.}
	684	\begin{C++}
[1526e86]	685	int bar( T, size_t, void ()(T&), void ()(T&), int (*)(T, int) );
[2ff78aae]	686	\end{C++}
[1526e86]	687	The implicit parameter approach has an apparent issue: when the satisfying declaration is also parametric, it may require its own implicit parameters too.
	688	That also causes the supplied implicit parameter to have a different \textbf{actual} type than the \textbf{nominal} type, so it cannot be passed directly.
	689	Therefore, a wrapper with matching actual type must be created, and it is here where GCC nested functions are used internally by the compiler.
[2ff78aae]	690
	691	Consider the following program:
	692	\begin{cfa}
	693	int assertion(int);
	694
[1526e86]	695	forall( otype T \| { int assertion(T); } )
[2ff78aae]	696	void foo(T);
	697
[1526e86]	698	forall(otype T \| { void foo(T); } )
[2ff78aae]	699	void bar(T t) {
	700	foo(t);
	701	}
	702	\end{cfa}
[1526e86]	703	The \CFA compiler translates the program to non-parametric form\footnote{In the final code output, \lstinline@T@ needs to be replaced by an opaque type, and arguments must be accessed by a frame pointer offset table, due to the unknown sizes. The presented code here is simplified for better understanding.}
[2ff78aae]	704	\begin{C++}
	705	// ctor, dtor and size arguments are omitted
	706	void foo(T, int (*)(T));
	707
	708	void bar(T t, void (*foo)(T)) {
	709	foo(t);
	710	}
	711	\end{C++}
	712	However, when @bar(1)@ is called, @foo@ cannot be directly provided as an argument:
	713	\begin{C++}
	714	bar(1, foo); // WRONG: foo has different actual type
	715	\end{C++}
	716	and an additional step is required:
	717	\begin{C++}
	718	{
	719	void _foo_wrapper(int t) {
[1526e86]	720	foo( t, assertion );
[2ff78aae]	721	}
[1526e86]	722	bar( 1, _foo_wrapper );
[2ff78aae]	723	}
	724	\end{C++}
[1526e86]	725	Nested assertions and implicit parameter creation may continue indefinitely.
	726	This issue is a limitation of implicit parameter implementation.
	727	In particular, polymorphic variadic recursion must be structural (\ie the number of arguments decreases in any possible recursive calls), otherwise code generation gets into an infinite loop.
	728	The \CFA compiler sets a limit on assertion depth and reports an error if assertion resolution does not terminate within the limit (as for \lstinline[language=C++]@templates@ in \CC).
[a75d464]	729
[101cc3a]	730	\addcontentsline{toc}{section}{\refname}
[ae45007]	731	\bibliographystyle{plain}
	732	\bibliography{pl}
	733
	734
	735	\end{document}
	736
	737	% Local Variables: %
	738	% tab-width: 4 %
	739	% fill-column: 100 %
	740	% compile-command: "make" %
	741	% End: %

Note: See TracBrowser for help on using the repository browser.

Download in other formats: