Context Navigation

source: doc/theses/andrew_beach_MMath/features.tex@ c2016b6

Visit:

ADT ast-experimental enum forall-pointer-decay jacob/cs343-translation pthread-emulation qualifiedEnum

Last change on this file since c2016b6 was 13afd0c, checked in by Andrew Beach <ajbeach@…>, 4 years ago
Andrew MMath: Features chapter updated from Peter's focused review.
Property mode set to `100644`
File size: 36.5 KB

Line
1	\chapter{Exception Features}
2	\label{c:features}
3
4	This chapter covers the design and user interface of the \CFA EHM
5	and begins with a general overview of EHMs. It is not a strict
6	definition of all EHMs nor an exhaustive list of all possible features.
7	However it does cover the most common structure and features found in them.
8
9	\section{Overview of EHMs}
10	% We should cover what is an exception handling mechanism and what is an
11	% exception before this. Probably in the introduction. Some of this could
12	% move there.
13	\subsection{Raise / Handle}
14	An exception operation has two main parts: raise and handle.
15	These terms are sometimes known as throw and catch but this work uses
16	throw/catch as a particular kind of raise/handle.
17	These are the two parts that the user writes and may
18	be the only two pieces of the EHM that have any syntax in a language.
19
20	\paragraph{Raise}
21	The raise is the starting point for exception handling,
22	by raising an exception, which passes it to
23	the EHM.
24
25	Some well known examples include the @throw@ statements of \Cpp and Java and
26	the \code{Python}{raise} statement of Python. In real systems, a raise may
27	perform some other work (such as memory management) but for the
28	purposes of this overview that can be ignored.
29
30	\paragraph{Handle}
31	The primary purpose of an EHM is to run some user code to handle a raised
32	exception. This code is given, along with some other information,
33	in a handler.
34
35	A handler has three common features: the previously mentioned user code, a
36	region of code it guards and an exception label/condition that matches
37	against the raised exception.
38	Only raises inside the guarded region and raising exceptions that match the
39	label can be handled by a given handler.
40	If multiple handlers could can handle an exception,
41	EHMs define a rule to pick one, such as ``best match" or ``first found".
42
43	The @try@ statements of \Cpp, Java and Python are common examples. All three
44	also show another common feature of handlers, they are grouped by the guarded
45	region.
46
47	\subsection{Propagation}
48	After an exception is raised comes what is usually the biggest step for the
49	EHM: finding and setting up the handler for execution.
50	The propagation from raise to
51	handler can be broken up into three different tasks: searching for a handler,
52	matching against the handler and installing the handler.
53
54	\paragraph{Searching}
55	The EHM begins by searching for handlers that might be used to handle
56	the exception.
57	The search will find handlers that have the raise site in their guarded
58	region.
59	The search includes handlers in the current function, as well as any in
60	callers on the stack that have the function call in their guarded region.
61
62	\paragraph{Matching}
63	Each handler found is with the raised exception. The exception
64	label defines a condition that is used with the exception and decides if
65	there is a match or not.
66	%
67	In languages where the first match is used, this step is intertwined with
68	searching; a match check is performed immediately after the search finds
69	a handler.
70
71	\paragraph{Installing}
72	After a handler is chosen, it must be made ready to run.
73	The implementation can vary widely to fit with the rest of the
74	design of the EHM. The installation step might be trivial or it could be
75	the most expensive step in handling an exception. The latter tends to be the
76	case when stack unwinding is involved.
77
78	If a matching handler is not guaranteed to be found, the EHM needs a
79	different course of action for this case.
80	This situation only occurs with unchecked exceptions as checked exceptions
81	(such as in Java) can make the guarantee.
82	The unhandled action is usually very general, such as aborting the program.
83
84	\paragraph{Hierarchy}
85	A common way to organize exceptions is in a hierarchical structure.
86	This pattern comes from object-orientated languages where the
87	exception hierarchy is a natural extension of the object hierarchy.
88
89	Consider the following exception hierarchy:
90	\begin{center}
91	\input{exception-hierarchy}
92	\end{center}
93	A handler labeled with any given exception can handle exceptions of that
94	type or any child type of that exception. The root of the exception hierarchy
95	(here \code{C}{exception}) acts as a catch-all, leaf types catch single types
96	and the exceptions in the middle can be used to catch different groups of
97	related exceptions.
98
99	This system has some notable advantages, such as multiple levels of grouping,
100	the ability for libraries to add new exception types and the isolation
101	between different sub-hierarchies.
102	This design is used in \CFA even though it is not a object-orientated
103	language; so different tools are used to create the hierarchy.
104
105	% Could I cite the rational for the Python IO exception rework?
106
107	\subsection{Completion}
108	After the handler has finished, the entire exception operation has to complete
109	and continue executing somewhere else. This step is usually simple,
110	both logically and in its implementation, as the installation of the handler
111	is usually set up to do most of the work.
112
113	The EHM can return control to many different places, where
114	the most common are after the handler definition (termination)
115	and after the raise (resumption).
116
117	\subsection{Communication}
118	For effective exception handling, additional information is often passed
119	from the raise to the handler and back again.
120	So far, only communication of the exceptions' identity is covered.
121	A common communication method for adding information to an exception
122	is putting fields into the exception instance
123	and giving the handler access to them.
124	% You can either have pointers/references in the exception, or have p/rs to
125	% the exception when it doesn't have to be copied.
126	Passing references or pointers allows data at the raise location to be
127	updated, passing information in both directions.
128
129	\section{Virtuals}
130	\label{s:virtuals}
131	Virtual types and casts are not part of \CFA's EHM nor are they required for
132	an EHM.
133	However, one of the best ways to support an exception hierarchy
134	is via a virtual hierarchy and dispatch system.
135	Ideally, the virtual system would have been part of \CFA before the work
136	on exception handling began, but unfortunately it was not.
137	Hence, only the features and framework needed for the EHM were
138	designed and implemented for this thesis.
139	Other features were considered to ensure that
140	the structure could accommodate other desirable features in the future
141	but are not implemented.
142	The rest of this section only discusses the implemented subset of the
143	virtual system design.
144
145	The virtual system supports multiple ``trees" of types. Each tree is
146	a simple hierarchy with a single root type. Each type in a tree has exactly
147	one parent -- except for the root type which has zero parents -- and any
148	number of children.
149	Any type that belongs to any of these trees is called a virtual type.
150	% A type's ancestors are its parent and its parent's ancestors.
151	% The root type has no ancestors.
152	% A type's descendants are its children and its children's descendants.
153
154	For the purposes of illustration, a proposed -- but unimplemented syntax --
155	will be used. Each virtual type is represented by a trait with an annotation
156	that makes it a virtual type. This annotation is empty for a root type, which
157	creates a new tree:
158	\begin{cfa}
159	trait root_type(T) virtual() {}
160	\end{cfa}
161	The annotation may also refer to any existing virtual type to make this new
162	type a child of that type and part of the same tree. The parent may itself
163	be a child or a root type and may have any number of existing children.
164	\begin{cfa}
165	trait child_a(T) virtual(root_type) {}
166	trait grandchild(T) virtual(child_a) {}
167	trait child_b(T) virtual(root_type) {}
168	\end{cfa}
169	\todo{Update the diagram in vtable.fig to show the new type tree.}
170
171	Every virtual type also has a list of virtual members and a unique id,
172	both are stored in a virtual table.
173	Every instance of a virtual type also has a pointer to a virtual table stored
174	in it, although there is no per-type virtual table as in many other languages.
175
176	The list of virtual members is built up down the tree. Every virtual type
177	inherits the list of virtual members from its parent and may add more
178	virtual members to the end of the list which are passed on to its children.
179	Again, using the unimplemented syntax this might look like:
180	\begin{cfa}
181	trait root_type(T) virtual() {
182	const char * to_string(T const & this);
183	unsigned int size;
184	}
185
186	trait child_type(T) virtual(root_type) {
187	char * irrelevant_function(int, char);
188	}
189	\end{cfa}
190	% Consider adding a diagram, but we might be good with the explanation.
191
192	As @child_type@ is a child of @root_type@ it has the virtual members of
193	@root_type@ (@to_string@ and @size@) as well as the one it declared
194	(@irrelevant_function@).
195
196	It is important to note that these are virtual members, and may contain
197	arbitrary fields, functions or otherwise.
198	The names ``size" and ``align" are reserved for the size and alignment of the
199	virtual type, and are always automatically initialized as such.
200	The other special case are uses of the trait's polymorphic argument
201	(@T@ in the example), which are always updated to refer to the current
202	virtual type. This allows functions that refer to to polymorphic argument
203	to act as traditional virtual methods (@to_string@ in the example), as the
204	object can always be passed to a virtual method in its virtual table.
205
206	Up until this point the virtual system is similar to ones found in
207	object-oriented languages but this is where \CFA diverges.
208	Objects encapsulate a single set of methods in each type,
209	universally across the entire program,
210	and indeed all programs that use that type definition.
211	The only way to change any method is to inherit and define a new type with
212	its own universal implementation. In this sense,
213	these object-oriented types are ``closed" and cannot be altered.
214	% Because really they are class oriented.
215
216	In \CFA, types do not encapsulate any code.
217	Whether or not satisfies any given assertion, and hence any trait, is
218	context sensitive. Types can begin to satisfy a trait, stop satisfying it or
219	satisfy the same trait at any lexical location in the program.
220	In this sense, an type's implementation in the set of functions and variables
221	that allow it to satisfy a trait is ``open" and can change
222	throughout the program.
223	This capability means it is impossible to pick a single set of functions
224	that represent a type's implementation across a program.
225
226	\CFA side-steps this issue by not having a single virtual table for each
227	type. A user can define virtual tables that are filled in at their
228	declaration and given a name. Anywhere that name is visible, even if it is
229	defined locally inside a function (although in this case the user must ensure
230	it outlives any objects that use it), it can be used.
231	Specifically, a virtual type is ``bound" to a virtual table that
232	sets the virtual members for that object. The virtual members can be accessed
233	through the object.
234
235	This means virtual tables are declared and named in \CFA.
236	They are declared as variables, using the type
237	@vtable(VIRTUAL_TYPE)@ and any valid name. For example:
238	\begin{cfa}
239	vtable(virtual_type_name) table_name;
240	\end{cfa}
241
242	Like any variable they may be forward declared with the @extern@ keyword.
243	Forward declaring virtual tables is relatively common.
244	Many virtual types have an ``obvious" implementation that works in most
245	cases.
246	A pattern that has appeared in the early work using virtuals is to
247	implement a virtual table with the the obvious definition and place a forward
248	declaration of it in the header beside the definition of the virtual type.
249
250	Even on the full declaration, no initializer should be used.
251	Initialization is automatic.
252	The type id and special virtual members ``size" and ``align" only depend on
253	the virtual type, which is fixed given the type of the virtual table and
254	so the compiler fills in a fixed value.
255	The other virtual members are resolved, using the best match to the member's
256	name and type, in the same context as the virtual table is declared using
257	\CFA's normal resolution rules.
258
259	While much of the virtual infrastructure is created, it is currently only used
260	internally for exception handling. The only user-level feature is the virtual
261	cast, which is the same as the \Cpp \code{C++}{dynamic_cast}.
262	\label{p:VirtualCast}
263	\begin{cfa}
264	(virtual TYPE)EXPRESSION
265	\end{cfa}
266	Note, the syntax and semantics matches a C-cast, rather than the function-like
267	\Cpp syntax for special casts. Both the type of @EXPRESSION@ and @TYPE@ must be
268	a pointer to a virtual type.
269	The cast dynamically checks if the @EXPRESSION@ type is the same or a sub-type
270	of @TYPE@, and if true, returns a pointer to the
271	@EXPRESSION@ object, otherwise it returns @0p@ (null pointer).
272
273	\section{Exceptions}
274
275	The syntax for declaring an exception is the same as declaring a structure
276	except the keyword that is swapped out:
277	\begin{cfa}
278	exception TYPE_NAME {
279	FIELDS
280	};
281	\end{cfa}
282
283	Fields are filled in the same way as a structure as well. However an extra
284	field is added that contains the pointer to the virtual table.
285	It must be explicitly initialized by the user when the exception is
286	constructed.
287
288	Here is an example of declaring an exception type along with a virtual table,
289	assuming the exception has an ``obvious" implementation and a default
290	virtual table makes sense.
291
292	\begin{minipage}[t]{0.4\textwidth}
293	Header:
294	\begin{cfa}
295	exception Example {
296	int data;
297	};
298
299	extern vtable(Example)
300	example_base_vtable;
301	\end{cfa}
302	\end{minipage}
303	\begin{minipage}[t]{0.6\textwidth}
304	Source:
305	\begin{cfa}
306	vtable(Example) example_base_vtable
307	\end{cfa}
308	\vfil
309	\end{minipage}
310
311	%\subsection{Exception Details}
312	This is the only interface needed when raising and handling exceptions.
313	However it is actually a short hand for a more complex
314	trait based interface.
315
316	The language views exceptions through a series of traits.
317	If a type satisfies them, then it can be used as an exception. The following
318	is the base trait all exceptions need to match.
319	\begin{cfa}
320	trait is_exception(exceptT &, virtualT &) {
321	// Numerous imaginary assertions.
322	};
323	\end{cfa}
324	The trait is defined over two types: the exception type and the virtual table
325	type. Each exception type should have a single virtual table type.
326	There are no actual assertions in this trait because the trait system
327	cannot express them yet (adding such assertions would be part of
328	completing the virtual system). The imaginary assertions would probably come
329	from a trait defined by the virtual system, and state that the exception type
330	is a virtual type, is a descendant of @exception_t@ (the base exception type)
331	and allow the user to find the virtual table type.
332
333	% I did have a note about how it is the programmer's responsibility to make
334	% sure the function is implemented correctly. But this is true of every
335	% similar system I know of (except Agda's I guess) so I took it out.
336
337	There are two more traits for exceptions defined as follows:
338	\begin{cfa}
339	trait is_termination_exception(
340	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
341	void defaultTerminationHandler(exceptT &);
342	};
343
344	trait is_resumption_exception(
345	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
346	void defaultResumptionHandler(exceptT &);
347	};
348	\end{cfa}
349	Both traits ensure a pair of types is an exception type, its virtual table
350	type
351	and defines one of the two default handlers. The default handlers are used
352	as fallbacks and are discussed in detail in \vref{s:ExceptionHandling}.
353
354	However, all three of these traits can be tricky to use directly.
355	While there is a bit of repetition required,
356	the largest issue is that the virtual table type is mangled and not in a user
357	facing way. So these three macros are provided to wrap these traits to
358	simplify referring to the names:
359	@IS_EXCEPTION@, @IS_TERMINATION_EXCEPTION@ and @IS_RESUMPTION_EXCEPTION@.
360
361	All three take one or two arguments. The first argument is the name of the
362	exception type. The macro passes its unmangled and mangled form to the trait.
363	The second (optional) argument is a parenthesized list of polymorphic
364	arguments. This argument is only used with polymorphic exceptions and the
365	list is be passed to both types.
366	In the current set-up, the two types always have the same polymorphic
367	arguments so these macros can be used without losing flexibility.
368
369	For example consider a function that is polymorphic over types that have a
370	defined arithmetic exception:
371	\begin{cfa}
372	forall(Num \| IS_EXCEPTION(Arithmetic, (Num)))
373	void some_math_function(Num & left, Num & right);
374	\end{cfa}
375
376	\section{Exception Handling}
377	\label{s:ExceptionHandling}
378	As stated,
379	\CFA provides two kinds of exception handling: termination and resumption.
380	These twin operations are the core of \CFA's exception handling mechanism.
381	This section covers the general patterns shared by the two operations and
382	then goes on to cover the details each individual operation.
383
384	Both operations follow the same set of steps.
385	First, a user raises an exception.
386	Second, the exception propagates up the stack, searching for a handler.
387	Third, if a handler is found, the exception is caught and the handler is run.
388	After that control continues at a raise-dependent location.
389	As an alternate to the third step,
390	if a handler is not found, a default handler is run and, if it returns,
391	then control
392	continues after the raise.
393
394	The differences between the two operations include how propagation is
395	performed, where execution continues after an exception is handled
396	and which default handler is run.
397
398	\subsection{Termination}
399	\label{s:Termination}
400	Termination handling is the familiar kind of handling
401	and used in most programming
402	languages with exception handling.
403	It is a dynamic, non-local goto. If the raised exception is matched and
404	handled, the stack is unwound and control (usually) continues in the function
405	on the call stack that defined the handler.
406	Termination is commonly used when an error has occurred and recovery is
407	impossible locally.
408
409	% (usually) Control can continue in the current function but then a different
410	% control flow construct should be used.
411
412	A termination raise is started with the @throw@ statement:
413	\begin{cfa}
414	throw EXPRESSION;
415	\end{cfa}
416	The expression must return a reference to a termination exception, where the
417	termination exception is any type that satisfies the trait
418	@is_termination_exception@ at the call site.
419	Through \CFA's trait system, the trait functions are implicitly passed into the
420	throw code for use by the EHM.
421	A new @defaultTerminationHandler@ can be defined in any scope to
422	change the throw's behaviour when a handler is not found (see below).
423
424	The throw copies the provided exception into managed memory to ensure
425	the exception is not destroyed if the stack is unwound.
426	It is the user's responsibility to ensure the original exception is cleaned
427	up whether the stack is unwound or not. Allocating it on the stack is
428	usually sufficient.
429
430	% How to say propagation starts, its first sub-step is the search.
431	Then propagation starts with the search. \CFA uses a ``first match" rule so
432	matching is performed with the copied exception as the search key.
433	It starts from the raise site and proceeds towards base of the stack,
434	from callee to caller.
435	At each stack frame, a check is made for termination handlers defined by the
436	@catch@ clauses of a @try@ statement.
437	\begin{cfa}
438	try {
439	GUARDED_BLOCK
440	} catch (EXCEPTION_TYPE$\(_1\)$ * [NAME$\(_1\)$]) {
441	HANDLER_BLOCK$\(_1\)$
442	} catch (EXCEPTION_TYPE$\(_2\)$ * [NAME$\(_2\)$]) {
443	HANDLER_BLOCK$\(_2\)$
444	}
445	\end{cfa}
446	When viewed on its own, a try statement simply executes the statements
447	in the \snake{GUARDED_BLOCK} and when those are finished,
448	the try statement finishes.
449
450	However, while the guarded statements are being executed, including any
451	invoked functions, all the handlers in these statements are included in the
452	search path.
453	Hence, if a termination exception is raised, these handlers may be matched
454	against the exception and may handle it.
455
456	Exception matching checks the handler in each catch clause in the order
457	they appear, top to bottom. If the representation of the raised exception type
458	is the same or a descendant of @EXCEPTION_TYPE@$_i$, then @NAME@$_i$
459	(if provided) is
460	bound to a pointer to the exception and the statements in @HANDLER_BLOCK@$_i$
461	are executed. If control reaches the end of the handler, the exception is
462	freed and control continues after the try statement.
463
464	If no termination handler is found during the search, then the default handler
465	(\defaultTerminationHandler) visible at the raise statement is called.
466	Through \CFA's trait system the best match at the raise statement is used.
467	This function is run and is passed the copied exception.
468	If the default handler finishes, control continues after the raise statement.
469
470	There is a global @defaultTerminationHandler@ that is polymorphic over all
471	termination exception types.
472	The global default termination handler performs a cancellation
473	(as described in \vref{s:Cancellation})
474	on the current stack with the copied exception.
475	Since it is so general, a more specific handler can be defined,
476	overriding the default behaviour for the specific exception types.
477
478	\subsection{Resumption}
479	\label{s:Resumption}
480
481	Resumption exception handling is less familar form of exception handling,
482	but is
483	just as old~\cite{Goodenough75} and is simpler in many ways.
484	It is a dynamic, non-local function call. If the raised exception is
485	matched, a closure is taken from up the stack and executed,
486	after which the raising function continues executing.
487	The common uses for resumption exceptions include
488	potentially repairable errors, where execution can continue in the same
489	function once the error is corrected, and
490	ignorable events, such as logging where nothing needs to happen and control
491	should always continue from the raise site.
492
493	Except for the changes to fit into that pattern, resumption exception
494	handling is symmetric with termination exception handling, by design
495	(see \autoref{s:Termination}).
496
497	A resumption raise is started with the @throwResume@ statement:
498	\begin{cfa}
499	throwResume EXPRESSION;
500	\end{cfa}
501	\todo{Decide on a final set of keywords and use them everywhere.}
502	It works much the same way as the termination raise, except the
503	type must satisfy the \snake{is_resumption_exception} that uses the
504	default handler: \defaultResumptionHandler.
505	This can be specialized for particular exception types.
506
507	At run-time, no exception copy is made. Since
508	resumption does not unwind the stack nor otherwise remove values from the
509	current scope, there is no need to manage memory to keep the exception
510	allocated.
511
512	Then propagation starts with the search,
513	following the same search path as termination,
514	from the raise site to the base of stack and top of try statement to bottom.
515	However, the handlers on try statements are defined by @catchResume@ clauses.
516	\begin{cfa}
517	try {
518	GUARDED_BLOCK
519	} catchResume (EXCEPTION_TYPE$\(_1\)$ * [NAME$\(_1\)$]) {
520	HANDLER_BLOCK$\(_1\)$
521	} catchResume (EXCEPTION_TYPE$\(_2\)$ * [NAME$\(_2\)$]) {
522	HANDLER_BLOCK$\(_2\)$
523	}
524	\end{cfa}
525	Note that termination handlers and resumption handlers may be used together
526	in a single try statement, intermixing @catch@ and @catchResume@ freely.
527	Each type of handler only interacts with exceptions from the matching
528	kind of raise.
529	Like @catch@ clauses, @catchResume@ clauses have no effect if an exception
530	is not raised.
531
532	The matching rules are exactly the same as well.
533	The first major difference here is that after
534	@EXCEPTION_TYPE@$_i$ is matched and @NAME@$_i$ is bound to the exception,
535	@HANDLER_BLOCK@$_i$ is executed right away without first unwinding the stack.
536	After the block has finished running control jumps to the raise site, where
537	the just handled exception came from, and continues executing after it,
538	not after the try statement.
539
540	\subsubsection{Resumption Marking}
541	\label{s:ResumptionMarking}
542	A key difference between resumption and termination is that resumption does
543	not unwind the stack. A side effect is that, when a handler is matched
544	and run, its try block (the guarded statements) and every try statement
545	searched before it are still on the stack. There presence can lead to
546	the recursive resumption problem.
547	\todo{Is there a citation for the recursive resumption problem?}
548
549	The recursive resumption problem is any situation where a resumption handler
550	ends up being called while it is running.
551	Consider a trivial case:
552	\begin{cfa}
553	try {
554	throwResume (E &){};
555	} catchResume(E *) {
556	throwResume (E &){};
557	}
558	\end{cfa}
559	When this code is executed, the guarded @throwResume@ starts a
560	search and matches the handler in the @catchResume@ clause. This
561	call is placed on the stack above the try-block.
562	Now the second raise in the handler searches the same try block,
563	matches again and then puts another instance of the
564	same handler on the stack leading to infinite recursion.
565
566	While this situation is trivial and easy to avoid, much more complex cycles
567	can form with multiple handlers and different exception types.
568	To prevent all of these cases, each try statement is ``marked" from the
569	time the exception search reaches it to either when a handler completes
570	handling that exception or when the search reaches the base
571	of the stack.
572	While a try statement is marked, its handlers are never matched, effectively
573	skipping over it to the next try statement.
574
575	\begin{center}
576	\input{stack-marking}
577	\end{center}
578
579	There are other sets of marking rules that could be used,
580	for instance, marking just the handlers that caught the exception,
581	would also prevent recursive resumption.
582	However, the rules selected mirrors what happens with termination,
583	so this reduces the amount of rules and patterns a programmer has to know.
584
585	The marked try statements are the ones that would be removed from
586	the stack for a termination exception, \ie those on the stack
587	between the handler and the raise statement.
588	This symmetry applies to the default handler as well, as both kinds of
589	default handlers are run at the raise statement, rather than (physically
590	or logically) at the bottom of the stack.
591	% In early development having the default handler happen after
592	% unmarking was just more useful. We assume that will continue.
593
594	\section{Conditional Catch}
595	Both termination and resumption handler clauses can be given an additional
596	condition to further control which exceptions they handle:
597	\begin{cfa}
598	catch (EXCEPTION_TYPE * [NAME] ; CONDITION)
599	\end{cfa}
600	First, the same semantics is used to match the exception type. Second, if the
601	exception matches, @CONDITION@ is executed. The condition expression may
602	reference all names in scope at the beginning of the try block and @NAME@
603	introduced in the handler clause. If the condition is true, then the handler
604	matches. Otherwise, the exception search continues as if the exception type
605	did not match.
606
607	The condition matching allows finer matching by checking
608	more kinds of information than just the exception type.
609	\begin{cfa}
610	try {
611	handle1 = open( f1, ... );
612	handle2 = open( f2, ... );
613	handle3 = open( f3, ... );
614	...
615	} catch( IOFailure * f ; fd( f ) == f1 ) {
616	// Only handle IO failure for f1.
617	} catch( IOFailure * f ; fd( f ) == f3 ) {
618	// Only handle IO failure for f3.
619	}
620	// Handle a failure relating to f2 further down the stack.
621	\end{cfa}
622	In this example the file that experienced the IO error is used to decide
623	which handler should be run, if any at all.
624
625	\begin{comment}
626	% I know I actually haven't got rid of them yet, but I'm going to try
627	% to write it as if I had and see if that makes sense:
628	\section{Reraising}
629	\label{s:Reraising}
630	Within the handler block or functions called from the handler block, it is
631	possible to reraise the most recently caught exception with @throw@ or
632	@throwResume@, respectively.
633	\begin{cfa}
634	try {
635	...
636	} catch( ... ) {
637	... throw;
638	} catchResume( ... ) {
639	... throwResume;
640	}
641	\end{cfa}
642	The only difference between a raise and a reraise is that reraise does not
643	create a new exception; instead it continues using the current exception, \ie
644	no allocation and copy. However the default handler is still set to the one
645	visible at the raise point, and hence, for termination could refer to data that
646	is part of an unwound stack frame. To prevent this problem, a new default
647	handler is generated that does a program-level abort.
648	\end{comment}
649
650	\subsection{Comparison with Reraising}
651	In languages without conditional catch, that is no ability to match an
652	exception based on something other than its type, it can be mimicked
653	by matching all exceptions of the right type, checking any additional
654	conditions inside the handler and re-raising the exception if it does not
655	match those.
656
657	Here is a minimal example comparing both patterns, using @throw;@
658	(no argument) to start a re-raise.
659	\begin{center}
660	\begin{tabular}{l r}
661	\begin{cfa}
662	try {
663	do_work_may_throw();
664	} catch(exception_t * exc ;
665	can_handle(exc)) {
666	handle(exc);
667	}
668
669
670
671	\end{cfa}
672	&
673	\begin{cfa}
674	try {
675	do_work_may_throw();
676	} catch(exception_t * exc) {
677	if (can_handle(exc)) {
678	handle(exc);
679	} else {
680	throw;
681	}
682	}
683	\end{cfa}
684	\end{tabular}
685	\end{center}
686	At first glance catch-and-reraise may appear to just be a quality of life
687	feature, but there are some significant differences between the two
688	stratagies.
689
690	A simple difference that is more important for \CFA than many other languages
691	is that the raise site changes, with a re-raise but does not with a
692	conditional catch.
693	This is important in \CFA because control returns to the raise site to run
694	the per-site default handler. Because of this only a conditional catch can
695	allow the original raise to continue.
696
697	The more complex issue comes from the difference in how conditional
698	catches and re-raises handle multiple handlers attached to a single try
699	statement. A conditional catch will continue checking later handlers while
700	a re-raise will skip them.
701	If the different handlers could handle some of the same exceptions,
702	translating a try statement that uses one to use the other can quickly
703	become non-trivial:
704
705	\noindent
706	Original, with conditional catch:
707	\begin{cfa}
708	...
709	} catch (an_exception * e ; check_a(e)) {
710	handle_a(e);
711	} catch (exception_t * e ; check_b(e)) {
712	handle_b(e);
713	}
714	\end{cfa}
715	Translated, with re-raise:
716	\begin{cfa}
717	...
718	} catch (exception_t * e) {
719	an_exception * an_e = (virtual an_exception *)e;
720	if (an_e && check_a(an_e)) {
721	handle_a(an_e);
722	} else if (check_b(e)) {
723	handle_b(e);
724	} else {
725	throw;
726	}
727	}
728	\end{cfa}
729	(There is a simpler solution if @handle_a@ never raises exceptions,
730	using nested try statements.)
731
732	% } catch (an_exception * e ; check_a(e)) {
733	% handle_a(e);
734	% } catch (exception_t * e ; !(virtual an_exception *)e && check_b(e)) {
735	% handle_b(e);
736	% }
737	%
738	% } catch (an_exception * e)
739	% if (check_a(e)) {
740	% handle_a(e);
741	% } else throw;
742	% } catch (exception_t * e)
743	% if (check_b(e)) {
744	% handle_b(e);
745	% } else throw;
746	% }
747	In similar simple examples translating from re-raise to conditional catch
748	takes less code but it does not have a general trivial solution either.
749
750	So, given that the two patterns do not trivially translate into each other,
751	it becomes a matter of which on should be encouraged and made the default.
752	From the premise that if a handler that could handle an exception then it
753	should, it follows that checking as many handlers as possible is preferred.
754	So conditional catch and checking later handlers is a good default.
755
756	\section{Finally Clauses}
757	\label{s:FinallyClauses}
758	Finally clauses are used to preform unconditional clean-up when leaving a
759	scope and are placed at the end of a try statement after any handler clauses:
760	\begin{cfa}
761	try {
762	GUARDED_BLOCK
763	} ... // any number or kind of handler clauses
764	... finally {
765	FINALLY_BLOCK
766	}
767	\end{cfa}
768	The @FINALLY_BLOCK@ is executed when the try statement is removed from the
769	stack, including when the @GUARDED_BLOCK@ finishes, any termination handler
770	finishes or during an unwind.
771	The only time the block is not executed is if the program is exited before
772	the stack is unwound.
773
774	Execution of the finally block should always finish, meaning control runs off
775	the end of the block. This requirement ensures control always continues as if
776	the finally clause is not present, \ie finally is for cleanup not changing
777	control flow.
778	Because of this requirement, local control flow out of the finally block
779	is forbidden. The compiler precludes any @break@, @continue@, @fallthru@ or
780	@return@ that causes control to leave the finally block. Other ways to leave
781	the finally block, such as a long jump or termination are much harder to check,
782	and at best requiring additional run-time overhead, and so are only
783	discouraged.
784
785	Not all languages with unwinding have finally clauses. Notably \Cpp does
786	without it as destructors, and the RAII design pattern, serve a similar role.
787	Although destructors and finally clauses can be used for the same cases,
788	they have their own strengths, similar to top-level function and lambda
789	functions with closures.
790	Destructors take more work to create, but if there is clean-up code
791	that needs to be run every time a type is used, they are much easier
792	to set-up for each use. % It's automatic.
793	On the other hand finally clauses capture the local context, so is easy to
794	use when the clean-up is not dependent on the type of a variable or requires
795	information from multiple variables.
796
797	\section{Cancellation}
798	\label{s:Cancellation}
799	Cancellation is a stack-level abort, which can be thought of as as an
800	uncatchable termination. It unwinds the entire current stack, and if
801	possible forwards the cancellation exception to a different stack.
802
803	Cancellation is not an exception operation like termination or resumption.
804	There is no special statement for starting a cancellation; instead the standard
805	library function @cancel_stack@ is called passing an exception. Unlike a
806	raise, this exception is not used in matching only to pass information about
807	the cause of the cancellation.
808	Finally, as no handler is provided, there is no default handler.
809
810	After @cancel_stack@ is called the exception is copied into the EHM's memory
811	and the current stack is unwound.
812	The behaviour after that depends on the kind of stack being cancelled.
813
814	\paragraph{Main Stack}
815	The main stack is the one used by the program main at the start of execution,
816	and is the only stack in a sequential program.
817	After the main stack is unwound there is a program-level abort.
818
819	The first reason for this behaviour is for sequential programs where there
820	is only one stack, and hence to stack to pass information to.
821	Second, even in concurrent programs, the main stack has no dependency
822	on another stack and no reliable way to find another living stack.
823	Finally, keeping the same behaviour in both sequential and concurrent
824	programs is simple and easy to understand.
825
826	\paragraph{Thread Stack}
827	A thread stack is created for a \CFA @thread@ object or object that satisfies
828	the @is_thread@ trait.
829	After a thread stack is unwound, the exception is stored until another
830	thread attempts to join with it. Then the exception @ThreadCancelled@,
831	which stores a reference to the thread and to the exception passed to the
832	cancellation, is reported from the join to the joining thread.
833	There is one difference between an explicit join (with the @join@ function)
834	and an implicit join (from a destructor call). The explicit join takes the
835	default handler (@defaultResumptionHandler@) from its calling context while
836	the implicit join provides its own; which does a program abort if the
837	@ThreadCancelled@ exception cannot be handled.
838
839	The communication and synchronization are done here because threads only have
840	two structural points (not dependent on user-code) where
841	communication/synchronization happens: start and join.
842	Since a thread must be running to perform a cancellation (and cannot be
843	cancelled from another stack), the cancellation must be after start and
844	before the join, so join is used.
845
846	% TODO: Find somewhere to discuss unwind collisions.
847	The difference between the explicit and implicit join is for safety and
848	debugging. It helps prevent unwinding collisions by avoiding throwing from
849	a destructor and prevents cascading the error across multiple threads if
850	the user is not equipped to deal with it.
851	It is always possible to add an explicit join if that is the desired behaviour.
852
853	With explicit join and a default handler that triggers a cancellation, it is
854	possible to cascade an error across any number of threads,
855	alternating between the resumption (possibly termination) and cancellation,
856	cleaning up each
857	in turn, until the error is handled or the main thread is reached.
858
859	\paragraph{Coroutine Stack}
860	A coroutine stack is created for a @coroutine@ object or object that
861	satisfies the @is_coroutine@ trait.
862	After a coroutine stack is unwound, control returns to the @resume@ function
863	that most recently resumed it. @resume@ reports a
864	@CoroutineCancelled@ exception, which contains a references to the cancelled
865	coroutine and the exception used to cancel it.
866	The @resume@ function also takes the \defaultResumptionHandler{} from the
867	caller's context and passes it to the internal report.
868
869	A coroutine only knows of two other coroutines,
870	its starter and its last resumer.
871	The starter has a much more distant connection, while the last resumer just
872	(in terms of coroutine state) called resume on this coroutine, so the message
873	is passed to the latter.
874
875	With a default handler that triggers a cancellation, it is possible to
876	cascade an error across any number of coroutines,
877	alternating between the resumption (possibly termination) and cancellation,
878	cleaning up each in turn,
879	until the error is handled or a thread stack is reached.

Note: See TracBrowser for help on using the repository browser.

Download in other formats: