Context Navigation

source: doc/theses/andrew_beach_MMath/features.tex @ 7780bd2

ADTast-experimentalenumforall-pointer-decayjacob/cs343-translationnew-ast-unique-exprpthread-emulationqualifiedEnum

Last change on this file since 7780bd2 was aa173d8, checked in by Peter A. Buhr <pabuhr@…>, 3 years ago
proofread features chapter of Andrew's thesis
Property mode set to `100644`
File size: 37.9 KB

Line
1	\chapter{Exception Features}
2	\label{c:features}
3
4	This chapter covers the design and user interface of the \CFA EHM
5	and begins with a general overview of EHMs. It is not a strict
6	definition of all EHMs nor an exhaustive list of all possible features.
7	However it does cover the most common structure and features found in them.
8
9	\section{Overview of EHMs}
10	% We should cover what is an exception handling mechanism and what is an
11	% exception before this. Probably in the introduction. Some of this could
12	% move there.
13	\subsection{Raise / Handle}
14	An exception operation has two main parts: raise and handle.
15	These terms are sometimes known as throw and catch but this work uses
16	throw/catch as a particular kind of raise/handle.
17	These are the two parts that the user writes and may
18	be the only two pieces of the EHM that have any syntax in a language.
19
20	\paragraph{Raise}
21	The raise is the starting point for exception handling
22	by raising an exception, which passes it to
23	the EHM.
24
25	Some well known examples include the @throw@ statements of \Cpp and Java and
26	the \code{Python}{raise} statement of Python. In real systems, a raise may
27	perform some other work (such as memory management) but for the
28	purposes of this overview that can be ignored.
29
30	\paragraph{Handle}
31	The primary purpose of an EHM is to run some user code to handle a raised
32	exception. This code is given, with some other information, in a handler.
33
34	A handler has three common features: the previously mentioned user code, a
35	region of code it guards, and an exception label/condition that matches
36	the raised exception.
37	Only raises inside the guarded region and raising exceptions that match the
38	label can be handled by a given handler.
39	If multiple handlers could can handle an exception,
40	EHMs define a rule to pick one, such as ``best match" or ``first found".
41
42	The @try@ statements of \Cpp, Java and Python are common examples. All three
43	show the common features of guarded region, raise, matching and handler.
44	\begin{cfa}
45	try { // guarded region
46	...
47	throw exception; // raise
48	...
49	} catch( exception ) { // matching condition, with exception label
50	... // handler code
51	}
52	\end{cfa}
53
54	\subsection{Propagation}
55	After an exception is raised comes what is usually the biggest step for the
56	EHM: finding and setting up the handler for execution. The propagation from raise to
57	handler can be broken up into three different tasks: searching for a handler,
58	matching against the handler and installing the handler.
59
60	\paragraph{Searching}
61	The EHM begins by searching for handlers that might be used to handle
62	the exception. The search is restricted to
63	handlers that have the raise site in their guarded
64	region.
65	The search includes handlers in the current function, as well as any in
66	callers on the stack that have the function call in their guarded region.
67
68	\paragraph{Matching}
69	Each handler found is matched with the raised exception. The exception
70	label defines a condition that is used with the exception and decides if
71	there is a match or not.
72	In languages where the first match is used, this step is intertwined with
73	searching; a match check is performed immediately after the search finds
74	a handler.
75
76	\paragraph{Installing}
77	After a handler is chosen, it must be made ready to run.
78	The implementation can vary widely to fit with the rest of the
79	design of the EHM. The installation step might be trivial or it could be
80	the most expensive step in handling an exception. The latter tends to be the
81	case when stack unwinding is involved.
82
83	If a matching handler is not guaranteed to be found, the EHM needs a
84	different course of action for this case.
85	This situation only occurs with unchecked exceptions as checked exceptions
86	(such as in Java) are guaranteed to find a matching handler.
87	The unhandled action is usually very general, such as aborting the program.
88
89	\paragraph{Hierarchy}
90	A common way to organize exceptions is in a hierarchical structure.
91	This pattern comes from object-orientated languages where the
92	exception hierarchy is a natural extension of the object hierarchy.
93
94	Consider the following exception hierarchy:
95	\begin{center}
96	\input{exception-hierarchy}
97	\end{center}
98	A handler labeled with any given exception can handle exceptions of that
99	type or any child type of that exception. The root of the exception hierarchy
100	(here \code{C}{exception}) acts as a catch-all, leaf types catch single types,
101	and the exceptions in the middle can be used to catch different groups of
102	related exceptions.
103
104	This system has some notable advantages, such as multiple levels of grouping,
105	the ability for libraries to add new exception types, and the isolation
106	between different sub-hierarchies.
107	This design is used in \CFA even though it is not a object-orientated
108	language; so different tools are used to create the hierarchy.
109
110	% Could I cite the rational for the Python IO exception rework?
111
112	\subsection{Completion}
113	After the handler has finished, the entire exception operation has to complete
114	and continue executing somewhere else. This step is usually simple,
115	both logically and in its implementation, as the installation of the handler
116	is usually set up to do most of the work.
117
118	The EHM can return control to many different places, where
119	the most common are after the handler definition (termination)
120	and after the raise (resumption).
121
122	\subsection{Communication}
123	For effective exception handling, additional information is often passed
124	from the raise to the handler and back again.
125	So far, only communication of the exception's identity is covered.
126	A common communication method for passing more information is putting fields into the exception instance
127	and giving the handler access to them.
128	Using reference fields pointing to data at the raise location allows data to be
129	passed in both directions.
130
131	\section{Virtuals}
132	Virtual types and casts are not part of \CFA's EHM nor are they required for
133	an EHM.
134	However, one of the best ways to support an exception hierarchy
135	is via a virtual hierarchy and dispatch system.
136
137	Ideally, the virtual system should have been part of \CFA before the work
138	on exception handling began, but unfortunately it was not.
139	Hence, only the features and framework needed for the EHM were
140	designed and implemented for this thesis. Other features were considered to ensure that
141	the structure could accommodate other desirable features in the future
142	but are not implemented.
143	The rest of this section only discusses the implemented subset of the
144	virtual-system design.
145
146	The virtual system supports multiple ``trees" of types. Each tree is
147	a simple hierarchy with a single root type. Each type in a tree has exactly
148	one parent -- except for the root type which has zero parents -- and any
149	number of children.
150	Any type that belongs to any of these trees is called a virtual type.
151
152	% A type's ancestors are its parent and its parent's ancestors.
153	% The root type has no ancestors.
154	% A type's descendants are its children and its children's descendants.
155
156	Every virtual type also has a list of virtual members. Children inherit
157	their parent's list of virtual members but may add new members to it.
158	It is important to note that these are virtual members, not virtual methods
159	of object-orientated programming, and can be of any type.
160
161	\PAB{Need to look at these when done.
162
163	\CFA still supports virtual methods as a special case of virtual members.
164	Function pointers that take a pointer to the virtual type are modified
165	with each level of inheritance so that refers to the new type.
166	This means an object can always be passed to a function in its virtual table
167	as if it were a method.
168	\todo{Clarify (with an example) virtual methods.}
169
170	Each virtual type has a unique id.
171	This id and all the virtual members are combined
172	into a virtual table type. Each virtual type has a pointer to a virtual table
173	as a hidden field.
174	\todo{Might need a diagram for virtual structure.}
175	}%
176
177	Up until this point the virtual system is similar to ones found in
178	object-orientated languages but this is where \CFA diverges. Objects encapsulate a
179	single set of methods in each type, universally across the entire program,
180	and indeed all programs that use that type definition. Even if a type inherits and adds methods, it still encapsulate a
181	single set of methods. In this sense,
182	object-oriented types are ``closed" and cannot be altered.
183
184	In \CFA, types do not encapsulate any code. Traits are local for each function and
185	types can satisfy a local trait, stop satisfying it or, satisfy the same
186	trait in a different way at any lexical location in the program where a function is call.
187	In this sense, the set of functions/variables that satisfy a trait for a type is ``open" as the set can change at every call site.
188	This capability means it is impossible to pick a single set of functions
189	that represent a type's implementation across a program.
190
191	\CFA side-steps this issue by not having a single virtual table for each
192	type. A user can define virtual tables that are filled in at their
193	declaration and given a name. Anywhere that name is visible, even if it is
194	defined locally inside a function \PAB{What does this mean? (although that means it does not have a
195	static lifetime)}, it can be used.
196	Specifically, a virtual type is ``bound" to a virtual table that
197	sets the virtual members for that object. The virtual members can be accessed
198	through the object.
199
200	While much of the virtual infrastructure is created, it is currently only used
201	internally for exception handling. The only user-level feature is the virtual
202	cast, which is the same as the \Cpp \code{C++}{dynamic_cast}.
203	\label{p:VirtualCast}
204	\begin{cfa}
205	(virtual TYPE)EXPRESSION
206	\end{cfa}
207	Note, the syntax and semantics matches a C-cast, rather than the function-like
208	\Cpp syntax for special casts. Both the type of @EXPRESSION@ and @TYPE@ must be
209	a pointer to a virtual type.
210	The cast dynamically checks if the @EXPRESSION@ type is the same or a sub-type
211	of @TYPE@, and if true, returns a pointer to the
212	@EXPRESSION@ object, otherwise it returns @0p@ (null pointer).
213
214	\section{Exception}
215	% Leaving until later, hopefully it can talk about actual syntax instead
216	% of my many strange macros. Syntax aside I will also have to talk about the
217	% features all exceptions support.
218
219	Exceptions are defined by the trait system; there are a series of traits, and
220	if a type satisfies them, then it can be used as an exception. The following
221	is the base trait all exceptions need to match.
222	\begin{cfa}
223	trait is_exception(exceptT &, virtualT &) {
224	// Numerous imaginary assertions.
225	};
226	\end{cfa}
227	The trait is defined over two types, the exception type and the virtual table
228	type. Each exception type should have a single virtual table type.
229	There are no actual assertions in this trait because the trait system
230	cannot express them yet (adding such assertions would be part of
231	completing the virtual system). The imaginary assertions would probably come
232	from a trait defined by the virtual system, and state that the exception type
233	is a virtual type, is a descendant of @exception_t@ (the base exception type),
234	and note its virtual table type.
235
236	% I did have a note about how it is the programmer's responsibility to make
237	% sure the function is implemented correctly. But this is true of every
238	% similar system I know of (except Agda's I guess) so I took it out.
239
240	There are two more traits for exceptions defined as follows:
241	\begin{cfa}
242	trait is_termination_exception(
243	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
244	void defaultTerminationHandler(exceptT &);
245	};
246
247	trait is_resumption_exception(
248	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
249	void defaultResumptionHandler(exceptT &);
250	};
251	\end{cfa}
252	Both traits ensure a pair of types are an exception type, its virtual table
253	type,
254	and defines one of the two default handlers. The default handlers are used
255	as fallbacks and are discussed in detail in \vref{s:ExceptionHandling}.
256
257	However, all three of these traits can be tricky to use directly.
258	While there is a bit of repetition required,
259	the largest issue is that the virtual table type is mangled and not in a user
260	facing way. So these three macros are provided to wrap these traits to
261	simplify referring to the names:
262	@IS_EXCEPTION@, @IS_TERMINATION_EXCEPTION@, and @IS_RESUMPTION_EXCEPTION@.
263
264	All three take one or two arguments. The first argument is the name of the
265	exception type. The macro passes its unmangled and mangled form to the trait.
266	The second (optional) argument is a parenthesized list of polymorphic
267	arguments. This argument is only used with polymorphic exceptions and the
268	list is be passed to both types.
269	In the current set-up, the two types always have the same polymorphic
270	arguments so these macros can be used without losing flexibility.
271
272	For example consider a function that is polymorphic over types that have a
273	defined arithmetic exception:
274	\begin{cfa}
275	forall(Num \| IS_EXCEPTION(Arithmetic, (Num)))
276	void some_math_function(Num & left, Num & right);
277	\end{cfa}
278
279	\section{Exception Handling}
280	\label{s:ExceptionHandling}
281	As stated,
282	\CFA provides two kinds of exception handling: termination and resumption.
283	These twin operations are the core of \CFA's exception handling mechanism.
284	This section covers the general patterns shared by the two operations and
285	then goes on to cover the details of each individual operation.
286
287	Both operations follow the same set of steps.
288	First, a user raises an exception.
289	Second, the exception propagates up the stack.
290	Third, if a handler is found, the exception is caught and the handler is run.
291	After that control continues at a raise-dependent location.
292	Fourth, if a handler is not found, a default handler is run and, if it returns, then control
293	continues after the raise.
294
295	%This general description covers what the two kinds have in common.
296	The differences in the two operations include how propagation is performed, where execution continues
297	after an exception is caught and handled, and which default handler is run.
298
299	\subsection{Termination}
300	\label{s:Termination}
301	Termination handling is the familiar EHM and used in most programming
302	languages with exception handling.
303	It is a dynamic, non-local goto. If the raised exception is matched and
304	handled, the stack is unwound and control (usually) continues in the function
305	on the call stack that defined the handler.
306	Termination is commonly used when an error has occurred and recovery is
307	impossible locally.
308
309	% (usually) Control can continue in the current function but then a different
310	% control flow construct should be used.
311
312	A termination raise is started with the @throw@ statement:
313	\begin{cfa}
314	throw EXPRESSION;
315	\end{cfa}
316	The expression must return a reference to a termination exception, where the
317	termination exception is any type that satisfies the trait
318	@is_termination_exception@ at the call site.
319	Through \CFA's trait system, the trait functions are implicitly passed into the
320	throw code for use by the EHM.
321	A new @defaultTerminationHandler@ can be defined in any scope to
322	change the throw's behaviour when a handler is not found (see below).
323
324	The throw copies the provided exception into managed memory to ensure
325	the exception is not destroyed if the stack is unwound.
326	It is the user's responsibility to ensure the original exception is cleaned
327	up whether the stack is unwound or not. Allocating it on the stack is
328	usually sufficient.
329
330	% How to say propagation starts, its first sub-step is the search.
331	Then propagation starts with the search. \CFA uses a ``first match" rule so
332	matching is performed with the copied exception as the search key.
333	It starts from the raise in the throwing function and proceeds towards the base of the stack,
334	from callee to caller.
335	At each stack frame, a check is made for termination handlers defined by the
336	@catch@ clauses of a @try@ statement.
337	\begin{cfa}
338	try {
339	GUARDED_BLOCK
340	} catch (EXCEPTION_TYPE$\(_1\)$ * [NAME$\(_1\)$]) {
341	HANDLER_BLOCK$\(_1\)$
342	} catch (EXCEPTION_TYPE$\(_2\)$ * [NAME$\(_2\)$]) {
343	HANDLER_BLOCK$\(_2\)$
344	}
345	\end{cfa}
346	When viewed on its own, a try statement simply executes the statements
347	in the \snake{GUARDED_BLOCK}, and when those are finished,
348	the try statement finishes.
349
350	However, while the guarded statements are being executed, including any
351	invoked functions, all the handlers in these statements are included in the
352	search path.
353	Hence, if a termination exception is raised, these handlers may be matched
354	against the exception and may handle it.
355
356	Exception matching checks the handler in each catch clause in the order
357	they appear, top to bottom. If the representation of the raised exception type
358	is the same or a descendant of @EXCEPTION_TYPE@$_i$, then @NAME@$_i$
359	(if provided) is
360	bound to a pointer to the exception and the statements in @HANDLER_BLOCK@$_i$
361	are executed. If control reaches the end of the handler, the exception is
362	freed and control continues after the try statement.
363
364	If no termination handler is found during the search, then the default handler
365	(\defaultTerminationHandler) visible at the raise statement is called.
366	Through \CFA's trait system the best match at the raise statement is used.
367	This function is run and is passed the copied exception.
368	If the default handler finishes, control continues after the raise statement.
369
370	There is a global @defaultTerminationHandler@ that is polymorphic over all
371	termination exception types.
372	The global default termination handler performs a cancellation
373	(see \vref{s:Cancellation} for the justification) on the current stack with the copied exception.
374	Since it is so general, a more specific handler is usually
375	defined, possibly with a detailed message, and used for specific exception type, effectively overriding the default handler.
376
377	\subsection{Resumption}
378	\label{s:Resumption}
379
380	Resumption exception handling is the less familar EHM, but is
381	just as old~\cite{Goodenough75} and is simpler in many ways.
382	It is a dynamic, non-local function call. If the raised exception is
383	matched, a closure is taken from up the stack and executed,
384	after which the raising function continues executing.
385	The common uses for resumption exceptions include
386	potentially repairable errors, where execution can continue in the same
387	function once the error is corrected, and
388	ignorable events, such as logging where nothing needs to happen and control
389	should always continue from the raise point.
390
391	A resumption raise is started with the @throwResume@ statement:
392	\begin{cfa}
393	throwResume EXPRESSION;
394	\end{cfa}
395	\todo{Decide on a final set of keywords and use them everywhere.}
396	It works much the same way as the termination throw.
397	The expression must return a reference to a resumption exception,
398	where the resumption exception is any type that satisfies the trait
399	@is_resumption_exception@ at the call site.
400	The assertions from this trait are available to
401	the exception system while handling the exception.
402
403	At run-time, no exception copy is made, since
404	resumption does not unwind the stack nor otherwise remove values from the
405	current scope, so there is no need to manage memory to keep the exception in scope.
406
407	Then propagation starts with the search. It starts from the raise in the
408	resuming function and proceeds towards the base of the stack,
409	from callee to caller.
410	At each stack frame, a check is made for resumption handlers defined by the
411	@catchResume@ clauses of a @try@ statement.
412	\begin{cfa}
413	try {
414	GUARDED_BLOCK
415	} catchResume (EXCEPTION_TYPE$\(_1\)$ * [NAME$\(_1\)$]) {
416	HANDLER_BLOCK$\(_1\)$
417	} catchResume (EXCEPTION_TYPE$\(_2\)$ * [NAME$\(_2\)$]) {
418	HANDLER_BLOCK$\(_2\)$
419	}
420	\end{cfa}
421	% PAB, you say this above.
422	% When a try statement is executed, it simply executes the statements in the
423	% @GUARDED_BLOCK@ and then finishes.
424	%
425	% However, while the guarded statements are being executed, including any
426	% invoked functions, all the handlers in these statements are included in the
427	% search path.
428	% Hence, if a resumption exception is raised, these handlers may be matched
429	% against the exception and may handle it.
430	%
431	% Exception matching checks the handler in each catch clause in the order
432	% they appear, top to bottom. If the representation of the raised exception type
433	% is the same or a descendant of @EXCEPTION_TYPE@$_i$, then @NAME@$_i$
434	% (if provided) is bound to a pointer to the exception and the statements in
435	% @HANDLER_BLOCK@$_i$ are executed.
436	% If control reaches the end of the handler, execution continues after the
437	% the raise statement that raised the handled exception.
438	%
439	% Like termination, if no resumption handler is found during the search,
440	% then the default handler (\defaultResumptionHandler) visible at the raise
441	% statement is called. It will use the best match at the raise sight according
442	% to \CFA's overloading rules. The default handler is
443	% passed the exception given to the raise. When the default handler finishes
444	% execution continues after the raise statement.
445	%
446	% There is a global @defaultResumptionHandler{} is polymorphic over all
447	% resumption exceptions and performs a termination throw on the exception.
448	% The \defaultTerminationHandler{} can be overridden by providing a new
449	% function that is a better match.
450
451	The @GUARDED_BLOCK@ and its associated nested guarded statements work the same
452	for resumption as for termination, as does exception matching at each
453	@catchResume@. Similarly, if no resumption handler is found during the search,
454	then the currently visible default handler (\defaultResumptionHandler) is
455	called and control continues after the raise statement if it returns. Finally,
456	there is also a global @defaultResumptionHandler@, which can be overridden,
457	that is polymorphic over all resumption exceptions but performs a termination
458	throw on the exception rather than a cancellation.
459
460	Throwing the exception in @defaultResumptionHandler@ has the positive effect of
461	walking the stack a second time for a recovery handler. Hence, a programmer has
462	two chances for help with a problem, fixup or recovery, should either kind of
463	handler appear on the stack. However, this dual stack walk leads to following
464	apparent anomaly:
465	\begin{cfa}
466	try {
467	throwResume E;
468	} catch (E) {
469	// this handler runs
470	}
471	\end{cfa}
472	because the @catch@ appears to handle a @throwResume@, but a @throwResume@ only
473	matches with @catchResume@. The anomaly results because the unmatched
474	@catchResuem@, calls @defaultResumptionHandler@, which in turn throws @E@.
475
476	% I wonder if there would be some good central place for this.
477	Note, termination and resumption handlers may be used together
478	in a single try statement, intermixing @catch@ and @catchResume@ freely.
479	Each type of handler only interacts with exceptions from the matching
480	kind of raise.
481
482	\subsubsection{Resumption Marking}
483	\label{s:ResumptionMarking}
484	A key difference between resumption and termination is that resumption does
485	not unwind the stack. A side effect is that, when a handler is matched
486	and run, its try block (the guarded statements) and every try statement
487	searched before it are still on the stack. There presence can lead to
488	the \emph{recursive resumption problem}.
489
490	The recursive resumption problem is any situation where a resumption handler
491	ends up being called while it is running.
492	Consider a trivial case:
493	\begin{cfa}
494	try {
495	throwResume (E &){};
496	} catchResume(E *) {
497	throwResume (E &){};
498	}
499	\end{cfa}
500	When this code is executed, the guarded @throwResume@ starts a
501	search and matches the handler in the @catchResume@ clause. This
502	call is placed on the stack above the try-block. Now the second raise in the handler
503	searches the same try block, matches, and puts another instance of the
504	same handler on the stack leading to infinite recursion.
505
506	While this situation is trivial and easy to avoid, much more complex cycles can
507	form with multiple handlers and different exception types. The key point is
508	that the programmer's intuition expects every raise in a handler to start
509	searching \emph{below} the @try@ statement, making it difficult to understand
510	and fix the problem.
511
512	To prevent all of these cases, each try statement is ``marked" from the
513	time the exception search reaches it to either when a matching handler
514	completes or when the search reaches the base
515	of the stack.
516	While a try statement is marked, its handlers are never matched, effectively
517	skipping over it to the next try statement.
518
519	\begin{center}
520	\input{stack-marking}
521	\end{center}
522
523	There are other sets of marking rules that could be used,
524	for instance, marking just the handlers that caught the exception,
525	would also prevent recursive resumption.
526	However, the rule selected mirrors what happens with termination,
527	and hence, matches programmer intuition that a raise searches below a try.
528
529	In detail, the marked try statements are the ones that would be removed from
530	the stack for a termination exception, \ie those on the stack
531	between the handler and the raise statement.
532	This symmetry applies to the default handler as well, as both kinds of
533	default handlers are run at the raise statement, rather than (physically
534	or logically) at the bottom of the stack.
535	% In early development having the default handler happen after
536	% unmarking was just more useful. We assume that will continue.
537
538	\section{Conditional Catch}
539	Both termination and resumption handler clauses can be given an additional
540	condition to further control which exceptions they handle:
541	\begin{cfa}
542	catch (EXCEPTION_TYPE * [NAME] ; CONDITION)
543	\end{cfa}
544	First, the same semantics is used to match the exception type. Second, if the
545	exception matches, @CONDITION@ is executed. The condition expression may
546	reference all names in scope at the beginning of the try block and @NAME@
547	introduced in the handler clause. If the condition is true, then the handler
548	matches. Otherwise, the exception search continues as if the exception type
549	did not match.
550
551	The condition matching allows finer matching by checking
552	more kinds of information than just the exception type.
553	\begin{cfa}
554	try {
555	handle1 = open( f1, ... );
556	handle2 = open( f2, ... );
557	handle3 = open( f3, ... );
558	...
559	} catch( IOFailure * f ; fd( f ) == f1 ) {
560	// Only handle IO failure for f1.
561	} catch( IOFailure * f ; fd( f ) == f3 ) {
562	// Only handle IO failure for f3.
563	}
564	// Handle a failure relating to f2 further down the stack.
565	\end{cfa}
566	In this example the file that experienced the IO error is used to decide
567	which handler should be run, if any at all.
568
569	\begin{comment}
570	% I know I actually haven't got rid of them yet, but I'm going to try
571	% to write it as if I had and see if that makes sense:
572	\section{Reraising}
573	\label{s:Reraising}
574	Within the handler block or functions called from the handler block, it is
575	possible to reraise the most recently caught exception with @throw@ or
576	@throwResume@, respectively.
577	\begin{cfa}
578	try {
579	...
580	} catch( ... ) {
581	... throw;
582	} catchResume( ... ) {
583	... throwResume;
584	}
585	\end{cfa}
586	The only difference between a raise and a reraise is that reraise does not
587	create a new exception; instead it continues using the current exception, \ie
588	no allocation and copy. However the default handler is still set to the one
589	visible at the raise point, and hence, for termination could refer to data that
590	is part of an unwound stack frame. To prevent this problem, a new default
591	handler is generated that does a program-level abort.
592	\end{comment}
593
594	\subsection{Comparison with Reraising}
595	Without conditional catch, the only approach to match in more detail is to reraise
596	the exception after it has been caught, if it could not be handled.
597	\begin{center}
598	\begin{tabular}{l\|l}
599	\begin{cfa}
600	try {
601	do_work_may_throw();
602	} catch(excep_t * ex; can_handle(ex)) {
603
604	handle(ex);
605
606
607
608	}
609	\end{cfa}
610	&
611	\begin{cfa}
612	try {
613	do_work_may_throw();
614	} catch(excep_t * ex) {
615	if (can_handle(ex)) {
616	handle(ex);
617	} else {
618	throw;
619	}
620	}
621	\end{cfa}
622	\end{tabular}
623	\end{center}
624	Notice catch-and-reraise increases complexity by adding additional data and
625	code to the exception process. Nevertheless, catch-and-reraise can simulate
626	conditional catch straightforwardly, when exceptions are disjoint, \ie no
627	inheritance.
628
629	However, catch-and-reraise simulation becomes unusable for exception inheritance.
630	\begin{flushleft}
631	\begin{cfa}[xleftmargin=6pt]
632	exception E1;
633	exception E2(E1); // inheritance
634	\end{cfa}
635	\begin{tabular}{l\|l}
636	\begin{cfa}
637	try {
638	... foo(); ... // raise E1/E2
639	... bar(); ... // raise E1/E2
640	} catch( E2 e; e.rtn == foo ) {
641	...
642	} catch( E1 e; e.rtn == foo ) {
643	...
644	} catch( E1 e; e.rtn == bar ) {
645	...
646	}
647
648	\end{cfa}
649	&
650	\begin{cfa}
651	try {
652	... foo(); ...
653	... bar(); ...
654	} catch( E2 e ) {
655	if ( e.rtn == foo ) { ...
656	} else throw; // reraise
657	} catch( E1 e ) {
658	if (e.rtn == foo) { ...
659	} else if (e.rtn == bar) { ...
660	else throw; // reraise
661	}
662	\end{cfa}
663	\end{tabular}
664	\end{flushleft}
665	The derived exception @E2@ must be ordered first in the catch list, otherwise
666	the base exception @E1@ catches both exceptions. In the catch-and-reraise code
667	(right), the @E2@ handler catches exceptions from both @foo@ and
668	@bar@. However, the reraise misses the following catch clause. To fix this
669	problem, an enclosing @try@ statement is need to catch @E2@ for @bar@ from the
670	reraise, and its handler must duplicate the inner handler code for @bar@. To
671	generalize, this fix for any amount of inheritance and complexity of try
672	statement requires a technique called \emph{try-block
673	splitting}~\cite{Krischer02}, which is not discussed in this thesis. It is
674	sufficient to state that conditional catch is more expressive than
675	catch-and-reraise in terms of complexity.
676
677	\begin{comment}
678	That is, they have the same behaviour in isolation.
679	Two things can expose differences between these cases.
680
681	One is the existence of multiple handlers on a single try statement.
682	A reraise skips all later handlers for a try statement but a conditional
683	catch does not.
684	% Hence, if an earlier handler contains a reraise later handlers are
685	% implicitly skipped, with a conditional catch they are not.
686	Still, they are equivalently powerful,
687	both can be used two mimic the behaviour of the other,
688	as reraise can pack arbitrary code in the handler and conditional catches
689	can put arbitrary code in the predicate.
690	% I was struggling with a long explanation about some simple solutions,
691	% like repeating a condition on later handlers, and the general solution of
692	% merging everything together. I don't think it is useful though unless its
693	% for a proof.
694	% https://en.cppreference.com/w/cpp/language/throw
695
696	The question then becomes ``Which is a better default?"
697	We believe that not skipping possibly useful handlers is a better default.
698	If a handler can handle an exception it should and if the handler can not
699	handle the exception then it is probably safer to have that explicitly
700	described in the handler itself instead of implicitly described by its
701	ordering with other handlers.
702	% Or you could just alter the semantics of the throw statement. The handler
703	% index is in the exception so you could use it to know where to start
704	% searching from in the current try statement.
705	% No place for the `goto else;` metaphor.
706
707	The other issue is all of the discussion above assumes that the only
708	way to tell apart two raises is the exception being raised and the remaining
709	search path.
710	This is not true generally, the current state of the stack can matter in
711	a number of cases, even only for a stack trace after an program abort.
712	But \CFA has a much more significant need of the rest of the stack, the
713	default handlers for both termination and resumption.
714
715	% For resumption it turns out it is possible continue a raise after the
716	% exception has been caught, as if it hadn't been caught in the first place.
717	This becomes a problem combined with the stack unwinding used in termination
718	exception handling.
719	The stack is unwound before the handler is installed, and hence before any
720	reraises can run. So if a reraise happens the previous stack is gone,
721	the place on the stack where the default handler was supposed to run is gone,
722	if the default handler was a local function it may have been unwound too.
723	There is no reasonable way to restore that information, so the reraise has
724	to be considered as a new raise.
725	This is the strongest advantage conditional catches have over reraising,
726	they happen before stack unwinding and avoid this problem.
727
728	% The one possible disadvantage of conditional catch is that it runs user
729	% code during the exception search. While this is a new place that user code
730	% can be run destructors and finally clauses are already run during the stack
731	% unwinding.
732	%
733	% https://www.cplusplus.com/reference/exception/current_exception/
734	% `exception_ptr current_exception() noexcept;`
735	% https://www.python.org/dev/peps/pep-0343/
736	\end{comment}
737
738	\section{Finally Clauses}
739	\label{s:FinallyClauses}
740	Finally clauses are used to preform unconditional clean-up when leaving a
741	scope and are placed at the end of a try statement after any handler clauses:
742	\begin{cfa}
743	try {
744	GUARDED_BLOCK
745	} ... // any number or kind of handler clauses
746	... finally {
747	FINALLY_BLOCK
748	}
749	\end{cfa}
750	The @FINALLY_BLOCK@ is executed when the try statement is removed from the
751	stack, including when the @GUARDED_BLOCK@ finishes, any termination handler
752	finishes, or during an unwind.
753	The only time the block is not executed is if the program is exited before
754	the stack is unwound.
755
756	Execution of the finally block should always finish, meaning control runs off
757	the end of the block. This requirement ensures control always continues as if
758	the finally clause is not present, \ie finally is for cleanup not changing
759	control flow.
760	Because of this requirement, local control flow out of the finally block
761	is forbidden. The compiler precludes any @break@, @continue@, @fallthru@ or
762	@return@ that causes control to leave the finally block. Other ways to leave
763	the finally block, such as a long jump or termination are much harder to check,
764	and at best requiring additional run-time overhead, and so are only
765	discouraged.
766
767	Not all languages with unwinding have finally clauses. Notably \Cpp does
768	without it as destructors, and the RAII design pattern, serve a similar role.
769	Although destructors and finally clauses can be used for the same cases,
770	they have their own strengths, similar to top-level function and lambda
771	functions with closures.
772	Destructors take more work for their creation, but if there is clean-up code
773	that needs to be run every time a type is used, they are much easier
774	to set-up.
775	On the other hand finally clauses capture the local context, so is easy to
776	use when the clean-up is not dependent on the type of a variable or requires
777	information from multiple variables.
778
779	\section{Cancellation}
780	\label{s:Cancellation}
781	Cancellation is a stack-level abort, which can be thought of as as an
782	uncatchable termination. It unwinds the entire current stack, and if
783	possible forwards the cancellation exception to a different stack.
784
785	Cancellation is not an exception operation like termination or resumption.
786	There is no special statement for starting a cancellation; instead the standard
787	library function @cancel_stack@ is called passing an exception. Unlike a
788	raise, this exception is not used in matching only to pass information about
789	the cause of the cancellation.
790	Finaly, since a cancellation only unwinds and forwards, there is no default handler.
791
792	After @cancel_stack@ is called the exception is copied into the EHM's memory
793	and the current stack is unwound.
794	The behaviour after that depends on the kind of stack being cancelled.
795
796	\paragraph{Main Stack}
797	The main stack is the one used by the program main at the start of execution,
798	and is the only stack in a sequential program.
799	After the main stack is unwound there is a program-level abort.
800
801	The reasons for this semantics in a sequential program is that there is no more code to execute.
802	This semantics also applies to concurrent programs, too, even if threads are running.
803	That is, if any threads starts a cancellation, it implies all threads terminate.
804	Keeping the same behaviour in sequential and concurrent programs is simple.
805	Also, even in concurrent programs there may not currently be any other stacks
806	and even if other stacks do exist, main has no way to know where they are.
807
808	\paragraph{Thread Stack}
809	A thread stack is created for a \CFA @thread@ object or object that satisfies
810	the @is_thread@ trait.
811	After a thread stack is unwound, the exception is stored until another
812	thread attempts to join with it. Then the exception @ThreadCancelled@,
813	which stores a reference to the thread and to the exception passed to the
814	cancellation, is reported from the join to the joining thread.
815	There is one difference between an explicit join (with the @join@ function)
816	and an implicit join (from a destructor call). The explicit join takes the
817	default handler (@defaultResumptionHandler@) from its calling context while
818	the implicit join provides its own; which does a program abort if the
819	@ThreadCancelled@ exception cannot be handled.
820
821	The communication and synchronization are done here because threads only have
822	two structural points (not dependent on user-code) where
823	communication/synchronization happens: start and join.
824	Since a thread must be running to perform a cancellation (and cannot be
825	cancelled from another stack), the cancellation must be after start and
826	before the join, so join is used.
827
828	% TODO: Find somewhere to discuss unwind collisions.
829	The difference between the explicit and implicit join is for safety and
830	debugging. It helps prevent unwinding collisions by avoiding throwing from
831	a destructor and prevents cascading the error across multiple threads if
832	the user is not equipped to deal with it.
833	It is always possible to add an explicit join if that is the desired behaviour.
834
835	With explicit join and a default handler that triggers a cancellation, it is
836	possible to cascade an error across any number of threads, cleaning up each
837	in turn, until the error is handled or the main thread is reached.
838
839	\paragraph{Coroutine Stack}
840	A coroutine stack is created for a @coroutine@ object or object that
841	satisfies the @is_coroutine@ trait.
842	After a coroutine stack is unwound, control returns to the @resume@ function
843	that most recently resumed it. @resume@ reports a
844	@CoroutineCancelled@ exception, which contains a references to the cancelled
845	coroutine and the exception used to cancel it.
846	The @resume@ function also takes the \defaultResumptionHandler{} from the
847	caller's context and passes it to the internal report.
848
849	A coroutine only knows of two other coroutines, its starter and its last resumer.
850	The starter has a much more distant connection, while the last resumer just
851	(in terms of coroutine state) called resume on this coroutine, so the message
852	is passed to the latter.
853
854	With a default handler that triggers a cancellation, it is possible to
855	cascade an error across any number of coroutines, cleaning up each in turn,
856	until the error is handled or a thread stack is reached.
857
858	\PAB{Part of this I do not understand. A cancellation cannot be caught. But you
859	talk about handling a cancellation in the last sentence. Which is correct?}

Note: See TracBrowser for help on using the repository browser.

Download in other formats: