Context Navigation

source: doc/theses/andrew_beach_MMath/features.tex @ 6a99803

ADTarm-ehast-experimentalenumforall-pointer-decayjacob/cs343-translationnew-ast-unique-exprpthread-emulationqualifiedEnum

Last change on this file since 6a99803 was 1c1c180, checked in by Andrew Beach <ajbeach@…>, 4 years ago
Fixed line length and trailing whitespace on modified tex files.
Property mode set to `100644`
File size: 19.7 KB

Line
1	\chapter{Exception Features}
2
3	This chapter covers the design and user interface of the \CFA
4	exception-handling mechanism.
5
6	\section{Virtuals}
7	Virtual types and casts are not required for a basic exception-system but are
8	useful for advanced exception features. However, \CFA is not object-oriented so
9	there is no obvious concept of virtuals. Hence, to create advanced exception
10	features for this work, I needed to designed and implemented a virtual-like
11	system for \CFA.
12
13	Object-oriented languages often organized exceptions into a simple hierarchy,
14	\eg Java.
15	\begin{center}
16	\setlength{\unitlength}{4000sp}%
17	\begin{picture}(1605,612)(2011,-1951)
18	\put(2100,-1411){\vector(1, 0){225}}
19	\put(3450,-1411){\vector(1, 0){225}}
20	\put(3550,-1411){\line(0,-1){225}}
21	\put(3550,-1636){\vector(1, 0){150}}
22	\put(3550,-1636){\line(0,-1){225}}
23	\put(3550,-1861){\vector(1, 0){150}}
24	\put(2025,-1490){\makebox(0,0)[rb]{\LstBasicStyle{exception}}}
25	\put(2400,-1460){\makebox(0,0)[lb]{\LstBasicStyle{arithmetic}}}
26	\put(3750,-1460){\makebox(0,0)[lb]{\LstBasicStyle{underflow}}}
27	\put(3750,-1690){\makebox(0,0)[lb]{\LstBasicStyle{overflow}}}
28	\put(3750,-1920){\makebox(0,0)[lb]{\LstBasicStyle{zerodivide}}}
29	\end{picture}%
30	\end{center}
31	The hierarchy provides the ability to handle an exception at different degrees
32	of specificity (left to right). Hence, it is possible to catch a more general
33	exception-type in higher-level code where the implementation details are
34	unknown, which reduces tight coupling to the lower-level implementation.
35	Otherwise, low-level code changes require higher-level code changes, \eg,
36	changing from raising @underflow@ to @overflow@ at the low level means changing
37	the matching catch at the high level versus catching the general @arithmetic@
38	exception. In detail, each virtual type may have a parent and can have any
39	number of children. A type's descendants are its children and its children's
40	descendants. A type may not be its own descendant.
41
42	The exception hierarchy allows a handler (@catch@ clause) to match multiple
43	exceptions, \eg a base-type handler catches both base and derived
44	exception-types.
45	\begin{cfa}
46	try {
47	...
48	} catch(arithmetic &) {
49	... // handle arithmetic, underflow, overflow, zerodivide
50	}
51	\end{cfa}
52	Most exception mechanisms perform a linear search of the handlers and select
53	the first matching handler, so the order of handers is now important because
54	matching is many to one.
55
56	Each virtual type needs an associated virtual table. A virtual table is a
57	structure with fields for all the virtual members of a type. A virtual type has
58	all the virtual members of its parent and can add more. It may also update the
59	values of the virtual members and often does.
60
61	While much of the virtual infrastructure is created, it is currently only used
62	internally for exception handling. The only user-level feature is the virtual
63	cast, which is the same as the \CC \lstinline[language=C++]\|dynamic_cast\|.
64	\label{p:VirtualCast}
65	\begin{cfa}
66	(virtual TYPE)EXPRESSION
67	\end{cfa}
68	Note, the syntax and semantics matches a C-cast, rather than the unusual \CC
69	syntax for special casts. Both the type of @EXPRESSION@ and @TYPE@ must be a
70	pointer to a virtual type. The cast dynamically checks if the @EXPRESSION@ type
71	is the same or a subtype of @TYPE@, and if true, returns a pointer to the
72	@EXPRESSION@ object, otherwise it returns @0p@ (null pointer).
73
74	\section{Exception}
75	% Leaving until later, hopefully it can talk about actual syntax instead
76	% of my many strange macros. Syntax aside I will also have to talk about the
77	% features all exceptions support.
78
79	Exceptions are defined by the trait system; there are a series of traits, and
80	if a type satisfies them, then it can be used as an exception. The following
81	is the base trait all exceptions need to match.
82	\begin{cfa}
83	trait is_exception(exceptT &, virtualT &) {
84	virtualT const & @get_exception_vtable@(exceptT *);
85	};
86	\end{cfa}
87	The function takes any pointer, including the null pointer, and returns a
88	reference to the virtual-table object. Defining this function also establishes
89	the virtual type and a virtual-table pair to the \CFA type-resolver and
90	promises @exceptT@ is a virtual type and a child of the base exception-type.
91
92	\PAB{I do not understand this paragraph.}
93	One odd thing about @get_exception_vtable@ is that it should always be a
94	constant function, returning the same value regardless of its argument. A
95	pointer or reference to the virtual table instance could be used instead,
96	however using a function has some ease of implementation advantages and allows
97	for easier disambiguation because the virtual type name (or the address of an
98	instance that is in scope) can be used instead of the mangled virtual table
99	name. Also note the use of the word ``promise'' in the trait
100	description. Currently, \CFA cannot check to see if either @exceptT@ or
101	@virtualT@ match the layout requirements. This is considered part of
102	@get_exception_vtable@'s correct implementation.
103
104	\section{Raise}
105	\CFA provides two kinds of exception raise: termination
106	\see{\VRef{s:Termination}} and resumption \see{\VRef{s:Resumption}}, which are
107	specified with the following traits.
108	\begin{cfa}
109	trait is_termination_exception(
110	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
111	void @defaultTerminationHandler@(exceptT &);
112	};
113	\end{cfa}
114	The function is required to allow a termination raise, but is only called if a
115	termination raise does not find an appropriate handler.
116
117	Allowing a resumption raise is similar.
118	\begin{cfa}
119	trait is_resumption_exception(
120	exceptT &, virtualT & \| is_exception(exceptT, virtualT)) {
121	void @defaultResumptionHandler@(exceptT &);
122	};
123	\end{cfa}
124	The function is required to allow a resumption raise, but is only called if a
125	resumption raise does not find an appropriate handler.
126
127	Finally there are three convenience macros for referring to the these traits:
128	@IS_EXCEPTION@, @IS_TERMINATION_EXCEPTION@ and @IS_RESUMPTION_EXCEPTION@. Each
129	takes the virtual type's name, and for polymorphic types only, the
130	parenthesized list of polymorphic arguments. These macros do the name mangling
131	to get the virtual-table name and provide the arguments to both sides
132	\PAB{What's a ``side''?}
133
134	\subsection{Termination}
135	\label{s:Termination}
136
137	Termination raise, called ``throw'', is familiar and used in most programming
138	languages with exception handling. The semantics of termination is: search the
139	stack for a matching handler, unwind the stack frames to the matching handler,
140	execute the handler, and continue execution after the handler. Termination is
141	used when execution \emph{cannot} return to the throw. To continue execution,
142	the program must \emph{recover} in the handler from the failed (unwound)
143	execution at the raise to safely proceed after the handler.
144
145	A termination raise is started with the @throw@ statement:
146	\begin{cfa}
147	throw EXPRESSION;
148	\end{cfa}
149	The expression must return a termination-exception reference, where the
150	termination exception has a type with a @void defaultTerminationHandler(T &)@
151	(default handler) defined. The handler is found at the call site using \CFA's
152	trait system and passed into the exception system along with the exception
153	itself.
154
155	At runtime, a representation of the exception type and an instance of the
156	exception type is copied into managed memory (heap) to ensure it remains in
157	scope during unwinding. It is the user's responsibility to ensure the original
158	exception object at the throw is freed when it goes out of scope. Being
159	allocated on the stack is sufficient for this.
160
161	Then the exception system searches the stack starting from the throw and
162	proceeding towards the base of the stack, from callee to caller. At each stack
163	frame, a check is made for termination handlers defined by the @catch@ clauses
164	of a @try@ statement.
165	\begin{cfa}
166	try {
167	GUARDED_BLOCK
168	} @catch (EXCEPTION_TYPE$\(_1\)$ * NAME)@ { // termination handler 1
169	HANDLER_BLOCK$\(_1\)$
170	} @catch (EXCEPTION_TYPE$\(_2\)$ * NAME)@ { // termination handler 2
171	HANDLER_BLOCK$\(_2\)$
172	}
173	\end{cfa}
174	The statements in the @GUARDED_BLOCK@ are executed. If those statements, or any
175	functions invoked from those statements, throws an exception, and the exception
176	is not handled by a try statement further up the stack, the termination
177	handlers are searched for a matching exception type from top to bottom.
178
179	Exception matching checks the representation of the thrown exception-type is
180	the same or a descendant type of the exception types in the handler clauses. If
181	there is a match, a pointer to the exception object created at the throw is
182	bound to @NAME@ and the statements in the associated @HANDLER_BLOCK@ are
183	executed. If control reaches the end of the handler, the exception is freed,
184	and control continues after the try statement.
185
186	The default handler visible at the throw statement is used if no matching
187	termination handler is found after the entire stack is searched. At that point,
188	the default handler is called with a reference to the exception object
189	generated at the throw. If the default handler returns, the system default
190	action is executed, which often terminates the program. This feature allows
191	each exception type to define its own action, such as printing an informative
192	error message, when an exception is not handled in the program.
193
194	\subsection{Resumption}
195	\label{s:Resumption}
196
197	Resumption raise, called ``resume'', is as old as termination
198	raise~\cite{Goodenough75} but is less popular. In many ways, resumption is
199	simpler and easier to understand, as it is simply a dynamic call (as in
200	Lisp). The semantics of resumption is: search the stack for a matching handler,
201	execute the handler, and continue execution after the resume. Notice, the stack
202	cannot be unwound because execution returns to the raise point. Resumption is
203	used used when execution \emph{can} return to the resume. To continue
204	execution, the program must \emph{correct} in the handler for the failed
205	execution at the raise so execution can safely continue after the resume.
206
207	A resumption raise is started with the @throwResume@ statement:
208	\begin{cfa}
209	throwResume EXPRESSION;
210	\end{cfa}
211	The semantics of the @throwResume@ statement are like the @throw@, but the
212	expression has a type with a @void defaultResumptionHandler(T &)@ (default
213	handler) defined, where the handler is found at the call site by the type
214	system. At runtime, a representation of the exception type and an instance of
215	the exception type is \emph{not} copied because the stack is maintained during
216	the handler search.
217
218	Then the exception system searches the stack starting from the resume and
219	proceeding towards the base of the stack, from callee to caller. At each stack
220	frame, a check is made for resumption handlers defined by the @catchResume@
221	clauses of a @try@ statement.
222	\begin{cfa}
223	try {
224	GUARDED_BLOCK
225	} @catchResume (EXCEPTION_TYPE$\(_1\)$ * NAME)@ { // resumption handler 1
226	HANDLER_BLOCK$\(_1\)$
227	} @catchResume (EXCEPTION_TYPE$\(_2\)$ * NAME)@ { // resumption handler 2
228	HANDLER_BLOCK$\(_2\)$
229	}
230	\end{cfa}
231	The statements in the @GUARDED_BLOCK@ are executed. If those statements, or any
232	functions invoked from those statements, resumes an exception, and the
233	exception is not handled by a try statement further up the stack, the
234	resumption handlers are searched for a matching exception type from top to
235	bottom. (Note, termination and resumption handlers may be intermixed in a @try@
236	statement but the kind of raise (throw/resume) only matches with the
237	corresponding kind of handler clause.)
238
239	The exception search and matching for resumption is the same as for
240	termination, including exception inheritance. The difference is when control
241	reaches the end of the handler: the resumption handler returns after the resume
242	rather than after the try statement. The resume point assumes the handler has
243	corrected the problem so execution can safely continue.
244
245	Like termination, if no resumption handler is found, the default handler
246	visible at the resume statement is called, and the system default action is
247	executed.
248
249	For resumption, the exception system uses stack marking to partition the
250	resumption search. If another resumption exception is raised in a resumption
251	handler, the second exception search does not start at the point of the
252	original raise. (Remember the stack is not unwound and the current handler is
253	at the top of the stack.) The search for the second resumption starts at the
254	current point on the stack because new try statements may have been pushed by
255	the handler or functions called from the handler. If there is no match back to
256	the point of the current handler, the search skips\label{p:searchskip} the stack frames already
257	searched by the first resume and continues after the try statement. The default
258	handler always continues from default handler associated with the point where
259	the exception is created.
260
261	% This might need a diagram. But it is an important part of the justification
262	% of the design of the traversal order.
263	\begin{verbatim}
264	throwResume2 ----------.
265	\| \|
266	generated from handler \|
267	\| \|
268	handler \|
269	\| \|
270	throwResume1 -----. :
271	\| \| :
272	try \| : search skip
273	\| \| :
274	catchResume <----' :
275	\| \|
276	\end{verbatim}
277
278	This resumption search-pattern reflect the one for termination, which matches
279	with programmer expectations. However, it avoids the \emph{recursive
280	resumption} problem. If parts of the stack are searched multiple times, loops
281	can easily form resulting in infinite recursion.
282
283	Consider the trivial case:
284	\begin{cfa}
285	try {
286	throwResume$\(_1\)$ (E &){};
287	} catch( E * ) {
288	throwResume;
289	}
290	\end{cfa}
291	Based on termination semantics, programmer expectation is for the re-resume to
292	continue searching the stack frames after the try statement. However, the
293	current try statement is still on the stack below the handler issuing the
294	reresume \see{\VRef{s:Reraise}}. Hence, the try statement catches the re-raise
295	again and does another re-raise \emph{ad infinitum}, which is confusing and
296	difficult to debug. The \CFA resumption search-pattern skips the try statement
297	so the reresume search continues after the try, mathcing programmer
298	expectation.
299
300	\section{Conditional Catch}
301	Both termination and resumption handler-clauses may perform conditional matching:
302	\begin{cfa}
303	catch (EXCEPTION_TYPE * NAME ; @CONDITION@)
304	\end{cfa}
305	First, the same semantics is used to match the exception type. Second, if the
306	exception matches, @CONDITION@ is executed. The condition expression may
307	reference all names in scope at the beginning of the try block and @NAME@
308	introduced in the handler clause. If the condition is true, then the handler
309	matches. Otherwise, the exception search continues at the next appropriate kind
310	of handler clause in the try block.
311	\begin{cfa}
312	try {
313	f1 = open( ... );
314	f2 = open( ... );
315	...
316	} catch( IOFailure * f ; fd( f ) == f1 ) {
317	// only handle IO failure for f1
318	}
319	\end{cfa}
320	Note, catching @IOFailure@, checking for @f1@ in the handler, and reraising the
321	exception if not @f1@ is different because the reraise does not examine any of
322	remaining handlers in the current try statement.
323
324	\section{Reraise}
325	\label{s:Reraise}
326	Within the handler block or functions called from the handler block, it is
327	possible to reraise the most recently caught exception with @throw@ or
328	@throwResume@, respective.
329	\begin{cfa}
330	catch( ... ) {
331	... throw; // rethrow
332	} catchResume( ... ) {
333	... throwResume; // reresume
334	}
335	\end{cfa}
336	The only difference between a raise and a reraise is that reraise does not
337	create a new exception; instead it continues using the current exception, \ie
338	no allocation and copy. However the default handler is still set to the one
339	visible at the raise point, and hence, for termination could refer to data that
340	is part of an unwound stack frame. To prevent this problem, a new default
341	handler is generated that does a program-level abort.
342
343
344	\section{Finally Clauses}
345	A @finally@ clause may be placed at the end of a @try@ statement.
346	\begin{cfa}
347	try {
348	GUARDED_BLOCK
349	} ... // any number or kind of handler clauses
350	} finally {
351	FINALLY_BLOCK
352	}
353	\end{cfa}
354	The @FINALLY_BLOCK@ is executed when the try statement is unwound from the
355	stack, \ie when the @GUARDED_BLOCK@ or any handler clause finishes. Hence, the
356	finally block is always executed.
357
358	Execution of the finally block should always finish, meaning control runs off
359	the end of the block. This requirement ensures always continues as if the
360	finally clause is not present, \ie finally is for cleanup not changing control
361	flow. Because of this requirement, local control flow out of the finally block
362	is forbidden. The compiler precludes any @break@, @continue@, @fallthru@ or
363	@return@ that causes control to leave the finally block. Other ways to leave
364	the finally block, such as a long jump or termination are much harder to check,
365	and at best requiring additional run-time overhead, and so are discouraged.
366
367	\section{Cancellation}
368	Cancellation is a stack-level abort, which can be thought of as as an
369	uncatchable termination. It unwinds the entirety of the current stack, and if
370	possible forwards the cancellation exception to a different stack.
371
372	There is no special statement for starting a cancellation; instead the standard
373	library function @cancel_stack@ is called passing an exception. Unlike a
374	raise, this exception is not used in matching only to pass information about
375	the cause of the cancellation.
376
377	Handling of a cancellation depends on which stack is being cancelled.
378	\begin{description}
379	\item[Main Stack:]
380	The main stack is the one used by the program main at the start of execution,
381	and is the only stack in a sequential program. Hence, when cancellation is
382	forwarded to the main stack, there is no other forwarding stack, so after the
383	stack is unwound, there is a program-level abort.
384
385	\item[Thread Stack:]
386	A thread stack is created for a @thread@ object or object that satisfies the
387	@is_thread@ trait. A thread only has two points of communication that must
388	happen: start and join. As the thread must be running to perform a
389	cancellation, it must occur after start and before join, so join is a
390	cancellation point. After the stack is unwound, the thread halts and waits for
391	another thread to join with it. The joining thread, checks for a cancellation,
392	and if present, resumes exception @ThreadCancelled@.
393
394	There is a subtle difference between the explicit join (@join@ function) and
395	implicit join (from a destructor call). The explicit join takes the default
396	handler (@defaultResumptionHandler@) from its calling context, which is used if
397	the exception is not caught. The implicit join does a program abort instead.
398
399	This semantics is for safety. One difficult problem for any exception system is
400	defining semantics when an exception is raised during an exception search:
401	which exception has priority, the original or new exception? No matter which
402	exception is selected, it is possible for the selected one to disrupt or
403	destroy the context required for the other. \PAB{I do not understand the
404	following sentences.} This loss of information can happen with join but as the
405	thread destructor is always run when the stack is being unwound and one
406	termination/cancellation is already active. Also since they are implicit they
407	are easier to forget about.
408
409	\item[Coroutine Stack:] A coroutine stack is created for a @coroutine@ object
410	or object that satisfies the @is_coroutine@ trait. A coroutine only knows of
411	two other coroutines, its starter and its last resumer. The last resumer has
412	the tightest coupling to the coroutine it activated. Hence, cancellation of
413	the active coroutine is forwarded to the last resumer after the stack is
414	unwound, as the last resumer has the most precise knowledge about the current
415	execution. When the resumer restarts, it resumes exception
416	@CoroutineCancelled@, which is polymorphic over the coroutine type and has a
417	pointer to the cancelled coroutine.
418
419	The resume function also has an assertion that the @defaultResumptionHandler@
420	for the exception. So it will use the default handler like a regular throw.
421	\end{description}

Note: See TracBrowser for help on using the repository browser.

Download in other formats: