Context Navigation

source: doc/theses/andrew_beach_MMath/implement.tex @ daf4c89

Last change on this file since daf4c89 was 814f87d, checked in by Andrew Beach <ajbeach@…>, 3 years ago
Andrew MMath: Updated thesis with Yizhou Zhang's feedback.
Property mode set to `100644`
File size: 41.8 KB

Line
1	\chapter{Implementation}
2	\label{c:implement}
3
4	% Local Helpers:
5	\newcommand\transformline[1][becomes...]{
6	\hrulefill#1\hrulefill
7	\medskip
8	}
9
10	The implementation work for this thesis covers the two components: virtual
11	system and exceptions. Each component is discussed in detail.
12
13	\section{Virtual System}
14	\label{s:VirtualSystem}
15	% Virtual table rules. Virtual tables, the pointer to them and the cast.
16	While the \CFA virtual system currently has only two public features, virtual
17	cast and virtual tables,
18	substantial structure is required to support them,
19	and provide features for exception handling and the standard library.
20
21	\subsection{Virtual Type}
22	A virtual type~(see \autoref{s:virtuals}) has a pointer to a virtual table,
23	called the \emph{virtual-table pointer},
24	which binds each instance of a virtual type to a virtual table.
25	Internally, the field is called \snake{virtual_table}
26	and is fixed after construction.
27	This pointer is also the table's id and how the system accesses the
28	virtual table and the virtual members there.
29	It is always the first field in the
30	structure so that its location is always known.
31
32	% We have no special rules for these constructors.
33	Virtual table pointers are passed to the constructors of virtual types
34	as part of field-by-field construction.
35
36	\subsection{Type ID}
37	Every virtual type has a unique ID.
38	These are used in type equality, to check if the representation of two values
39	are the same, and to access the type's type information.
40	This uniqueness means across a program composed of multiple translation
41	units (TU), not uniqueness across all programs or even across multiple
42	processes on the same machine.
43
44	Our approach for program uniqueness is using a static declaration for each
45	type ID, where the run-time storage address of that variable is guaranteed to
46	be unique during program execution.
47	The type ID storage can also be used for other purposes,
48	and is used for type information.
49
50	The problem is that a type ID may appear in multiple TUs that compose a
51	program (see \autoref{ss:VirtualTable}), so the initial solution would seem
52	to be make it external in each translation unit. However, the type ID must
53	have a declaration in (exactly) one of the TUs to create the storage.
54	No other declaration related to the virtual type has this property, so doing
55	this through standard C declarations would require the user to do it manually.
56
57	Instead, the linker is used to handle this problem.
58	% I did not base anything off of C++17; they are solving the same problem.
59	A new feature has been added to \CFA for this purpose, the special attribute
60	\snake{cfa_linkonce}, which uses the special section @.gnu.linkonce@.
61	When used as a prefix (\eg @.gnu.linkonce.example@), the linker does
62	not combine these sections, but instead discards all but one with the same
63	full name.
64
65	So, each type ID must be given a unique section name with the \snake{linkonce}
66	prefix. Luckily, \CFA already has a way to get unique names, the name mangler.
67	For example, this could be written directly in \CFA:
68	\begin{cfa}
69	__attribute__((cfa_linkonce)) void f() {}
70	\end{cfa}
71	This is translated to:
72	\begin{cfa}
73	__attribute__((section(".gnu.linkonce._X1fFv___1"))) void _X1fFv___1() {}
74	\end{cfa}
75	This is done internally to access the name mangler.
76	This attribute is useful for other purposes, any other place a unique
77	instance required, and should eventually be made part of a public and
78	stable feature in \CFA.
79
80	\subsection{Type Information}
81
82	There is data stored at the type ID's declaration, the type information.
83	The type information currently is only the parent's type ID or, if the
84	type has no parent, the null pointer.
85	The ancestors of a virtual type are found by traversing type IDs through
86	the type information.
87	An example using helper macros looks like:
88	\begin{cfa}
89	struct INFO_TYPE(TYPE) {
90	INFO_TYPE(PARENT) const * parent;
91	};
92
93	__attribute__((cfa_linkonce))
94	INFO_TYPE(TYPE) const INFO_NAME(TYPE) = {
95	&INFO_NAME(PARENT),
96	};
97	\end{cfa}
98
99	Type information is constructed as follows:
100	\begin{enumerate}[nosep]
101	\item
102	Use the type's name to generate a name for the type information structure,
103	which is saved so it can be reused.
104	\item
105	Generate a new structure definition to store the type
106	information. The layout is the same in each case, just the parent's type ID,
107	but the types used change from instance to instance.
108	The generated name is used for both this structure and, if relevant, the
109	parent pointer.
110	If the virtual type is polymorphic then the type information structure is
111	polymorphic as well, with the same polymorphic arguments.
112	\item
113	A separate name for instances is generated from the type's name.
114	\item
115	The definition is generated and initialized.
116	The parent ID is set to the null pointer or to the address of the parent's
117	type information instance. Name resolution handles the rest.
118	\item
119	\CFA's name mangler does its regular name mangling encoding the type of
120	the declaration into the instance name.
121	This process gives a completely unique name
122	including different instances of the same polymorphic type.
123	\end{enumerate}
124
125	Writing that code manually, with helper macros for the early name mangling,
126	would look like this:
127	\begin{cfa}
128	struct INFO_TYPE(TYPE) {
129	INFO_TYPE(PARENT) const * parent;
130	};
131
132	__attribute__((cfa_linkonce))
133	INFO_TYPE(TYPE) const INFO_NAME(TYPE) = {
134	&INFO_NAME(PARENT),
135	};
136	\end{cfa}
137
138	\begin{comment}
139	\subsubsection{\lstinline{cfa\_linkonce} Attribute}
140	% I just realized: This is an extension of the inline keyword.
141	% An extension of C's at least, it is very similar to C++'s.
142	Another feature added to \CFA is a new attribute: \texttt{cfa\_linkonce}.
143	This attribute is attached to an object or function definition
144	(any global declaration with a name and a type)
145	allowing it to be defined multiple times.
146	All matching definitions mush have the link-once attribute
147	and their implementations should be identical as well.
148
149	A single definition with the attribute can be included in a header
150	file as if it was a forward declaration, except no definition is required.
151
152	This technique is used for type ID instances. A link-once definition is
153	generated each time the structure is seen. This will result in multiple
154	copies but the link-once attribute ensures all but one are removed for a
155	unique instance.
156
157	Internally, @cfa_linkonce@ is replaced with
158	@section(".gnu.linkonce.NAME")@ where \texttt{NAME} is replaced by the
159	mangled name of the object.
160	Any other @section@ attributes are removed from the declaration.
161	The prefix \texttt{.gnu.linkonce} in section names is recognized by the
162	linker. If two of these sections appear with the same name, including
163	everything that comes after the special prefix, then only one is used
164	and the other is discarded.
165	\end{comment}
166
167	\subsection{Virtual Table}
168	\label{ss:VirtualTable}
169	Each virtual type has a virtual table type that stores its type ID and
170	virtual members.
171	An instance of a virtual type is bound to a virtual table instance,
172	which have the values of the virtual members.
173	Both the layout of the fields (in the virtual table type)
174	and their value (in the virtual table instance) are decided by the rules given
175	below.
176
177	The layout always comes in three parts (see \autoref{f:VirtualTableLayout}).
178	The first section is just the type ID at the head of the table. It is always
179	there to ensure that it can be found even when the accessing code does not
180	know which virtual type it has.
181	The second section is all the virtual members of the parent, in the same
182	order as they appear in the parent's virtual table. Note that the type may
183	change slightly as references to the ``this" change. This is limited to
184	inside pointers/references and via function pointers so that the size (and
185	hence the offsets) are the same.
186	The third section is similar to the second except that it is the new virtual
187	members introduced at this level in the hierarchy.
188
189	\begin{figure}
190	\begin{center}
191	\input{vtable-layout}
192	\end{center}
193	\caption{Virtual Table Layout}
194	\label{f:VirtualTableLayout}
195	\end{figure}
196
197	The first and second sections together mean that every virtual table has a
198	prefix that has the same layout and types as its parent virtual table.
199	This, combined with the fixed offset to the virtual table pointer, means that
200	for any virtual type, it is always safe to access its virtual table and,
201	from there, it is safe to check the type ID to identify the exact type of the
202	underlying object, access any of the virtual members and pass the object to
203	any of the method-like virtual members.
204
205	When a virtual table is declared, the user decides where to declare it and its
206	name. The initialization of the virtual table is entirely automatic based on
207	the context of the declaration.
208
209	The type ID is always fixed, with each virtual table type having
210	exactly one possible type ID.
211	The virtual members are usually filled in by type resolution.
212	The best match for a given name and type at the declaration site is used.
213	There are two exceptions to that rule: the @size@ field, the type's size,
214	is set using a @sizeof@ expression, and the @align@ field, the
215	type's alignment, is set using an @alignof@ expression.
216
217	Most of these tools are already inside the compiler. Using simple
218	code transformations early on in compilation allows most of that work to be
219	handed off to the existing tools. \autoref{f:VirtualTableTransformation}
220	shows an example transformation; this example shows an exception virtual table.
221	It also shows the transformation on the full declaration.
222	For a forward declaration, the @extern@ keyword is preserved and the
223	initializer is not added.
224
225	\begin{figure}[htb]
226	\begin{cfa}
227	vtable(example_type) example_name;
228	\end{cfa}
229	\transformline
230	% Check mangling.
231	\begin{cfa}
232	const struct example_type_vtable example_name = {
233	.__cfavir_typeid : &__cfatid_example_type,
234	.size : sizeof(example_type),
235	.copy : copy,
236	.^?{} : ^?{},
237	.msg : msg,
238	};
239	\end{cfa}
240	\caption{Virtual Table Transformation}
241	\label{f:VirtualTableTransformation}
242	\end{figure}
243
244	\subsection{Concurrency Integration}
245	Coroutines and threads need instances of @CoroutineCancelled@ and
246	@ThreadCancelled@ respectively to use all of their functionality. When a new
247	data type is declared with @coroutine@ or @thread@, a forward declaration for
248	the instance is created as well. The definition of the virtual table is created
249	at the definition of the main function.
250
251	These transformations are shown through code re-writing in
252	\autoref{f:CoroutineTypeTransformation} and
253	\autoref{f:CoroutineMainTransformation}.
254	Threads use the same pattern, with some names and types changed.
255	In both cases, the original declaration is not modified,
256	only new ones are added.
257
258	\begin{figure}[htb]
259	\begin{cfa}
260	coroutine Example {
261	// fields
262	};
263	\end{cfa}
264
265	\transformline[appends...]
266
267	\begin{cfa}
268	__attribute__((cfa_linkonce))
269	struct __cfatid_struct_CoroutineCancelled(Example)
270	__cfatid_CoroutineCancelled = {
271	&EXCEPTION_TYPE_ID,
272	};
273	extern CoroutineCancelled_vtable _default_vtable_object_declaration;
274	extern CoroutineCancelled_vtable & _default_vtable;
275	\end{cfa}
276	\caption{Coroutine Type Transformation}
277	\label{f:CoroutineTypeTransformation}
278	\end{figure}
279
280	\begin{figure}[htb]
281	\begin{cfa}
282	void main(Example & this) {
283	// body
284	}
285	\end{cfa}
286
287	\transformline[appends...]
288
289	\begin{cfa}
290	CoroutineCancelled_vtable _default_vtable_object_declaration = {
291	__cfatid_CoroutineCancelled,
292	// Virtual member initialization.
293	};
294
295	CoroutineCancelled_vtable & _default_vtable =
296	&_default_vtable_object_declaration;
297	\end{cfa}
298	\caption{Coroutine Main Transformation}
299	\label{f:CoroutineMainTransformation}
300	\end{figure}
301
302	\subsection{Virtual Cast}
303	Virtual casts are implemented as a function call that does the subtype check
304	and a C coercion-cast to do the type conversion.
305	% The C-cast is just to make sure the generated code is correct so the rest of
306	% the section is about that function.
307	The function is implemented in the standard library and has the following
308	signature:
309	\begin{cfa}
310	void * __cfa__virtual_cast(
311	struct __cfavir_type_id * parent,
312	struct __cfavir_type_id * const * child );
313	\end{cfa}
314	The type ID for the target type of the virtual cast is passed in as
315	@parent@ and
316	the cast target is passed in as @child@.
317	The generated C code wraps both arguments and the result with type casts.
318	There is also an internal check inside the compiler to make sure that the
319	target type is a virtual type.
320	% It also checks for conflicting definitions.
321
322	The virtual cast either returns the original pointer or the null pointer
323	as the new type.
324	The function does the parent check and returns the appropriate value.
325	The parent check is a simple linear search of the child's ancestors using the
326	type information.
327
328	\section{Exceptions}
329	% The implementation of exception types.
330
331	Creating exceptions can be roughly divided into two parts:
332	the exceptions themselves and the virtual system interactions.
333
334	Creating an exception type is just a matter of prepending the field
335	with the virtual table pointer to the list of the fields
336	(see \autoref{f:ExceptionTypeTransformation}).
337
338	\begin{figure}[htb]
339	\begin{cfa}
340	exception new_exception {
341	// EXISTING FIELDS
342	};
343	\end{cfa}
344	\transformline
345	\begin{cfa}
346	struct new_exception {
347	struct new_exception_vtable const * virtual_table;
348	// EXISTING FIELDS
349	};
350	\end{cfa}
351	\caption{Exception Type Transformation}
352	\label{f:ExceptionTypeTransformation}
353	\end{figure}
354
355	The integration between exceptions and the virtual system is a bit more
356	complex simply because of the nature of the virtual system prototype.
357	The primary issue is that the virtual system has no way to detect when it
358	should generate any of its internal types and data. This is handled by
359	the exception code, which tells the virtual system when to generate
360	its components.
361
362	All types associated with a virtual type,
363	the types of the virtual table and the type ID,
364	are generated when the virtual type (the exception) is first found.
365	The type ID (the instance) is generated with the exception, if it is
366	a monomorphic type.
367	However, if the exception is polymorphic, then a different type ID has to
368	be generated for every instance. In this case, generation is delayed
369	until a virtual table is created.
370	% There are actually some problems with this, which is why it is not used
371	% for monomorphic types.
372	When a virtual table is created and initialized, two functions are created
373	to fill in the list of virtual members.
374	The first is the @copy@ function that adapts the exception's copy constructor
375	to work with pointers, avoiding some issues with the current copy constructor
376	interface.
377	Second is the @msg@ function that returns a C-string with the type's name,
378	including any polymorphic parameters.
379
380	\section{Unwinding}
381	% Adapt the unwind chapter, just describe the sections of libunwind used.
382	% Mention that termination and cancellation use it. Maybe go into why
383	% resumption doesn't as well.
384
385	% Many modern languages work with an internal stack that function push and pop
386	% their local data to. Stack unwinding removes large sections of the stack,
387	% often across functions.
388
389	Stack unwinding is the process of removing stack frames (activations) from the
390	stack. On function entry and return, unwinding is handled directly by the
391	call/return code embedded in the function.
392
393	% Discussing normal stack unwinding:
394	Usually, the stack-frame size is known statically based on parameter and
395	local variable declarations. Even for a dynamic stack-size, the information
396	to determine how much of the stack has to be removed is still contained
397	within the function.
398	Allocating/deallocating stack space is usually an $O(1)$ operation achieved by
399	bumping the hardware stack-pointer up or down as needed.
400	Constructing/destructing values within a stack frame has
401	a similar complexity but larger constants.
402
403	% Discussing multiple frame stack unwinding:
404	Unwinding across multiple stack frames is more complex, because that
405	information is no longer contained within the current function.
406	With separate compilation,
407	a function does not know its callers nor their frame layout.
408	Even using the return address, that information is encoded in terms of
409	actions in code, intermixed with the actions required to finish the function.
410	Without changing the main code path it is impossible to select one of those
411	two groups of actions at the return site.
412
413	The traditional unwinding mechanism for C is implemented by saving a snapshot
414	of a function's state with @setjmp@ and restoring that snapshot with
415	@longjmp@. This approach bypasses the need to know stack details by simply
416	resetting to a snapshot of an arbitrary but existing function frame on the
417	stack. It is up to the programmer to ensure the snapshot is valid when it is
418	reset and that all required cleanup from the unwound stacks is performed.
419	Because it does not automate or check any of this cleanup,
420	it can be easy to make mistakes and always must be handled manually.
421
422	With respect to the extra work in the surrounding code,
423	many languages define cleanup actions that must be taken when certain
424	sections of the stack are removed, such as when the storage for a variable
425	is removed from the stack, possibly requiring a destructor call,
426	or when a try statement with a finally clause is
427	(conceptually) popped from the stack.
428	None of these cases should be handled by the user -- that would contradict the
429	intention of these features -- so they need to be handled automatically.
430
431	To safely remove sections of the stack, the language must be able to find and
432	run these cleanup actions even when removing multiple functions unknown at
433	the beginning of the unwinding.
434
435	One of the most popular tools for stack management is libunwind, a low-level
436	library that provides tools for stack walking, handler execution, and
437	unwinding. What follows is an overview of all the relevant features of
438	libunwind needed for this work.
439	Following that is the description of the \CFA code that uses libunwind
440	to implement termination.
441
442	\subsection{libunwind Usage}
443	Libunwind, accessed through @unwind.h@ on most platforms, is a C library that
444	provides \Cpp-style stack-unwinding. Its operation is divided into two phases:
445	search and cleanup. The dynamic target search -- phase 1 -- is used to scan the
446	stack and decide where unwinding should stop (but no unwinding occurs). The
447	cleanup -- phase 2 -- does the unwinding and also runs any cleanup code.
448
449	To use libunwind, each function must have a personality function and a Language
450	Specific Data Area (LSDA). The LSDA has the unique information for each
451	function to tell the personality function where a function is executing, its
452	current stack frame, and what handlers should be checked. Theoretically, the
453	LSDA can contain any information but conventionally it is a table with entries
454	representing regions of a function and what has to be done there during
455	unwinding. These regions are bracketed by instruction addresses. If the
456	instruction pointer is within a region's start/end, then execution is currently
457	executing in that region. Regions are used to mark out the scopes of objects
458	with destructors and try blocks.
459
460	% Libunwind actually does very little, it simply moves down the stack from
461	% function to function. Most of the actions are implemented by the personality
462	% function which libunwind calls on every function. Since this is shared across
463	% many functions or even every function in a language it will need a bit more
464	% information.
465
466	The GCC compilation flag @-fexceptions@ causes the generation of an LSDA and
467	attaches a personality function to each function.
468	In plain C (which \CFA currently compiles down to) this
469	flag only handles the cleanup attribute:
470	%\label{code:cleanup}
471	\begin{cfa}
472	void clean_up( int * var ) { ... }
473	int avar __attribute__(( cleanup(clean_up) ));
474	\end{cfa}
475	The attribute is used on a variable and specifies a function,
476	in this case @clean_up@, run when the variable goes out of scope.
477	This feature is enough to mimic destructors,
478	but not try statements that affect
479	the unwinding.
480
481	To get full unwinding support, all of these features must be handled directly
482	in assembly and assembler directives; particularly the cfi directives
483	\snake{.cfi_lsda} and \snake{.cfi_personality}.
484
485	\subsection{Personality Functions}
486	Personality functions have a complex interface specified by libunwind. This
487	section covers some of the important parts of the interface.
488
489	A personality function can perform different actions depending on how it is
490	called.
491	\begin{lstlisting}
492	typedef _Unwind_Reason_Code (*_Unwind_Personality_Fn) (
493	_Unwind_Action action,
494	_Unwind_Exception_Class exception_class,
495	_Unwind_Exception * exception,
496	struct _Unwind_Context * context);
497	\end{lstlisting}
498	The @action@ argument is a bitmask of possible actions:
499	\begin{enumerate}[topsep=5pt]
500	\item
501	@_UA_SEARCH_PHASE@ specifies a search phase and tells the personality function
502	to check for handlers. If there is a handler in a stack frame, as defined by
503	the language, the personality function returns @_URC_HANDLER_FOUND@; otherwise
504	it return @_URC_CONTINUE_UNWIND@.
505
506	\item
507	@_UA_CLEANUP_PHASE@ specifies a cleanup phase, where the entire frame is
508	unwound and all cleanup code is run. The personality function does whatever
509	cleanup the language defines (such as running destructors/finalizers) and then
510	generally returns @_URC_CONTINUE_UNWIND@.
511
512	\item
513	\begin{sloppypar}
514	@_UA_HANDLER_FRAME@ specifies a cleanup phase on a function frame that found a
515	handler. The personality function must prepare to return to normal code
516	execution and return @_URC_INSTALL_CONTEXT@.
517	\end{sloppypar}
518
519	\item
520	@_UA_FORCE_UNWIND@ specifies a forced unwind call. Forced unwind only performs
521	the cleanup phase and uses a different means to decide when to stop
522	(see \autoref{s:ForcedUnwind}).
523	\end{enumerate}
524
525	The @exception_class@ argument is a copy of the
526	\code{C}{exception}'s @exception_class@ field,
527	which is a number that identifies the EHM
528	that created the exception.
529
530	The \code{C}{exception} argument is a pointer to a user
531	provided storage object. It has two public fields: the @exception_class@,
532	which is described above, and the @exception_cleanup@ function.
533	The cleanup function is used by the EHM to clean up the exception. If it
534	should need to be freed at an unusual time, it takes an argument that says
535	why it had to be cleaned up.
536
537	The @context@ argument is a pointer to an opaque type passed to helper
538	functions called inside the personality function.
539
540	The return value, @_Unwind_Reason_Code@, is an enumeration of possible messages
541	that can be passed several places in libunwind. It includes a number of
542	messages for special cases (some of which should never be used by the
543	personality function) and error codes. However, unless otherwise noted, the
544	personality function always returns @_URC_CONTINUE_UNWIND@.
545
546	\subsection{Raise Exception}
547	Raising an exception is the central function of libunwind and it performs
548	two-staged unwinding.
549	\begin{cfa}
550	_Unwind_Reason_Code _Unwind_RaiseException(_Unwind_Exception *);
551	\end{cfa}
552	First, the function begins the search phase, calling the personality function
553	of the most recent stack frame. It continues to call personality functions
554	traversing the stack from newest to oldest until a function finds a handler or
555	the end of the stack is reached. In the latter case,
556	@_Unwind_RaiseException@ returns @_URC_END_OF_STACK@.
557
558	Second, when a handler is matched, @_Unwind_RaiseException@
559	moves to the cleanup phase and walks the stack a second time.
560	Once again, it calls the personality functions of each stack frame from newest
561	to oldest. This pass stops at the stack frame containing the matching handler.
562	If that personality function has not installed a handler, it is an error.
563
564	If an error is encountered, @_Unwind_RaiseException@ returns either
565	@_URC_FATAL_PHASE1_ERROR@ or @_URC_FATAL_PHASE2_ERROR@ depending on when the
566	error occurred.
567
568	\subsection{Forced Unwind}
569	\label{s:ForcedUnwind}
570	Forced Unwind is the other central function in libunwind.
571	\begin{cfa}
572	_Unwind_Reason_Code _Unwind_ForcedUnwind(_Unwind_Exception *,
573	_Unwind_Stop_Fn, void *);
574	\end{cfa}
575	It also unwinds the stack but it does not use the search phase. Instead,
576	another
577	function, the stop function, is used to stop searching. The exception is the
578	same as the one passed to @_Unwind_RaiseException@.
579	The extra arguments are the stop
580	function and the stop parameter. The stop function has a similar interface as a
581	personality function, except it is also passed the stop parameter.
582	\begin{lstlisting}
583	typedef _Unwind_Reason_Code (*_Unwind_Stop_Fn)(
584	_Unwind_Action action,
585	_Unwind_Exception_Class exception_class,
586	_Unwind_Exception * exception,
587	struct _Unwind_Context * context,
588	void * stop_parameter);
589	\end{lstlisting}
590
591	The stop function is called at every stack frame before the personality
592	function is called and then once more after all frames of the stack are
593	unwound.
594
595	Each time it is called, the stop function should return @_URC_NO_REASON@ or
596	transfer control directly to other code outside of libunwind. The framework
597	does not provide any assistance here.
598
599	\begin{sloppypar}
600	Its arguments are the same as the paired personality function. The actions
601	\snake{_UA_CLEANUP_PHASE} and \snake{_UA_FORCE_UNWIND} are always set when it is
602	called. Beyond the libunwind standard, both GCC and Clang add an extra action
603	on the last call at the end of the stack: \snake{_UA_END_OF_STACK}.
604	\end{sloppypar}
605
606	\section{Exception Context}
607	% Should I have another independent section?
608	% There are only two things in it, top_resume and current_exception. How it is
609	% stored changes depending on whether or not the thread-library is linked.
610
611	The exception context is global storage used to maintain data across different
612	exception operations and to communicate among different components.
613
614	Each stack must have its own exception context. In a sequential \CFA program,
615	there is only one stack with a single global exception-context. However, when
616	the library @libcfathread@ is linked, there are multiple stacks and each
617	needs its own exception context.
618
619	The current exception context should be retrieved by calling the function
620	\snake{this_exception_context}.
621	For sequential execution, this function is defined as
622	a weak symbol in the \CFA system-library, @libcfa@. When a \CFA program is
623	concurrent, it links with @libcfathread@, where this function is defined with a
624	strong symbol replacing the sequential version.
625
626	The sequential @this_exception_context@ returns a hard-coded pointer to the
627	global exception context.
628	The concurrent version adds the exception context to the data stored at the
629	base of each stack. When @this_exception_context@ is called, it retrieves the
630	active stack and returns the address of the context saved there.
631
632	\section{Termination}
633	% Memory management & extra information, the custom function used to implement
634	% catches. Talk about GCC nested functions.
635
636	\CFA termination exceptions use libunwind heavily because they match
637	\Cpp exceptions closely. The main complication for \CFA is that the
638	compiler generates C code, making it very difficult to generate the assembly to
639	form the LSDA for try blocks or destructors.
640
641	\subsection{Memory Management}
642	The first step of a termination raise is to copy the exception into memory
643	managed by the exception system. Currently, the system uses @malloc@, rather
644	than reserved memory or the stack top. The EHM manages
645	memory for the exception as well as memory for libunwind and the system's own
646	per-exception storage.
647
648	\begin{figure}
649	\centering
650	\input{exception-layout}
651	\caption{Exception Layout}
652	\label{f:ExceptionLayout}
653	\end{figure}
654
655	Exceptions are stored in variable-sized blocks
656	(see \autoref{f:ExceptionLayout}).
657	The first component is a fixed-sized data structure that contains the
658	information for libunwind and the exception system. The second component is an
659	area of memory big enough to store the exception. Macros with pointer arthritic
660	and type cast are used to move between the components or go from the embedded
661	@_Unwind_Exception@ to the entire node.
662
663	Multiple exceptions can exist at the same time because exceptions can be
664	raised inside handlers, destructors and finally blocks.
665	Figure~\vref{f:MultipleExceptions} shows a program that has multiple
666	exceptions active at one time.
667	Each time an exception is thrown and caught the stack unwinds and the finally
668	clause runs. This handler throws another exception (until @num_exceptions@ gets
669	high enough), which must be allocated. The previous exceptions may not be
670	freed because the handler/catch clause has not been run.
671	Therefore, the EHM must keep all unhandled exceptions alive
672	while it allocates exceptions for new throws.
673
674	\begin{figure}
675	\centering
676	\newsavebox{\codeBox}
677	\newsavebox{\stackBox}
678	\begin{lrbox}{\codeBox}
679	\begin{cfa}
680	unsigned num_exceptions = 0;
681	void throws() {
682	try {
683	try {
684	++num_exceptions;
685	throw (Example){table};
686	} finally {
687	if (num_exceptions < 3) {
688	throws();
689	}
690	}
691	} catch (exception_t *) {
692	--num_exceptions;
693	}
694	}
695	int main() {
696	throws();
697	}
698	\end{cfa}
699	\end{lrbox}
700
701	\begin{lrbox}{\stackBox}
702	\begin{lstlisting}
703	\| finally block (Example)
704	\| try block
705	throws()
706	\| finally block (Example)
707	\| try block
708	throws()
709	\| finally block (Example)
710	\| try block
711	throws()
712	main()
713	\end{lstlisting}
714	\end{lrbox}
715
716	{\usebox\codeBox}
717	\hspace{25pt}
718	{\usebox\stackBox}
719
720	\caption{Multiple Exceptions}
721	\label{f:MultipleExceptions}
722	\end{figure}
723
724	All exceptions are stored in nodes, which are then linked together in lists
725	one list per stack, with the
726	list head stored in the exception context. Within each linked list, the most
727	recently thrown exception is at the head, followed by older thrown
728	exceptions. This format allows exceptions to be thrown, while a different
729	exception is being handled. The exception at the head of the list is currently
730	being handled, while other exceptions wait for the exceptions before them to be
731	handled and removed.
732
733	The virtual members in the exception's virtual table provide the size of the
734	exception, the copy function, and the free function, so they are specific to an
735	exception type. The size and copy function are used immediately to copy an
736	exception into managed memory. After the exception is handled, the free
737	function is used to clean up the exception and then the entire node is
738	passed to @free@, returning the memory back to the heap.
739
740	\subsection{Try Statements and Catch Clauses}
741	The try statement with termination handlers is complex because it must
742	compensate for the C code-generation versus proper
743	assembly-code generated from \CFA. Libunwind
744	requires an LSDA and personality function for control to unwind across a
745	function. The LSDA in particular is hard to mimic in generated C code.
746
747	The workaround is a function called \snake{__cfaehm_try_terminate} in the
748	standard \CFA library. The contents of a try block and the termination
749	handlers are converted into nested functions. These are then passed to the
750	try terminate function and it calls them, appropriately.
751	Because this function is known and fixed (and not an arbitrary function that
752	happens to contain a try statement), its LSDA can be generated ahead
753	of time.
754
755	Both the LSDA and the personality function for \snake{__cfaehm_try_terminate}
756	are set ahead of time using
757	embedded assembly. This assembly code is handcrafted using C @asm@ statements
758	and contains
759	enough information for the single try statement the function represents.
760
761	The three functions passed to try terminate are:
762	\begin{description}
763	\item[try function:] This function is the try block. It is where all the code
764	from inside the try block is placed. It takes no parameters and has no
765	return value. This function is called during regular execution to run the try
766	block.
767
768	\item[match function:] This function is called during the search phase and
769	decides if a catch clause matches the termination exception. It is constructed
770	from the conditional part of each handler and runs each check, top to bottom,
771	in turn, to see if the exception matches this handler.
772	The match is performed in two steps: first, a virtual cast is used to check
773	if the raised exception is an instance of the declared exception type or
774	one of its descendant types, and then the condition is evaluated, if
775	present.
776	The match function takes a pointer to the exception and returns 0 if the
777	exception is not handled here. Otherwise, the return value is the ID of the
778	handler that matches the exception.
779
780	\item[handler function:] This function handles the exception, and contains
781	all the code from the handlers in the try statement, joined with a switch
782	statement on the handler's id.
783	It takes a
784	pointer to the exception and the handler's id and returns nothing. It is called
785	after the cleanup phase.
786	\end{description}
787	All three functions are created with GCC nested functions. GCC nested functions
788	can be used to create closures;
789	in other words,
790	functions that can refer to variables in their lexical scope even though
791	those variables are part of a different function.
792	This approach allows the functions to refer to all the
793	variables in scope for the function containing the @try@ statement. These
794	nested functions and all other functions besides @__cfaehm_try_terminate@ in
795	\CFA use the GCC personality function and the @-fexceptions@ flag to generate
796	the LSDA.
797	Using this pattern, \CFA implements destructors with the cleanup attribute.
798
799	\autoref{f:TerminationTransformation} shows the pattern used to transform
800	a \CFA try statement with catch clauses into the appropriate C functions.
801
802	\begin{figure}
803	\begin{cfa}
804	try {
805	// TRY BLOCK
806	} catch (Exception1 * name1 ; check(name1)) {
807	// CATCH BLOCK 1
808	} catch (Exception2 * name2) {
809	// CATCH BLOCK 2
810	}
811	\end{cfa}
812
813	\transformline
814
815	\begin{cfa}
816	void try(void) {
817	// TRY BLOCK
818	}
819	int match(exception_t * __exception_inst) {
820	{
821	Exception1 * name1;
822	if (name1 = (virtual Exception1 *)__exception_inst
823	&& check(name1)) {
824	return 1;
825	}
826	}
827	{
828	Exception2 * name2;
829	if (name2 = (virtual Exception2 *)__exception_inst) {
830	return 2;
831	}
832	}
833	return 0;
834	}
835	void catch(exception_t * __exception_inst, int __handler_index) {
836	switch (__handler_index) {
837	case 1:
838	{
839	Exception1 * name1 = (virtual Exception1 *)__exception_inst;
840	// CATCH BLOCK 1
841	}
842	return;
843	case 2:
844	{
845	Exception2 * name2 = (virtual Exception2 *)__exception_inst;
846	// CATCH BLOCK 2
847	}
848	return;
849	}
850	}
851	{
852	__cfaehm_try_terminate(try, catch, match);
853	}
854	\end{cfa}
855
856	\caption{Termination Transformation}
857	\label{f:TerminationTransformation}
858	\end{figure}
859
860	\section{Resumption}
861	% The stack-local data, the linked list of nodes.
862
863	Resumption is simpler to implement than termination
864	because there is no stack unwinding.
865	Instead of storing the data in a special area using assembly,
866	there is just a linked list of possible handlers for each stack,
867	with each node on the list representing a try statement on the stack.
868
869	The head of the list is stored in the exception context.
870	The nodes are stored in order, with the more recent try statements closer
871	to the head of the list.
872	Instead of traversing the stack, resumption handling traverses the list.
873	At each node, the EHM checks to see if the try statement the node represents
874	can handle the exception. If it can, then the exception is handled and
875	the operation finishes; otherwise, the search continues to the next node.
876	If the search reaches the end of the list without finding a try statement
877	with a handler clause
878	that can handle the exception, the default handler is executed.
879	If the default handler returns, control continues after the raise statement.
880
881	Each node has a handler function that does most of the work.
882	The handler function is passed the raised exception and returns true
883	if the exception is handled and false otherwise.
884	The handler function checks each of its internal handlers in order,
885	top-to-bottom, until it finds a match. If a match is found that handler is
886	run, after which the function returns true, ignoring all remaining handlers.
887	If no match is found the function returns false.
888	The match is performed in two steps. First a virtual cast is used to see
889	if the raised exception is an instance of the declared exception type or one
890	of its descendant types, if so, then the second step is to see if the
891	exception passes the custom predicate
892	if one is defined.
893	% You need to make sure the type is correct before running the predicate
894	% because the predicate can depend on that.
895
896	\autoref{f:ResumptionTransformation} shows the pattern used to transform
897	a \CFA try statement with catchResume clauses into the appropriate
898	C functions.
899
900	\begin{figure}
901	\begin{cfa}
902	try {
903	// TRY BLOCK
904	} catchResume (Exception1 * name1 ; check(name1)) {
905	// CATCH BLOCK 1
906	} catchResume (Exception2 * name2) {
907	// CATCH BLOCK 2
908	}
909	\end{cfa}
910
911	\transformline
912
913	\begin{cfa}
914	bool handle(exception_t * __exception_inst) {
915	{
916	Exception1 * name1;
917	if (name1 = (virtual Exception1 *)__exception_inst
918	&& check(name1)) {
919	// CATCH BLOCK 1
920	return 1;
921	}
922	}
923	{
924	Exception2 * name2;
925	if (name2 = (virtual Exception2 *)__exception_inst) {
926	// CATCH BLOCK 2
927	return 2;
928	}
929	}
930	return false;
931	}
932	struct __try_resume_node __resume_node
933	__attribute__((cleanup( __cfaehm_try_resume_cleanup )));
934	__cfaehm_try_resume_setup( &__resume_node, handler );
935	\end{cfa}
936
937	\caption{Resumption Transformation}
938	\label{f:ResumptionTransformation}
939	\end{figure}
940
941	% Recursive Resumption Stuff:
942	\autoref{f:ResumptionMarking} shows search skipping
943	(see \autoref{s:ResumptionMarking}), which ignores parts of
944	the stack
945	already examined, and is accomplished by updating the front of the list as
946	the search continues.
947	Before the handler is called at a matching node, the head of the list
948	is updated to the next node of the current node. After the search is complete,
949	successful or not, the head of the list is reset.
950	% No paragraph?
951	This mechanism means the current handler and every handler that has already
952	been checked are not on the list while a handler is run. If a resumption is
953	thrown during the handling of another resumption, the active handlers and all
954	the other handlers checked up to this point are not checked again.
955	% No paragraph?
956	This structure also supports new handlers added while the resumption is being
957	handled. These are added to the front of the list, pointing back along the
958	stack -- the first one points over all the checked handlers --
959	and the ordering is maintained.
960
961	\begin{figure}
962	\centering
963	\input{resumption-marking}
964	\caption{Resumption Marking}
965	\label{f:ResumptionMarking}
966	\end{figure}
967
968	\label{p:zero-cost}
969	Finally, the resumption implementation has a cost for entering/exiting a try
970	statement with @catchResume@ clauses, whereas a try statement with @catch@
971	clauses has zero-cost entry/exit. While resumption does not need the stack
972	unwinding and cleanup provided by libunwind, it could use the search phase to
973	providing zero-cost enter/exit using the LSDA. Unfortunately, there is no way
974	to return from a libunwind search without installing a handler or raising an
975	error. Although workarounds might be possible, they are beyond the scope of
976	this thesis. The current resumption implementation has simplicity in its
977	favour.
978	% Seriously, just compare the size of the two chapters and then consider
979	% that unwind is required knowledge for that chapter.
980
981	\section{Finally}
982	% Uses destructors and GCC nested functions.
983
984	%\autoref{code:cleanup}
985	A finally clause is handled by converting it into a once-off destructor.
986	The code inside the clause is placed into a GCC nested-function
987	with a unique name, and no arguments or return values.
988	This nested function is
989	then set as the cleanup function of an empty object that is declared at the
990	beginning of a block placed around the context of the associated try
991	statement, as shown in \autoref{f:FinallyTransformation}.
992
993	\begin{figure}
994	\begin{cfa}
995	try {
996	// TRY BLOCK
997	} finally {
998	// FINALLY BLOCK
999	}
1000	\end{cfa}
1001
1002	\transformline
1003
1004	\begin{cfa}
1005	{
1006	void finally(void *__hook){
1007	// FINALLY BLOCK
1008	}
1009	__attribute__ ((cleanup(finally)))
1010	struct __cfaehm_cleanup_hook __finally_hook;
1011	{
1012	// TRY BLOCK
1013	}
1014	}
1015	\end{cfa}
1016
1017	\caption{Finally Transformation}
1018	\label{f:FinallyTransformation}
1019	\end{figure}
1020
1021	The rest is handled by GCC.
1022	The TRY BLOCK
1023	contains the try block itself as well as all code generated for handlers.
1024	Once that code has completed,
1025	control exits the block and the empty object is cleaned
1026	up, which runs the function that contains the finally code.
1027
1028	\section{Cancellation}
1029	% Stack selections, the three internal unwind functions.
1030
1031	Cancellation also uses libunwind to do its stack traversal and unwinding.
1032	However, it uses a different primary function: @_Unwind_ForcedUnwind@. Details
1033	of its interface can be found in Section~\vref{s:ForcedUnwind}.
1034
1035	The first step of cancellation is to find the cancelled stack and its type:
1036	coroutine, thread or main thread.
1037	In \CFA, a thread (the construct the user works with) is a user-level thread
1038	(point of execution) paired with a coroutine, the thread's main coroutine.
1039	The thread library also stores pointers to the main thread and the current
1040	thread.
1041	If the current thread's main and current coroutines are the same then the
1042	current stack is a thread stack, otherwise it is a coroutine stack.
1043	If the current stack is a thread stack, it is also the main thread stack
1044	if and only if the main and current threads are the same.
1045
1046	However, if the threading library is not linked, the sequential execution is on
1047	the main stack. Hence, the entire check is skipped because the weak-symbol
1048	function is loaded. Therefore, main thread cancellation is unconditionally
1049	performed.
1050
1051	Regardless of how the stack is chosen, the stop function and parameter are
1052	passed to the forced-unwind function. The general pattern of all three stop
1053	functions is the same: continue unwinding until the end of stack and
1054	then perform the appropriate transfer.
1055
1056	For main stack cancellation, the transfer is just a program abort.
1057
1058	For coroutine cancellation, the exception is stored on the coroutine's stack,
1059	and the coroutine context switches to its last resumer. The rest is handled on
1060	the backside of the resume, which checks if the resumed coroutine is
1061	cancelled. If cancelled, the exception is retrieved from the resumed coroutine,
1062	and a @CoroutineCancelled@ exception is constructed and loaded with the
1063	cancelled exception. It is then resumed as a regular exception with the default
1064	handler coming from the context of the resumption call.
1065
1066	For thread cancellation, the exception is stored on the thread's main stack and
1067	then context switched to the scheduler. The rest is handled by the thread
1068	joiner. When the join is complete, the joiner checks if the joined thread is
1069	cancelled. If cancelled, the exception is retrieved and the joined thread, and
1070	a @ThreadCancelled@ exception is constructed and loaded with the cancelled
1071	exception. The default handler is passed in as a function pointer. If it is
1072	null (as it is for the auto-generated joins on destructor call), the default is
1073	used, which is a program abort.
1074	%; which gives the required handling on implicate join.

Note: See TracBrowser for help on using the repository browser.

Download in other formats: