Context Navigation

source: doc/theses/andrew_beach_MMath/implement.tex @ 0a55a53

ADTast-experimentalenumforall-pointer-decayjacob/cs343-translationpthread-emulationqualifiedEnum

Last change on this file since 0a55a53 was 0a55a53, checked in by Andrew Beach <ajbeach@…>, 3 years ago
Andrew MMath: Implement chapter updated from Peter's focused review.
Property mode set to `100644`
File size: 42.1 KB

Line
1	\chapter{Implementation}
2	\label{c:implement}
3
4	% Local Helpers:
5	\newcommand\transformline[1][becomes...]{
6	\hrulefill#1\hrulefill
7	\medskip
8	}
9
10	The implementation work for this thesis covers the two components: virtual
11	system and exceptions. Each component is discussed in detail.
12
13	\section{Virtual System}
14	\label{s:VirtualSystem}
15	% Virtual table rules. Virtual tables, the pointer to them and the cast.
16	While the \CFA virtual system currently has only one public features, virtual
17	cast and virtual tables,
18	% ??? refs (see the virtual cast feature \vpageref{p:VirtualCast}),
19	substantial structure is required to support them,
20	and provide features for exception handling and the standard library.
21
22	\subsection{Virtual Type}
23	A virtual type~(see \autoref{s:virtuals}) has a pointer to a virtual table,
24	called the \emph{virtual-table pointer},
25	which binds each instance of a virtual type to a virtual table.
26	Internally, the field is called \snake{virtual_table}
27	and is fixed after construction.
28	This pointer is also the table's id and how the system accesses the
29	virtual table and the virtual members there.
30	It is always the first field in the
31	structure so that its location is always known.
32
33	% We have no special rules for these constructors.
34	Virtual table pointers are passed to the constructors of virtual types
35	as part of field-by-field construction.
36
37	\subsection{Type Id}
38	Every virtual type has a unique id.
39	These are used in type equality, to check if the representation of two values
40	are the same, and to access the type's type information.
41	This uniqueness means across a program composed of multiple translation
42	units (TU), not uniqueness across all programs or even across multiple
43	processes on the same machine.
44
45	Our approach for program uniqueness is using a static declaration for each
46	type id, where the run-time storage address of that variable is guaranteed to
47	be unique during program execution.
48	The type id storage can also be used for other purposes,
49	and is used for type information.
50
51	The problem is that a type id may appear in multiple TUs that compose a
52	program (see \autoref{ss:VirtualTable}); so the initial solution would seem
53	to be make it external in each translation unit. Hovever, the type id must
54	have a declaration in (exactly) one of the TUs to create the storage.
55	No other declaration related to the virtual type has this property, so doing
56	this through standard C declarations would require the user to do it manually.
57
58	Instead the linker is used to handle this problem.
59	% I did not base anything off of C++17; they are solving the same problem.
60	A new feature has been added to \CFA for this purpose, the special attribute
61	\snake{cfa_linkonce}, which uses the special section @.gnu.linkonce@.
62	When used as a prefix (\eg @.gnu.linkonce.example@) the linker does
63	not combine these sections, but instead discards all but one with the same
64	full name.
65
66	So each type id must be given a unique section name with the linkonce
67	prefix. Luckily \CFA already has a way to get unique names, the name mangler.
68	For example, this could be written directly in \CFA:
69	\begin{cfa}
70	__attribute__((cfa_linkonce)) void f() {}
71	\end{cfa}
72	This is translated to:
73	\begin{cfa}
74	__attribute__((section(".gnu.linkonce._X1fFv___1"))) void _X1fFv___1() {}
75	\end{cfa}
76	This is done internally to access the name manglers.
77	This attribute is useful for other purposes, any other place a unique
78	instance required, and should eventually be made part of a public and
79	stable feature in \CFA.
80
81	\subsection{Type Information}
82
83	There is data stored at the type id's declaration, the type information.
84	The type information currently is only the parent's type id or, if the
85	type has no parent, the null pointer.
86	The ancestors of a virtual type are found by traversing type ids through
87	the type information.
88	An example using helper macros looks like:
89	\begin{cfa}
90	struct INFO_TYPE(TYPE) {
91	INFO_TYPE(PARENT) const * parent;
92	};
93
94	__attribute__((cfa_linkonce))
95	INFO_TYPE(TYPE) const INFO_NAME(TYPE) = {
96	&INFO_NAME(PARENT),
97	};
98	\end{cfa}
99
100	Type information is constructed as follows:
101	\begin{enumerate}
102	\item
103	Use the type's name to generate a name for the type information structure,
104	which is saved so it can be reused.
105	\item
106	Generate a new structure definition to store the type
107	information. The layout is the same in each case, just the parent's type id,
108	but the types used change from instance to instance.
109	The generated name is used for both this structure and, if relevant, the
110	parent pointer.
111	If the virtual type is polymorphic then the type information structure is
112	polymorphic as well, with the same polymorphic arguments.
113	\item
114	A separate name for instances is generated from the type's name.
115	\item
116	The definition is generated and initialized.
117	The parent id is set to the null pointer or to the address of the parent's
118	type information instance. Name resolution handles the rest.
119	\item
120	\CFA's name mangler does its regular name mangling encoding the type of
121	the declaration into the instance name.
122	This process gives a completely unique name
123	including different instances of the same polymorphic type.
124	\end{enumerate}
125	\todo{The list is making me realize, some of this isn't ordered.}
126
127	Writing that code manually, with helper macros for the early name mangling,
128	would look like this:
129	\begin{cfa}
130	struct INFO_TYPE(TYPE) {
131	INFO_TYPE(PARENT) const * parent;
132	};
133
134	__attribute__((cfa_linkonce))
135	INFO_TYPE(TYPE) const INFO_NAME(TYPE) = {
136	&INFO_NAME(PARENT),
137	};
138	\end{cfa}
139
140	\begin{comment}
141	\subsubsection{\lstinline{cfa\_linkonce} Attribute}
142	% I just realized: This is an extension of the inline keyword.
143	% An extension of C's at least, it is very similar to C++'s.
144	Another feature added to \CFA is a new attribute: \texttt{cfa\_linkonce}.
145	This attribute is attached to an object or function definition
146	(any global declaration with a name and a type)
147	allowing it to be defined multiple times.
148	All matching definitions mush have the link-once attribute
149	and their implementations should be identical as well.
150
151	A single definition with the attribute can be included in a header
152	file as if it was a forward declaration, except no definition is required.
153
154	This technique is used for type-id instances. A link-once definition is
155	generated each time the structure is seen. This will result in multiple
156	copies but the link-once attribute ensures all but one are removed for a
157	unique instance.
158
159	Internally, @cfa_linkonce@ is replaced with
160	@section(".gnu.linkonce.NAME")@ where \texttt{NAME} is replaced by the
161	mangled name of the object.
162	Any other @section@ attributes are removed from the declaration.
163	The prefix \texttt{.gnu.linkonce} in section names is recognized by the
164	linker. If two of these sections appear with the same name, including
165	everything that comes after the special prefix, then only one is used
166	and the other is discarded.
167	\end{comment}
168
169	\subsection{Virtual Table}
170	\label{ss:VirtualTable}
171	Each virtual type has a virtual table type that stores its type id and
172	virtual members.
173	Each virtual type instance is bound to a table instance that is filled with
174	the values of virtual members.
175	Both the layout of the fields and their value are decided by the rules given
176	below.
177
178	The layout always comes in three parts.
179	\todo{Add labels to the virtual table layout figure.}
180	The first section is just the type id at the head of the table. It is always
181	there to ensure that it can be found even when the accessing code does not
182	know which virtual type it has.
183	The second section are all the virtual members of the parent, in the same
184	order as they appear in the parent's virtual table. Note that the type may
185	change slightly as references to the ``this" change. This is limited to
186	inside pointers/references and via function pointers so that the size (and
187	hence the offsets) are the same.
188	The third section is similar to the second except that it is the new virtual
189	members introduced at this level in the hierarchy.
190
191	\begin{figure}
192	\input{vtable-layout}
193	\caption{Virtual Table Layout}
194	\label{f:VirtualTableLayout}
195	\todo*{Improve the Virtual Table Layout diagram.}
196	\end{figure}
197
198	The first and second sections together mean that every virtual table has a
199	prefix that has the same layout and types as its parent virtual table.
200	This, combined with the fixed offset to the virtual table pointer, means that
201	for any virtual type, it is always safe to access its virtual table and,
202	from there, it is safe to check the type id to identify the exact type of the
203	underlying object, access any of the virtual members and pass the object to
204	any of the method-like virtual members.
205
206	When a virtual table is declared, the user decides where to declare it and its
207	name. The initialization of the virtual table is entirely automatic based on
208	the context of the declaration.
209
210	The type id is always fixed; with each virtual table type having
211	exactly one possible type id.
212	The virtual members are usually filled in by type resolution.
213	The best match for a given name and type at the declaration site is used.
214	There are two exceptions to that rule: the @size@ field, the type's size,
215	is set using a @sizeof@ expression and the @align@ field, the
216	type's alignment, is set using an @alignof@ expression.
217
218	Most of these tools are already inside the compiler. Using simple
219	code transformations early on in compilation, allows most of that work to be
220	handed off to the existing tools. \autoref{f:VirtualTableTransformation}
221	shows an example transformation, this example shows an exception virtual table.
222	It also shows the transformation on the full declaration.
223	For a forward declaration, the @extern@ keyword is preserved and the
224	initializer is not added.
225
226	\begin{figure}[htb]
227	\begin{cfa}
228	vtable(example_type) example_name;
229	\end{cfa}
230	\transformline
231	% Check mangling.
232	\begin{cfa}
233	const struct example_type_vtable example_name = {
234	.__cfavir_typeid : &__cfatid_example_type,
235	.size : sizeof(example_type),
236	.copy : copy,
237	.^?{} : ^?{},
238	.msg : msg,
239	};
240	\end{cfa}
241	\caption{Virtual Table Transformation}
242	\label{f:VirtualTableTransformation}
243	\end{figure}
244
245	\subsection{Concurrency Integration}
246	Coroutines and threads need instances of @CoroutineCancelled@ and
247	@ThreadCancelled@ respectively to use all of their functionality. When a new
248	data type is declared with @coroutine@ or @thread@, a forward declaration for
249	the instance is created as well. The definition of the virtual table is created
250	at the definition of the main function.
251
252	These transformations are shown through code re-writing in
253	\autoref{f:CoroutineTypeTransformation} and
254	\autoref{f:CoroutineMainTransformation}.
255	Threads use the same pattern, with some names and types changed.
256	In both cases, the original declaration is not modified,
257	only new ones are added.
258
259	\begin{figure}[htb]
260	\begin{cfa}
261	coroutine Example {
262	// fields
263	};
264	\end{cfa}
265
266	\transformline[appends...]
267
268	\begin{cfa}
269	__attribute__((cfa_linkonce))
270	struct __cfatid_struct_CoroutineCancelled(Example)
271	__cfatid_CoroutineCancelled = {
272	&EXCEPTION_TYPE_ID,
273	};
274	extern CoroutineCancelled_vtable _default_vtable_object_declaration;
275	extern CoroutineCancelled_vtable & _default_vtable;
276	\end{cfa}
277	\caption{Coroutine Type Transformation}
278	\label{f:CoroutineTypeTransformation}
279	\end{figure}
280
281	\begin{figure}[htb]
282	\begin{cfa}
283	void main(Example & this) {
284	// body
285	}
286	\end{cfa}
287
288	\transformline[appends...]
289
290	\begin{cfa}
291	CoroutineCancelled_vtable _default_vtable_object_declaration = {
292	__cfatid_CoroutineCancelled,
293	// Virtual member initialization.
294	};
295
296	CoroutineCancelled_vtable & _default_vtable =
297	&_default_vtable_object_declaration;
298	\end{cfa}
299	\caption{Coroutine Main Transformation}
300	\label{f:CoroutineMainTransformation}
301	\end{figure}
302
303	\subsection{Virtual Cast}
304	Virtual casts are implemented as a function call that does the subtype check
305	and a C coercion-cast to do the type conversion.
306	% The C-cast is just to make sure the generated code is correct so the rest of
307	% the section is about that function.
308	The function is implemented in the standard library and has the following
309	signature:
310	\begin{cfa}
311	void * __cfa__virtual_cast(
312	struct __cfavir_type_id * parent,
313	struct __cfavir_type_id * const * child );
314	\end{cfa}
315	The type id for the target type of the virtual cast is passed in as
316	@parent@ and
317	the cast target is passed in as @child@.
318	The generated C code wraps both arguments and the result with type casts.
319	There is also an internal check inside the compiler to make sure that the
320	target type is a virtual type.
321	% It also checks for conflicting definitions.
322
323	The virtual cast either returns the original pointer or the null pointer
324	as the new type.
325	So the function does the parent check and returns the appropriate value.
326	The parent check is a simple linear search of child's ancestors using the
327	type information.
328
329	\section{Exceptions}
330	% The implementation of exception types.
331
332	Creating exceptions can roughly divided into two parts,
333	the exceptions themselves and the virtual system interactions.
334
335	Creating an exception type is just a matter of prepending the field
336	with the virtual table pointer to the list of the fields
337	(see \autoref{f:ExceptionTypeTransformation}).
338
339	\begin{figure}[htb]
340	\begin{cfa}
341	exception new_exception {
342	// EXISTING FIELDS
343	};
344	\end{cfa}
345	\transformline
346	\begin{cfa}
347	struct new_exception {
348	struct new_exception_vtable const * virtual_table;
349	// EXISTING FIELDS
350	};
351	\end{cfa}
352	\caption{Exception Type Transformation}
353	\label{f:ExceptionTypeTransformation}
354	\end{figure}
355
356	The integration between exceptions and the virtual system is a bit more
357	complex simply because of the nature of the virtual system prototype.
358	The primary issue is that the virtual system has no way to detect when it
359	should generate any of its internal types and data. This is handled by
360	the exception code, which tells the virtual system when to generate
361	its components.
362
363	All types associated with a virtual type,
364	the types of the virtual table and the type id,
365	are generated when the virtual type (the exception) is first found.
366	The type id (the instance) is generated with the exception, if it is
367	a monomorphic type.
368	However, if the exception is polymorphic, then a different type id has to
369	be generated for every instance. In this case, generation is delayed
370	until a virtual table is created.
371	% There are actually some problems with this, which is why it is not used
372	% for monomorphic types.
373	When a virtual table is created and initialized, two functions are created
374	to fill in the list of virtual members.
375	The first is a copy function that adapts the exception's copy constructor
376	to work with pointers, avoiding some issues with the current copy constructor
377	interface.
378	Second is the msg function that returns a C-string with the type's name,
379	including any polymorphic parameters.
380
381	\section{Unwinding}
382	% Adapt the unwind chapter, just describe the sections of libunwind used.
383	% Mention that termination and cancellation use it. Maybe go into why
384	% resumption doesn't as well.
385
386	% Many modern languages work with an internal stack that function push and pop
387	% their local data to. Stack unwinding removes large sections of the stack,
388	% often across functions.
389
390	Stack unwinding is the process of removing stack frames (activations) from the
391	stack. On function entry and return, unwinding is handled directly by the
392	call/return code embedded in the function.
393
394	% Discussing normal stack unwinding:
395	Usually, the stack-frame size is known statically based on parameter and
396	local variable declarations. Even for a dynamic stack-size, the information
397	to determine how much of the stack has to be removed is still contained
398	within the function.
399	Allocating/deallocating stack space is usually an $O(1)$ operation achieved by
400	bumping the hardware stack-pointer up or down as needed.
401	Constructing/destructing values within a stack frame has
402	a similar complexity but larger constants.
403
404	% Discussing multiple frame stack unwinding:
405	Unwinding across multiple stack frames is more complex because that
406	information is no longer contained within the current function.
407	With separate compilation,
408	a function does not know its callers nor their frame layout.
409	Even using the return address, that information is encoded in terms of
410	actions in code, intermixed with the actions required finish the function.
411	Without changing the main code path it is impossible to select one of those
412	two groups of actions at the return site.
413
414	The traditional unwinding mechanism for C is implemented by saving a snap-shot
415	of a function's state with @setjmp@ and restoring that snap-shot with
416	@longjmp@. This approach bypasses the need to know stack details by simply
417	reseting to a snap-shot of an arbitrary but existing function frame on the
418	stack. It is up to the programmer to ensure the snap-shot is valid when it is
419	reset and that all required clean-up from the unwound stacks is performed.
420	This approach is fragile and requires extra work in the surrounding code.
421
422	With respect to the extra work in the surrounding code,
423	many languages define clean-up actions that must be taken when certain
424	sections of the stack are removed. Such as when the storage for a variable
425	is removed from the stack, possibly requiring a destructor call,
426	or when a try statement with a finally clause is
427	(conceptually) popped from the stack.
428	None of these cases should be handled by the user --- that would contradict the
429	intention of these features --- so they need to be handled automatically.
430
431	To safely remove sections of the stack, the language must be able to find and
432	run these clean-up actions even when removing multiple functions unknown at
433	the beginning of the unwinding.
434
435	One of the most popular tools for stack management is libunwind, a low-level
436	library that provides tools for stack walking, handler execution, and
437	unwinding. What follows is an overview of all the relevant features of
438	libunwind needed for this work, and how \CFA uses them to implement exception
439	handling.
440
441	\subsection{libunwind Usage}
442	Libunwind, accessed through @unwind.h@ on most platforms, is a C library that
443	provides \Cpp-style stack-unwinding. Its operation is divided into two phases:
444	search and cleanup. The dynamic target search -- phase 1 -- is used to scan the
445	stack and decide where unwinding should stop (but no unwinding occurs). The
446	cleanup -- phase 2 -- does the unwinding and also runs any cleanup code.
447
448	To use libunwind, each function must have a personality function and a Language
449	Specific Data Area (LSDA). The LSDA has the unique information for each
450	function to tell the personality function where a function is executing, its
451	current stack frame, and what handlers should be checked. Theoretically, the
452	LSDA can contain any information but conventionally it is a table with entries
453	representing regions of a function and what has to be done there during
454	unwinding. These regions are bracketed by instruction addresses. If the
455	instruction pointer is within a region's start/end, then execution is currently
456	executing in that region. Regions are used to mark out the scopes of objects
457	with destructors and try blocks.
458
459	% Libunwind actually does very little, it simply moves down the stack from
460	% function to function. Most of the actions are implemented by the personality
461	% function which libunwind calls on every function. Since this is shared across
462	% many functions or even every function in a language it will need a bit more
463	% information.
464
465	The GCC compilation flag @-fexceptions@ causes the generation of an LSDA and
466	attaches a personality function to each function.
467	In plain C (which \CFA currently compiles down to) this
468	flag only handles the cleanup attribute:
469	%\label{code:cleanup}
470	\begin{cfa}
471	void clean_up( int * var ) { ... }
472	int avar __attribute__(( cleanup(clean_up) ));
473	\end{cfa}
474	The attribute is used on a variable and specifies a function,
475	in this case @clean_up@, run when the variable goes out of scope.
476	This feature is enough to mimic destructors,
477	but not try statements that affect
478	the unwinding.
479
480	To get full unwinding support, all of these features must be handled directly
481	in assembly and assembler directives; particularly the cfi directives
482	\snake{.cfi_lsda} and \snake{.cfi_personality}.
483
484	\subsection{Personality Functions}
485	Personality functions have a complex interface specified by libunwind. This
486	section covers some of the important parts of the interface.
487
488	A personality function can perform different actions depending on how it is
489	called.
490	\begin{lstlisting}
491	typedef _Unwind_Reason_Code (*_Unwind_Personality_Fn) (
492	_Unwind_Action action,
493	_Unwind_Exception_Class exception_class,
494	_Unwind_Exception * exception,
495	struct _Unwind_Context * context);
496	\end{lstlisting}
497	The @action@ argument is a bitmask of possible actions:
498	\begin{enumerate}[topsep=5pt]
499	\item
500	@_UA_SEARCH_PHASE@ specifies a search phase and tells the personality function
501	to check for handlers. If there is a handler in a stack frame, as defined by
502	the language, the personality function returns @_URC_HANDLER_FOUND@; otherwise
503	it return @_URC_CONTINUE_UNWIND@.
504
505	\item
506	@_UA_CLEANUP_PHASE@ specifies a cleanup phase, where the entire frame is
507	unwound and all cleanup code is run. The personality function does whatever
508	cleanup the language defines (such as running destructors/finalizers) and then
509	generally returns @_URC_CONTINUE_UNWIND@.
510
511	\item
512	\begin{sloppypar}
513	@_UA_HANDLER_FRAME@ specifies a cleanup phase on a function frame that found a
514	handler. The personality function must prepare to return to normal code
515	execution and return @_URC_INSTALL_CONTEXT@.
516	\end{sloppypar}
517
518	\item
519	@_UA_FORCE_UNWIND@ specifies a forced unwind call. Forced unwind only performs
520	the cleanup phase and uses a different means to decide when to stop
521	(see \autoref{s:ForcedUnwind}).
522	\end{enumerate}
523
524	The @exception_class@ argument is a copy of the
525	\code{C}{exception}'s @exception_class@ field,
526	which is a number that identifies the EHM
527	that created the exception.
528
529	The \code{C}{exception} argument is a pointer to a user
530	provided storage object. It has two public fields: the @exception_class@,
531	which is described above, and the @exception_cleanup@ function.
532	The clean-up function is used by the EHM to clean-up the exception, if it
533	should need to be freed at an unusual time, it takes an argument that says
534	why it had to be cleaned up.
535
536	The @context@ argument is a pointer to an opaque type passed to helper
537	functions called inside the personality function.
538
539	The return value, @_Unwind_Reason_Code@, is an enumeration of possible messages
540	that can be passed several places in libunwind. It includes a number of
541	messages for special cases (some of which should never be used by the
542	personality function) and error codes. However, unless otherwise noted, the
543	personality function always returns @_URC_CONTINUE_UNWIND@.
544
545	\subsection{Raise Exception}
546	Raising an exception is the central function of libunwind and it performs
547	two-staged unwinding.
548	\begin{cfa}
549	_Unwind_Reason_Code _Unwind_RaiseException(_Unwind_Exception *);
550	\end{cfa}
551	First, the function begins the search phase, calling the personality function
552	of the most recent stack frame. It continues to call personality functions
553	traversing the stack from newest to oldest until a function finds a handler or
554	the end of the stack is reached. In the latter case, raise exception returns
555	@_URC_END_OF_STACK@.
556
557	Second, when a handler is matched, raise exception moves to the clean-up
558	phase and walks the stack a second time.
559	Once again, it calls the personality functions of each stack frame from newest
560	to oldest. This pass stops at the stack frame containing the matching handler.
561	If that personality function has not install a handler, it is an error.
562
563	If an error is encountered, raise exception returns either
564	@_URC_FATAL_PHASE1_ERROR@ or @_URC_FATAL_PHASE2_ERROR@ depending on when the
565	error occurred.
566
567	\subsection{Forced Unwind}
568	\label{s:ForcedUnwind}
569	Forced Unwind is the other central function in libunwind.
570	\begin{cfa}
571	_Unwind_Reason_Code _Unwind_ForcedUnwind(_Unwind_Exception *,
572	_Unwind_Stop_Fn, void *);
573	\end{cfa}
574	It also unwinds the stack but it does not use the search phase. Instead another
575	function, the stop function, is used to stop searching. The exception is the
576	same as the one passed to raise exception. The extra arguments are the stop
577	function and the stop parameter. The stop function has a similar interface as a
578	personality function, except it is also passed the stop parameter.
579	\begin{lstlisting}
580	typedef _Unwind_Reason_Code (*_Unwind_Stop_Fn)(
581	_Unwind_Action action,
582	_Unwind_Exception_Class exception_class,
583	_Unwind_Exception * exception,
584	struct _Unwind_Context * context,
585	void * stop_parameter);
586	\end{lstlisting}
587
588	The stop function is called at every stack frame before the personality
589	function is called and then once more after all frames of the stack are
590	unwound.
591
592	Each time it is called, the stop function should return @_URC_NO_REASON@ or
593	transfer control directly to other code outside of libunwind. The framework
594	does not provide any assistance here.
595
596	\begin{sloppypar}
597	Its arguments are the same as the paired personality function. The actions
598	\snake{_UA_CLEANUP_PHASE} and \snake{_UA_FORCE_UNWIND} are always set when it is
599	called. Beyond the libunwind standard, both GCC and Clang add an extra action
600	on the last call at the end of the stack: \snake{_UA_END_OF_STACK}.
601	\end{sloppypar}
602
603	\section{Exception Context}
604	% Should I have another independent section?
605	% There are only two things in it, top_resume and current_exception. How it is
606	% stored changes depending on whether or not the thread-library is linked.
607
608	The exception context is global storage used to maintain data across different
609	exception operations and to communicate among different components.
610
611	Each stack must have its own exception context. In a sequential \CFA program,
612	there is only one stack with a single global exception-context. However, when
613	the library @libcfathread@ is linked, there are multiple stacks and each
614	needs its own exception context.
615
616	The current exception context should be retrieved by calling the function
617	\snake{this_exception_context}.
618	For sequential execution, this function is defined as
619	a weak symbol in the \CFA system-library, @libcfa@. When a \CFA program is
620	concurrent, it links with @libcfathread@, where this function is defined with a
621	strong symbol replacing the sequential version.
622
623	The sequential @this_exception_context@ returns a hard-coded pointer to the
624	global exception context.
625	The concurrent version adds the exception context to the data stored at the
626	base of each stack. When @this_exception_context@ is called, it retrieves the
627	active stack and returns the address of the context saved there.
628
629	\section{Termination}
630	% Memory management & extra information, the custom function used to implement
631	% catches. Talk about GCC nested functions.
632
633	\CFA termination exceptions use libunwind heavily because they match
634	\Cpp exceptions closely. The main complication for \CFA is that the
635	compiler generates C code, making it very difficult to generate the assembly to
636	form the LSDA for try blocks or destructors.
637
638	\subsection{Memory Management}
639	The first step of a termination raise is to copy the exception into memory
640	managed by the exception system. Currently, the system uses @malloc@, rather
641	than reserved memory or the stack top. The EHM manages
642	memory for the exception as well as memory for libunwind and the system's own
643	per-exception storage.
644
645	\begin{figure}
646	\centering
647	\input{exception-layout}
648	\caption{Exception Layout}
649	\label{f:ExceptionLayout}
650	\end{figure}
651
652	Exceptions are stored in variable-sized blocks
653	(see \autoref{f:ExceptionLayout}).
654	The first component is a fixed-sized data structure that contains the
655	information for libunwind and the exception system. The second component is an
656	area of memory big enough to store the exception. Macros with pointer arthritic
657	and type cast are used to move between the components or go from the embedded
658	@_Unwind_Exception@ to the entire node.
659
660	Multiple exceptions can exist at the same time because exceptions can be
661	raised inside handlers, destructors and finally blocks.
662	Figure~\vref{f:MultipleExceptions} shows a program that has multiple
663	exceptions active at one time.
664	Each time an exception is thrown and caught the stack unwinds and the finally
665	clause runs. This handler throws another exception (until @num_exceptions@ gets
666	high enough), which must be allocated. The previous exceptions may not be
667	freed because the handler/catch clause has not been run.
668	Therefore, the EHM must keep all unhandled exceptions alive
669	while it allocates exceptions for new throws.
670
671	\begin{figure}
672	\centering
673	\newsavebox{\codeBox}
674	\newsavebox{\stackBox}
675	\begin{lrbox}{\codeBox}
676	\begin{lstlisting}[language=CFA,{moredelim=**[is][\color{red}]{@}{@}}]
677	unsigned num_exceptions = 0;
678	void throws() {
679	try {
680	try {
681	++num_exceptions;
682	throw (Example){table};
683	} finally {
684	if (num_exceptions < 3) {
685	throws();
686	}
687	}
688	} catch (exception_t *) {
689	--num_exceptions;
690	}
691	}
692	int main() {
693	throws();
694	}
695	\end{lstlisting}
696	\end{lrbox}
697
698	\begin{lrbox}{\stackBox}
699	\begin{lstlisting}
700	\| try-finally
701	\| try-catch (Example)
702	throws()
703	\| try-finally
704	\| try-catch (Example)
705	throws()
706	\| try-finally
707	\| try-catch (Example)
708	throws()
709	main()
710	\end{lstlisting}
711	\end{lrbox}
712
713	{\usebox\codeBox}
714	\hspace{25pt}
715	{\usebox\stackBox}
716
717	\caption{Multiple Exceptions}
718	\label{f:MultipleExceptions}
719	\end{figure}
720	\todo*{Work on multiple exceptions code sample.}
721
722	All exceptions are stored in nodes, which are then linked together in lists
723	one list per stack, with the
724	list head stored in the exception context. Within each linked list, the most
725	recently thrown exception is at the head followed by older thrown
726	exceptions. This format allows exceptions to be thrown, while a different
727	exception is being handled. The exception at the head of the list is currently
728	being handled, while other exceptions wait for the exceptions before them to be
729	handled and removed.
730
731	The virtual members in the exception's virtual table provide the size of the
732	exception, the copy function, and the free function, so they are specific to an
733	exception type. The size and copy function are used immediately to copy an
734	exception into managed memory. After the exception is handled, the free
735	function is used to clean up the exception and then the entire node is
736	passed to free, returning the memory back to the heap.
737
738	\subsection{Try Statements and Catch Clauses}
739	The try statement with termination handlers is complex because it must
740	compensate for the C code-generation versus proper
741	assembly-code generated from \CFA. Libunwind
742	requires an LSDA and personality function for control to unwind across a
743	function. The LSDA in particular is hard to mimic in generated C code.
744
745	The workaround is a function called \snake{__cfaehm_try_terminate} in the
746	standard \CFA library. The contents of a try block and the termination
747	handlers are converted into nested functions. These are then passed to the
748	try terminate function and it calls them, appropriately.
749	Because this function is known and fixed (and not an arbitrary function that
750	happens to contain a try statement), its LSDA can be generated ahead
751	of time.
752
753	Both the LSDA and the personality function for \snake{__cfaehm_try_terminate}
754	are set ahead of time using
755	embedded assembly. This assembly code is handcrafted using C @asm@ statements
756	and contains
757	enough information for the single try statement the function represents.
758
759	The three functions passed to try terminate are:
760	\begin{description}
761	\item[try function:] This function is the try block, it is where all the code
762	from inside the try block is placed. It takes no parameters and has no
763	return value. This function is called during regular execution to run the try
764	block.
765
766	\item[match function:] This function is called during the search phase and
767	decides if a catch clause matches the termination exception. It is constructed
768	from the conditional part of each handler and runs each check, top to bottom,
769	in turn, to see if the exception matches this handler.
770	The match is performed in two steps, first a virtual cast is used to check
771	if the raised exception is an instance of the declared exception type or
772	one of its descendant types, and then the condition is evaluated, if
773	present.
774	The match function takes a pointer to the exception and returns 0 if the
775	exception is not handled here. Otherwise the return value is the id of the
776	handler that matches the exception.
777
778	\item[handler function:] This function handles the exception, and contains
779	all the code from the handlers in the try statement, joined with a switch
780	statement on the handler's id.
781	It takes a
782	pointer to the exception and the handler's id and returns nothing. It is called
783	after the cleanup phase.
784	\end{description}
785	All three functions are created with GCC nested functions. GCC nested functions
786	can be used to create closures,
787	in other words,
788	functions that can refer to variables in their lexical scope even
789	those variables are part of a different function.
790	This approach allows the functions to refer to all the
791	variables in scope for the function containing the @try@ statement. These
792	nested functions and all other functions besides @__cfaehm_try_terminate@ in
793	\CFA use the GCC personality function and the @-fexceptions@ flag to generate
794	the LSDA.
795	Using this pattern, \CFA implements destructors with the cleanup attribute.
796
797	\autoref{f:TerminationTransformation} shows the pattern used to transform
798	a \CFA try statement with catch clauses into the appropriate C functions.
799	\todo{Explain the Termination Transformation figure.}
800
801	\begin{figure}
802	\begin{cfa}
803	try {
804	// TRY BLOCK
805	} catch (Exception1 * name1 ; check(name1)) {
806	// CATCH BLOCK 1
807	} catch (Exception2 * name2) {
808	// CATCH BLOCK 2
809	}
810	\end{cfa}
811
812	\transformline
813
814	\begin{cfa}
815	void try(void) {
816	// TRY BLOCK
817	}
818	int match(exception_t * __exception_inst) {
819	{
820	Exception1 * name1;
821	if (name1 = (virtual Exception1 *)__exception_inst
822	&& check(name1)) {
823	return 1;
824	}
825	}
826	{
827	Exception2 * name2;
828	if (name2 = (virtual Exception2 *)__exception_inst) {
829	return 2;
830	}
831	}
832	return 0;
833	}
834	void catch(exception_t * __exception_inst, int __handler_index) {
835	switch (__handler_index) {
836	case 1:
837	{
838	Exception1 * name1 = (virtual Exception1 *)__exception_inst;
839	// CATCH BLOCK 1
840	}
841	return;
842	case 2:
843	{
844	Exception2 * name2 = (virtual Exception2 *)__exception_inst;
845	// CATCH BLOCK 2
846	}
847	return;
848	}
849	}
850	{
851	__cfaehm_try_terminate(try, catch, match);
852	}
853	\end{cfa}
854
855	\caption{Termination Transformation}
856	\label{f:TerminationTransformation}
857	\todo*{Improve (compress?) Termination Transformations.}
858	\end{figure}
859
860	\section{Resumption}
861	% The stack-local data, the linked list of nodes.
862
863	Resumption is simpler to implement than termination
864	because there is no stack unwinding.
865	Instead of storing the data in a special area using assembly,
866	there is just a linked list of possible handlers for each stack,
867	with each node on the list representing a try statement on the stack.
868
869	The head of the list is stored in the exception context.
870	The nodes are stored in order, with the more recent try statements closer
871	to the head of the list.
872	Instead of traversing the stack, resumption handling traverses the list.
873	At each node, the EHM checks to see if the try statement the node represents
874	can handle the exception. If it can, then the exception is handled and
875	the operation finishes, otherwise the search continues to the next node.
876	If the search reaches the end of the list without finding a try statement
877	with a handler clause
878	that can handle the exception, the default handler is executed.
879	If the default handler returns, control continues after the raise statement.
880
881	Each node has a handler function that does most of the work.
882	The handler function is passed the raised exception and returns true
883	if the exception is handled and false otherwise.
884	The handler function checks each of its internal handlers in order,
885	top-to-bottom, until it funds a match. If a match is found that handler is
886	run, after which the function returns true, ignoring all remaining handlers.
887	If no match is found the function returns false.
888	The match is performed in two steps, first a virtual cast is used to see
889	if the raised exception is an instance of the declared exception type or one
890	of its descendant types, if so then it is passed to the custom predicate
891	if one is defined.
892	% You need to make sure the type is correct before running the predicate
893	% because the predicate can depend on that.
894
895	\autoref{f:ResumptionTransformation} shows the pattern used to transform
896	a \CFA try statement with catch clauses into the appropriate C functions.
897	\todo{Explain the Resumption Transformation figure.}
898
899	\begin{figure}
900	\begin{cfa}
901	try {
902	// TRY BLOCK
903	} catchResume (Exception1 * name1 ; check(name1)) {
904	// CATCH BLOCK 1
905	} catchResume (Exception2 * name2) {
906	// CATCH BLOCK 2
907	}
908	\end{cfa}
909
910	\transformline
911
912	\begin{cfa}
913	bool handle(exception_t * __exception_inst) {
914	{
915	Exception1 * name1;
916	if (name1 = (virtual Exception1 *)__exception_inst
917	&& check(name1)) {
918	// CATCH BLOCK 1
919	return 1;
920	}
921	}
922	{
923	Exception2 * name2;
924	if (name2 = (virtual Exception2 *)__exception_inst) {
925	// CATCH BLOCK 2
926	return 2;
927	}
928	}
929	return false;
930	}
931	struct __try_resume_node __resume_node
932	__attribute__((cleanup( __cfaehm_try_resume_cleanup )));
933	__cfaehm_try_resume_setup( &__resume_node, handler );
934	\end{cfa}
935
936	\caption{Resumption Transformation}
937	\label{f:ResumptionTransformation}
938	\todo*{Improve (compress?) Resumption Transformations.}
939	\end{figure}
940
941	% Recursive Resumption Stuff:
942	\autoref{f:ResumptionMarking} shows search skipping
943	(see \vpageref{s:ResumptionMarking}), which ignores parts of
944	the stack
945	already examined, and is accomplished by updating the front of the list as
946	the search continues.
947	Before the handler is called at a matching node, the head of the list
948	is updated to the next node of the current node. After the search is complete,
949	successful or not, the head of the list is reset.
950	% No paragraph?
951	This mechanism means the current handler and every handler that has already
952	been checked are not on the list while a handler is run. If a resumption is
953	thrown during the handling of another resumption, the active handlers and all
954	the other handlers checked up to this point are not checked again.
955	% No paragraph?
956	This structure also supports new handlers added while the resumption is being
957	handled. These are added to the front of the list, pointing back along the
958	stack --- the first one points over all the checked handlers ---
959	and the ordering is maintained.
960
961	\begin{figure}
962	\centering
963	\input{resumption-marking}
964	\caption{Resumption Marking}
965	\label{f:ResumptionMarking}
966	\todo*{Label Resumption Marking to aid clarity.}
967	\end{figure}
968
969	\label{p:zero-cost}
970	Finally, the resumption implementation has a cost for entering/exiting a try
971	statement with @catchResume@ clauses, whereas a try statement with @catch@
972	clauses has zero-cost entry/exit. While resumption does not need the stack
973	unwinding and cleanup provided by libunwind, it could use the search phase to
974	providing zero-cost enter/exit using the LSDA. Unfortunately, there is no way
975	to return from a libunwind search without installing a handler or raising an
976	error. Although workarounds might be possible, they are beyond the scope of
977	this thesis. The current resumption implementation has simplicity in its
978	favour.
979	% Seriously, just compare the size of the two chapters and then consider
980	% that unwind is required knowledge for that chapter.
981
982	\section{Finally}
983	% Uses destructors and GCC nested functions.
984
985	%\autoref{code:cleanup}
986	A finally clause is handled by converting it into a once-off destructor.
987	The code inside the clause is placed into GCC nested-function
988	with a unique name, and no arguments or return values.
989	This nested function is
990	then set as the cleanup function of an empty object that is declared at the
991	beginning of a block placed around the context of the associated try
992	statement (see \autoref{f:FinallyTransformation}).
993
994	\begin{figure}
995	\begin{cfa}
996	try {
997	// TRY BLOCK
998	} finally {
999	// FINALLY BLOCK
1000	}
1001	\end{cfa}
1002
1003	\transformline
1004
1005	\begin{cfa}
1006	{
1007	void finally(void *__hook){
1008	// FINALLY BLOCK
1009	}
1010	__attribute__ ((cleanup(finally)))
1011	struct __cfaehm_cleanup_hook __finally_hook;
1012	{
1013	// TRY BLOCK
1014	}
1015	}
1016	\end{cfa}
1017
1018	\caption{Finally Transformation}
1019	\label{f:FinallyTransformation}
1020	\end{figure}
1021
1022	The rest is handled by GCC.
1023	The TRY BLOCK
1024	contains the try block itself as well as all code generated for handlers.
1025	Once that code has completed,
1026	control exits the block and the empty object is cleaned
1027	up, which runs the function that contains the finally code.
1028
1029	\section{Cancellation}
1030	% Stack selections, the three internal unwind functions.
1031
1032	Cancellation also uses libunwind to do its stack traversal and unwinding,
1033	however it uses a different primary function: @_Unwind_ForcedUnwind@. Details
1034	of its interface can be found in the Section~\vref{s:ForcedUnwind}.
1035
1036	The first step of cancellation is to find the cancelled stack and its type:
1037	coroutine, thread or main thread.
1038	In \CFA, a thread (the construct the user works with) is a user-level thread
1039	(point of execution) paired with a coroutine, the thread's main coroutine.
1040	The thread library also stores pointers to the main thread and the current
1041	thread.
1042	If the current thread's main and current coroutines are the same then the
1043	current stack is a thread stack, otherwise it is a coroutine stack.
1044	If the current stack is a thread stack, it is also the main thread stack
1045	if and only if the main and current threads are the same.
1046
1047	However, if the threading library is not linked, the sequential execution is on
1048	the main stack. Hence, the entire check is skipped because the weak-symbol
1049	function is loaded. Therefore, main thread cancellation is unconditionally
1050	performed.
1051
1052	Regardless of how the stack is chosen, the stop function and parameter are
1053	passed to the forced-unwind function. The general pattern of all three stop
1054	functions is the same: continue unwinding until the end of stack and
1055	then perform the appropriate transfer.
1056
1057	For main stack cancellation, the transfer is just a program abort.
1058
1059	For coroutine cancellation, the exception is stored on the coroutine's stack,
1060	and the coroutine context switches to its last resumer. The rest is handled on
1061	the backside of the resume, which checks if the resumed coroutine is
1062	cancelled. If cancelled, the exception is retrieved from the resumed coroutine,
1063	and a @CoroutineCancelled@ exception is constructed and loaded with the
1064	cancelled exception. It is then resumed as a regular exception with the default
1065	handler coming from the context of the resumption call.
1066
1067	For thread cancellation, the exception is stored on the thread's main stack and
1068	then context switched to the scheduler. The rest is handled by the thread
1069	joiner. When the join is complete, the joiner checks if the joined thread is
1070	cancelled. If cancelled, the exception is retrieved and the joined thread, and
1071	a @ThreadCancelled@ exception is constructed and loaded with the cancelled
1072	exception. The default handler is passed in as a function pointer. If it is
1073	null (as it is for the auto-generated joins on destructor call), the default is
1074	used, which is a program abort.
1075	%; which gives the required handling on implicate join.

Note: See TracBrowser for help on using the repository browser.

Download in other formats: