1 | \chapter{Generic Types} |
---|
2 | \label{generic-chap} |
---|
3 | |
---|
4 | A significant shortcoming in standard C is the lack of reusable type-safe abstractions for generic data structures and algorithms. |
---|
5 | Broadly speaking, there are three approaches to implement abstract data structures in C. |
---|
6 | One approach is to write bespoke data structures for each context in which they are needed. |
---|
7 | While this approach is flexible and supports integration with the C type checker and tooling, it is also tedious and error prone, especially for more complex data structures. |
---|
8 | A second approach is to use !void*!-based polymorphism, \eg{} the C standard library functions !bsearch! and !qsort!, which allow for the reuse of common functionality. |
---|
9 | However, basing all polymorphism on !void*! eliminates the type checker's ability to ensure that argument types are properly matched, often requiring a number of extra function parameters, pointer indirection, and dynamic allocation that is otherwise not needed. |
---|
10 | A third approach to generic code is to use preprocessor macros, which does allow the generated code to be both generic and type checked, but errors in such code may be difficult to locate and debug. |
---|
11 | Furthermore, writing and using preprocessor macros is unnatural and inflexible. |
---|
12 | Figure~\ref{bespoke-generic-fig} demonstrates the bespoke approach for a simple linked list with !insert! and !head! operations, while Figure~\ref{void-generic-fig} and Figure~\ref{macro-generic-fig} show the same example using !void*!- and !#define!-based polymorphism, respectively. |
---|
13 | |
---|
14 | \begin{figure} |
---|
15 | \begin{cfa} |
---|
16 | struct int_list { int value; struct int_list* next; }; |
---|
17 | |
---|
18 | void int_list_insert( struct int_list** ls, int x ) { |
---|
19 | struct int_list* node = malloc(sizeof(struct int_list)); |
---|
20 | node->value = x; node->next = *ls; |
---|
21 | *ls = node; |
---|
22 | } |
---|
23 | |
---|
24 | int int_list_head( const struct int_list* ls ) { return ls->value; } |
---|
25 | |
---|
26 | $\C[\textwidth]{// all code must be duplicated for every generic instantiation}$ |
---|
27 | |
---|
28 | struct string_list { const char* value; struct string_list* next; }; |
---|
29 | |
---|
30 | void string_list_insert( struct string_list** ls, const char* x ) { |
---|
31 | struct string_list* node = malloc(sizeof(struct string_list)); |
---|
32 | node->value = x; node->next = *ls; |
---|
33 | *ls = node; |
---|
34 | } |
---|
35 | |
---|
36 | const char* string_list_head( const struct string_list* ls ) |
---|
37 | { return ls->value; } |
---|
38 | |
---|
39 | $\C[\textwidth]{// use is efficient and idiomatic}$ |
---|
40 | |
---|
41 | int main() { |
---|
42 | struct int_list* il = NULL; |
---|
43 | int_list_insert( &il, 42 ); |
---|
44 | printf("%d\n", int_list_head(il)); |
---|
45 | |
---|
46 | struct string_list* sl = NULL; |
---|
47 | string_list_insert( &sl, "hello" ); |
---|
48 | printf("%s\n", string_list_head(sl)); |
---|
49 | } |
---|
50 | \end{cfa} |
---|
51 | |
---|
52 | \caption{Bespoke code for linked list implementation.} \label{bespoke-generic-fig} |
---|
53 | \end{figure} |
---|
54 | |
---|
55 | \begin{figure} |
---|
56 | \begin{cfa} |
---|
57 | // single code implementation |
---|
58 | |
---|
59 | struct list { void* value; struct list* next; }; |
---|
60 | |
---|
61 | $\C[\textwidth]{// internal memory management requires helper functions}$ |
---|
62 | |
---|
63 | void list_insert( struct list** ls, void* x, void* (*copy)(void*) ) { |
---|
64 | struct list* node = malloc(sizeof(struct list)); |
---|
65 | node->value = copy(x); node->next = *ls; |
---|
66 | *ls = node; |
---|
67 | } |
---|
68 | |
---|
69 | void* list_head( const struct list* ls ) { return ls->value; } |
---|
70 | |
---|
71 | $\C[\textwidth]{// helpers duplicated per type}$ |
---|
72 | |
---|
73 | void* int_copy(void* x) { |
---|
74 | int* n = malloc(sizeof(int)); |
---|
75 | *n = *(int*)x; |
---|
76 | return n; |
---|
77 | } |
---|
78 | |
---|
79 | void* string_copy(void* x) { return strdup((const char*)x); } |
---|
80 | |
---|
81 | int main() { |
---|
82 | struct list* il = NULL; |
---|
83 | int i = 42; |
---|
84 | list_insert( &il, &i, int_copy ); |
---|
85 | printf("%d\n", *(int*)list_head(il)); $\C[2in]{// unsafe type cast}$ |
---|
86 | |
---|
87 | struct list* sl = NULL; |
---|
88 | list_insert( &sl, "hello", string_copy ); |
---|
89 | printf("%s\n", (char*)list_head(sl)); $\C[2in]{// unsafe type cast}$ |
---|
90 | } |
---|
91 | \end{cfa} |
---|
92 | |
---|
93 | \caption{\lstinline{void*}-polymorphic code for linked list implementation.} \label{void-generic-fig} |
---|
94 | \end{figure} |
---|
95 | |
---|
96 | \begin{figure} |
---|
97 | \begin{cfa} |
---|
98 | $\C[\textwidth]{// code is nested in macros}$ |
---|
99 | |
---|
100 | #define list(N) N ## _list |
---|
101 | |
---|
102 | #define list_insert(N) N ## _list_insert |
---|
103 | |
---|
104 | #define list_head(N) N ## _list_head |
---|
105 | |
---|
106 | #define define_list(N, T) $\C[0.25in]{ \textbackslash }$ |
---|
107 | struct list(N) { T value; struct list(N)* next; }; $\C[0.25in]{ \textbackslash }$ |
---|
108 | $\C[0.25in]{ \textbackslash }$ |
---|
109 | void list_insert(N)( struct list(N)** ls, T x ) { $\C[0.25in]{ \textbackslash }$ |
---|
110 | struct list(N)* node = malloc(sizeof(struct list(N))); $\C[0.25in]{ \textbackslash }$ |
---|
111 | node->value = x; node->next = *ls; $\C[0.25in]{ \textbackslash }$ |
---|
112 | *ls = node; $\C[0.25in]{ \textbackslash }$ |
---|
113 | } $\C[0.25in]{ \textbackslash }$ |
---|
114 | $\C[0.25in]{ \textbackslash }$ |
---|
115 | T list_head(N)( const struct list(N)* ls ) { return ls->value; } |
---|
116 | |
---|
117 | define_list(int, int); $\C[3in]{// defines int\_list}$ |
---|
118 | define_list(string, const char*); $\C[3in]{// defines string\_list}$ |
---|
119 | |
---|
120 | $\C[\textwidth]{// use is efficient, but syntactically idiosyncratic}$ |
---|
121 | |
---|
122 | int main() { |
---|
123 | struct list(int)* il = NULL; $\C[3in]{// does not match compiler-visible name}$ |
---|
124 | list_insert(int)( &il, 42 ); |
---|
125 | printf("%d\n", list_head(int)(il)); |
---|
126 | |
---|
127 | struct list(string)* sl = NULL; |
---|
128 | list_insert(string)( &sl, "hello" ); |
---|
129 | printf("%s\n", list_head(string)(sl)); |
---|
130 | } |
---|
131 | \end{cfa} |
---|
132 | |
---|
133 | \caption{Macros for generic linked list implementation.} \label{macro-generic-fig} |
---|
134 | \end{figure} |
---|
135 | |
---|
136 | \CC{}, Java, and other languages use \emph{generic types} to produce type-safe abstract data types. |
---|
137 | Design and implementation of generic types for \CFA{} is the first major contribution of this thesis, a summary of which is published in \cite{Moss18}. |
---|
138 | \CFA{} generic types integrate efficiently and naturally with the existing polymorphic functions in \CFA{}, while retaining backward compatibility with C in layout and support for separate compilation. |
---|
139 | A generic type can be declared in \CFA{} by placing a !forall! specifier on a !struct! or !union! declaration, and instantiated using a parenthesized list of types after the generic name. |
---|
140 | An example comparable to the C polymorphism examples in Figures~\ref{bespoke-generic-fig}, \ref{void-generic-fig}, and \ref{macro-generic-fig} can be seen in Figure~\ref{cfa-generic-fig} \TODO{test this code}. |
---|
141 | |
---|
142 | \begin{figure} |
---|
143 | \begin{cfa} |
---|
144 | forall(otype T) struct list { T value; list(T)* next; }; |
---|
145 | |
---|
146 | $\C[\textwidth]{// single polymorphic implementation of each function}$ |
---|
147 | $\C[\textwidth]{// overloading reduces need for namespace prefixes}$ |
---|
148 | |
---|
149 | forall(otype T) void insert( list(T)** ls, T x ) { |
---|
150 | list(T)* node = alloc(); $\C{// type-inferring alloc}$ |
---|
151 | (*node){ x, *ls }; $\C{// concise constructor syntax}$ |
---|
152 | *ls = node; |
---|
153 | } |
---|
154 | |
---|
155 | forall(otype T) T head( const list(T)* ls ) { return ls->value; } |
---|
156 | |
---|
157 | $\C[\textwidth]{// use is clear and efficient}$ |
---|
158 | |
---|
159 | int main() { |
---|
160 | list(int)* il = 0; |
---|
161 | insert( &il, 42 ); $\C{// inferred polymorphic T}$ |
---|
162 | printf("%d\n", head(il)); |
---|
163 | |
---|
164 | list(const char*)* sl = 0; |
---|
165 | insert( &sl, "hello" ); |
---|
166 | printf("%s\n", head(sl)); |
---|
167 | } |
---|
168 | \end{cfa} |
---|
169 | |
---|
170 | \caption{\CFA{} generic linked list implementation.} \label{cfa-generic-fig} |
---|
171 | \end{figure} |
---|
172 | |
---|
173 | \section{Design} |
---|
174 | |
---|
175 | Though a number of languages have some implementation of generic types, backward compatibility with both C and existing \CFA{} polymorphism presented some unique design constraints for this project. |
---|
176 | The guiding principle was to maintain an unsurprising language model for C programmers without compromising runtime efficiency. |
---|
177 | A key insight for this design was that C already possesses a handful of built-in generic types (\emph{compound types} in the language of the standard\cit{}), notably pointer (!T*!) and array (!T[]!), and that user-definable generics should act similarly. |
---|
178 | |
---|
179 | \subsection{Related Work} |
---|
180 | |
---|
181 | One approach to the design of generic types is that taken by \CC{} templates\cite{C++}. |
---|
182 | The template approach is closely related to the macro-expansion approach to C polymorphism demonstrated in Figure~\ref{macro-generic-fig}, but where the macro-expansion syntax has been given first-class language support. |
---|
183 | Template expansion has the benefit of generating code with near-optimal runtime efficiency, as distinct optimizations can be applied for each instantiation of the template. |
---|
184 | On the other hand, template expansion can also lead to significant code bloat, exponential in the worst case\cit{}, and the costs of increased instruction cache pressure at runtime and wasted developer time when compiling cannot be discounted. |
---|
185 | The most significant restriction of the \CC{} template model is that it breaks separate compilation and C's translation-unit-based encapsulation mechanisms. |
---|
186 | Because a \CC{} template is not actually code, but rather a sort of ``recipe'' to generate code, template code must be visible at its call site to be used. |
---|
187 | Furthermore, \CC{} template code cannot be type-checked without instantiating it, a time consuming process with no hope of improvement until \CC{} concepts\cite{C++Concepts} are standardized in \CCtwenty{}. |
---|
188 | C code, by contrast, only needs a !struct! or function declaration to call that function or use (by-pointer) values of that type, a desirable property to maintain for \CFA{}. |
---|
189 | |
---|
190 | Java\cite{Java8} has another prominent implementation for generic types, introduced in Java~5 and based on a significantly different approach than \CC{}. |
---|
191 | The Java approach has much more in common with the !void*!-polymorphism shown in Figure~\ref{void-generic-fig}; since in Java nearly all data is stored by reference, the Java approach to polymorphic data is to store pointers to arbitrary data and insert type-checked implicit casts at compile-time. |
---|
192 | This process of \emph{type erasure} has the benefit of allowing a single instantiation of polymorphic code, but relies heavily on Java's object model and garbage collector. |
---|
193 | To use this model, a more C-like language such as \CFA{} would be required to dynamically allocate internal storage for variables, track their lifetime, and properly clean them up afterward. |
---|
194 | |
---|
195 | Cyclone\cite{Grossman06} is another language extending C, and also provides capabilities for polymorphic functions and existential types, similar to \CFA{}'s !forall! functions and generic types. |
---|
196 | Cyclone existential types can include function pointers in a construct similar to a virtual function table, but these pointers must be explicitly initialized at some point in the code, which is tedious and error-prone compared to \CFA{}'s implicit assertion satisfaction. |
---|
197 | Furthermore, Cyclone's polymorphic functions and types are restricted to abstraction over types with the same layout and calling convention as !void*!, \ie{} only pointer types and !int!. |
---|
198 | In the \CFA{} terminology discussed in Section~\ref{generic-impl-sec}, all Cyclone polymorphism must be dtype-static. |
---|
199 | While the Cyclone polymorphism design provides the efficiency benefits discussed in Section~\ref{dtype-static-sec} for dtype-static polymorphism, it is more restrictive than the more general model of \CFA{}. |
---|
200 | |
---|
201 | Many other languages include some form of generic types. |
---|
202 | As a brief survey, ML\cite{ML} was the first language to support parameteric polymorphism, but unlike \CFA{} does not support the use of assertions and traits to constrain type arguments. |
---|
203 | Haskell\cite{Haskell10} combines ML-style polymorphism with the notion of type classes, similar to \CFA{} traits, but requiring an explicit association with their implementing types, unlike \CFA{}. |
---|
204 | Objective-C\cite{obj-c-book} is an extension to C which has had some industrial success; however, it did not support type-checked generics until recently\cite{xcode7}, and it's garbage-collected, message-passing object-oriented model is a radical departure from C. |
---|
205 | Go\cite{Go}, and Rust\cite{Rust} are modern compiled languages with abstraction features similar to \CFA{} traits, \emph{interfaces} in Go and \emph{traits} in Rust. |
---|
206 | Go has implicit interface implementation and uses a ``fat pointer'' construct to pass polymorphic objects to functions, similar in principle to \CFA{}'s implicit forall paramters. |
---|
207 | Go does not, however, allow user code to define generic types, restricting Go programmers to the small set of generic types defined by the compiler. |
---|
208 | Rust has powerful abstractions for generic programming, including explicit implemenation of traits and options for both separately-compiled virtual dispatch and template-instantiated static dispatch in functions. |
---|
209 | On the other hand, the safety guarantees of Rust's \emph{lifetime} abstraction and borrow checker impose a distinctly idiosyncratic programming style and steep learning curve; \CFA{}, with its more modest safety features, allows direct ports of C code while maintaining the idiomatic style of the original source. |
---|
210 | |
---|
211 | \subsection{\CFA{} Generics} |
---|
212 | |
---|
213 | The generic types design in \CFA{} draws inspiration from both \CC{} and Java generics, capturing the better aspects of each. |
---|
214 | Like \CC{} template types, generic !struct!s and !union!s in \CFA{} have macro-expanded storage layouts, but, like Java generics, \CFA{} generic types can be used with separately-compiled polymorphic functions without requiring either the type or function definition to be visible to the other. |
---|
215 | The fact that the storage layout of any instantiation of a \CFA{} generic type is identical to that of the monomorphic type produced by simple macro replacement of the generic type parameters is important to provide consistent and predictable runtime performance, and to not impose any undue abstraction penalty on generic code. |
---|
216 | As an example, consider the following generic type and function \TODO{test this}: |
---|
217 | |
---|
218 | \begin{cfa} |
---|
219 | forall( otype R, otype S ) struct pair { R first; S second; }; |
---|
220 | |
---|
221 | pair(const char*, int) with_len( const char* s ) { |
---|
222 | return (pair(const char*), int){ s, strlen(s) }; |
---|
223 | } |
---|
224 | \end{cfa} |
---|
225 | |
---|
226 | In this example, !with_len! is defined at the same scope as !pair!, but it could be called from any context that can see the definition of !pair! and a declaration of !with_len!. |
---|
227 | If its return type was !pair(const char*, int)*!, callers of !with_len! would only need the declaration !forall(otype R, otype S) struct pair! to call it, in accordance with the usual C rules for opaque types. |
---|
228 | |
---|
229 | !with_len! is itself a monomorphic function, returning a type that is structurally identical to !struct { const char* first; int second; }!, and as such could be called from C given an appropriate redeclaration and demangling flags. |
---|
230 | However, the definition of !with_len! depends on a polymorphic function call to the !pair! constructor, which only needs to be written once (in this case, implicitly by the compiler according to the usual \CFA{} constructor generation\cite{Moss18}) and can be re-used for a wide variety of !pair! instantiations. |
---|
231 | Since the parameters to this polymorphic constructor call are all statically known, compiler inlining can eliminate any runtime overhead of this polymorphic call. |
---|
232 | |
---|
233 | \CFA{} deliberately does not support \CC{}-style partial specializations of generic types. |
---|
234 | A particularly infamous example in the \CC{} standard library is !vector<bool>!, which is represented as a bitstring rather than the array representation of the other !vector! instantiations. |
---|
235 | Complications from this inconsistency (chiefly the fact that a single bit is not addressable, unlike an array element) make the \CC{} !vector! unpleasant to use in generic contexts due to the break in its public interface. |
---|
236 | Rather than attempting to plug leaks in the template specialization abstraction with a detailed method interface, \CFA{} takes the more principled position that two types with an unrelated data layout are in fact unrelated types, and should be handled with different code. |
---|
237 | Of course, to the degree that distinct types are similar enough to share an interface, the \CFA{} !trait! system allows one to be defined, and objects of types implementing that !trait! to be operated on in the same polymorphic functions. |
---|
238 | |
---|
239 | Since \CFA{} polymorphic functions can operate over polymorphic generic types, functions over such types can be partially or completely specialized using the usual overload selection rules. |
---|
240 | As an example, the !with_len! function above could be an optimization of the following more general function: |
---|
241 | |
---|
242 | \begin{cfa} |
---|
243 | forall(otype T, otype I | { I len(T); }) |
---|
244 | pair(T, I) with_len( T s ) { |
---|
245 | return (pair(T,I)){ s, len(s) }; |
---|
246 | } |
---|
247 | \end{cfa} |
---|
248 | |
---|
249 | \CFA{} generic types also support the type constraints from !forall! functions. |
---|
250 | For example, the following declaration of a sorted set type ensures that the set key implements equality and relational comparison: |
---|
251 | |
---|
252 | \begin{cfa} |
---|
253 | forall(otype Key | { int ?==?(Key, Key); int ?<?(Key, Key); }) struct sorted_set; |
---|
254 | \end{cfa} |
---|
255 | |
---|
256 | These constraints are implemented by applying equivalent constraints to the compiler-generated constructors for this type. |
---|
257 | |
---|
258 | \section{Implementation} \label{generic-impl-sec} |
---|
259 | |
---|
260 | The ability to use generic types in polymorphic contexts means that the \CFA{} implementation in \CFACC{} must support a mechanism for accessing fields of generic types dynamically at runtime. |
---|
261 | While \CFACC{} could in principle use this same mechanism for accessing fields of all generic types, such an approach would throw away compiler knowledge of static types and impose an unnecessary runtime cost, limiting the utility of the generic type design. |
---|
262 | Instead, my design for generic type support in \CFACC{} distinguishes between \emph{concrete} generic types that have a fixed memory layout regardless of type parameters and \emph{dynamic} generic types that may vary in memory layout depending on their type parameters. |
---|
263 | A \emph{dtype-static} type has polymorphic parameters but is still concrete. |
---|
264 | Polymorphic pointers are an example of dtype-static types; given some type variable !T!, T is a polymorphic type, but !T*! has a fixed size and can therefore be represented by a !void*! in code generation. |
---|
265 | In particular, generic types where all parameters are un-!sized! (\ie{} they do not conform to the built-in !sized! trait because the compiler does not know their size and alignment) are always concrete, as there is no possibility for their layout to vary based on type parameters of unknown size and alignment. |
---|
266 | More precisely, a type is concrete if and only if all of its !sized! type parameters are concrete, and a concrete type is dtype-static if any of its type parameters are (possibly recursively) polymorphic. |
---|
267 | To illustrate, the following code using the !pair! type from above \TODO{test this} has each use of !pair! commented with its class: |
---|
268 | |
---|
269 | \begin{cfa} |
---|
270 | //dynamic, layout varies based on T |
---|
271 | forall(otype T) T value( pair(const char*, T) p ) { return p.second; } |
---|
272 | |
---|
273 | // dtype-static, F* and T* are concrete but recursively polymorphic |
---|
274 | forall(dtype F, otype T) T value( pair(F*, T*) ) { return *p.second; } |
---|
275 | |
---|
276 | pair(const char*, int) p = {"magic", 42}; $\C[2.5in]{// concrete}$ |
---|
277 | int i = value(p); |
---|
278 | pair(void*, int*) q = {0, &p.second}; $\C[2.5in]{// concrete}$ |
---|
279 | i = value(q); |
---|
280 | double d = 1.0; |
---|
281 | pair(double*, double*) r = {&d, &d}; $\C[2.5in]{// concrete}$ |
---|
282 | d = value(r); |
---|
283 | \end{cfa} |
---|
284 | |
---|
285 | \subsection{Concrete Generic Types} |
---|
286 | |
---|
287 | The \CFACC{} translator template expands concrete generic types into new structure types, affording maximal inlining. |
---|
288 | To enable interoperation among equivalent instantiations of a generic type, \CFACC{} saves the set of instantiations currently in scope and reuses the generated structure declarations where appropriate. |
---|
289 | In particular, tuple types are implemented as a single compiler-generated generic type definition per tuple arity, and can be instantiated and reused according to the usual rules for generic types. |
---|
290 | A function declaration that accepts or returns a concrete generic type produces a declaration for the instantiated structure in the same scope, which all callers may reuse. |
---|
291 | As an example, the concrete instantiation for !pair(const char*, int)! is\footnote{This omits the field name mangling performed by \CFACC{} for overloading purposes.\label{mangle-foot}} |
---|
292 | |
---|
293 | \begin{cfa} |
---|
294 | struct _pair_conc0 { const char * first; int second; }; |
---|
295 | \end{cfa} |
---|
296 | |
---|
297 | A concrete generic type with dtype-static parameters is also expanded to a structure type, but this type is used for all matching instantiations. |
---|
298 | In the example above, the !pair(F*, T*)! parameter to !value! is such a type; its expansion is below\footref{mangle-foot}, and it is used as the type of the variables !q! and !r! as well, with casts for member access where appropriate. |
---|
299 | |
---|
300 | \begin{cfa} |
---|
301 | struct _pair_conc1 { void* first; void* second; }; |
---|
302 | \end{cfa} |
---|
303 | |
---|
304 | \subsection{Dynamic Generic Types} |
---|
305 | |
---|
306 | In addition to this efficient implementation of concrete generic types, \CFA{} also offers flexibility with powerful support for dynamic generic types. |
---|
307 | In the pre-existing compiler design, !otype! (and all !sized!) type parameters come with implicit size and alignment parameters provided by the caller. |
---|
308 | The design for generic types presented here adds an \emph{offset array} containing structure-member offsets for dynamic generic !struct! types. |
---|
309 | A dynamic generic !union! needs no such offset array, as all members are at offset 0, but size and alignment are still necessary. |
---|
310 | Access to members of a dynamic structure is provided at runtime via base displacement addressing the structure pointer and the member offset (similar to the !offsetof! macro), moving a compile-time offset calculation to runtime. |
---|
311 | |
---|
312 | the offset arrays are statically generated where possible. |
---|
313 | If a dynamic generic type is passed or returned by value from a polymorphic function, \CFACC{} can safely assume that the generic type is complete (\ie{} has a known layout) at any call site, and the offset array is passed from the caller; if the generic type is concrete at the call site, the elements of this offset array can even be statically generated using the C !offsetof! macro. |
---|
314 | As an example, the body of the second !value! function above is implemented as |
---|
315 | |
---|
316 | \begin{cfa} |
---|
317 | _assign_T( _retval, p + _offsetof_pair[1] ); $\C[2in]{// return *p.second}$ |
---|
318 | \end{cfa} |
---|
319 | |
---|
320 | Here, !_assign_T! is passed in as an implicit parameter from !otype T! and takes two !T*! (!void*! in the generated code), a destination and a source, and !_retval! is the pointer to a caller-allocated buffer for the return value, the usual \CFA{} method to handle dynamically-sized return types. |
---|
321 | !_offsetof_pair! is the offset array passed into !value!; this array is generated at the call site as |
---|
322 | |
---|
323 | \begin{cfa} |
---|
324 | size_t _offsetof_pair[] = {offsetof(_pair_conc0, first), offsetof(_pair_conc0, second)}; |
---|
325 | \end{cfa} |
---|
326 | |
---|
327 | \subsubsection{Layout Functions} |
---|
328 | |
---|
329 | In some cases, the offset arrays cannot be statically generated. |
---|
330 | For instance, modularity is generally provided in C by including an opaque forward declaration of a structure and associated accessor and mutator functions in a header file, with the actual implementations in a separately-compiled \texttt{.c} file. |
---|
331 | \CFA{} supports this pattern for generic types, implying that the caller of a polymorphic function may not know the actual layout or size of a dynamic generic type and only holds it by pointer. |
---|
332 | \CFACC{} automatically generates \emph{layout functions} for cases where the size, alignment, and offset array of a generic struct cannot be passed into a function from that functions's caller. |
---|
333 | These layout functions take as arguments pointers to size and alignment variables and a caller-allocated array of member offsets, as well as the size and alignment of all !sized! parameters to the generic structure. |
---|
334 | Un!sized! parameters not passed because they are forbidden from being used in a context that affects layout by C's usual rules about incomplete types. |
---|
335 | Similarly, the layout function can only safely be called from a context where the generic type definition is visible, because otherwise the caller will not know how large to allocate the array of member offsets. |
---|
336 | |
---|
337 | The C standard does not specify a memory layout for structs, but the POSIX ABI for x86\cit{} does; this memory layout is common for C implementations, but is a platform-specific issue for porting \CFA{}. |
---|
338 | This algorithm, sketched below in pseudo-\CFA{}, is a straightforward mapping of consecutive fields into the first properly-aligned offset in the !struct! layout; layout functions for !union! types omit the offset array and simply calculate the maximum size and alignment over all union variants. |
---|
339 | Since \CFACC{} generates a distinct layout function for each type, constant-folding and loop unrolling are applied. |
---|
340 | |
---|
341 | \begin{cfa} |
---|
342 | forall(dtype T1, dtype T2, ... | sized(T1) | sized(T2) | ...) |
---|
343 | void layout(size_t* size, size_t* align, size_t* offsets) { |
---|
344 | // initialize values |
---|
345 | *size = 0; *align = 1; |
---|
346 | // set up members |
---|
347 | for ( int i = 0; i < n_fields; ++i ) { |
---|
348 | // pad to alignment |
---|
349 | size_t off_align = *size % alignof(field[i]); |
---|
350 | if ( off_align != 0 ) { *size += alignof(field[i]) - off_align; } |
---|
351 | // mark member, increase size, and fix alignment |
---|
352 | offsets[i] = *size; |
---|
353 | *size += sizeof(field[i]); |
---|
354 | if ( *align < alignof(field[i]) ) { *align = alignof(field[i]); } |
---|
355 | } |
---|
356 | // final padding to alignment |
---|
357 | size_t off_align = *size % *align; |
---|
358 | if ( off_align != 0 ) { *size += *align - off_align; } |
---|
359 | } |
---|
360 | \end{cfa} |
---|
361 | |
---|
362 | Results of layout function calls are cached so that they are only computed once per type per function. |
---|
363 | Layout functions also allow generic types to be used in a function definition without reflecting them in the function signature, an important implemenation-hiding constraint of the design. |
---|
364 | For instance, a function that strips duplicate values from an unsorted !list(T)! likely has a reference to the list as its only explicit parameter, but uses some sort of !set(T)! internally to test for duplicate values. |
---|
365 | This function could acquire the layout for !set(T)! by calling its layout function, providing as an argument the layout of !T! implicitly passed into that function. |
---|
366 | |
---|
367 | Whether a type is concrete, dtype-static, or dynamic is decided solely on the basis of the type arguments and !forall! clause type paramters. |
---|
368 | This design allows opaque forward declarations of generic types, \eg{} !forall(otype T) struct Box;! like in C, all uses of $Box(T)$ can be separately compiled, and callers from other translation units know the proper calling conventions to use. |
---|
369 | In an alternate design where the definition of a structure type is included in deciding whether a generic type is dynamic or concrete, some further types may be recognized as dtype-static --- \eg{} !Box! could be defined with a body !{ T* p; }!, and would thus not depend on !T! for its layout. |
---|
370 | However, the existence of an !otype! parameter !T! means that !Box! \emph{could} depend on !T! for its layout if this definition is not visible, and we judged preserving separate compilation (and the associated C compatibility) in the implemented design to be an acceptable trade-off. |
---|
371 | |
---|
372 | \subsection{Applications of Dtype-static Types} \label{dtype-static-sec} |
---|
373 | |
---|
374 | The reuse of dtype-static structure instantiations enables useful programming patterns at zero runtime cost. |
---|
375 | The most important such pattern is using !forall(dtype T) T*! as a type-checked replacement for !void*!, \eg{} creating a lexicographic comparison function for pairs of pointers. |
---|
376 | |
---|
377 | \begin{cfa} |
---|
378 | forall(dtype T) |
---|
379 | int lexcmp( pair(T*, T*)* a, pair(T*, T*)* b, int (*cmp)(T*, T*) ) { |
---|
380 | int c = cmp( a->first, b->first ); |
---|
381 | return c ? c : cmp( a->second, b->second ); |
---|
382 | } |
---|
383 | \end{cfa} |
---|
384 | |
---|
385 | Since !pair(T*, T*)! is a concrete type, there are no implicit parameters passed to !lexcmp!; hence, the generated code is identical to a function written in standard C using !void*!, yet the \CFA{} version is type-checked to ensure members of both pairs and arguments to the comparison function match in type. |
---|
386 | |
---|
387 | Another useful pattern enabled by reused dtype-static type instantiations is zero-cost \emph{tag structures}. |
---|
388 | Sometimes, information is only used for type checking and can be omitted at runtime. |
---|
389 | In the example below, !scalar! is a dtype-static type; hence, all uses have a single structure definition containing !unsigned long! and can share the same implementations of common functions like !?+?!. |
---|
390 | These implementations may even be separately compiled, unlike \CC{} template functions. |
---|
391 | However, the \CFA{} type checker ensures matching types are used by all calls to !?+?!, preventing nonsensical computations like adding a length to a volume. |
---|
392 | |
---|
393 | \begin{cfa} |
---|
394 | forall(dtype Unit) struct scalar { unsigned long value; }; |
---|
395 | struct metres {}; |
---|
396 | struct litres {}; |
---|
397 | |
---|
398 | forall(dtype U) scalar(U) ?+?(scalar(U) a, scalar(U) b) { |
---|
399 | return (scalar(U)){ a.value + b.value }; |
---|
400 | } |
---|
401 | |
---|
402 | scalar(metres) half_marathon = { 21098 }; |
---|
403 | scalar(litres) pool = { 2500000 }; |
---|
404 | scalar(metres) marathon = half_marathon + half_marathon; |
---|
405 | `marathon + pool;` $\C[4in]{// compiler ERROR, mismatched types}$ |
---|
406 | \end{cfa} |
---|
407 | |
---|
408 | \section{Performance Experiments} \label{generic-performance-sec} |
---|
409 | |
---|
410 | \TODO{pull benchmarks from Moss et al.} |
---|
411 | |
---|
412 | \section{Future Work} |
---|
413 | |
---|
414 | The generic types design presented here is already sufficiently expressive to implement a variety of useful library types. |
---|
415 | However, some other features based on this design could further improve \CFA{}. |
---|
416 | |
---|
417 | The most pressing addition is the ability to have non-type generic parameters. |
---|
418 | C already supports fixed-length array types, \eg{} !int[10]!; these types are essentially generic types with unsigned integer parameters, and allowing \CFA{} users the capability to build similar types is a requested feature. |
---|
419 | More exotically, the ability to have these non-type parameters depend on dynamic runtime values rather than static compile-time constants opens up interesting opportunities for type-checking problematic code patterns. |
---|
420 | For example, if a collection iterator was parameterized over the pointer to the collection it was drawn from, then a sufficiently powerful static analysis pass could ensure that that iterator was only used for that collection, eliminating one source of hard-to-find bugs. |
---|
421 | |
---|
422 | The implementation mechanisms behind this generic types design can also be used to add new features to \CFA{}. |
---|
423 | One such potential feature would be to add \emph{field assertions} to the existing function and variable assertions on polymorphic type variables. |
---|
424 | Implementation of these field assertions would be based on the same code that supports member access by dynamic offset calculation for dynamic generic types. |
---|
425 | Simulating field access can already be done more flexibly in \CFA{} by declaring a trait containing an accessor function to be called from polymorphic code, but these accessor functions impose some overhead both to write and call, and directly providing field access via an implicit offset parameter would be both more concise and more efficient. |
---|
426 | Of course, there are language design trade-offs to such an approach, notably that providing the two similar features of field and function assertions would impose a burden of choice on programmers writing traits, with field assertions more efficient, but function assertions more general; given this open design question we have deferred a decision on field assertions until we have more experience using \CFA{}. |
---|
427 | If field assertions are included in the language, a natural extension would be to provide a structural inheritance mechanism for every !struct! type that simply turns the list of !struct! fields into a list of field assertions, allowing monomorphic functions over that type to be generalized to polymorphic functions over other similar types with added or reordered fields. |
---|
428 | \CFA{} could also support a packed or otherwise size-optimized representation for generic types based on a similar mechanism --- the layout function would need to be re-written, but nothing in the use of the offset arrays implies that the field offsets need be monotonically increasing. |
---|
429 | |
---|
430 | With respect to the broader \CFA{} polymorphism design, the experimental results in Section~\ref{generic-performance-sec} demonstrate that though the runtime impact of \CFA{}'s dynamic virtual dispatch is low, it is not as low as the static dispatch of \CC{} template inlining. |
---|
431 | However, rather than subject all \CFA{} users to the compile-time costs of ubiquitous template expansion, we are considering more targeted mechanisms for performance-sensitive code. |
---|
432 | Two promising approaches are are an !inline! annotation at polymorphic function call sites to create a template specialization of the function (provided the code is visible) or placing a different !inline! annotation on polymorphic function definitions to instantiate a specialized version of the function for some set of types. |
---|
433 | These approaches are not mutually exclusive and allow performance optimizations to be applied only when necessary, without suffering global code bloat. |
---|
434 | In general, the \CFA{} team believes that separate compilation works well with loaded hardware caches by producing smaller code, which may offset the benefit of larger inlined code. |
---|