source: doc/proposals/virtual.txt@ 5da5a96

ADT aaron-thesis arm-eh ast-experimental cleanup-dtors deferred_resn demangler enum forall-pointer-decay jacob/cs343-translation jenkins-sandbox new-ast new-ast-unique-expr new-env no_list persistent-indexer pthread-emulation qualifiedEnum with_gc
Last change on this file since 5da5a96 was c3b96677, checked in by Andrew Beach <ajbeach@…>, 8 years ago

Some ramblings for whoever picks up the work I left behind.

  • Property mode set to 100644
File size: 12.7 KB
RevLine 
[63c2bca]1Proposal for virtual functionality
2
[d49bfa8]3There are two types of virtual inheritance in this proposal, relaxed
[c72f9fd]4(implicit) and strict (explicit). Relaxed is the simpler case that uses the
[d49bfa8]5existing trait system with the addition of trait references and vtables.
[c72f9fd]6Strict adds some constraints and requires some additional notation but allows
[d49bfa8]7for down-casting.
8
9Relaxed Virtual Inheritance:
10
[63c2bca]11Imagine the following code :
12
13trait drawable(otype T) {
14 void draw(T* );
15};
16
17struct text {
18 char* text;
19};
20
21void draw(text*);
22
23struct line{
24 vec2 start;
25 vec2 end;
26};
27
28void draw(line*);
29
[c72f9fd]30While all the members of this simple UI support drawing, creating a UI that
31easily supports both these UI requires some tedious boiler-plate code:
[63c2bca]32
33enum type_t { text, line };
34
35struct widget {
36 type_t type;
37 union {
38 text t;
39 line l;
40 };
41};
42
43void draw(widget* w) {
44 switch(w->type) {
45 case text : draw(&w->text); break;
[5a0735ac]46 case line : draw(&w->line); break;
[63c2bca]47 default : handle_error(); break;
48 }
49}
50
[c72f9fd]51While this code will work as implemented, adding any new widgets or any new
[d49bfa8]52widget behaviors requires changing existing code to add the desired
53functionality. To ease this maintenance effort required CFA introduces the
54concept of trait references.
[63c2bca]55
[d49bfa8]56Using trait references to implement the above gives the following :
[63c2bca]57
[d49bfa8]58trait drawable objects[10];
[63c2bca]59fill_objects(objects);
60
61while(running) {
[d49bfa8]62 for(drawable object : objects) {
[63c2bca]63 draw(object);
64 }
65}
66
[d49bfa8]67The keyword trait is optional (by the same rules as the struct keyword). This
[e1e4aa9]68is not currently supported in CFA and the lookup is not possible to implement
[d49bfa8]69statically. Therefore we need to add a new feature to handle having dynamic
70lookups like this.
[63c2bca]71
[d49bfa8]72What we really want to do is express the fact that calling draw() on a trait
73reference should find the underlying type of the given parameter and find how
74it implements the routine, as in the example with the enumeration and union.
[63c2bca]75
[d49bfa8]76For instance specifying that the drawable trait reference looks up the type
[e1e4aa9]77of the first argument to find the implementation would be :
[63c2bca]78
79trait drawable(otype T) {
80 void draw(virtual T* );
81};
82
[d49bfa8]83This could be implied in simple cases like this one (single parameter on the
84trait and single generic parameter on the function). In more complex cases it
[c72f9fd]85would have to be explicitly given, or a strong convention would have to be
[e1e4aa9]86enforced (e.g. implementation of trait functions is always drawn from the
[c72f9fd]87first polymorphic parameter).
[63c2bca]88
[d49bfa8]89Once a function in a trait has been marked as virtual it defines a new
90function that takes in that trait's reference and then dynamically calls the
[e1e4aa9]91underlying type implementation. Hence a trait reference becomes a kind of
[d49bfa8]92abstract type, cannot be directly instantiated but can still be used.
[63c2bca]93
[d49bfa8]94One of the limitations of this design is that it does not support double
95dispatching, which concretely means traits cannot have routines with more than
96one virtual parameter. The program must have a single table to look up the
97function on. Using trait references with traits with more than one parameter
[c72f9fd]98is also restricted, initially forbidden, see extension.
[63c2bca]99
[c72f9fd]100Extension: Multi-parameter Virtual Traits:
[63c2bca]101
[e1e4aa9]102This implementation can be extended to traits with multiple parameters if
[d49bfa8]103one is called out as being the virtual trait. For example :
[63c2bca]104
[d49bfa8]105trait iterator(otype T, dtype Item) {
106 Maybe(Item) next(virtual T *);
107}
[63c2bca]108
[d49bfa8]109iterator(int) generators[10];
[63c2bca]110
[d49bfa8]111Which creates a collection of iterators that produce integers, regardless of
112how those iterators are implemented. This may require a note that this trait
113is virtual on T and not Item, but noting it on the functions may be enough.
[da81e1d0]114
[63c2bca]115
[d49bfa8]116Strict Virtual Inheritance:
[63c2bca]117
[d49bfa8]118One powerful feature relaxed virtual does not support is the idea of down
119casting. Once something has been converted into a trait reference there is
120very little we can do to recover and of the type information, only the trait's
121required function implementations are kept.
[63c2bca]122
[d49bfa8]123To allow down casting strict virtual requires that all traits and structures
[e1e4aa9]124involved be organized into a tree. Each trait or struct must have a unique
[d49bfa8]125position on this tree (no multiple inheritance).
[63c2bca]126
[d49bfa8]127This is declared as follows :
[63c2bca]128
[d49bfa8]129trait error(otype T) virtual {
130 const char * msg(T *);
[63c2bca]131}
132
[d49bfa8]133trait io_error(otype T) virtual error {
134 FILE * src(T *);
[63c2bca]135}
136
[d49bfa8]137struct eof_error virtual io_error {
138 FILE * fd;
139};
140
141So the trait error is the head of a new tree and io_error is a child of it.
142
[c72f9fd]143Also the parent trait is implicitly part of the assertions of the children,
[d49bfa8]144so all children implement the same operations as the parent. By the unique
[c72f9fd]145path down the tree, we can also uniquely order them so that a prefix of a
146child's vtable has the same format as its parent's.
[d49bfa8]147
148This gives us an important extra feature, runtime checking of the parent-child
[c3b96677]149relationship with virtual cast, where a pointer (and maybe a reference) to
150a virtual type can be cast to another virtual cast. However the cast is
151dynamicly check and only occurs if the underlying type is a child of the type
152being cast to. Otherwise null is returned.
153
154(virtual TYPE)EXPR
155
156As an extention, the TYPE may be ommitted if it can be determained from
157context, for instance if the cast occurs on the right hand side of an
158assignment.
[d49bfa8]159
[e1e4aa9]160Extension: Multiple Parents
[d49bfa8]161
162Although each trait/struct must have a unique position on each tree, it could
163have positions on multiple trees. All this requires is the ability to give
164multiple parents, as here :
165
166trait region(otype T) virtual drawable, collider;
[63c2bca]167
[d49bfa8]168The restriction being, the parents must come from different trees. This
169object (and all of its children) can be cast to either tree. This is handled
[e1e4aa9]170by generating a separate vtable for each tree the structure is in.
[63c2bca]171
[e1e4aa9]172Extension: Multi-parameter Strict Virtual
[d49bfa8]173
[e1e4aa9]174If a trait has multiple parameters then one must be called out to be the one
175we generate separate vtables for, as in :
[d49bfa8]176
177trait example(otype T, otype U) virtual(T) ...
178
[e1e4aa9]179This can generate a separate vtable for each U for which all the T+U
180implementations are provided. These are then separate nodes in the tree (or
181the root of different trees) as if each was created individually. Providing a
182single unique instance of these nodes would be the most difficult aspect of
183this extension, possibly intractable, though with sufficient hoisting and
[c72f9fd]184link-once duplication it may be possible.
[d49bfa8]185
186Example:
187
188trait argument(otype T) virtual {
189 char short_name(virtual T *);
190 bool is_set(virtual T *);
[63c2bca]191};
192
[d49bfa8]193trait value_argument(otype T, otype U) virtual(T) argument {
194 U get_value(virtual T *);
195};
196
[e1e4aa9]197Extension: Structural Inheritance
[d49bfa8]198
199Currently traits must be the internal nodes and structs the leaf nodes.
[e1e4aa9]200Structs could be made internal nodes as well, in which case the child structs
[c72f9fd]201would likely structurally inherit the fields of their parents.
[d49bfa8]202
203
204Storing the Virtual Lookup Table (vtable):
205
[c72f9fd]206We have so far been silent on how the vtable is created, stored and accessed.
[d49bfa8]207
208Creation happens at compile time. Function pointers are found by using the
209same best match rules as elsewhere (additional rules for defaults from the
[e1e4aa9]210parent may or may not be required). For strict virtual this must happen at the
211global scope and forbidding static functions, to ensure that a single unique
[d49bfa8]212vtable is created. Similarly, there may have to be stricter matching rules
213for the functions that go into the vtable, possibly requiring an exact match.
214Relaxed virtual could relax both restrictions, if we allow different vtable
215at different conversion (struct to trait reference) sites. If it is allowed
216local functions being bound to a vtable could cause issues when they go out
217of scope, however this should follow the lifetime rules most C programs
[e1e4aa9]218already follow implicitly.
[d49bfa8]219
220Most vtables should be stored statically, the only exception being some of
221the relaxed vtables that could have local function pointers. These may be able
222to be stack allocated. All vtables should be immutable and require no manual
223cleanup.
224
225Access has two main options:
226
227The first is through the use of fat pointers, or a tuple of pointers. When the
228object is converted to a trait reference, the pointers to its vtables are
229stored along side it.
230
[e1e4aa9]231This allows for compatibility with existing structures (such as those imported
[d49bfa8]232from C) and is the default storage method unless a different one is given.
233
234The other is by inlining the vtable pointer as "intrusive vtables". This adds
[e1e4aa9]235a field to the structure to the vtable. The trait reference then has a single
[d49bfa8]236pointer to this field, the vtable includes an offset to find the beginning of
237the structure again.
238
239This is used if you specify a vtable field in the structure. If given in the
240trait the vtable pointer in the trait reference can then become a single
241pointer to the vtable field and use that to recover the original object
[e1e4aa9]242pointer as well as retrieve all operations.
[d49bfa8]243
244trait drawable(otype T) {
245 vtable drawable;
246};
247
248struct line {
249 vtable drawable;
250 vec2 start;
251 vec2 end;
252};
253
254This inline code allows trait references to be converted to plain pointers
255(although they still must be called specially). The vtable field may just be
256an opaque block of memory or it may allow user access to the vtable. If so
[e1e4aa9]257then there should be some way to retrieve the type of the vtable, which will be
[d49bfa8]258autogenerated and often unique.
259
[c3b96677]260It may be worth looking into a way to force the vtable pointer to be in a
261particular location, which would save the storage to store the offset and
262maybe the offset operation itself (offset = 0). However it may not be worth
263introducing a new language feature for.
264As of writing, exceptions actually use this system.
265
[d49bfa8]266
267Keyword Usage:
268
269It may be desirable to add fewer new keywords than discussed in this proposal.
270It is possible that "virtual" could replace both "vtable" above with
271unambiguous contextual meaning. However, for purposes of clarity in the design
272discussion it is beneficial to keep the keywords for separate concepts distinct.
273
274
275Trait References and Operations:
[63c2bca]276
[d49bfa8]277sizeof(drawable) will return the size of the trait object itself. However :
[63c2bca]278
[d49bfa8]279line a_line;
280drawable widget = a_line;
281sizeof(widget);
[da81e1d0]282
[d49bfa8]283Will instead return the sizeof the underlying object, although the trait must
284require that its implementation is sized for there to be a meaningful value
285to return. You may also get the size of the trait reference with
[da81e1d0]286
[d49bfa8]287sizeof(&widget);
[da81e1d0]288
[d49bfa8]289Calling free on a trait reference will free the memory for the object. It will
290leave the vtables alone, as those are (always?) statically allocated.
[c3b96677]291
292
293Special Traits:
294
295trait is_virtual_parent(dtype parent, dtype child) { ... };
296
297There are others but I believe this one to be the most important. The trait
298holds if the parent type is a strict virtual ancestor (any number of levels)
299of child. It will have to exist at least internally to check for upcasts and
300it can also be used to optimize virtual casts into upcasts. Or a null value or
301error if the cast would never succeed. Exporting it to a special trait allows
302users to express that requirement in their own polymorphic code.
303
304
305Implementation:
306
307Before we can generate any of the nessasary code, the compiler has to get some
308additional information about the code that it currently does not collect.
309
310First it should establish all child->parent links so that it may travel up the
311hierarchy to grab the nessasary information, and create the actual parent
312pointers in the strict virtual tables. It should also maintain the connections
313between the virtual type (structure or trait), the vtable type and the vtable
314instance (or default instance for relaxed virtual if multiple are allowed). To
315this end a sub-node should be created with the nessasary pointers. Traits and
316structs with virtual can create an instance and store all the nessasary data.
317
318With the hierarchy in place it can generate the vtable type for each type,
319it will generally have a function pointer field for each type assertion in
320some consistant order. Strict virtual will also have a pointer to the parent's
321vtable and intrusive vtables will also have the offset to recover the original
322pointer. Sized types will also carry the size.
323
324Wheither the vtable is intrusive or not should also be save so that the trait
325object/reference/pointer knows if it has to store 1 or 2 pointers. A wrapper
326function will have to be generated for each type assertion so that they may
327be called on the trait type, these can probably be inlined.
328
329The virtual parameter will also have to be marked (implicately or explicately)
330until code generation so that the wrapper functions know where to go to get
331the vtable for the function look up. That could probably be added as a
332storageclass, although one that is only valid on type assertions.
333
334The generated vtable will than have to have a vtable instance created and
335filled with all the approprate values. Stricter matching may have to be used
336to ensure that the functions used are stable. It will also have to use
337".gnu.linkonce" or equilant to ensure only one copy exists in the final code
338base.
Note: See TracBrowser for help on using the repository browser.