[fc568163] | 1 | Operator Defaults
|
---|
| 2 | =================
|
---|
| 3 |
|
---|
| 4 | This proposal introduces a new syntax for requesting a default implementation
|
---|
| 5 | of an operator and the rules for generating them.
|
---|
| 6 |
|
---|
| 7 | This is to build on the implied relationships between operators. Most default
|
---|
| 8 | operators will be implemented in terms of other operators. As a fall-back
|
---|
| 9 | operators can try to figure out an intuitive definition from inspection of the
|
---|
| 10 | type's sue declaration.
|
---|
| 11 |
|
---|
| 12 | Syntax
|
---|
| 13 | ------
|
---|
| 14 |
|
---|
| 15 | The syntax for requesting a default implementation is part of function
|
---|
| 16 | definition. After the function signature (after the parameter list) instead
|
---|
| 17 | of putting the function body you declare it to be equal to default.
|
---|
| 18 |
|
---|
| 19 | box ?+?(box const & left, box const & right) = default;
|
---|
| 20 |
|
---|
| 21 | These can exist along side any forward definitions of the function, but would
|
---|
| 22 | conflict with any other complete definitions or deletions of the function.
|
---|
| 23 |
|
---|
| 24 | It could be valid syntax on any function, but possibly all non-operators would
|
---|
| 25 | report that no default implementation available.
|
---|
| 26 |
|
---|
| 27 | If default implementations are really popular and we don't need additional
|
---|
| 28 | information about the signature a more compact syntax could be added.
|
---|
| 29 |
|
---|
| 30 | default box( ?(), ?=?, ?!=?, ?<?, ?>?, ?<=?, ?>=? );
|
---|
| 31 |
|
---|
| 32 | Generation Strategies
|
---|
| 33 | ---------------------
|
---|
| 34 |
|
---|
| 35 | There exists a system around the default generation that selects how to
|
---|
| 36 | generate a given function if one can be generated at all. This section
|
---|
| 37 | describes that system and some of the logic behind it.
|
---|
| 38 |
|
---|
| 39 | There are two main strategies for generating an operator implementation.
|
---|
| 40 |
|
---|
| 41 | The first is to mimic the relationship between operators on the primitive
|
---|
| 42 | types by defining a new operator in terms of an existing operator on the same
|
---|
| 43 | type. For instance `++i` is equivalent to `i += 1`, so the generated
|
---|
| 44 | implementation will "look like" that.
|
---|
| 45 |
|
---|
| 46 | The second is to inspect the structure of the declaration to guess at what
|
---|
| 47 | the default implementation would be. Because of that it is dependent on the
|
---|
| 48 | kind of declaration, a structure doesn't have the same rules as an
|
---|
| 49 | enumeration. Also it is similar to the implicate definitions currently created
|
---|
| 50 | by the compiler and the generation methods for many are carried over.
|
---|
| 51 |
|
---|
| 52 | From these strategies a method of generation (a particular set of rules that
|
---|
| 53 | define an implementation from the type and existing functions) has to be
|
---|
| 54 | picked, but often there is more than on reasonable choice. In these cases they
|
---|
| 55 | are ordered and the first (best) one whose requirements is met is then used.
|
---|
| 56 | See "Circular Requirements" below for some exceptions and extensions to this
|
---|
| 57 | pattern.
|
---|
| 58 |
|
---|
| 59 | Generally the methods based on operators come first as they propagate any
|
---|
| 60 | unusual implementations from the explicate operators to the ones being
|
---|
| 61 | generated. If all of those fail then the intuitive definition based on the
|
---|
| 62 | declaration's shape is used. The general patterns in this area for the
|
---|
| 63 | different sue types follow.
|
---|
| 64 |
|
---|
| 65 | ### Structures
|
---|
| 66 | Structures will usually apply the operation to each field, or when there are
|
---|
| 67 | two parameters the matching pairs of fields from each, and then combine the
|
---|
| 68 | results.
|
---|
| 69 |
|
---|
| 70 | This does require that the fields have certain operators defined on them.
|
---|
| 71 | In this respect it is still operator based generation, but we use inspection
|
---|
| 72 | on the structure to find out which operators to use.
|
---|
| 73 |
|
---|
| 74 | Also, for the purposes of default generation types declared with the
|
---|
| 75 | concurrency modifiers (coroutine, monitor and thread) are considered structs.
|
---|
| 76 | The default implementations should be the same as if you had written out the
|
---|
| 77 | extra field and functions by hand.
|
---|
| 78 |
|
---|
| 79 | ### Enumerations
|
---|
| 80 | The two ways of using enumerations are considered. First as "one of" the
|
---|
| 81 | list options as in normal use, the second is as a set of flags where each
|
---|
| 82 | option represents a flag that may or may not be set.
|
---|
| 83 |
|
---|
| 84 | Currently there is no way to specify which nor does the system attempt to
|
---|
| 85 | guess by checking assigned values. There is one case where an operator could
|
---|
| 86 | have a meaningful default in both versions. If both are included then we can
|
---|
| 87 | try to pick one by scanning the enumeration to see what values its options
|
---|
| 88 | are given (a linear series or powers of 2) could be used. In all other cases
|
---|
| 89 | the definition that makes sense can be assumed.
|
---|
| 90 |
|
---|
| 91 | ### Unions
|
---|
| 92 | Unions are the hardest to deal with because the instance does not show which
|
---|
| 93 | field in the union is being used. Because of that there are very few intuitive
|
---|
| 94 | definitions to use and the ones that do depend on bit-wise operations and only
|
---|
| 95 | if the union is made of primitive types.
|
---|
| 96 |
|
---|
| 97 | ### Traits
|
---|
| 98 | Default operations are not supported on traits. A function implemented by the
|
---|
| 99 | default generation may be used to satisfy an assertion. However a default
|
---|
| 100 | implementation may not be requested on a polymorphic function.
|
---|
| 101 |
|
---|
| 102 | It could in theory, limiting to operation based generation and using the
|
---|
| 103 | operations available in the assertion list. There are a few problems:
|
---|
| 104 | + Knowing the entire set of functions being generated is very useful in some
|
---|
| 105 | cases and this information is quickly lost with polymorphic functions.
|
---|
| 106 | + The rules for choosing a generation method do not match how a polymorphic
|
---|
| 107 | function is selected so the results can be inconsistent.
|
---|
| 108 | + It is easily to mimic with a polymorphic function already, writing out one
|
---|
| 109 | generic function and including it.
|
---|
| 110 |
|
---|
| 111 | Default Generation
|
---|
| 112 | ------------------
|
---|
| 113 |
|
---|
| 114 | Here are the generation methods. Unless otherwise stated they are listed in
|
---|
| 115 | priority order. That is the first one mentioned that a type fits (has all the
|
---|
| 116 | required operators or its form matches) will be used.
|
---|
| 117 |
|
---|
| 118 | The operator based constructions can be used on any sue type, those that
|
---|
| 119 | require a particular kind mention that.
|
---|
| 120 |
|
---|
| 121 | ### Constructor: ?{}
|
---|
| 122 | Note that requesting any constructor to be generated counts as defining a
|
---|
| 123 | constructor for purposes of disabling the implicate constructors. There are
|
---|
| 124 | no operator based methods for generating constructors.
|
---|
| 125 |
|
---|
| 126 | For structures: For the zero argument constructor (aka the default constructor,
|
---|
| 127 | which takes just a reference to the value to construct) each field is
|
---|
| 128 | constructed with its zero argument constructor. For the copy constructor each
|
---|
| 129 | field is copy constructed from the same field in the copied structure. For
|
---|
| 130 | zero_t and one_t each field is also constructed from zero_t or one_t.
|
---|
| 131 |
|
---|
| 132 | For enumerations: For the zero argument constructor the value is set to 0 if
|
---|
| 133 | one of the enumeration options is 0 (or set to the first value in the enum).
|
---|
| 134 | For the zero_t constructor it is the same except the check is skipped (and no
|
---|
| 135 | or). The copy constructor is the same as memcpy.
|
---|
| 136 |
|
---|
| 137 | For unions: For the zero argument constructor of a union that is constructed
|
---|
| 138 | entirely of primitive types (or other types that zero argument construct to
|
---|
| 139 | all 0s) the union filled with 0s. For the copy constructor of a union of types
|
---|
| 140 | that all have trivial copy constructors memcpy is used.
|
---|
| 141 |
|
---|
| 142 | ### Destructor: ^?{}
|
---|
| 143 | Requesting the default destructor should be the same as having it implicitly
|
---|
| 144 | defined. Destructors only have one signature and the intuitive definition for
|
---|
| 145 | that is the same as without the signature.
|
---|
| 146 |
|
---|
| 147 | Still it should be allowed for consistency. It also allows it to be forward
|
---|
| 148 | declared and then generated in a .cfa file.
|
---|
| 149 |
|
---|
| 150 | ### Assignment: ?=?
|
---|
| 151 | Default assignment is only supported between two objects of the same type.
|
---|
| 152 | For structures it is field to field assignment. For enumerations and unions
|
---|
| 153 | of primitives or trivially copiable types it the same as memcpy.
|
---|
| 154 |
|
---|
| 155 | ### Equality: ?==? ?!=?
|
---|
| 156 | Both equality operations can be implemented by negating the result of the
|
---|
| 157 | other operations.
|
---|
| 158 |
|
---|
| 159 | For structures: Equality can be implement by checking equality on each pair of
|
---|
| 160 | matching fields and taking the logical and of the results. Inequality can be
|
---|
| 161 | implemented by checking inequality on each pair of matching fields and taking
|
---|
| 162 | the logical or of the results.
|
---|
| 163 |
|
---|
| 164 | Both logical operations could be short circuiting. Without side effects it is
|
---|
| 165 | purely an optimization.
|
---|
| 166 |
|
---|
| 167 | For enumerations: Both operations are the same as on the underlying integral.
|
---|
| 168 |
|
---|
| 169 | For unions: If it is assumed that the different branches represent different
|
---|
| 170 | views of the same data and this data is primitive, than bit-wise comparisons
|
---|
| 171 | can be used.
|
---|
| 172 |
|
---|
| 173 | ### Comparison: ?<? ?>? ?<=? ?>=?
|
---|
| 174 | Less than can be implemented by flipping the arguments on greater than.
|
---|
| 175 | Greater than can be implemented by flipping less then. Less than or equal to
|
---|
| 176 | can be implemented by flipping greater than or equal to. Greater than or equal
|
---|
| 177 | to can be implemented by flipping less than or equal to.
|
---|
| 178 |
|
---|
| 179 | Less than or equal to can be implemented by using less than, equals and taking
|
---|
| 180 | the or of the results. Greater than or equal to can be implemented by using
|
---|
| 181 | greater than, equals and taking the or of the results.
|
---|
| 182 |
|
---|
| 183 | > The trick of negating comparisons is not used. As an example ?<? is not
|
---|
| 184 | > (boolean) not ?>=? unless the type is strictly ordered. For operator based
|
---|
| 185 | > overloads that might not be true in very reasonable implementations so it is
|
---|
| 186 | > not assumed.
|
---|
| 187 |
|
---|
| 188 | Opposite less than can be implemented as less than or equal to and not equal
|
---|
| 189 | to. Greater than can be implemented as greater than or equal to and not equal
|
---|
| 190 | to.
|
---|
| 191 |
|
---|
| 192 | For enumerations: Enumerations that represent one of all operations are the
|
---|
| 193 | same as on the underlying integral. Enumerations that represent a set of
|
---|
| 194 | options could replace less then with subset of and greator than with superset
|
---|
| 195 | of and use bit-wise masking to implement those operations.
|
---|
| 196 |
|
---|
| 197 | ### Binary & Relative Assignment Operators: ?_? ?_=?
|
---|
| 198 | This applies to each operator in the form of `T ?_?(T, T);` for some type T
|
---|
| 199 | and has a matching relative assignment operator `T& ?_=?(T&, T)` where the
|
---|
| 200 | `_` in both is replaced by some string of operator characters.
|
---|
| 201 |
|
---|
| 202 | The binary operator can be created by copying the left argument, using the
|
---|
| 203 | relative assignment with the right argument and returning the updated copy.
|
---|
| 204 |
|
---|
| 205 | The relative assignment operator can be implemented by using the binary
|
---|
| 206 | operation to create a copy, then assigning the result to the left argument.
|
---|
| 207 | The left argument should then be returned. For the signature above it would
|
---|
| 208 | return by reference. The signature `T ?_=?(T&, T)` could also be supported in
|
---|
| 209 | which case it would return by copy.
|
---|
| 210 |
|
---|
| 211 | ### Minus: -? ?-?
|
---|
| 212 | Unary minus can be implemented by subtracting argument from the value created
|
---|
| 213 | from zero_t.
|
---|
| 214 |
|
---|
| 215 | Binary minus can be implemented by negating (with unary minus) the right
|
---|
| 216 | argument and adding the result to the left argument.
|
---|
| 217 |
|
---|
| 218 | ### Increment & Decrement: ++? ?++ --? ?--
|
---|
| 219 | Either pre- operation can implemented by applying the post- operation and then
|
---|
| 220 | returning a reference to, or copy of, the updated value. Either post-
|
---|
| 221 | operation can be implemented by copying the argument, applying the pre-
|
---|
| 222 | operation to the original and returning the copy.
|
---|
| 223 |
|
---|
| 224 | Pre-increment can be implemented by using addition assignment by the value
|
---|
| 225 | constructed from one_t. Pre-decrement can be implemented by using subtraction
|
---|
| 226 | assignment by the value from one_t.
|
---|
| 227 |
|
---|
| 228 | Because many of the operations used have there own default implementations
|
---|
| 229 | (for example: ?+=? from ?+? or field-wise one_t construction) this list could
|
---|
| 230 | be expanded by replacing a function call with the default implementation for
|
---|
| 231 | that function. This might work for the post- operations generated from
|
---|
| 232 | relative assignment, the one_t constructed object and copying, but the logical
|
---|
| 233 | connection becomes weaker and weaker as that process continues.
|
---|
| 234 |
|
---|
| 235 | ### Bit Manipulation: ?|? ?&? ~? ?<<? ?>>?
|
---|
| 236 | No operation based construction is provided for bit manipulation operators.
|
---|
| 237 |
|
---|
| 238 | For enumerations that are considered sets of flags: And returns a set with all
|
---|
| 239 | flags set in both operand sets, or returns a set with all flags set that are
|
---|
| 240 | set in either operand sets and not returns a set with all flags set that are
|
---|
| 241 | not set in the operand set.
|
---|
| 242 |
|
---|
| 243 | > And it would be possible to actually implement this for any sized type and
|
---|
| 244 | > just do the bitwise operation and trust the user that it makes sense if they
|
---|
| 245 | > request it.
|
---|
| 246 |
|
---|
| 247 | Also for set enumerations, the signature `T ?<<?(one_t, unsigned)` also has
|
---|
| 248 | some use, as it can be used to implement a loop that goes over each flag,
|
---|
| 249 | instead of each combination of flags.
|
---|
| 250 |
|
---|
| 251 | ### Logical Negation: !?
|
---|
| 252 | Not can be implemented by negating the result of a conversion to boolean, the
|
---|
| 253 | does not equal 0 test.
|
---|
| 254 |
|
---|
| 255 | Circular Requirements
|
---|
| 256 | ---------------------
|
---|
| 257 |
|
---|
| 258 | There are several cases where there are two operators that can be implemented
|
---|
| 259 | with the other operator. If both are implemented that way calling either of
|
---|
| 260 | them could result in infinite recursion.
|
---|
| 261 |
|
---|
| 262 | The simplest way to handle the issue would be to tell the user to not do that,
|
---|
| 263 | they are responsible for providing the base operations. This is C like but is
|
---|
| 264 | perhaps more error prone than it would save in work and if we do check we can
|
---|
| 265 | automatically use fallbacks.
|
---|
| 266 |
|
---|
| 267 | Before generating any default implementations the compiler should generate
|
---|
| 268 | a list of everything it has been requested and ignore any generation methods
|
---|
| 269 | that would lead to chains instead of counting other defaults that could lead
|
---|
| 270 | to a loop. As a further improvment this could be done selectively to break
|
---|
| 271 | rings while allowing chains of non-recurive implementations.
|
---|
| 272 |
|
---|
| 273 | For instance if both ?==? and ?!=? are requested they cannot both be defined
|
---|
| 274 | as the negation of each other. In the simple version they would both be
|
---|
| 275 | generated by introspection on the declaration. With the more selective version
|
---|
| 276 | one could be generated by introspection and the other by negating that result.
|
---|
| 277 |
|
---|
| 278 | However there are still ways to get around this by placing the function
|
---|
| 279 | definitions in different translation units or defining a function that uses
|
---|
| 280 | an operator that is default generated to use it. Searching for all these cases
|
---|
| 281 | is probably not worth it, although checking for some might be useful warnings.
|
---|