[7a0e8c8] | 1 | Tuples
|
---|
| 2 | ======
|
---|
| 3 |
|
---|
[12c4a5f] | 4 | This is a proposal discusses update to Cforall tuples as they exist after
|
---|
| 5 | the work done by Robert Schluntz (after they were added to Cforall by
|
---|
| 6 | Rodolfo Esteves and to K-W C by Dave Till). The new features and ideas
|
---|
| 7 | discussed here are designed to address problems that have appeared as tuples
|
---|
| 8 | have been used in the past 6 years. Some or all of the changes discussed may
|
---|
| 9 | already be implemented.
|
---|
[7a0e8c8] | 10 |
|
---|
[1b770e40] | 11 | The core change is breaking the current restructurable tuples into unstructured
|
---|
| 12 | and structured tuples. Unstructured tuples handle most of the existing uses,
|
---|
| 13 | with structured tuples filling in a few missing use cases.
|
---|
[7a0e8c8] | 14 |
|
---|
| 15 | Current State of Tuples
|
---|
| 16 | -----------------------
|
---|
[12c4a5f] | 17 | An overview of the current tuple design is the starting place for the proposal.
|
---|
[7a0e8c8] | 18 |
|
---|
| 19 | ### Tuple Syntax
|
---|
| 20 |
|
---|
[1b770e40] | 21 | Currently, tuples have three main components: tuple types, tuple
|
---|
| 22 | expressions/values (constructing tuples), and tuple index expressions
|
---|
| 23 | (deconstructing tuples).
|
---|
| 24 |
|
---|
[12c4a5f] | 25 | Current syntax for tuple types:
|
---|
[7a0e8c8] | 26 |
|
---|
| 27 | - Nullary: [void] or []
|
---|
[12c4a5f] | 28 |
|
---|
| 29 | [void] f();
|
---|
| 30 | [] f();
|
---|
| 31 |
|
---|
[7a0e8c8] | 32 | - Unary: [TYPE]
|
---|
[1b770e40] | 33 |
|
---|
[12c4a5f] | 34 | [int] f();
|
---|
| 35 | [double] x = 1.5;
|
---|
[1b770e40] | 36 |
|
---|
[12c4a5f] | 37 | - Multiary: [TYPE, TYPE, ...]
|
---|
[1b770e40] | 38 |
|
---|
[12c4a5f] | 39 | [bool, char] f();
|
---|
| 40 | [short, int, long] x;
|
---|
[1b770e40] | 41 |
|
---|
[12c4a5f] | 42 | Tuple types can appear anywhere regular types may be used, except for the
|
---|
| 43 | nullary tuple which can only appear where void can be used (usually return
|
---|
| 44 | types), with the exception of special case when void is used to make an empty
|
---|
| 45 | parameter list.
|
---|
[7a0e8c8] | 46 |
|
---|
| 47 | Current syntax for tuple expressions:
|
---|
| 48 |
|
---|
[1b770e40] | 49 | - Nullary: (Same as `void`, use return without an expression.)
|
---|
| 50 |
|
---|
[12c4a5f] | 51 | [] f() { return; }
|
---|
[1b770e40] | 52 |
|
---|
[12c4a5f] | 53 | - Unary: EXPR (in return only) or {EXPR}
|
---|
[1b770e40] | 54 |
|
---|
[12c4a5f] | 55 | [int] f() { return 3; }
|
---|
| 56 | [int] x = {3};
|
---|
[1b770e40] | 57 |
|
---|
[12c4a5f] | 58 | - Multiary: [EXPR, EXPR, ...]
|
---|
[7a0e8c8] | 59 |
|
---|
[12c4a5f] | 60 | [int, int] f() { return [3, 4]; }
|
---|
| 61 | [bool, char, int] x = [true, 'a', 3];
|
---|
[1b770e40] | 62 |
|
---|
[12c4a5f] | 63 | Tuple expressions should be useable whenever an expression of that type is
|
---|
| 64 | expected. Issues with the current state will be discussed later.
|
---|
[1b770e40] | 65 |
|
---|
| 66 | Current syntax for tuple indexing is an integer constant, where its value
|
---|
| 67 | selects a tuple member, e.g.:
|
---|
[7a0e8c8] | 68 |
|
---|
[12c4a5f] | 69 | [char, int] tup = ['a', 0];
|
---|
| 70 | char ch = tup.0;
|
---|
| 71 | int value = tup.1;
|
---|
[7a0e8c8] | 72 |
|
---|
| 73 | ### Mass and Multi-Assignment
|
---|
| 74 |
|
---|
[1b770e40] | 75 | Two special forms of assignment can be used to set values in tuples: mass and
|
---|
| 76 | multi. Mass assignment assigns every element in the destination tuple to a
|
---|
| 77 | single source value.
|
---|
[7a0e8c8] | 78 |
|
---|
| 79 | [int, long, float] dst;
|
---|
[12c4a5f] | 80 | int src = 4;
|
---|
| 81 | dst = src;
|
---|
| 82 | // Becomes: dst.0 = src; dst.1 = src; dst.2 = src;
|
---|
[7a0e8c8] | 83 |
|
---|
[1b770e40] | 84 | Multi-assignment assigns every element in the destination tuple to the
|
---|
| 85 | corresponding element in the source tuple. Both tuples must be the same size
|
---|
[12c4a5f] | 86 | and the elements pairs must be assignment compatible conversions.
|
---|
[7a0e8c8] | 87 |
|
---|
[1b770e40] | 88 | [long, int, float] dst;
|
---|
| 89 | [int, char, double] src = [1, 'a', 300.0];
|
---|
[12c4a5f] | 90 | dst = src;
|
---|
| 91 | // Becomes: dst.0 = src.0; dst.1 = src.1; dst.2 = src.1;
|
---|
[7a0e8c8] | 92 |
|
---|
| 93 | ### Tuple Restructuring
|
---|
| 94 |
|
---|
| 95 | Tuples can be restructured as part of overload resolution. Function calls
|
---|
[1b770e40] | 96 | unpack tuples and repack tuples to match signatures. This semantics is a form
|
---|
| 97 | of implicit conversion and is considered during overload resolution.
|
---|
[7a0e8c8] | 98 |
|
---|
| 99 | A simple example is matching multiple parameters of a function to a single
|
---|
[1b770e40] | 100 | argument expression, where each parameter is bound to a different element of
|
---|
| 101 | the returned tuple.
|
---|
[7a0e8c8] | 102 |
|
---|
[1b770e40] | 103 | [int, int] argFunc();
|
---|
| 104 | void parmFunc(int a, int b, int c, int d);
|
---|
| 105 |
|
---|
| 106 | parmFunc(argFunc(), argFunc());
|
---|
[7a0e8c8] | 107 |
|
---|
| 108 | // Roughly equivilent to:
|
---|
[1b770e40] | 109 | [int, int] fst = argFunc();
|
---|
| 110 | [int, int] snd = argFunc();
|
---|
| 111 | parmFunc(fst.0, fst.1, snd.0, snd.1);
|
---|
[7a0e8c8] | 112 |
|
---|
| 113 | ### Tuple Casts
|
---|
| 114 |
|
---|
[b0fcd0e] | 115 | C-style casts can be used on tuples. These are usually conversion casts (casts
|
---|
[1b770e40] | 116 | that perform operations on the cast type, as opposed to reinterpreting the
|
---|
[7a0e8c8] | 117 | existing value).
|
---|
| 118 |
|
---|
| 119 | Tuple casts can remove elements from the end of a tuple and apply a recursive
|
---|
| 120 | cast to any of the elements. As an example:
|
---|
| 121 |
|
---|
| 122 | [char, char, char] x = ['a', 'b', 'c'];
|
---|
| 123 | ([int, char])x;
|
---|
| 124 |
|
---|
[1b770e40] | 125 | This casts the first element type of x from a char to an int and drops the last
|
---|
[b0fcd0e] | 126 | element. The second line can be replaced with the following code, which creates
|
---|
[1b770e40] | 127 | a new tuple by directly casting the kept elements of the old tuple:
|
---|
[7a0e8c8] | 128 |
|
---|
| 129 | [(int)x.0, (char)x.1];
|
---|
| 130 |
|
---|
| 131 | Note, tuple casting is more restricted than the implicit tuple restructuring.
|
---|
| 132 | It cannot do any restructuring beyond dropping the trailing elements of
|
---|
[1b770e40] | 133 | tuples. For example,
|
---|
| 134 |
|
---|
| 135 | int i; char c; bool b;
|
---|
| 136 | [i, b, c, i] = [i, [b, c], i];
|
---|
| 137 | [int, [bool, char], int] w;
|
---|
| 138 | [i, b, c, i] = w;
|
---|
[12c4a5f] | 139 | [i, b, c] = ([int, [bool, char]])w;
|
---|
| 140 | // But this does not work:
|
---|
| 141 | [i, b, c, i] = ([int, bool, char, int])w;
|
---|
[7a0e8c8] | 142 |
|
---|
| 143 | ### Polymorphic Tuple Parameters
|
---|
| 144 |
|
---|
| 145 | One kind of polymorphic parameter is the tuple type parameter, previously
|
---|
| 146 | noted by the `ttype` keyword and now marked by a tailing ellipses (`...`).
|
---|
| 147 | Although described as a tuple type, it is usually viewed as its flattened
|
---|
| 148 | sequence of types.
|
---|
| 149 |
|
---|
[12c4a5f] | 150 | // Fixed Length Max - Base Case:
|
---|
| 151 | forall(T | { int ?>?( T, T ); })
|
---|
| 152 | T max(T v1, T v2) { return v1 > v2 ? v1 : v2; }
|
---|
[1b770e40] | 153 |
|
---|
[12c4a5f] | 154 | // Variable Length Max - Recursive:
|
---|
| 155 | forall(T, Ts... | { T max(T, T); T max(Ts); })
|
---|
[7a0e8c8] | 156 | T max(T arg, Ts args) {
|
---|
| 157 | return max(arg, max(args));
|
---|
| 158 | }
|
---|
| 159 |
|
---|
[12c4a5f] | 160 | This feature introduces a type name into scope (Ts). It is used as a type and
|
---|
| 161 | the value it declares (args) is a tuple value, and that the second assertion
|
---|
| 162 | "T max(Ts);" matches takes a single tuple, but with restructuring it can also
|
---|
| 163 | match functions that take a series of non-tuple arguments.
|
---|
| 164 |
|
---|
| 165 | A good way to explain this is to show a partial implementation the following
|
---|
| 166 | call `max(int_a, int_b, int_c);` into actual code:
|
---|
| 167 |
|
---|
| 168 | void _thunk0(int v1, int v2) {
|
---|
| 169 | return max{?>?}(v1, v2);
|
---|
| 170 | }
|
---|
| 171 | void _thunk1([int, int] tup) {
|
---|
| 172 | return max{?>?}(tup.0, tup.1);
|
---|
| 173 | }
|
---|
| 174 | max{_thunk0, _thunk1}(int_a, [int_b, int_c]);
|
---|
| 175 |
|
---|
| 176 | The part to highlight is that "_thunk1", one of the functions created to
|
---|
| 177 | make an inexact match that can be used as a call site into an exact match
|
---|
| 178 | so the function can be passed through function parameters.
|
---|
| 179 | Although, the tuple type `[int, int]` matches a parameter list `int, int`,
|
---|
| 180 | just with a bit of low lever restructuring.
|
---|
[1b770e40] | 181 |
|
---|
[12c4a5f] | 182 | In larger cases (with four or more parameters in the first case), the
|
---|
| 183 | recursive case will work similarly, creating an intermidate function to
|
---|
| 184 | restructure the tuple into a new call such as:
|
---|
| 185 |
|
---|
| 186 | void _thunk2([int, int, int] tup) {
|
---|
| 187 | return max{_thunk0, _thunk1}(tup.0, [tup.1, tup.2]);
|
---|
| 188 | }
|
---|
[7a0e8c8] | 189 |
|
---|
| 190 | Issues with the Current State
|
---|
| 191 | -----------------------------
|
---|
[1b770e40] | 192 | There are a variety of problems with the current implementation which need to
|
---|
| 193 | be fixed.
|
---|
[7a0e8c8] | 194 |
|
---|
| 195 | ### Tuples are not Objects
|
---|
| 196 |
|
---|
[1b770e40] | 197 | Spoilers: this proposal actually takes them even further away from being
|
---|
| 198 | objects, but illustrates why the current version is not as useful is it could
|
---|
| 199 | be.
|
---|
| 200 |
|
---|
[12c4a5f] | 201 | Tuples do not have the lifetime operators (copy construction, copy assignment
|
---|
| 202 | and destructor) that define an object type in Cforall. This means they are
|
---|
| 203 | not polymorphic otypes, they are objects in C's usage of the word. This means
|
---|
| 204 | that tuples cannot be used as a single object in
|
---|
| 205 |
|
---|
| 206 | There are several reasons for this. The fluid nature of tuples (flattening
|
---|
| 207 | and restructuring) means that matching against a tuple type is fluid in some
|
---|
| 208 | inconvent ways.
|
---|
[7a0e8c8] | 209 |
|
---|
[12c4a5f] | 210 | forall(T, U) void ?{}([T, U] & this;
|
---|
| 211 | void ?{}([int, int, int] & this);
|
---|
| 212 | // What constructor(s) should be used here?
|
---|
| 213 | [int, [int, int]] pair;
|
---|
[7a0e8c8] | 214 |
|
---|
[12c4a5f] | 215 | Should the polymorpic constructor be applied twice or should the tuple be
|
---|
| 216 | restructured and the three element constructor be used? This is not really
|
---|
| 217 | something a user should be wondering about (especially since tuples should
|
---|
| 218 | always be simple containers wrapping their elements).
|
---|
[7a0e8c8] | 219 |
|
---|
[b0fcd0e] | 220 | ### Providing TType Arguments is Inconsistent
|
---|
[7a0e8c8] | 221 |
|
---|
[1b770e40] | 222 | The syntax for ttype arguments is slightly inconsistent. It has not come up
|
---|
| 223 | much yet, because you do not directly provide ttype polymorphic arguments to
|
---|
[b0fcd0e] | 224 | functions and there are very few existing use-cases for ttype structures.
|
---|
[7a0e8c8] | 225 |
|
---|
[12c4a5f] | 226 | forall(T, Us...)
|
---|
| 227 | void function(T head, Us tail) {}
|
---|
| 228 |
|
---|
| 229 | forall(T, Us...)
|
---|
| 230 | struct Type {};
|
---|
| 231 |
|
---|
| 232 | int a; bool b; char c;
|
---|
| 233 | function(a, b, c);
|
---|
| 234 | function(a, [b, c]);
|
---|
| 235 | // Doesn't work: Type(int, bool, char) x;
|
---|
| 236 | Type(int, [bool, char]) x;
|
---|
| 237 |
|
---|
| 238 | The `function` case works either way because the tuple value can be
|
---|
| 239 | restructured if it does not match exactly. The `Type` case though does not
|
---|
| 240 | allow restructuring when the type is passed as a type value. This may be
|
---|
| 241 | the most correct form, but seems surprising compared to the common use.
|
---|
[7a0e8c8] | 242 |
|
---|
[12c4a5f] | 243 | ### Unary Tuple Value Syntax
|
---|
| 244 |
|
---|
| 245 | Unary tuples don't use the standard tuple syntax and one of the two alternate
|
---|
| 246 | syntax options your may use instead doesn't work consistently. Solving both
|
---|
| 247 | of these issues would help with consistency.
|
---|
[7a0e8c8] | 248 |
|
---|
| 249 | ### Syntax Conflict
|
---|
| 250 |
|
---|
[12c4a5f] | 251 | The tuple syntax conflicts with designators and the C23 (C++-style) attribute
|
---|
[1b770e40] | 252 | syntax.
|
---|
| 253 |
|
---|
[12c4a5f] | 254 | int a[10] = { [2] = 3 };
|
---|
| 255 |
|
---|
| 256 | Here [2] = 3 is an array designator, but has the same syntax as an assignment
|
---|
| 257 | to a tuple of references, it isn't until the resolver determains that 2 is
|
---|
| 258 | not a reference that that case can be ruled out.
|
---|
| 259 |
|
---|
| 260 | [[ unused ]] [[int, int], int] x;
|
---|
| 261 |
|
---|
| 262 | Here the opening `[[` could be a nested tuple or the beginning of an
|
---|
| 263 | attribute, the parser can't figure out until later.
|
---|
[1b770e40] | 264 |
|
---|
| 265 | These conflicts break C compatibility goals of Cforall. Designators had to
|
---|
| 266 | their syntax change and Cforall cannot parse the new attributes.
|
---|
[12c4a5f] | 267 | The behaviour of these cases, if they could be parsed, should be unchanged.
|
---|
[7a0e8c8] | 268 |
|
---|
| 269 | Change in Model
|
---|
| 270 | ---------------
|
---|
| 271 | This proposal modifies the existing tuples to better handle its main use
|
---|
[12c4a5f] | 272 | cases. There are some use cases that are no longer handled, so in addition
|
---|
| 273 | to changing the existing "restructured tuples" to "unstructured tuples"
|
---|
| 274 | (or packs) a new "structured tuple" is added in the form of struct tuple.
|
---|
[7a0e8c8] | 275 |
|
---|
[12c4a5f] | 276 | The new tuples is even more "unstructured" than before. New tuples are
|
---|
[1b770e40] | 277 | considered a sequence of types or typed entities. These packs are then unpacked
|
---|
| 278 | into the surrounding context. Put differently, tuples are now flattened as much
|
---|
| 279 | as possible, with some places (like parameter lists) being treated as an
|
---|
| 280 | implicit tuple and the tuple being flattened into that.
|
---|
[7a0e8c8] | 281 |
|
---|
[1b770e40] | 282 | Structured tuples are now a separate feature: a structure called "tuple".
|
---|
| 283 | These are polymorphic structures; an instance should act as a structure, except
|
---|
| 284 | its fields are accessed using indices instead of field names. Experience so far
|
---|
| 285 | is that structured tuples are not used often, but fill in the use cases that
|
---|
| 286 | unstructured tuples no longer support.
|
---|
[7a0e8c8] | 287 |
|
---|
[12c4a5f] | 288 | Note that the underlying implementation might not actually look like this,
|
---|
| 289 | but this is how it should seem to the user.
|
---|
[7a0e8c8] | 290 |
|
---|
| 291 | Changed Features
|
---|
| 292 | ----------------
|
---|
[b0fcd0e] | 293 | Some of the concrete changes to the features of the language.
|
---|
[7a0e8c8] | 294 |
|
---|
[12c4a5f] | 295 | ### Unstructured Tuple / Pack Type
|
---|
| 296 |
|
---|
| 297 | The evolution of our current tuples (called packs in an attempt to make the
|
---|
| 298 | names more distinct). They work like the current tuples except they are
|
---|
| 299 | always flattened. You might still be able to declare nested tuples, but these
|
---|
| 300 | are not meaningful, they are flattened to a single level and, where
|
---|
| 301 | approprate, is inlined into the sounding content.
|
---|
| 302 |
|
---|
| 303 | [bool, [char, int], long] x;
|
---|
| 304 | // Becomes:
|
---|
| 305 | [bool, char, int, long] x;
|
---|
| 306 |
|
---|
| 307 | void f(bool a0, [char, [int, long]] a1, float a2);
|
---|
| 308 | // Becomes:
|
---|
| 309 | void f(bool a0, char a1_0, int a1_1, long a1_2, float a2);
|
---|
| 310 |
|
---|
| 311 | [,] f(int a0, [,] a1, [bool,] a2);
|
---|
| 312 | // Becomes:
|
---|
| 313 | void f(int a0, bool a2);
|
---|
| 314 |
|
---|
| 315 | This actually means that tuples do not participate in overload resolution
|
---|
| 316 | in the way the do currently. Restructuring is always free because they are
|
---|
| 317 | always reduced to the same form as part of type matching.
|
---|
| 318 |
|
---|
| 319 | Packs are still not object types and do not have lifetime functions. They
|
---|
| 320 | cannot be used as object types, nor as any data type, but they can be still
|
---|
| 321 | used as ttypes.
|
---|
| 322 |
|
---|
| 323 | (Unless they need to have lifetime functions for other features.)
|
---|
| 324 |
|
---|
[7a0e8c8] | 325 | ### Structured Tuple Type
|
---|
| 326 |
|
---|
[12c4a5f] | 327 | There is a standard library or built-in type named `tuple`, it does not need
|
---|
| 328 | a special syntax to write types or values. The type definition might need
|
---|
| 329 | some primitive support, but if supported as a regular type would look
|
---|
| 330 | something like this:
|
---|
[7a0e8c8] | 331 |
|
---|
| 332 | forall(Ts...)
|
---|
| 333 | struct tuple {
|
---|
[12c4a5f] | 334 | inline Ts all;
|
---|
[7a0e8c8] | 335 | };
|
---|
| 336 |
|
---|
[12c4a5f] | 337 | This type wraps up an unstructured tuple, in this case a pack of members,
|
---|
| 338 | and gives it "edges" and so structure. It also gives tuples an interface
|
---|
| 339 | more like a normal structure type, for the anonymous structure use case.
|
---|
| 340 |
|
---|
| 341 | The struct tuple does have lifetime functions, can use list initializer
|
---|
| 342 | syntax and otherwise be treated as a normal structure. You can use regular
|
---|
| 343 | member access syntax to convert the struct tuple into a unstructured pack,
|
---|
| 344 | writing `object.all`.
|
---|
| 345 |
|
---|
| 346 | The `inline` modifier is a new specialized feature to allow you use tuple
|
---|
| 347 | index expressions directly on the type. With or without inline you should
|
---|
| 348 | be able to chain a tuple index expression onto the member expression, such
|
---|
| 349 | as `object.all.1`. The inline specifier allows you to skip the middle
|
---|
| 350 | and use `object.1`.
|
---|
| 351 |
|
---|
| 352 | More complex operations can usually be preformed by taking the pack out of
|
---|
| 353 | the struct tuple and acting on it. Polymorphic operations only have to go one
|
---|
| 354 | level down to get to base operations, there is no need for recursive
|
---|
| 355 | construction, nor manually coding different sizes of tuple. This should give
|
---|
| 356 | most of the ease of working with a primitive tuple, because effectively you
|
---|
| 357 | are except it has been given fixed edges.
|
---|
| 358 |
|
---|
| 359 | A tuple struct should also be an object type, and let's you pass multiple
|
---|
| 360 | values through a single polymorphic slot.
|
---|
| 361 |
|
---|
| 362 | A note on the naming here, because there are more pieces that need names
|
---|
| 363 | instead of symbols. The name `tuple` follows because that is the common name,
|
---|
| 364 | although this means this is now the default tuple in some sense. The name of
|
---|
| 365 | the pack was picked to work with inline and make the difference between `.0`
|
---|
| 366 | and `.all` clear. (If inline doesn't work then `get` might be a better name.)
|
---|
[7a0e8c8] | 367 |
|
---|
| 368 | ### Type Qualifier Distribution
|
---|
| 369 |
|
---|
| 370 | Because tuples are no longer object types, applying type modifiers to them,
|
---|
| 371 | such as cv-qualifiers and pointers, no longer makes sense. That syntax is
|
---|
| 372 | now considered distribution, applying the type modifier to each element of
|
---|
| 373 | the tuple.
|
---|
| 374 |
|
---|
[1b770e40] | 375 | Previously `const [int, bool] &` would mean a const reference to a tuple of an
|
---|
| 376 | integer and a boolean. Now it means an alias for `[const int &, const bool &]`,
|
---|
| 377 | a tuple of a reference to a constant integer and a reference to a constant
|
---|
| 378 | boolean. This also applies to polymorphic tuple type packs `Ts &` in
|
---|
| 379 | polymorphic functions.
|
---|
[7a0e8c8] | 380 |
|
---|
[1b770e40] | 381 | This new approach can specify restrictions on tuple variables as for a single
|
---|
| 382 | type variable. For example, this approach can replace the special cased tuple
|
---|
| 383 | operations multi-assignment (N-to-N) and mass-assignment (1-to-N).
|
---|
[7a0e8c8] | 384 |
|
---|
| 385 | // Multi-Assignment
|
---|
| 386 | forall(T, Us... |
|
---|
| 387 | { T & ?=?(T &, T const &); Us & ?=?(Us &, Us const &); })
|
---|
| 388 | [T, Us] & ?=?(T & dst, Us & dsts, T const & src, Us const & srcs) {
|
---|
| 389 | dst = src;
|
---|
| 390 | dsts = srcs;
|
---|
| 391 | return [dst, dsts];
|
---|
| 392 | }
|
---|
| 393 |
|
---|
| 394 | // Mass-Assignment
|
---|
| 395 | forall(T, U, Vs... |
|
---|
| 396 | { U & ?=?(U &, T const &); Vs & ?=?(Vs &, T const &); })
|
---|
| 397 | [U, Vs] & ?=?(U & dst, Vs & dsts, T const & src) {
|
---|
| 398 | dst = src;
|
---|
| 399 | dsts = src;
|
---|
| 400 | return [dst, dsts];
|
---|
| 401 | }
|
---|
| 402 |
|
---|
| 403 | These may not work exactly as given (for one, the copy assignment assertion
|
---|
[b0fcd0e] | 404 | in the first function would likely be redundant/conflicting with the implicit
|
---|
[7a0e8c8] | 405 | assertions on the parameters), but they show the pattern. Multi-assignment
|
---|
[b0fcd0e] | 406 | also would be very hard to write with simple tuple types, because the
|
---|
[7a0e8c8] | 407 | relationship between the two halves of the parameter list does have to line
|
---|
| 408 | up, that cannot be enforced with two different tuples.
|
---|
| 409 |
|
---|
[12c4a5f] | 410 | This example doesn't have to be implemented this way because we do have the
|
---|
| 411 | special case operations for these already.
|
---|
| 412 |
|
---|
[7a0e8c8] | 413 | ### Type Packs
|
---|
| 414 |
|
---|
[1b770e40] | 415 | This approach is not a new feature, but a reframing/extension of existing tuple
|
---|
| 416 | tuple polymorphic parameters as polymorphic type packs. The old `Vars...`
|
---|
| 417 | syntax introduces a pack of types into scope. It can be used in much the same
|
---|
| 418 | way as a tuple, but in some new ways to.
|
---|
[7a0e8c8] | 419 |
|
---|
[1b770e40] | 420 | The primary existing use remains: to use a polymorphic pack in a parameter
|
---|
| 421 | list, both as part of an assertion and in the signature of the main
|
---|
| 422 | function. The difference is that this is not an enclosed tuple, but a series of
|
---|
| 423 | types. The only effective difference this makes is it does not prefer to match
|
---|
| 424 | another tuple/pack.
|
---|
[7a0e8c8] | 425 |
|
---|
| 426 | This pattern continues to a parameter defined with a pack of types, which
|
---|
[b0fcd0e] | 427 | is considered a pack of parameters, and the name it introduces is a pack
|
---|
[7a0e8c8] | 428 | of variables, or a flattened tuple.
|
---|
| 429 |
|
---|
| 430 | forall(Params...)
|
---|
| 431 | void function(Params values);
|
---|
| 432 |
|
---|
[1b770e40] | 433 | New use cases include declarations of members and variables. For example, the
|
---|
| 434 | creation of a structured tuple structure:
|
---|
[7a0e8c8] | 435 |
|
---|
| 436 | forall(Fields...)
|
---|
| 437 | struct tuple {
|
---|
| 438 | Fields get;
|
---|
| 439 | };
|
---|
| 440 |
|
---|
| 441 | This is again, treated as a pack of members. They have the same layout as
|
---|
[b0fcd0e] | 442 | if they were written out by hand. Now, the name get is still accessed as if
|
---|
[7a0e8c8] | 443 | it was a regular, singular, member. The result of that expression is a
|
---|
| 444 | pack expression, a tuple of all the field accesses, which can be used with a
|
---|
| 445 | tuple index expression to access the underlying members. For example:
|
---|
| 446 |
|
---|
| 447 | tuple(bool, char, int) data = { ... };
|
---|
| 448 | // This expression:
|
---|
| 449 | data.get.2;
|
---|
| 450 | // Is the same as (if the fields had these names):
|
---|
| 451 | data.[__anonymous0, __anonymous1, __anonymous2].2;
|
---|
| 452 |
|
---|
| 453 | For local declarations it works similarly, except the name introduced is
|
---|
[12c4a5f] | 454 | directly usable as a tuple instead of going through a member access.
|
---|
[7a0e8c8] | 455 |
|
---|
| 456 | forall(Objects...)
|
---|
| 457 | void function( ??? ) {
|
---|
| 458 | ???
|
---|
| 459 | Objects objs = ???;
|
---|
| 460 | ???
|
---|
| 461 | }
|
---|
| 462 |
|
---|
[12c4a5f] | 463 | ### Tuple Pack Restructuring
|
---|
| 464 |
|
---|
| 465 | Implicit restructuring still occurs, but is not considered as part of
|
---|
| 466 | overload resolution. The structure of tuples is ignored, where individual
|
---|
| 467 | tuples begin or end is not considered, just the sequence of types.
|
---|
| 468 |
|
---|
| 469 | This can be viewed as flattening both sides (the arguments/caller and the
|
---|
| 470 | parameters/callee) before type resolution. Both any nested packs and packs
|
---|
| 471 | into any larger context:
|
---|
| 472 |
|
---|
| 473 | call(false, ['a', [-7,]], 3.14)
|
---|
| 474 | // Inline Nesting:
|
---|
| 475 | call(false, ['a', -7], 3.14)
|
---|
| 476 | // Inline into Context:
|
---|
| 477 | call(false, 'a', -7, 3.14)
|
---|
| 478 |
|
---|
| 479 | The main context where tuples are inlined are parameter lists, in that you
|
---|
| 480 | can consider a parameter list as a single pack. Return types could be viewed
|
---|
| 481 | in this way, but because they are presented in a single type, they are
|
---|
| 482 | already wrapped in a tuple.
|
---|
| 483 |
|
---|
| 484 | [int, [bool, char]] f();
|
---|
| 485 | // Inline Nesting:
|
---|
| 486 | [int, bool, char] f();
|
---|
| 487 |
|
---|
| 488 | This also happens "after" any polymorphic parameters are instantiated.
|
---|
| 489 |
|
---|
| 490 | Empty packs are also collapsed, logically being removed or replaced with
|
---|
| 491 | void as in the following example:
|
---|
| 492 |
|
---|
| 493 | // For example, this function:
|
---|
| 494 | forall(Ts...) Ts example(int first, Ts middle, int last);
|
---|
| 495 | // With Ts = [,], should not be treated as:
|
---|
| 496 | [,] example(int first, [,] middle, int last);
|
---|
| 497 | // But instead:
|
---|
| 498 | void example(int first, int last);
|
---|
| 499 |
|
---|
| 500 | Or closer to it, there are some slight differences between the nullary tuple
|
---|
| 501 | and void, for instance creating some instances for consistency with the other
|
---|
| 502 | tuple types.
|
---|
| 503 |
|
---|
| 504 | The struct tuple is never implicitly restructured.
|
---|
| 505 |
|
---|
[7a0e8c8] | 506 | ### Tuple Declaration/Deconstruction
|
---|
| 507 |
|
---|
| 508 | Declaring a tuple acts as a pack of variable declarations. When this is done
|
---|
[1b770e40] | 509 | with a written out type (as opposed to a polymorphic parameter above), then the
|
---|
[7a0e8c8] | 510 | elements of the tuple can be named.
|
---|
| 511 |
|
---|
[12c4a5f] | 512 | [int quo, int rem] quorem = divmod(a, b);
|
---|
[7a0e8c8] | 513 |
|
---|
| 514 | Here `ret` refers to the tuple, the entire pack, while `quo` and `rem`
|
---|
| 515 | give explicit names to elements of the tuple. Not all the names have to be
|
---|
| 516 | provided, at the very least, any element name can be omitted if the pack name
|
---|
| 517 | is provided, and the pack name can be omitted if all the element names are
|
---|
| 518 | provided. That would ensure every element can be accessed, but it could be
|
---|
| 519 | reduced even more, effectively immediately dropping some values if there is
|
---|
| 520 | no named to access it.
|
---|
| 521 |
|
---|
[12c4a5f] | 522 | This does not add any new functionality, more it is a qualify of life feature
|
---|
| 523 | making it very easy to give elements of a tuple descriptive names
|
---|
| 524 | instead of trying to combine a name for the whole tuple with an index.
|
---|
| 525 | (`quo` and `rem` vs. `quorem.0` and `quorem.1`)
|
---|
[1b770e40] | 526 |
|
---|
[7a0e8c8] | 527 | ### Tuple Casts
|
---|
| 528 |
|
---|
| 529 | Tuple casts are no longer allowed to do any restructuring. Internal
|
---|
| 530 | restructuring would not be useful, as there is no internal structure.
|
---|
| 531 | Dropping tail elements could be added back in but it is a narrow use case
|
---|
| 532 | so it may be replaced with other features (for example: tuple slicing).
|
---|
| 533 |
|
---|
[12c4a5f] | 534 | // Allowed:
|
---|
| 535 | ([bool, short])tuple_bool_char;
|
---|
| 536 | ([int, float])tuple_float_float;
|
---|
[7a0e8c8] | 537 |
|
---|
[12c4a5f] | 538 | // Forbidden:
|
---|
| 539 | ([int, int, int])tuple_int_int;
|
---|
| 540 | ([long, [float, int]])tuple_long_float_int;
|
---|
[7a0e8c8] | 541 |
|
---|
[12c4a5f] | 542 | ### New Tuple Literal Syntax
|
---|
[7a0e8c8] | 543 |
|
---|
[12c4a5f] | 544 | In addition to the main redesign of this proposal, the syntax is updated to
|
---|
| 545 | be more consistant. It is now a "[]" list with "," separators and a ","/comma
|
---|
| 546 | terminator that is optional for tuples with 2 or more elements (those that
|
---|
| 547 | already have a comma in them) and manditory in other tuples.
|
---|
[7a0e8c8] | 548 |
|
---|
[12c4a5f] | 549 | It should also be symmetric between types and value, differing only in that
|
---|
| 550 | an element (ITEM below) of a tuple type is a type and the element of a tuple
|
---|
| 551 | value is a value.
|
---|
[7a0e8c8] | 552 |
|
---|
[12c4a5f] | 553 | - Nullary Tuple: [,]
|
---|
| 554 | - Unary Tuples: [ITEM,]
|
---|
| 555 | - Multiary Tuples: [ITEM, ITEM] [ITEM, ITEM,] [ITEM, ITEM, ITEM] ...
|
---|
[7a0e8c8] | 556 |
|
---|
[12c4a5f] | 557 | Currently, although syntax is provided to write them, but manually writing
|
---|
| 558 | a nullary or unary tuple should be very rare. This is because a nullary tuple
|
---|
| 559 | can either be erased or treated as void, while unary tuples can be treated
|
---|
| 560 | as their element type. Similary for nested tuples, can be treated as the
|
---|
| 561 | flattened tuple.
|
---|
[7a0e8c8] | 562 |
|
---|
[12c4a5f] | 563 | (Update: Orignal nullary tuple proposal was `[]`, but that had some issues.)
|
---|
[7a0e8c8] | 564 |
|
---|
[12c4a5f] | 565 | ### TType Argument Syntax
|
---|
| 566 |
|
---|
| 567 | It would be nice to seamlessly provide packs of types in polymorphic
|
---|
| 568 | argument lists, however doing so requires a new restriction.
|
---|
| 569 |
|
---|
| 570 | As an example of the change, consider providing arguments to a polymorphic
|
---|
| 571 | structure with a otype and a ttype parameter.
|
---|
[7a0e8c8] | 572 |
|
---|
[12c4a5f] | 573 | forall(T, Us...) struct D {...};
|
---|
| 574 | D(int, [,]) // D(int)
|
---|
| 575 | D(bool, [char,]) // D(bool, char)
|
---|
| 576 | D(long, [int, short]) // D(long, int, short)
|
---|
| 577 |
|
---|
| 578 | This is a nice syntaxtic improvement for the common case, it does restrict
|
---|
| 579 | some of the more complex cases, such as cases with two ttypes. There are
|
---|
| 580 | various restrictions on this for functions. If the rules make it unambigous
|
---|
| 581 | then the grouping could be omitted, or a more advanced version, it can be
|
---|
| 582 | ommited only in the indivual cases where it is unambigous.
|
---|
[7a0e8c8] | 583 |
|
---|
| 584 | ### Tuple Slicing (Range Tuple Indexing)
|
---|
| 585 |
|
---|
| 586 | (Disclaimer: this is a bit more impulsive, see end for notes.)
|
---|
| 587 |
|
---|
[1b770e40] | 588 | This is an extension to tuple indexing. Currently, only single location of a
|
---|
| 589 | tuple can be index, extracting the element at that location. By extending the
|
---|
| 590 | index expression to be a list-range a multiple sub-tuples can be extracted.
|
---|
| 591 |
|
---|
[12c4a5f] | 592 | ['a', 'b', 'c', 'd', 'e', 'f'].0~3
|
---|
| 593 | ['a', 'b', 'c', 'd', 'e', 'f'].1~5~2
|
---|
| 594 | // Produces:
|
---|
| 595 | ['a', 'b', 'c']
|
---|
| 596 | ['b', 'd']
|
---|
[7a0e8c8] | 597 |
|
---|
[12c4a5f] | 598 | In words, the first indexes the first 3 tuple elements (0, 1, 2, stopping
|
---|
| 599 | before index 3) and the second indexes elements starting at index 1, stoping
|
---|
| 600 | before index 5 going in steps of 2. This does not allow for arbitary
|
---|
| 601 | selections, but it covers many common slices.
|
---|
[1b770e40] | 602 |
|
---|
| 603 | To get the new tuple, the range has to be constructed and traversed at compile
|
---|
| 604 | time. The size of the range is the size of the result tuple, with each element
|
---|
| 605 | in the result tuple decided by the matching element of the range, which gives
|
---|
| 606 | the index of the original tuple to place there. The type of the indices may be
|
---|
| 607 | an integral type, but all indices must be in range, otherwise it is a compile
|
---|
| 608 | time error.
|
---|
[7a0e8c8] | 609 |
|
---|
| 610 | In terms of the existing range loops, if you could write this dynamically, it
|
---|
| 611 | would look something like this:
|
---|
| 612 |
|
---|
| 613 | // Implementation of: output = input.RANGE
|
---|
| 614 | dyn output = []
|
---|
| 615 | for ( index : RANGE ) { output = [output, input.index] }
|
---|
| 616 |
|
---|
| 617 | The result of the expression is only has to be considered a tuple if the
|
---|
| 618 | range has two or more values, otherwise it can be considered void or the same
|
---|
| 619 | as indexing the single value in the range.
|
---|
| 620 |
|
---|
[1b770e40] | 621 | Some closing notes, this is dependent on generalized range expressions.
|
---|
[7a0e8c8] | 622 | The iterators proposal has a section on some new range types, this feature
|
---|
| 623 | is intended to be built on that feature. Not simply reuse some of the syntax
|
---|
[12c4a5f] | 624 | that is also used in the special for loop. This may add some requirements
|
---|
| 625 | about the compile-time evaluation on range expressions to make sure these
|
---|
| 626 | expressions can be evaluated at compile-time.
|
---|
[1b770e40] | 627 |
|
---|
[7a0e8c8] | 628 | Implementation
|
---|
| 629 | --------------
|
---|
| 630 | An overview of the implementation details of the new proposal.
|
---|
| 631 |
|
---|
[12c4a5f] | 632 | ### Structured Tuple
|
---|
| 633 |
|
---|
| 634 | There is actually a decision point here. If it can be implemented entirely
|
---|
| 635 | in the standard library, that would be good. If it cannot, it can actually
|
---|
| 636 | be implemented as the same primitive implementation as we use now and may be
|
---|
| 637 | how unstructured tuple packs are implemented.
|
---|
| 638 |
|
---|
| 639 | ### Member Pack Implementation
|
---|
| 640 |
|
---|
| 641 | Implementing member packs can use the same pattern as current built-in tuple
|
---|
| 642 | syntax. This is a two stage process, first is by ... then the it can be
|
---|
| 643 | handled by the existing code for polymorphic but not variadic structures.
|
---|
| 644 |
|
---|
| 645 | forall(Ts...)
|
---|
| 646 | struct tuple {
|
---|
| 647 | Ts get;
|
---|
| 648 | };
|
---|
[1b770e40] | 649 |
|
---|
[12c4a5f] | 650 | So when used with diffences length an hidden declaration is created which
|
---|
| 651 | has the pack parameter replaced with the approprate number of object type
|
---|
| 652 | parameters. Such as in the following examples:
|
---|
[1b770e40] | 653 |
|
---|
[12c4a5f] | 654 | struct tuple$0 {
|
---|
| 655 | };
|
---|
[1b770e40] | 656 |
|
---|
[12c4a5f] | 657 | forall(Ts$0)
|
---|
| 658 | struct tuple$1 {
|
---|
| 659 | Ts$0 get$0;
|
---|
| 660 | };
|
---|
[1b770e40] | 661 |
|
---|
[12c4a5f] | 662 | forall(Ts$0, Ts$1)
|
---|
| 663 | struct tuple$2 {
|
---|
| 664 | Ts$0 get$0;
|
---|
| 665 | Ts$1 get$1;
|
---|
| 666 | };
|
---|
[7a0e8c8] | 667 |
|
---|
[12c4a5f] | 668 | Then it can be handled by the regular boxing code. Detection of these cases
|
---|
| 669 | will have to changed because it is no longer a single case. But the pattern
|
---|
| 670 | does work if the structure has additional fields in it, as they are just
|
---|
| 671 | placed before and after the pack.
|
---|
| 672 |
|
---|
| 673 | forall(U, Ts...)
|
---|
| 674 | struct blob {
|
---|
| 675 | int header;
|
---|
| 676 | Ts data;
|
---|
| 677 | U footer;
|
---|
| 678 | };
|
---|
| 679 |
|
---|
| 680 | forall(U, Ts$0, Ts$1)
|
---|
| 681 | struct blob$2 {
|
---|
| 682 | int header;
|
---|
| 683 | Ts$0 data$0;
|
---|
| 684 | Ts$1 data$1;
|
---|
| 685 | U footer;
|
---|
| 686 | };
|
---|
| 687 |
|
---|
| 688 | The `inline` member pack just effects the resolver. It should be expanded out
|
---|
| 689 | to the member access and then the tuple index expression.
|
---|
| 690 |
|
---|
| 691 | ### Local Declaration Pack
|
---|
| 692 |
|
---|
| 693 | Packs of local declarations can be handled very differently if they are
|
---|
| 694 | concrete or not. Concrete packs can be writen out in full (even if some
|
---|
| 695 | individual elements are polymorphic) can be inlined directly, expanding out
|
---|
| 696 | into a series of object declarations.
|
---|
| 697 |
|
---|
| 698 | [A, B] local = func();
|
---|
| 699 | call(local);
|
---|
| 700 |
|
---|
| 701 | A local$0;
|
---|
| 702 | B local$1;
|
---|
| 703 | ?{}([&local$0, &local$1], func());
|
---|
| 704 | call([local$0, local$1]);
|
---|
| 705 |
|
---|
| 706 | In the polymorphic case though, that entire group needs to be reduced to a
|
---|
| 707 | block that can put into a generic function. Here we can use something like
|
---|
| 708 | the currently existing implementation, where the entire tuple an opaque
|
---|
| 709 | bundle, passed to adapters and thunks for handling.
|
---|
| 710 |
|
---|
| 711 | This is also how tuple declarations are implemented. In the concrete case the
|
---|
| 712 | element names are real and the pack name is replaced with a tuple expression
|
---|
| 713 | of the elements. In the polymorphic case the pack name is real and the
|
---|
| 714 | element names are replaced with tuple index expressions.
|
---|
| 715 |
|
---|
| 716 | ### Unstructured Tuple Implementation
|
---|
| 717 |
|
---|
| 718 | Unstructured tuples have two general implementation stratagies. Either they
|
---|
| 719 | are erased and the packs inlined into their context, or they are wrapped
|
---|
| 720 | up as like a structured tuple, generally using the current implementation.
|
---|
| 721 |
|
---|
| 722 | For example, if a concrete pack (size and all the types are known), appears
|
---|
| 723 | in a parameter list the pack can be split and inlined into the argument list.
|
---|
| 724 |
|
---|
| 725 | void function(int a, [bool, char, short] b, float c);
|
---|
| 726 | // Becomes:
|
---|
| 727 | void function(int a, bool b0, char b1, short b2, float c);
|
---|
| 728 |
|
---|
| 729 | On the other hand, if a polymorphic pack appears in the same place, it will
|
---|
| 730 | have to be wrapped up in a dynamic structure so it different instances can
|
---|
| 731 |
|
---|
| 732 | forall(Ts...)
|
---|
| 733 | void function(int a, Ts b, float c);
|
---|
| 734 | // Becomes (after the implicit packing and unpacking operations):
|
---|
| 735 | forall(T)
|
---|
| 736 | void function(int a, T* b, float c);
|
---|
[7a0e8c8] | 737 |
|
---|
| 738 | ### AST Updates
|
---|
[1b770e40] | 739 |
|
---|
[12c4a5f] | 740 | The AST may already be capable of representing most of this. I have only
|
---|
| 741 | identified one feature that definitely will now work in the current AST, and
|
---|
| 742 | that are the object like tuple declarations with named elements.
|
---|
| 743 |
|
---|
| 744 | Particularly, an object declaration cannot name elements of the tuple.
|
---|
| 745 | To this end a new node type, `TupleDecl`,
|
---|
| 746 | should be added to handle tuple deconstruction declarations
|
---|
| 747 | (other cases can still be handled with `ObjectDecl`).
|
---|
[7a0e8c8] | 748 |
|
---|
| 749 | This would act much like a `FunctionDecl` except for tuples, narrowing the
|
---|
| 750 | possible types, to `TupleType` instances instead of `FunctionType` instances,
|
---|
[1b770e40] | 751 | and storing some additional information. In this case, the names of the
|
---|
[7a0e8c8] | 752 | elements of the tuples.
|
---|
| 753 |
|
---|
| 754 | (I'm not actually going to decide the implementation now, but some early
|
---|
| 755 | examination of the problem suggests that it might be better off wrapping a
|
---|
| 756 | series of `ObjectDecl` rather than just some strings to hold the names.)
|
---|
| 757 |
|
---|
| 758 | Related Features in Other Languages
|
---|
| 759 | -----------------------------------
|
---|
[12c4a5f] | 760 | Other languages have similar features. Organized by the related feature,
|
---|
| 761 | then language (this does mean we can and do hit languages multiple times).
|
---|
[7a0e8c8] | 762 |
|
---|
| 763 | (In hindsight, I may have gone overboard with the number of examples.)
|
---|
| 764 |
|
---|
| 765 | ### Structured Tuples
|
---|
| 766 |
|
---|
| 767 | There are many languages with structured tuples. Usually just called tuples,
|
---|
| 768 | but they usually follow the same rules as a polymorphic anonymous structures,
|
---|
[1b770e40] | 769 | that is to say N types are provided to create a new N-arity tuple. They also
|
---|
[7a0e8c8] | 770 | usually have some special syntax to index the tuples, because there are no
|
---|
[1b770e40] | 771 | field names.
|
---|
[7a0e8c8] | 772 |
|
---|
| 773 | #### Rust
|
---|
[1b770e40] | 774 |
|
---|
[7a0e8c8] | 775 | Rust has the standard tuples as a primitive in the language. Each arity of
|
---|
| 776 | tuple is a different polymorphic type.
|
---|
| 777 |
|
---|
[b0fcd0e] | 778 | Tuple types and expressions are written with a parenthesized, comma separated
|
---|
[7a0e8c8] | 779 | list of types or expressions. To avoid confusion with the grouping "(...)",
|
---|
[b0fcd0e] | 780 | one-element tuples must have a trailing comma (and larger tuples may have a
|
---|
[7a0e8c8] | 781 | trailing comma).
|
---|
| 782 |
|
---|
| 783 | const empty: () = ();
|
---|
| 784 | const single: (usize,) = (12,)
|
---|
| 785 | const double: (usize, isize) = (34, -56);
|
---|
| 786 |
|
---|
| 787 | Element access uses similar syntax to field access (that is "."), but uses an
|
---|
[b0fcd0e] | 788 | integer literal instead of a field name (for example: "pair.1"). Tuples
|
---|
[7a0e8c8] | 789 | support pattern matching with similar syntax to their expression form.
|
---|
| 790 |
|
---|
| 791 | Some tuple features only apply up to 12-arity tuples.
|
---|
[1b770e40] | 792 | It is not directly stated, but I believe the difference in the more limited
|
---|
| 793 | features are implemented in the standard library, one for each arity of tuple.
|
---|
[7a0e8c8] | 794 |
|
---|
| 795 | - https://doc.rust-lang.org/std/primitive.tuple.html
|
---|
| 796 | - https://doc.rust-lang.org/reference/types/tuple.html
|
---|
| 797 | - https://doc.rust-lang.org/reference/expressions/tuple-expr.html
|
---|
| 798 |
|
---|
[12c4a5f] | 799 | #### C++
|
---|
[1b770e40] | 800 |
|
---|
[7a0e8c8] | 801 | Implemented as a template type defined in the standard library. No special
|
---|
| 802 | language features exist to support this, due to the power of C++'s template
|
---|
| 803 | meta-programming.
|
---|
| 804 |
|
---|
[b0fcd0e] | 805 | C++ is also one of the few languages with support for variadic polymorphic
|
---|
[7a0e8c8] | 806 | types, so there is one template that defines all the types. It is written
|
---|
| 807 | as `std::tuple<TYPE...>`, where "TYPE..." is any number of comma separated
|
---|
| 808 | types. This is the standard notation for template type instances.
|
---|
| 809 |
|
---|
[1b770e40] | 810 | There is no special syntax for member access of a tuple. A template function is
|
---|
| 811 | used, e.g., `std::get<0>( tuple )`.
|
---|
[7a0e8c8] | 812 |
|
---|
[12c4a5f] | 813 | C++ is an interesting case because it is one of the few cases where no part
|
---|
| 814 | of the tuple system is built-in. It is entirely created in the standard
|
---|
| 815 | library using templates. This also has massive error messages when something
|
---|
| 816 | goes wrong, hopefully our version of struct tuple will reduce the problem.
|
---|
[1b770e40] | 817 |
|
---|
[12c4a5f] | 818 | C++ also has structured binding, a kind of limited pattern matching. In a
|
---|
| 819 | structured binding declaration, you can write an auto typed declaration where
|
---|
| 820 | a list of identifiers in a `[]` replaces the single identifier.
|
---|
[1b770e40] | 821 |
|
---|
[12c4a5f] | 822 | auto [first, second] = getSize2Tuple();
|
---|
[1b770e40] | 823 |
|
---|
[12c4a5f] | 824 | The base type has to be `auto` because the base type for each element varies.
|
---|
| 825 | Still, qualifiers (cv and reference) on the auto will be distributed to types
|
---|
| 826 | on the individual members.
|
---|
[1b770e40] | 827 |
|
---|
[12c4a5f] | 828 | The type bound to the binding may be an array, a "plain" structure or any
|
---|
| 829 | type that implements the tuple-like interface (`std::tuple_size` and
|
---|
| 830 | `std::tuple_element`).
|
---|
[1b770e40] | 831 |
|
---|
[12c4a5f] | 832 | - https://en.cppreference.com/w/cpp/utility/tuple
|
---|
| 833 | - https://en.cppreference.com/w/cpp/language/structured_binding
|
---|
[1b770e40] | 834 |
|
---|
[7a0e8c8] | 835 | #### Haskell
|
---|
[1b770e40] | 836 |
|
---|
[7a0e8c8] | 837 | Haskell has a special syntax for tuples, but otherwise they are completely
|
---|
| 838 | normal polymorphic types. Because of this, tuples have a maximum size.
|
---|
| 839 | Haskell (98) supports tuples of up to 15 elements and the standard library
|
---|
| 840 | has functions for tuples of up to 7 elements.
|
---|
| 841 |
|
---|
[b0fcd0e] | 842 | The syntax for tuples is a comma separated list of elements. Either element
|
---|
[7a0e8c8] | 843 | types to create a tuple type, or element values to create a tuple value. The
|
---|
[1b770e40] | 844 | context decides among them, such as `(6, "six")` or `(Bool, Char, Int)`.
|
---|
[7a0e8c8] | 845 |
|
---|
[1b770e40] | 846 | Also all the elements can be removed, getting an expression like "(,)" or
|
---|
| 847 | "(,,,)", which can be then be used as a function, for a type, or an expression.
|
---|
[7a0e8c8] | 848 |
|
---|
| 849 | Haskell supports pattern matching as the main way to extract values from a
|
---|
| 850 | tuple, although helper functions "fst" and "snd" are provided for field
|
---|
| 851 | access on two element tuples.
|
---|
| 852 |
|
---|
| 853 | Haskell does not have 0 or 1-element tuples. The nil type, written "()" for
|
---|
| 854 | both type and value, is effectively the 0 element tuple. There is also a type
|
---|
| 855 | called "Solo" that is a polymorphic structure with one field and is used as
|
---|
[1b770e40] | 856 | the 1-element tuple, but has regular Haskell data-type syntax.
|
---|
[7a0e8c8] | 857 |
|
---|
| 858 | - https://www.haskell.org/onlinereport/basic.html
|
---|
| 859 |
|
---|
| 860 | #### OCaml
|
---|
| 861 |
|
---|
[1b770e40] | 862 | OCaml only supports multi-element (2 or more) tuples. It does have the `unit`
|
---|
| 863 | type, which has one value, written `()`. Tuple types are written as a '*'
|
---|
| 864 | separated list of types, tuple values are written as ',' separated list of
|
---|
| 865 | expressions. Pattern matching tuples is supported and uses the same syntax as
|
---|
| 866 | values. Parenthesizing these lists is only needed for grouping.
|
---|
[7a0e8c8] | 867 |
|
---|
| 868 | - https://ocaml.org/docs/basic-data-types#tuples
|
---|
| 869 |
|
---|
| 870 | #### Swift
|
---|
| 871 |
|
---|
[1b770e40] | 872 | Swift has tuple types that use the basic parenthesized, comma separated list of
|
---|
| 873 | types or values. It only supports 0 and 2 or more element tuples (the `Void`
|
---|
| 874 | type is an alias for the empty tuple type).
|
---|
| 875 |
|
---|
| 876 | Swift also supports named tuples. Names can be added before the tuple element,
|
---|
| 877 | both for the tuple type and value. The syntax is a name followed by a colon,
|
---|
| 878 | e.g., `(first: int, second: int)`. These names are a fixed part of the type,
|
---|
| 879 | and can be used as part of field access notation (otherwise numbers are used
|
---|
| 880 | in-place of field names `tuple.0` vs. `tuple.first`).
|
---|
[7a0e8c8] | 881 |
|
---|
| 882 | - https://docs.swift.org/swift-book/documentation/the-swift-programming-language/types/#Tuple-Type
|
---|
| 883 |
|
---|
| 884 | #### Python
|
---|
[1b770e40] | 885 |
|
---|
| 886 | In Python tuples are immutable lists. Because they are dynamically typed,
|
---|
[7a0e8c8] | 887 | there is only one tuple type `tuple`.
|
---|
| 888 |
|
---|
[1b770e40] | 889 | It also has various named tuples. The first, namedtuple, allows naming the
|
---|
| 890 | elements of the tuple. The second, NamedTuple, is actually a way of creating a
|
---|
| 891 | typed record in a normally untyped language.
|
---|
[7a0e8c8] | 892 |
|
---|
| 893 | - https://docs.python.org/3/library/stdtypes.html#sequence-types-list-tuple-range
|
---|
| 894 | - https://docs.python.org/3/library/collections.html#collections.namedtuple
|
---|
| 895 | - https://docs.python.org/3/library/typing.html#typing.NamedTuple
|
---|
| 896 |
|
---|
| 897 | #### LISP
|
---|
[1b770e40] | 898 | As LISP is dynamically typed, its `cons` data type is an untyped pair and is
|
---|
| 899 | (perhaps infamously) the main constructor of all compound data types. The
|
---|
| 900 | function `cons` takes two arguments and builds a pair. Functions `car` and
|
---|
| 901 | `cdr` get the first and second elements of the pair.
|
---|
[7a0e8c8] | 902 |
|
---|
| 903 | ### Packs
|
---|
[b0fcd0e] | 904 | Packs (or unstructured tuples) are a much less common feature. In fact, there
|
---|
[7a0e8c8] | 905 | might just be one language, C++, that supports packs. The most common use
|
---|
| 906 | case for unstructured tuples is returning multiple values, so there is a
|
---|
| 907 | comparison to languages that have that as a special feature.
|
---|
| 908 |
|
---|
| 909 | #### Go
|
---|
| 910 | Go does not have built in tuple types, but it has multi-return syntax that
|
---|
| 911 | looks like the tuple syntax of many other languages.
|
---|
| 912 |
|
---|
| 913 | ```
|
---|
| 914 | func main() {
|
---|
| 915 | i, j := returnIntInt()
|
---|
| 916 | ...
|
---|
| 917 | }
|
---|
| 918 |
|
---|
| 919 | func returnIntInt() (int, int) {
|
---|
| 920 | return 12, 34
|
---|
| 921 | }
|
---|
| 922 | ```
|
---|
| 923 |
|
---|
[12c4a5f] | 924 | Go can unpack multiple return values and pass them all to a function, but
|
---|
| 925 | the unpacked tuple must match the parameter list exactly.
|
---|
| 926 |
|
---|
| 927 | ```
|
---|
| 928 | func acceptIntInt(a int, b int) {
|
---|
| 929 | fmt.Println(a, b)
|
---|
| 930 | }
|
---|
| 931 | ```
|
---|
| 932 |
|
---|
[7a0e8c8] | 933 | - https://golangdocs.com/functions-in-golang
|
---|
| 934 | - https://go.dev/src/go/types/tuple.go
|
---|
| 935 |
|
---|
| 936 | #### Lua
|
---|
[1b770e40] | 937 | Lua is a scripting language that is dynamically typed and stack based. Although
|
---|
| 938 | the stack is usually only directly visible in the C-API, it does allow any
|
---|
[12c4a5f] | 939 | function to return any number of values, one, zero or more, from any return
|
---|
| 940 | expression.
|
---|
[7a0e8c8] | 941 |
|
---|
| 942 | ```
|
---|
| 943 | local funcion f()
|
---|
| 944 | return 12, 34
|
---|
| 945 | end
|
---|
| 946 |
|
---|
| 947 | local i, j = f()
|
---|
| 948 | ```
|
---|
[12c4a5f] | 949 |
|
---|
| 950 | The last argument in an argument list can be an expression - function call -
|
---|
| 951 | with multiple results and all the results are passed to the function after
|
---|
| 952 | other arguments.
|
---|
| 953 |
|
---|
| 954 | Because Lua is dynamically typed, multiple return values are allowed anywhere
|
---|
| 955 | and the length of the value listed is adjusted, discarding any extra values
|
---|
| 956 | and filling in new values with the value `nil`. This is also used if a
|
---|
| 957 | function is called with the incorrect number of arguments.
|
---|
| 958 |
|
---|
| 959 | - https://lua.org/
|
---|
| 960 | - https://lua.org/manual/5.4/manual.html#3.4.12
|
---|
| 961 |
|
---|
| 962 | #### C++
|
---|
| 963 |
|
---|
| 964 | We return to C++ to view template parameter packs. These are the symmetric
|
---|
| 965 | with our pack feature (and both are even used to construct the structured
|
---|
| 966 | tuples in the last section).
|
---|
| 967 |
|
---|
| 968 | C++ template parameter packs are a modification that can be applied to any
|
---|
| 969 | template parameter, so the parameter now takes a series of arguments instead
|
---|
| 970 | of just one. These are used in pack expansion,
|
---|
| 971 | which usually expand to a comma separated list, but it can also be a chain of
|
---|
| 972 | boolean binary operator applications. For example, if the parameter
|
---|
| 973 | `typename... Ts` is in scope, then you can declare `std::tuple<Ts...>` to
|
---|
| 974 | introduce a tuple type, if Ts = bool, char, int then the type becomes
|
---|
| 975 | `std::tuple<bool, char, int>`.
|
---|
| 976 |
|
---|
| 977 | A common use is for perfect argument forwarding, which shows some different
|
---|
| 978 | uses of the pattern:
|
---|
| 979 |
|
---|
| 980 | ```
|
---|
| 981 | template<typename inner_t>
|
---|
| 982 | class Outer {
|
---|
| 983 | inner_t inner;
|
---|
| 984 | public:
|
---|
| 985 | template<typename... Args>
|
---|
| 986 | Outer(Args&&... args) : inner(std::forward<Args>(args)...) {}
|
---|
| 987 | };
|
---|
| 988 | ```
|
---|
| 989 |
|
---|
| 990 | In the first application, `Args&&... args` both uses a pack and introduces
|
---|
| 991 | another one. `Arg0&& arg0, Arg1&& arg1, Arg2&& arg2, ...` is the pattern
|
---|
| 992 | it expands too. The `&&` is used to copy the argument type's reference
|
---|
| 993 | qualifier (the default can strip references away).
|
---|
| 994 |
|
---|
| 995 | The second application, `std::forward<Args>(args)...` uses two packs. These
|
---|
| 996 | are expanded in parallel, and must be the same length (in this case they
|
---|
| 997 | always will be). It also helps show that the `...` is actually the bit doing
|
---|
| 998 | the expansion, it is a suffix "operator" to the expansion pattern.
|
---|
| 999 |
|
---|
| 1000 | There are also fold expressions that use binary operators to combine a pack
|
---|
| 1001 | into a single expression. For example, you can write a variadic sum function
|
---|
| 1002 | that compiles down to the primitive additions.
|
---|
| 1003 |
|
---|
| 1004 | ```
|
---|
| 1005 | template<typename... Args>
|
---|
| 1006 | int sum(Args&&... args) {
|
---|
| 1007 | return (args + ... + 0);
|
---|
| 1008 | }
|
---|
| 1009 | ```
|
---|
| 1010 |
|
---|
| 1011 | C++ is about the best you could ask for in this area, but it does a lot of work
|
---|
| 1012 | at compile time to make this happen.
|
---|
| 1013 |
|
---|
| 1014 | - https://en.cppreference.com/w/cpp/language/template_parameters
|
---|
| 1015 | - https://en.cppreference.com/w/cpp/language/parameter_pack
|
---|
| 1016 | - https://en.cppreference.com/w/cpp/language/fold
|
---|