Context Navigation

source: doc/proposals/tuples.md@ 14c31eb

Visit:

Last change on this file since 14c31eb was 12c4a5f, checked in by Andrew Beach <ajbeach@…>, 11 months ago
Folding in feedback to the tuple proposal and rewriting the last rewrite.
Property mode set to `100644`
File size: 37.6 KB

Rev	Line
[7a0e8c8]	1	Tuples
	2	======
	3
[12c4a5f]	4	This is a proposal discusses update to Cforall tuples as they exist after
	5	the work done by Robert Schluntz (after they were added to Cforall by
	6	Rodolfo Esteves and to K-W C by Dave Till). The new features and ideas
	7	discussed here are designed to address problems that have appeared as tuples
	8	have been used in the past 6 years. Some or all of the changes discussed may
	9	already be implemented.
[7a0e8c8]	10
[1b770e40]	11	The core change is breaking the current restructurable tuples into unstructured
	12	and structured tuples. Unstructured tuples handle most of the existing uses,
	13	with structured tuples filling in a few missing use cases.
[7a0e8c8]	14
	15	Current State of Tuples
	16	-----------------------
[12c4a5f]	17	An overview of the current tuple design is the starting place for the proposal.
[7a0e8c8]	18
	19	### Tuple Syntax
	20
[1b770e40]	21	Currently, tuples have three main components: tuple types, tuple
	22	expressions/values (constructing tuples), and tuple index expressions
	23	(deconstructing tuples).
	24
[12c4a5f]	25	Current syntax for tuple types:
[7a0e8c8]	26
	27	- Nullary: [void] or []
[12c4a5f]	28
	29	[void] f();
	30	[] f();
	31
[7a0e8c8]	32	- Unary: [TYPE]
[1b770e40]	33
[12c4a5f]	34	[int] f();
	35	[double] x = 1.5;
[1b770e40]	36
[12c4a5f]	37	- Multiary: [TYPE, TYPE, ...]
[1b770e40]	38
[12c4a5f]	39	[bool, char] f();
	40	[short, int, long] x;
[1b770e40]	41
[12c4a5f]	42	Tuple types can appear anywhere regular types may be used, except for the
	43	nullary tuple which can only appear where void can be used (usually return
	44	types), with the exception of special case when void is used to make an empty
	45	parameter list.
[7a0e8c8]	46
	47	Current syntax for tuple expressions:
	48
[1b770e40]	49	- Nullary: (Same as `void`, use return without an expression.)
	50
[12c4a5f]	51	[] f() { return; }
[1b770e40]	52
[12c4a5f]	53	- Unary: EXPR (in return only) or {EXPR}
[1b770e40]	54
[12c4a5f]	55	[int] f() { return 3; }
	56	[int] x = {3};
[1b770e40]	57
[12c4a5f]	58	- Multiary: [EXPR, EXPR, ...]
[7a0e8c8]	59
[12c4a5f]	60	[int, int] f() { return [3, 4]; }
	61	[bool, char, int] x = [true, 'a', 3];
[1b770e40]	62
[12c4a5f]	63	Tuple expressions should be useable whenever an expression of that type is
	64	expected. Issues with the current state will be discussed later.
[1b770e40]	65
	66	Current syntax for tuple indexing is an integer constant, where its value
	67	selects a tuple member, e.g.:
[7a0e8c8]	68
[12c4a5f]	69	[char, int] tup = ['a', 0];
	70	char ch = tup.0;
	71	int value = tup.1;
[7a0e8c8]	72
	73	### Mass and Multi-Assignment
	74
[1b770e40]	75	Two special forms of assignment can be used to set values in tuples: mass and
	76	multi. Mass assignment assigns every element in the destination tuple to a
	77	single source value.
[7a0e8c8]	78
	79	[int, long, float] dst;
[12c4a5f]	80	int src = 4;
	81	dst = src;
	82	// Becomes: dst.0 = src; dst.1 = src; dst.2 = src;
[7a0e8c8]	83
[1b770e40]	84	Multi-assignment assigns every element in the destination tuple to the
	85	corresponding element in the source tuple. Both tuples must be the same size
[12c4a5f]	86	and the elements pairs must be assignment compatible conversions.
[7a0e8c8]	87
[1b770e40]	88	[long, int, float] dst;
	89	[int, char, double] src = [1, 'a', 300.0];
[12c4a5f]	90	dst = src;
	91	// Becomes: dst.0 = src.0; dst.1 = src.1; dst.2 = src.1;
[7a0e8c8]	92
	93	### Tuple Restructuring
	94
	95	Tuples can be restructured as part of overload resolution. Function calls
[1b770e40]	96	unpack tuples and repack tuples to match signatures. This semantics is a form
	97	of implicit conversion and is considered during overload resolution.
[7a0e8c8]	98
	99	A simple example is matching multiple parameters of a function to a single
[1b770e40]	100	argument expression, where each parameter is bound to a different element of
	101	the returned tuple.
[7a0e8c8]	102
[1b770e40]	103	[int, int] argFunc();
	104	void parmFunc(int a, int b, int c, int d);
	105
	106	parmFunc(argFunc(), argFunc());
[7a0e8c8]	107
	108	// Roughly equivilent to:
[1b770e40]	109	[int, int] fst = argFunc();
	110	[int, int] snd = argFunc();
	111	parmFunc(fst.0, fst.1, snd.0, snd.1);
[7a0e8c8]	112
	113	### Tuple Casts
	114
[b0fcd0e]	115	C-style casts can be used on tuples. These are usually conversion casts (casts
[1b770e40]	116	that perform operations on the cast type, as opposed to reinterpreting the
[7a0e8c8]	117	existing value).
	118
	119	Tuple casts can remove elements from the end of a tuple and apply a recursive
	120	cast to any of the elements. As an example:
	121
	122	[char, char, char] x = ['a', 'b', 'c'];
	123	([int, char])x;
	124
[1b770e40]	125	This casts the first element type of x from a char to an int and drops the last
[b0fcd0e]	126	element. The second line can be replaced with the following code, which creates
[1b770e40]	127	a new tuple by directly casting the kept elements of the old tuple:
[7a0e8c8]	128
	129	[(int)x.0, (char)x.1];
	130
	131	Note, tuple casting is more restricted than the implicit tuple restructuring.
	132	It cannot do any restructuring beyond dropping the trailing elements of
[1b770e40]	133	tuples. For example,
	134
	135	int i; char c; bool b;
	136	[i, b, c, i] = [i, [b, c], i];
	137	[int, [bool, char], int] w;
	138	[i, b, c, i] = w;
[12c4a5f]	139	[i, b, c] = ([int, [bool, char]])w;
	140	// But this does not work:
	141	[i, b, c, i] = ([int, bool, char, int])w;
[7a0e8c8]	142
	143	### Polymorphic Tuple Parameters
	144
	145	One kind of polymorphic parameter is the tuple type parameter, previously
	146	noted by the `ttype` keyword and now marked by a tailing ellipses (`...`).
	147	Although described as a tuple type, it is usually viewed as its flattened
	148	sequence of types.
	149
[12c4a5f]	150	// Fixed Length Max - Base Case:
	151	forall(T \| { int ?>?( T, T ); })
	152	T max(T v1, T v2) { return v1 > v2 ? v1 : v2; }
[1b770e40]	153
[12c4a5f]	154	// Variable Length Max - Recursive:
	155	forall(T, Ts... \| { T max(T, T); T max(Ts); })
[7a0e8c8]	156	T max(T arg, Ts args) {
	157	return max(arg, max(args));
	158	}
	159
[12c4a5f]	160	This feature introduces a type name into scope (Ts). It is used as a type and
	161	the value it declares (args) is a tuple value, and that the second assertion
	162	"T max(Ts);" matches takes a single tuple, but with restructuring it can also
	163	match functions that take a series of non-tuple arguments.
	164
	165	A good way to explain this is to show a partial implementation the following
	166	call `max(int_a, int_b, int_c);` into actual code:
	167
	168	void _thunk0(int v1, int v2) {
	169	return max{?>?}(v1, v2);
	170	}
	171	void _thunk1([int, int] tup) {
	172	return max{?>?}(tup.0, tup.1);
	173	}
	174	max{_thunk0, _thunk1}(int_a, [int_b, int_c]);
	175
	176	The part to highlight is that "_thunk1", one of the functions created to
	177	make an inexact match that can be used as a call site into an exact match
	178	so the function can be passed through function parameters.
	179	Although, the tuple type `[int, int]` matches a parameter list `int, int`,
	180	just with a bit of low lever restructuring.
[1b770e40]	181
[12c4a5f]	182	In larger cases (with four or more parameters in the first case), the
	183	recursive case will work similarly, creating an intermidate function to
	184	restructure the tuple into a new call such as:
	185
	186	void _thunk2([int, int, int] tup) {
	187	return max{_thunk0, _thunk1}(tup.0, [tup.1, tup.2]);
	188	}
[7a0e8c8]	189
	190	Issues with the Current State
	191	-----------------------------
[1b770e40]	192	There are a variety of problems with the current implementation which need to
	193	be fixed.
[7a0e8c8]	194
	195	### Tuples are not Objects
	196
[1b770e40]	197	Spoilers: this proposal actually takes them even further away from being
	198	objects, but illustrates why the current version is not as useful is it could
	199	be.
	200
[12c4a5f]	201	Tuples do not have the lifetime operators (copy construction, copy assignment
	202	and destructor) that define an object type in Cforall. This means they are
	203	not polymorphic otypes, they are objects in C's usage of the word. This means
	204	that tuples cannot be used as a single object in
	205
	206	There are several reasons for this. The fluid nature of tuples (flattening
	207	and restructuring) means that matching against a tuple type is fluid in some
	208	inconvent ways.
[7a0e8c8]	209
[12c4a5f]	210	forall(T, U) void ?{}([T, U] & this;
	211	void ?{}([int, int, int] & this);
	212	// What constructor(s) should be used here?
	213	[int, [int, int]] pair;
[7a0e8c8]	214
[12c4a5f]	215	Should the polymorpic constructor be applied twice or should the tuple be
	216	restructured and the three element constructor be used? This is not really
	217	something a user should be wondering about (especially since tuples should
	218	always be simple containers wrapping their elements).
[7a0e8c8]	219
[b0fcd0e]	220	### Providing TType Arguments is Inconsistent
[7a0e8c8]	221
[1b770e40]	222	The syntax for ttype arguments is slightly inconsistent. It has not come up
	223	much yet, because you do not directly provide ttype polymorphic arguments to
[b0fcd0e]	224	functions and there are very few existing use-cases for ttype structures.
[7a0e8c8]	225
[12c4a5f]	226	forall(T, Us...)
	227	void function(T head, Us tail) {}
	228
	229	forall(T, Us...)
	230	struct Type {};
	231
	232	int a; bool b; char c;
	233	function(a, b, c);
	234	function(a, [b, c]);
	235	// Doesn't work: Type(int, bool, char) x;
	236	Type(int, [bool, char]) x;
	237
	238	The `function` case works either way because the tuple value can be
	239	restructured if it does not match exactly. The `Type` case though does not
	240	allow restructuring when the type is passed as a type value. This may be
	241	the most correct form, but seems surprising compared to the common use.
[7a0e8c8]	242
[12c4a5f]	243	### Unary Tuple Value Syntax
	244
	245	Unary tuples don't use the standard tuple syntax and one of the two alternate
	246	syntax options your may use instead doesn't work consistently. Solving both
	247	of these issues would help with consistency.
[7a0e8c8]	248
	249	### Syntax Conflict
	250
[12c4a5f]	251	The tuple syntax conflicts with designators and the C23 (C++-style) attribute
[1b770e40]	252	syntax.
	253
[12c4a5f]	254	int a[10] = { [2] = 3 };
	255
	256	Here [2] = 3 is an array designator, but has the same syntax as an assignment
	257	to a tuple of references, it isn't until the resolver determains that 2 is
	258	not a reference that that case can be ruled out.
	259
	260	[[ unused ]] [[int, int], int] x;
	261
	262	Here the opening `[[` could be a nested tuple or the beginning of an
	263	attribute, the parser can't figure out until later.
[1b770e40]	264
	265	These conflicts break C compatibility goals of Cforall. Designators had to
	266	their syntax change and Cforall cannot parse the new attributes.
[12c4a5f]	267	The behaviour of these cases, if they could be parsed, should be unchanged.
[7a0e8c8]	268
	269	Change in Model
	270	---------------
	271	This proposal modifies the existing tuples to better handle its main use
[12c4a5f]	272	cases. There are some use cases that are no longer handled, so in addition
	273	to changing the existing "restructured tuples" to "unstructured tuples"
	274	(or packs) a new "structured tuple" is added in the form of struct tuple.
[7a0e8c8]	275
[12c4a5f]	276	The new tuples is even more "unstructured" than before. New tuples are
[1b770e40]	277	considered a sequence of types or typed entities. These packs are then unpacked
	278	into the surrounding context. Put differently, tuples are now flattened as much
	279	as possible, with some places (like parameter lists) being treated as an
	280	implicit tuple and the tuple being flattened into that.
[7a0e8c8]	281
[1b770e40]	282	Structured tuples are now a separate feature: a structure called "tuple".
	283	These are polymorphic structures; an instance should act as a structure, except
	284	its fields are accessed using indices instead of field names. Experience so far
	285	is that structured tuples are not used often, but fill in the use cases that
	286	unstructured tuples no longer support.
[7a0e8c8]	287
[12c4a5f]	288	Note that the underlying implementation might not actually look like this,
	289	but this is how it should seem to the user.
[7a0e8c8]	290
	291	Changed Features
	292	----------------
[b0fcd0e]	293	Some of the concrete changes to the features of the language.
[7a0e8c8]	294
[12c4a5f]	295	### Unstructured Tuple / Pack Type
	296
	297	The evolution of our current tuples (called packs in an attempt to make the
	298	names more distinct). They work like the current tuples except they are
	299	always flattened. You might still be able to declare nested tuples, but these
	300	are not meaningful, they are flattened to a single level and, where
	301	approprate, is inlined into the sounding content.
	302
	303	[bool, [char, int], long] x;
	304	// Becomes:
	305	[bool, char, int, long] x;
	306
	307	void f(bool a0, [char, [int, long]] a1, float a2);
	308	// Becomes:
	309	void f(bool a0, char a1_0, int a1_1, long a1_2, float a2);
	310
	311	[,] f(int a0, [,] a1, [bool,] a2);
	312	// Becomes:
	313	void f(int a0, bool a2);
	314
	315	This actually means that tuples do not participate in overload resolution
	316	in the way the do currently. Restructuring is always free because they are
	317	always reduced to the same form as part of type matching.
	318
	319	Packs are still not object types and do not have lifetime functions. They
	320	cannot be used as object types, nor as any data type, but they can be still
	321	used as ttypes.
	322
	323	(Unless they need to have lifetime functions for other features.)
	324
[7a0e8c8]	325	### Structured Tuple Type
	326
[12c4a5f]	327	There is a standard library or built-in type named `tuple`, it does not need
	328	a special syntax to write types or values. The type definition might need
	329	some primitive support, but if supported as a regular type would look
	330	something like this:
[7a0e8c8]	331
	332	forall(Ts...)
	333	struct tuple {
[12c4a5f]	334	inline Ts all;
[7a0e8c8]	335	};
	336
[12c4a5f]	337	This type wraps up an unstructured tuple, in this case a pack of members,
	338	and gives it "edges" and so structure. It also gives tuples an interface
	339	more like a normal structure type, for the anonymous structure use case.
	340
	341	The struct tuple does have lifetime functions, can use list initializer
	342	syntax and otherwise be treated as a normal structure. You can use regular
	343	member access syntax to convert the struct tuple into a unstructured pack,
	344	writing `object.all`.
	345
	346	The `inline` modifier is a new specialized feature to allow you use tuple
	347	index expressions directly on the type. With or without inline you should
	348	be able to chain a tuple index expression onto the member expression, such
	349	as `object.all.1`. The inline specifier allows you to skip the middle
	350	and use `object.1`.
	351
	352	More complex operations can usually be preformed by taking the pack out of
	353	the struct tuple and acting on it. Polymorphic operations only have to go one
	354	level down to get to base operations, there is no need for recursive
	355	construction, nor manually coding different sizes of tuple. This should give
	356	most of the ease of working with a primitive tuple, because effectively you
	357	are except it has been given fixed edges.
	358
	359	A tuple struct should also be an object type, and let's you pass multiple
	360	values through a single polymorphic slot.
	361
	362	A note on the naming here, because there are more pieces that need names
	363	instead of symbols. The name `tuple` follows because that is the common name,
	364	although this means this is now the default tuple in some sense. The name of
	365	the pack was picked to work with inline and make the difference between `.0`
	366	and `.all` clear. (If inline doesn't work then `get` might be a better name.)
[7a0e8c8]	367
	368	### Type Qualifier Distribution
	369
	370	Because tuples are no longer object types, applying type modifiers to them,
	371	such as cv-qualifiers and pointers, no longer makes sense. That syntax is
	372	now considered distribution, applying the type modifier to each element of
	373	the tuple.
	374
[1b770e40]	375	Previously `const [int, bool] &` would mean a const reference to a tuple of an
	376	integer and a boolean. Now it means an alias for `[const int &, const bool &]`,
	377	a tuple of a reference to a constant integer and a reference to a constant
	378	boolean. This also applies to polymorphic tuple type packs `Ts &` in
	379	polymorphic functions.
[7a0e8c8]	380
[1b770e40]	381	This new approach can specify restrictions on tuple variables as for a single
	382	type variable. For example, this approach can replace the special cased tuple
	383	operations multi-assignment (N-to-N) and mass-assignment (1-to-N).
[7a0e8c8]	384
	385	// Multi-Assignment
	386	forall(T, Us... \|
	387	{ T & ?=?(T &, T const &); Us & ?=?(Us &, Us const &); })
	388	[T, Us] & ?=?(T & dst, Us & dsts, T const & src, Us const & srcs) {
	389	dst = src;
	390	dsts = srcs;
	391	return [dst, dsts];
	392	}
	393
	394	// Mass-Assignment
	395	forall(T, U, Vs... \|
	396	{ U & ?=?(U &, T const &); Vs & ?=?(Vs &, T const &); })
	397	[U, Vs] & ?=?(U & dst, Vs & dsts, T const & src) {
	398	dst = src;
	399	dsts = src;
	400	return [dst, dsts];
	401	}
	402
	403	These may not work exactly as given (for one, the copy assignment assertion
[b0fcd0e]	404	in the first function would likely be redundant/conflicting with the implicit
[7a0e8c8]	405	assertions on the parameters), but they show the pattern. Multi-assignment
[b0fcd0e]	406	also would be very hard to write with simple tuple types, because the
[7a0e8c8]	407	relationship between the two halves of the parameter list does have to line
	408	up, that cannot be enforced with two different tuples.
	409
[12c4a5f]	410	This example doesn't have to be implemented this way because we do have the
	411	special case operations for these already.
	412
[7a0e8c8]	413	### Type Packs
	414
[1b770e40]	415	This approach is not a new feature, but a reframing/extension of existing tuple
	416	tuple polymorphic parameters as polymorphic type packs. The old `Vars...`
	417	syntax introduces a pack of types into scope. It can be used in much the same
	418	way as a tuple, but in some new ways to.
[7a0e8c8]	419
[1b770e40]	420	The primary existing use remains: to use a polymorphic pack in a parameter
	421	list, both as part of an assertion and in the signature of the main
	422	function. The difference is that this is not an enclosed tuple, but a series of
	423	types. The only effective difference this makes is it does not prefer to match
	424	another tuple/pack.
[7a0e8c8]	425
	426	This pattern continues to a parameter defined with a pack of types, which
[b0fcd0e]	427	is considered a pack of parameters, and the name it introduces is a pack
[7a0e8c8]	428	of variables, or a flattened tuple.
	429
	430	forall(Params...)
	431	void function(Params values);
	432
[1b770e40]	433	New use cases include declarations of members and variables. For example, the
	434	creation of a structured tuple structure:
[7a0e8c8]	435
	436	forall(Fields...)
	437	struct tuple {
	438	Fields get;
	439	};
	440
	441	This is again, treated as a pack of members. They have the same layout as
[b0fcd0e]	442	if they were written out by hand. Now, the name get is still accessed as if
[7a0e8c8]	443	it was a regular, singular, member. The result of that expression is a
	444	pack expression, a tuple of all the field accesses, which can be used with a
	445	tuple index expression to access the underlying members. For example:
	446
	447	tuple(bool, char, int) data = { ... };
	448	// This expression:
	449	data.get.2;
	450	// Is the same as (if the fields had these names):
	451	data.[__anonymous0, __anonymous1, __anonymous2].2;
	452
	453	For local declarations it works similarly, except the name introduced is
[12c4a5f]	454	directly usable as a tuple instead of going through a member access.
[7a0e8c8]	455
	456	forall(Objects...)
	457	void function( ??? ) {
	458	???
	459	Objects objs = ???;
	460	???
	461	}
	462
[12c4a5f]	463	### Tuple Pack Restructuring
	464
	465	Implicit restructuring still occurs, but is not considered as part of
	466	overload resolution. The structure of tuples is ignored, where individual
	467	tuples begin or end is not considered, just the sequence of types.
	468
	469	This can be viewed as flattening both sides (the arguments/caller and the
	470	parameters/callee) before type resolution. Both any nested packs and packs
	471	into any larger context:
	472
	473	call(false, ['a', [-7,]], 3.14)
	474	// Inline Nesting:
	475	call(false, ['a', -7], 3.14)
	476	// Inline into Context:
	477	call(false, 'a', -7, 3.14)
	478
	479	The main context where tuples are inlined are parameter lists, in that you
	480	can consider a parameter list as a single pack. Return types could be viewed
	481	in this way, but because they are presented in a single type, they are
	482	already wrapped in a tuple.
	483
	484	[int, [bool, char]] f();
	485	// Inline Nesting:
	486	[int, bool, char] f();
	487
	488	This also happens "after" any polymorphic parameters are instantiated.
	489
	490	Empty packs are also collapsed, logically being removed or replaced with
	491	void as in the following example:
	492
	493	// For example, this function:
	494	forall(Ts...) Ts example(int first, Ts middle, int last);
	495	// With Ts = [,], should not be treated as:
	496	[,] example(int first, [,] middle, int last);
	497	// But instead:
	498	void example(int first, int last);
	499
	500	Or closer to it, there are some slight differences between the nullary tuple
	501	and void, for instance creating some instances for consistency with the other
	502	tuple types.
	503
	504	The struct tuple is never implicitly restructured.
	505
[7a0e8c8]	506	### Tuple Declaration/Deconstruction
	507
	508	Declaring a tuple acts as a pack of variable declarations. When this is done
[1b770e40]	509	with a written out type (as opposed to a polymorphic parameter above), then the
[7a0e8c8]	510	elements of the tuple can be named.
	511
[12c4a5f]	512	[int quo, int rem] quorem = divmod(a, b);
[7a0e8c8]	513
	514	Here `ret` refers to the tuple, the entire pack, while `quo` and `rem`
	515	give explicit names to elements of the tuple. Not all the names have to be
	516	provided, at the very least, any element name can be omitted if the pack name
	517	is provided, and the pack name can be omitted if all the element names are
	518	provided. That would ensure every element can be accessed, but it could be
	519	reduced even more, effectively immediately dropping some values if there is
	520	no named to access it.
	521
[12c4a5f]	522	This does not add any new functionality, more it is a qualify of life feature
	523	making it very easy to give elements of a tuple descriptive names
	524	instead of trying to combine a name for the whole tuple with an index.
	525	(`quo` and `rem` vs. `quorem.0` and `quorem.1`)
[1b770e40]	526
[7a0e8c8]	527	### Tuple Casts
	528
	529	Tuple casts are no longer allowed to do any restructuring. Internal
	530	restructuring would not be useful, as there is no internal structure.
	531	Dropping tail elements could be added back in but it is a narrow use case
	532	so it may be replaced with other features (for example: tuple slicing).
	533
[12c4a5f]	534	// Allowed:
	535	([bool, short])tuple_bool_char;
	536	([int, float])tuple_float_float;
[7a0e8c8]	537
[12c4a5f]	538	// Forbidden:
	539	([int, int, int])tuple_int_int;
	540	([long, [float, int]])tuple_long_float_int;
[7a0e8c8]	541
[12c4a5f]	542	### New Tuple Literal Syntax
[7a0e8c8]	543
[12c4a5f]	544	In addition to the main redesign of this proposal, the syntax is updated to
	545	be more consistant. It is now a "[]" list with "," separators and a ","/comma
	546	terminator that is optional for tuples with 2 or more elements (those that
	547	already have a comma in them) and manditory in other tuples.
[7a0e8c8]	548
[12c4a5f]	549	It should also be symmetric between types and value, differing only in that
	550	an element (ITEM below) of a tuple type is a type and the element of a tuple
	551	value is a value.
[7a0e8c8]	552
[12c4a5f]	553	- Nullary Tuple: [,]
	554	- Unary Tuples: [ITEM,]
	555	- Multiary Tuples: [ITEM, ITEM] [ITEM, ITEM,] [ITEM, ITEM, ITEM] ...
[7a0e8c8]	556
[12c4a5f]	557	Currently, although syntax is provided to write them, but manually writing
	558	a nullary or unary tuple should be very rare. This is because a nullary tuple
	559	can either be erased or treated as void, while unary tuples can be treated
	560	as their element type. Similary for nested tuples, can be treated as the
	561	flattened tuple.
[7a0e8c8]	562
[12c4a5f]	563	(Update: Orignal nullary tuple proposal was `[]`, but that had some issues.)
[7a0e8c8]	564
[12c4a5f]	565	### TType Argument Syntax
	566
	567	It would be nice to seamlessly provide packs of types in polymorphic
	568	argument lists, however doing so requires a new restriction.
	569
	570	As an example of the change, consider providing arguments to a polymorphic
	571	structure with a otype and a ttype parameter.
[7a0e8c8]	572
[12c4a5f]	573	forall(T, Us...) struct D {...};
	574	D(int, [,]) // D(int)
	575	D(bool, [char,]) // D(bool, char)
	576	D(long, [int, short]) // D(long, int, short)
	577
	578	This is a nice syntaxtic improvement for the common case, it does restrict
	579	some of the more complex cases, such as cases with two ttypes. There are
	580	various restrictions on this for functions. If the rules make it unambigous
	581	then the grouping could be omitted, or a more advanced version, it can be
	582	ommited only in the indivual cases where it is unambigous.
[7a0e8c8]	583
	584	### Tuple Slicing (Range Tuple Indexing)
	585
	586	(Disclaimer: this is a bit more impulsive, see end for notes.)
	587
[1b770e40]	588	This is an extension to tuple indexing. Currently, only single location of a
	589	tuple can be index, extracting the element at that location. By extending the
	590	index expression to be a list-range a multiple sub-tuples can be extracted.
	591
[12c4a5f]	592	['a', 'b', 'c', 'd', 'e', 'f'].0~3
	593	['a', 'b', 'c', 'd', 'e', 'f'].1~5~2
	594	// Produces:
	595	['a', 'b', 'c']
	596	['b', 'd']
[7a0e8c8]	597
[12c4a5f]	598	In words, the first indexes the first 3 tuple elements (0, 1, 2, stopping
	599	before index 3) and the second indexes elements starting at index 1, stoping
	600	before index 5 going in steps of 2. This does not allow for arbitary
	601	selections, but it covers many common slices.
[1b770e40]	602
	603	To get the new tuple, the range has to be constructed and traversed at compile
	604	time. The size of the range is the size of the result tuple, with each element
	605	in the result tuple decided by the matching element of the range, which gives
	606	the index of the original tuple to place there. The type of the indices may be
	607	an integral type, but all indices must be in range, otherwise it is a compile
	608	time error.
[7a0e8c8]	609
	610	In terms of the existing range loops, if you could write this dynamically, it
	611	would look something like this:
	612
	613	// Implementation of: output = input.RANGE
	614	dyn output = []
	615	for ( index : RANGE ) { output = [output, input.index] }
	616
	617	The result of the expression is only has to be considered a tuple if the
	618	range has two or more values, otherwise it can be considered void or the same
	619	as indexing the single value in the range.
	620
[1b770e40]	621	Some closing notes, this is dependent on generalized range expressions.
[7a0e8c8]	622	The iterators proposal has a section on some new range types, this feature
	623	is intended to be built on that feature. Not simply reuse some of the syntax
[12c4a5f]	624	that is also used in the special for loop. This may add some requirements
	625	about the compile-time evaluation on range expressions to make sure these
	626	expressions can be evaluated at compile-time.
[1b770e40]	627
[7a0e8c8]	628	Implementation
	629	--------------
	630	An overview of the implementation details of the new proposal.
	631
[12c4a5f]	632	### Structured Tuple
	633
	634	There is actually a decision point here. If it can be implemented entirely
	635	in the standard library, that would be good. If it cannot, it can actually
	636	be implemented as the same primitive implementation as we use now and may be
	637	how unstructured tuple packs are implemented.
	638
	639	### Member Pack Implementation
	640
	641	Implementing member packs can use the same pattern as current built-in tuple
	642	syntax. This is a two stage process, first is by ... then the it can be
	643	handled by the existing code for polymorphic but not variadic structures.
	644
	645	forall(Ts...)
	646	struct tuple {
	647	Ts get;
	648	};
[1b770e40]	649
[12c4a5f]	650	So when used with diffences length an hidden declaration is created which
	651	has the pack parameter replaced with the approprate number of object type
	652	parameters. Such as in the following examples:
[1b770e40]	653
[12c4a5f]	654	struct tuple$0 {
	655	};
[1b770e40]	656
[12c4a5f]	657	forall(Ts$0)
	658	struct tuple$1 {
	659	Ts$0 get$0;
	660	};
[1b770e40]	661
[12c4a5f]	662	forall(Ts$0, Ts$1)
	663	struct tuple$2 {
	664	Ts$0 get$0;
	665	Ts$1 get$1;
	666	};
[7a0e8c8]	667
[12c4a5f]	668	Then it can be handled by the regular boxing code. Detection of these cases
	669	will have to changed because it is no longer a single case. But the pattern
	670	does work if the structure has additional fields in it, as they are just
	671	placed before and after the pack.
	672
	673	forall(U, Ts...)
	674	struct blob {
	675	int header;
	676	Ts data;
	677	U footer;
	678	};
	679
	680	forall(U, Ts$0, Ts$1)
	681	struct blob$2 {
	682	int header;
	683	Ts$0 data$0;
	684	Ts$1 data$1;
	685	U footer;
	686	};
	687
	688	The `inline` member pack just effects the resolver. It should be expanded out
	689	to the member access and then the tuple index expression.
	690
	691	### Local Declaration Pack
	692
	693	Packs of local declarations can be handled very differently if they are
	694	concrete or not. Concrete packs can be writen out in full (even if some
	695	individual elements are polymorphic) can be inlined directly, expanding out
	696	into a series of object declarations.
	697
	698	[A, B] local = func();
	699	call(local);
	700
	701	A local$0;
	702	B local$1;
	703	?{}([&local$0, &local$1], func());
	704	call([local$0, local$1]);
	705
	706	In the polymorphic case though, that entire group needs to be reduced to a
	707	block that can put into a generic function. Here we can use something like
	708	the currently existing implementation, where the entire tuple an opaque
	709	bundle, passed to adapters and thunks for handling.
	710
	711	This is also how tuple declarations are implemented. In the concrete case the
	712	element names are real and the pack name is replaced with a tuple expression
	713	of the elements. In the polymorphic case the pack name is real and the
	714	element names are replaced with tuple index expressions.
	715
	716	### Unstructured Tuple Implementation
	717
	718	Unstructured tuples have two general implementation stratagies. Either they
	719	are erased and the packs inlined into their context, or they are wrapped
	720	up as like a structured tuple, generally using the current implementation.
	721
	722	For example, if a concrete pack (size and all the types are known), appears
	723	in a parameter list the pack can be split and inlined into the argument list.
	724
	725	void function(int a, [bool, char, short] b, float c);
	726	// Becomes:
	727	void function(int a, bool b0, char b1, short b2, float c);
	728
	729	On the other hand, if a polymorphic pack appears in the same place, it will
	730	have to be wrapped up in a dynamic structure so it different instances can
	731
	732	forall(Ts...)
	733	void function(int a, Ts b, float c);
	734	// Becomes (after the implicit packing and unpacking operations):
	735	forall(T)
	736	void function(int a, T* b, float c);
[7a0e8c8]	737
	738	### AST Updates
[1b770e40]	739
[12c4a5f]	740	The AST may already be capable of representing most of this. I have only
	741	identified one feature that definitely will now work in the current AST, and
	742	that are the object like tuple declarations with named elements.
	743
	744	Particularly, an object declaration cannot name elements of the tuple.
	745	To this end a new node type, `TupleDecl`,
	746	should be added to handle tuple deconstruction declarations
	747	(other cases can still be handled with `ObjectDecl`).
[7a0e8c8]	748
	749	This would act much like a `FunctionDecl` except for tuples, narrowing the
	750	possible types, to `TupleType` instances instead of `FunctionType` instances,
[1b770e40]	751	and storing some additional information. In this case, the names of the
[7a0e8c8]	752	elements of the tuples.
	753
	754	(I'm not actually going to decide the implementation now, but some early
	755	examination of the problem suggests that it might be better off wrapping a
	756	series of `ObjectDecl` rather than just some strings to hold the names.)
	757
	758	Related Features in Other Languages
	759	-----------------------------------
[12c4a5f]	760	Other languages have similar features. Organized by the related feature,
	761	then language (this does mean we can and do hit languages multiple times).
[7a0e8c8]	762
	763	(In hindsight, I may have gone overboard with the number of examples.)
	764
	765	### Structured Tuples
	766
	767	There are many languages with structured tuples. Usually just called tuples,
	768	but they usually follow the same rules as a polymorphic anonymous structures,
[1b770e40]	769	that is to say N types are provided to create a new N-arity tuple. They also
[7a0e8c8]	770	usually have some special syntax to index the tuples, because there are no
[1b770e40]	771	field names.
[7a0e8c8]	772
	773	#### Rust
[1b770e40]	774
[7a0e8c8]	775	Rust has the standard tuples as a primitive in the language. Each arity of
	776	tuple is a different polymorphic type.
	777
[b0fcd0e]	778	Tuple types and expressions are written with a parenthesized, comma separated
[7a0e8c8]	779	list of types or expressions. To avoid confusion with the grouping "(...)",
[b0fcd0e]	780	one-element tuples must have a trailing comma (and larger tuples may have a
[7a0e8c8]	781	trailing comma).
	782
	783	const empty: () = ();
	784	const single: (usize,) = (12,)
	785	const double: (usize, isize) = (34, -56);
	786
	787	Element access uses similar syntax to field access (that is "."), but uses an
[b0fcd0e]	788	integer literal instead of a field name (for example: "pair.1"). Tuples
[7a0e8c8]	789	support pattern matching with similar syntax to their expression form.
	790
	791	Some tuple features only apply up to 12-arity tuples.
[1b770e40]	792	It is not directly stated, but I believe the difference in the more limited
	793	features are implemented in the standard library, one for each arity of tuple.
[7a0e8c8]	794
	795	- https://doc.rust-lang.org/std/primitive.tuple.html
	796	- https://doc.rust-lang.org/reference/types/tuple.html
	797	- https://doc.rust-lang.org/reference/expressions/tuple-expr.html
	798
[12c4a5f]	799	#### C++
[1b770e40]	800
[7a0e8c8]	801	Implemented as a template type defined in the standard library. No special
	802	language features exist to support this, due to the power of C++'s template
	803	meta-programming.
	804
[b0fcd0e]	805	C++ is also one of the few languages with support for variadic polymorphic
[7a0e8c8]	806	types, so there is one template that defines all the types. It is written
	807	as `std::tuple<TYPE...>`, where "TYPE..." is any number of comma separated
	808	types. This is the standard notation for template type instances.
	809
[1b770e40]	810	There is no special syntax for member access of a tuple. A template function is
	811	used, e.g., `std::get<0>( tuple )`.
[7a0e8c8]	812
[12c4a5f]	813	C++ is an interesting case because it is one of the few cases where no part
	814	of the tuple system is built-in. It is entirely created in the standard
	815	library using templates. This also has massive error messages when something
	816	goes wrong, hopefully our version of struct tuple will reduce the problem.
[1b770e40]	817
[12c4a5f]	818	C++ also has structured binding, a kind of limited pattern matching. In a
	819	structured binding declaration, you can write an auto typed declaration where
	820	a list of identifiers in a `[]` replaces the single identifier.
[1b770e40]	821
[12c4a5f]	822	auto [first, second] = getSize2Tuple();
[1b770e40]	823
[12c4a5f]	824	The base type has to be `auto` because the base type for each element varies.
	825	Still, qualifiers (cv and reference) on the auto will be distributed to types
	826	on the individual members.
[1b770e40]	827
[12c4a5f]	828	The type bound to the binding may be an array, a "plain" structure or any
	829	type that implements the tuple-like interface (`std::tuple_size` and
	830	`std::tuple_element`).
[1b770e40]	831
[12c4a5f]	832	- https://en.cppreference.com/w/cpp/utility/tuple
	833	- https://en.cppreference.com/w/cpp/language/structured_binding
[1b770e40]	834
[7a0e8c8]	835	#### Haskell
[1b770e40]	836
[7a0e8c8]	837	Haskell has a special syntax for tuples, but otherwise they are completely
	838	normal polymorphic types. Because of this, tuples have a maximum size.
	839	Haskell (98) supports tuples of up to 15 elements and the standard library
	840	has functions for tuples of up to 7 elements.
	841
[b0fcd0e]	842	The syntax for tuples is a comma separated list of elements. Either element
[7a0e8c8]	843	types to create a tuple type, or element values to create a tuple value. The
[1b770e40]	844	context decides among them, such as `(6, "six")` or `(Bool, Char, Int)`.
[7a0e8c8]	845
[1b770e40]	846	Also all the elements can be removed, getting an expression like "(,)" or
	847	"(,,,)", which can be then be used as a function, for a type, or an expression.
[7a0e8c8]	848
	849	Haskell supports pattern matching as the main way to extract values from a
	850	tuple, although helper functions "fst" and "snd" are provided for field
	851	access on two element tuples.
	852
	853	Haskell does not have 0 or 1-element tuples. The nil type, written "()" for
	854	both type and value, is effectively the 0 element tuple. There is also a type
	855	called "Solo" that is a polymorphic structure with one field and is used as
[1b770e40]	856	the 1-element tuple, but has regular Haskell data-type syntax.
[7a0e8c8]	857
	858	- https://www.haskell.org/onlinereport/basic.html
	859
	860	#### OCaml
	861
[1b770e40]	862	OCaml only supports multi-element (2 or more) tuples. It does have the `unit`
	863	type, which has one value, written `()`. Tuple types are written as a '*'
	864	separated list of types, tuple values are written as ',' separated list of
	865	expressions. Pattern matching tuples is supported and uses the same syntax as
	866	values. Parenthesizing these lists is only needed for grouping.
[7a0e8c8]	867
	868	- https://ocaml.org/docs/basic-data-types#tuples
	869
	870	#### Swift
	871
[1b770e40]	872	Swift has tuple types that use the basic parenthesized, comma separated list of
	873	types or values. It only supports 0 and 2 or more element tuples (the `Void`
	874	type is an alias for the empty tuple type).
	875
	876	Swift also supports named tuples. Names can be added before the tuple element,
	877	both for the tuple type and value. The syntax is a name followed by a colon,
	878	e.g., `(first: int, second: int)`. These names are a fixed part of the type,
	879	and can be used as part of field access notation (otherwise numbers are used
	880	in-place of field names `tuple.0` vs. `tuple.first`).
[7a0e8c8]	881
	882	- https://docs.swift.org/swift-book/documentation/the-swift-programming-language/types/#Tuple-Type
	883
	884	#### Python
[1b770e40]	885
	886	In Python tuples are immutable lists. Because they are dynamically typed,
[7a0e8c8]	887	there is only one tuple type `tuple`.
	888
[1b770e40]	889	It also has various named tuples. The first, namedtuple, allows naming the
	890	elements of the tuple. The second, NamedTuple, is actually a way of creating a
	891	typed record in a normally untyped language.
[7a0e8c8]	892
	893	- https://docs.python.org/3/library/stdtypes.html#sequence-types-list-tuple-range
	894	- https://docs.python.org/3/library/collections.html#collections.namedtuple
	895	- https://docs.python.org/3/library/typing.html#typing.NamedTuple
	896
	897	#### LISP
[1b770e40]	898	As LISP is dynamically typed, its `cons` data type is an untyped pair and is
	899	(perhaps infamously) the main constructor of all compound data types. The
	900	function `cons` takes two arguments and builds a pair. Functions `car` and
	901	`cdr` get the first and second elements of the pair.
[7a0e8c8]	902
	903	### Packs
[b0fcd0e]	904	Packs (or unstructured tuples) are a much less common feature. In fact, there
[7a0e8c8]	905	might just be one language, C++, that supports packs. The most common use
	906	case for unstructured tuples is returning multiple values, so there is a
	907	comparison to languages that have that as a special feature.
	908
	909	#### Go
	910	Go does not have built in tuple types, but it has multi-return syntax that
	911	looks like the tuple syntax of many other languages.
	912
	913	```
	914	func main() {
	915	i, j := returnIntInt()
	916	...
	917	}
	918
	919	func returnIntInt() (int, int) {
	920	return 12, 34
	921	}
	922	```
	923
[12c4a5f]	924	Go can unpack multiple return values and pass them all to a function, but
	925	the unpacked tuple must match the parameter list exactly.
	926
	927	```
	928	func acceptIntInt(a int, b int) {
	929	fmt.Println(a, b)
	930	}
	931	```
	932
[7a0e8c8]	933	- https://golangdocs.com/functions-in-golang
	934	- https://go.dev/src/go/types/tuple.go
	935
	936	#### Lua
[1b770e40]	937	Lua is a scripting language that is dynamically typed and stack based. Although
	938	the stack is usually only directly visible in the C-API, it does allow any
[12c4a5f]	939	function to return any number of values, one, zero or more, from any return
	940	expression.
[7a0e8c8]	941
	942	```
	943	local funcion f()
	944	return 12, 34
	945	end
	946
	947	local i, j = f()
	948	```
[12c4a5f]	949
	950	The last argument in an argument list can be an expression - function call -
	951	with multiple results and all the results are passed to the function after
	952	other arguments.
	953
	954	Because Lua is dynamically typed, multiple return values are allowed anywhere
	955	and the length of the value listed is adjusted, discarding any extra values
	956	and filling in new values with the value `nil`. This is also used if a
	957	function is called with the incorrect number of arguments.
	958
	959	- https://lua.org/
	960	- https://lua.org/manual/5.4/manual.html#3.4.12
	961
	962	#### C++
	963
	964	We return to C++ to view template parameter packs. These are the symmetric
	965	with our pack feature (and both are even used to construct the structured
	966	tuples in the last section).
	967
	968	C++ template parameter packs are a modification that can be applied to any
	969	template parameter, so the parameter now takes a series of arguments instead
	970	of just one. These are used in pack expansion,
	971	which usually expand to a comma separated list, but it can also be a chain of
	972	boolean binary operator applications. For example, if the parameter
	973	`typename... Ts` is in scope, then you can declare `std::tuple<Ts...>` to
	974	introduce a tuple type, if Ts = bool, char, int then the type becomes
	975	`std::tuple<bool, char, int>`.
	976
	977	A common use is for perfect argument forwarding, which shows some different
	978	uses of the pattern:
	979
	980	```
	981	template<typename inner_t>
	982	class Outer {
	983	inner_t inner;
	984	public:
	985	template<typename... Args>
	986	Outer(Args&&... args) : inner(std::forward<Args>(args)...) {}
	987	};
	988	```
	989
	990	In the first application, `Args&&... args` both uses a pack and introduces
	991	another one. `Arg0&& arg0, Arg1&& arg1, Arg2&& arg2, ...` is the pattern
	992	it expands too. The `&&` is used to copy the argument type's reference
	993	qualifier (the default can strip references away).
	994
	995	The second application, `std::forward<Args>(args)...` uses two packs. These
	996	are expanded in parallel, and must be the same length (in this case they
	997	always will be). It also helps show that the `...` is actually the bit doing
	998	the expansion, it is a suffix "operator" to the expansion pattern.
	999
	1000	There are also fold expressions that use binary operators to combine a pack
	1001	into a single expression. For example, you can write a variadic sum function
	1002	that compiles down to the primitive additions.
	1003
	1004	```
	1005	template<typename... Args>
	1006	int sum(Args&&... args) {
	1007	return (args + ... + 0);
	1008	}
	1009	```
	1010
	1011	C++ is about the best you could ask for in this area, but it does a lot of work
	1012	at compile time to make this happen.
	1013
	1014	- https://en.cppreference.com/w/cpp/language/template_parameters
	1015	- https://en.cppreference.com/w/cpp/language/parameter_pack
	1016	- https://en.cppreference.com/w/cpp/language/fold

Note: See TracBrowser for help on using the repository browser.

Download in other formats: