source: doc/theses/mike_brooks_MMath/list.tex@ 3c1e432

Last change on this file since 3c1e432 was 0d41e600, checked in by Michael Brooks <mlbrooks@…>, 5 months ago

Linked-list chapter writing, pushing work in progress

  • Property mode set to 100644
File size: 13.2 KB
Line 
1\chapter{Linked List}
2
3I wrote a linked-list library for \CFA. This chapter describes it.
4
5The library provides a doubly-linked list that
6attaches links intrusively,
7supports multiple link directions,
8integrates with user code via the type system,
9treats its ends uniformly, and
10identifies a list using an explicit head.
11
12TODO: more summary
13
14
15
16\section{Features}
17
18\subsection{Core Design Issues}
19
20This section reviews how a user experiences my \CFA list library's position on the issues of Section~\ref{toc:lst:issue}.
21The library provides a doubly-linked list that
22attaches links intrusively,
23supports multiple link directions,
24integrates with user code via the type system,
25treats its ends uniformly, and
26identifies a list using an explicit head.
27
28The \CFA list library's version of the running @req@ example is in Figure~\ref{fig:lst-features-intro}.
29Its link attachment is intrusive and the resulting memory layout is pure-stack, just as for the LQ version of Figure~\ref{f:Intrusive}.
30The framework-provided type @dlink(...)@ provides the links.
31The user inserts the links into the @req@ structure by using \CFA inline-inheritance (TODO: reference introduction).
32Inline inheritance means the type of the field is @dlink(req)@, the field is unnamed, a reference to a @req@ is implicitly convertible to @dlink@.\footnote{
33 The \CFA list examples elide the \lstinline{P9_EMBEDDED} annotations that (TODO: xref P9E future work) proposes to obviate.
34 Thus, these examples illustrate a to-be state, free of what is to be historic clutter.
35 The elided portions are immaterial to the discussion and the examples work with the annotations provided.
36 The \CFA test suite (TODO:cite?) includes equivalent demonstrations, with the annotations included.}
37These links have a nontrivial, user-specified location within the @req@ structure;
38this convention encapsulates the implied pointer arithmetic safely.
39
40\begin{figure}
41 \lstinput{20-32}{lst-features-intro.run.cfa}
42 \caption[Multiple link directions in \CFA list library]{
43 Demonstration of the running \lstinline{req} example, done using the \CFA list library.
44 This example does the same job that Figure~\ref{fig:lst-issues-attach} shows three ways.
45 }
46 \label{fig:lst-features-intro}
47\end{figure}
48
49\begin{figure}
50\centering
51\begin{tabular}{@{}ll@{}}
52\begin{tabular}{@{}l@{}}
53 \lstinput{20-25}{lst-features-multidir.run.cfa} \\
54 \lstinput{40-67}{lst-features-multidir.run.cfa}
55 \end{tabular}
56 &
57 \lstinput[language=C++]{20-60}{lst-issues-multi-static.run.c}
58 \end{tabular}
59
60\caption{
61 Demonstration of multiple static link directions done in the \CFA list library.
62 This example does the same job as Figure~\ref{fig:lst-issues-multi-static}.
63 }
64 \label{fig:lst-features-multidir}
65\end{figure}
66
67Figure~\ref{fig:lst-features-multidir} shows how the \CFA library supports multi-static link directionality.
68The declaration of @req@ now has two inline-inheriting @dlink@ occurrences.
69The first of these lines gives a type named @req.by_pri@, @req@ inherits from it, and it inherits from @dlink@.
70The second line @req.by_rqr@ is similar to @req.by_pri@.
71Thus, there is a diamond, non-virtual, inheritance from @req@ to @dlink@, with @by_pri@ and @by_rqr@ being the mid-level types.
72Disambiguation occurs in the declarations of the list-head objects.
73The type of the variable @reqs_pri_global@ is @dlist(req, req.by_pri)@,
74meaning operations called on @reqs_pri_global@ are implicitly disambiguated.
75In the example, the calls @insert_first(reqs_pri_global, ...)@ imply, ``here, we are working by priority.''
76
77The \CFA library also supports the common case, of single directionality, more naturally than LQ.
78Figure~\ref{fig:lst-features-intro} shows a single-direction list done with no contrived name for the link direction,
79where Figure~\ref{f:Intrusive} adds the unnecessary name, @x@.
80In \CFA, a user doing a single direction (Figure~\ref{fig:lst-features-intro})
81sets up a simple inheritance with @dlink@, and declares a list head to have the simpler type @dlist(...)@.
82While a user doing multiple link directions (Figure~\ref{fig:lst-features-multidir})
83sets up a diamond inheritance with @dlink@, and declares a list head to have the more-informed type @dlist(..., DIR)@.
84
85The directionality issue also has an advanced corner-case that needs treatment.
86When working with multiple directions, while calls like @insert_first@ benefit from implicit direction disambiguation, other calls like @insert_after@ still require explicit disambiguation.
87It happens because the call
88\begin{cfa}
89insert_after(r1, r2);
90\end{cfa}
91does not have enough information to clarify which of a request's simultaneous list directions is intended.
92Is @r2@ supposed to be the next-priority request after @r1@, or is @r2@ supposed to join the same-requestor list of @r1@?
93As such, the \CFA compiler gives an ambiguity error for this call.
94To let the programmer resolve the ambiguity, the list library provides a hook for applying the \CFA language's scoping and priority rules.
95It applies as:
96\begin{cfa}
97with ( DLINK_VIA(req, req.pri) ) insert_after(r1, r2);
98\end{cfa}
99Here, importing (using the @with@ syntax on) the @DLINK_VIA@ result causes one of the list directions to become a more attrictive candidate to \CFA's overload resolution.
100This boost applies within the scope of the the import, which is a single line in the example just given, but could also be a custom block or an entire function body.
101
102\noindent
103\begin{tabular}{ll}
104\begin{cfa}
105void f() with ( DLINK_VIA(req, req.pri) ) {
106 ...
107
108 insert_after(r1, r2);
109
110 ...
111}
112\end{cfa}
113&
114\begin{cfa}
115void f() {
116 ...
117 with ( DLINK_VIA(req, req.pri) ) {
118 ...
119 }
120 ...
121}
122\end{cfa}
123\end{tabular}
124
125\noindent
126By doing in on a larger scope, a user can put code within that acts as if there is only one list direction.
127This boost is needed only when operating on a list with several directions, using operations that do not take list heads.
128Otherwise, the sole applicable list direction ``just works.''
129
130The \CFA library offers a type-system mediated integration with user code.
131The examples presented do not use preprocessor macros.
132The touchpoints @dlink@ and @dlist@ are ordinary types.
133Even though they are delivered as header-included static-inline implementations,
134the \CFA compiler typechecks the list library code separately from user code.
135Errors in user code are reported only with mention of the library's declarations.
136
137The \CFA library works in headed and headless modes. TODO: elaborate.
138
139
140
141\subsection{Iteration}
142
143Many languages offer an iterator interface for collections to implement, and a corresponding for-each loop syntax for consuming the items through implicit interface calls.
144\CFA does not yet have a general-purpose form of such a feature, though it has a form that addresses some use cases.
145This section shows why the incumbemt \CFA pattern does not work for linked lists and gives the alternative now offered by the linked-list libary.
146Chapter 5 [TODO: deal with optimism here] presents a design that satisfies both uses and accommodates even more complex collections.
147
148The incumbent \CFA extensible loop syntax is:
149\begin{cfa}
150for( elem; begin ~ end )
151for( elem; end )
152\end{cfa}
153Many derived forms of @begin ~ end@ exist, but they of primaty interest to defining numeric ranges, so they are excluded from the linked-list discussion.
154This ``two-place for'' syntax abbreviates, respectively:
155\begin{cfa}
156for( typeof(end) elem = begin; elem < end; elem += 1 )
157for( typeof(end) elem = 0; elem < end; elem += 1 )
158\end{cfa}
159From these calls, letting @E@ be @typeof(end)@, the implied interface is:
160\begin{itemize}
161\item
162 case-appropriate construction: one of @void ?{}( E &, typeof(begin) )@, or @void ?{}( E &, zero_t )@, depending on the specific loop form
163\item
164 comparison, for termination testing: @int ?<?( const E &, const E & )@
165\item
166 increment, for ``next item'': @E & ?+=?( E &, one_t )@
167\end{itemize}
168The shortened loop works well for using a numeric range to step through an array:
169\begin{cfa}
170for ( i; n ) total += a[i];
171\end{cfa}
172
173When used for array-stepping, the loop is acting like JavaScript's @for...in@ construct,
174\begin{cfa}
175for ( i in a ) total += a[i];
176\end{cfa}
177albeit with different mechanisms for expressing the array's length.
178But the similarity is that the loop binds the declared identifier (@i@) to the array's \emph{indeces}, not its contained values.
179A linked list has only values, no keys.
180So, to seek iteration of a linked list via the existing loop syntax is to ask whether this syntax can also do double-duty for iterating values.
181That is, to be an analog of JavaScript's @for..of@ syntax:
182\begin{cfa}
183for ( e of a ) total += e;
184\end{cfa}
185
186The \CFA team will likely implement an extension of this functionality that moves the @~@ syntax from being part of the loop, to being a first-class operator (with associated multi-pace operators for the elided derived forms).
187With this change, both @begin ~ end@ and @end@ (in context of the latter ``two-place for'' expression) parse as \emph{ranges}, and the loop syntax becomes, simply:
188\begin{cfa}
189 for( elem; rangeExpr )
190\end{cfa}
191The expansion and underlying API are under discussion.
192TODO: explain pivot from ``is at done?'' to ``has more?''
193Advantages of this change include being able to pass ranges to functions, for example, projecting a numerically regular subsequence of array entries, and being able to use the loop syntax to cover more collection types, such as looping over the keys of a hashtable.
194
195
196
197
198When iteratig an empty list, the question, ``Is there a further element?'' needs to be posed once, receiving the answer, ``no.''
199When iterating an $n$-item list, the same question gets $n$ ``yes'' answers (one for each element), plus one ``no'' answer, once there are no more elements; the question is posed $n+1$ times.
200
201When iteratig an empty list, the question, ``What is the value of the current element?'' is never posed, nor is the command, ``Move to the next element,'' issued. When iterating an $n$-item list, each happens $n$ times.
202
203So, asking about the existence of an element happens once more than retrieving an element's value and advancing the position.
204
205Many iteration APIs deal with this fact by splitting these steps across different functions, and relying on the user's knowledge of iterator state to know when to call each. In Java, the function @hasNext@ should be called $n+1$ times and @next@ should be called $n$ times (doing the double duty of advancing the iteration and returning a value). In \CC, the jobs are split among the three actions, @it != end@, @it++@ and @*it@, the latter two being called one time more than the first.
206
207TODO: deal with simultaneous axes: @DLINK_VIA@ just works
208
209TODO: deal with spontaneous simultaneity, like a single-axis req, put into an array: which ``axis'' is @&req++@ navigating: array-adjacency vs link dereference. It should sick according to how you got it in the first place: navigating dlist(req, req.pri) vs navigating array(req, 42). (prob. future work)
210
211\section{Implementation}
212
213\subsection{Links}
214
215\VRef[Figure]{fig:lst-impl-links} continues the running @req@ example, now showing the \CFA list library's internal representation.
216
217\begin{figure}
218 \centering
219 \includegraphics{lst-impl-links.pdf}
220 \caption{
221 \CFA list library representations for the cases under discussion.
222 }
223 \label{fig:lst-impl-links}
224\end{figure}
225
226The @dlink@ structure contains exactly two pointers: @next@ and @prev@. This @dlink@ content is opaque to the user.
227
228Even though the user-facing list model is ordered (linear), the CFA library implements all listing as circular.
229This choice helps achieve uniform end treatment and TODO finish summarizing benefit.
230
231A link pointer targets a neighbouring @dlink@ structure, rather than a neighbouring @req@.
232(Recall, the running example has the user putting a @dlink@ within a @req@.)
233
234Link pointers are tagged according to whether the link is user-visible.
235Links among user-requested neighbours are left natural, with the tag bit not set.
236System-added links, which produce the circular implementation, have the tag bit set.
237Iteration reports ``has more elements'' when crossing natural links, and ``no more elements'' upon reaching a tagged link.
238
239In a headed list, the list head (@dlist(req)@) acts as an extra element in the implementation-level circularly-linked list.
240The content of a @dlist@ is a (private) @dlink@, with the @next@ pointer purposed for the first element, and the @prev@ pointer purposed for the last element.
241Since the head wraps a @dlink@, since a @req@ does too, and since a link-pointer targets a @dlink@, the resulting cycle is among @dlink@ strucures, situated inside of other things.
242The tags on the links say what the wrapper is: untagged (user link) means the targeted @dlink@ is within a @req@, while tagged (system link) means the targeted @dlink@ is within a list head.
243
244In a headless list, the cicular backing list is only among @dlink@s within @req@s. The tags are set on the links that a user cannot navigate.
245
246No distinction is made between an unlisted item under a headed model and a singleton list under a headless model. Both are represented as an item referring to itself, with both tags set.
247
248
249
250\section{Future Work}
251\label{toc:lst:futwork}
252
253
254TODO: deal with: A doubly linked list is being designed.
255
256TODO: deal with: Link fields are system-managed.
257Links in GDB.
258
Note: See TracBrowser for help on using the repository browser.