Context Navigation

← Previous Change
Next Change →

list.tex

Timestamp:

Apr 28, 2026, 8:56:08 AM (14 hours ago)

Author:

Peter A. Buhr <pabuhr@…>

Branches:

master

Parents:

bf8112b

Message:

spelling corrections, and final proofreading

File:

: 1 edited

doc/theses/mike_brooks_MMath/list.tex (modified) (14 diffs)

Legend:

: Unmodified
: Added
: Removed

doc/theses/mike_brooks_MMath/list.tex

-              rbf8112b
+              r5faa3a5
 Therefore, the duration's response to size is not a steady worsening as size increases.
 Often, each size-independent configuration responds to size increases in steps of slowdown.
 Occasionally a slowdown step is followed by some perforamnce increase, where an incurred penalty begins to amortize away.
+Occasionally a slowdown step is followed by some performance increase, where an incurred penalty begins to amortize away.
 Hence, performance results can have interesting jitter as size increases.
 The analysis treats these behaviours as incidental.
 …
 Rather, size zones are picked, specific effects inside of a zone are averaged away, and the story at one zone is compared to that at another.
 % It is true, but perhaps not obvious, that buildind and destroying long lists is slower than building and destroying short lists.
+% It is true, but perhaps not obvious, that building and destroying long lists is slower than building and destroying short lists.
 % Obviously, indeed, it takes longer to fuse and divide a hundred neighbours than five.
 % But the key metric in this work, AII, is about a single link--unlink.
 …
 % So, duration's response to size is not a steady worsening as size increases.
 % Rather, each size-independent configuration often responds to size increases with leaps of worsening.
 % Occasionally a leap is even followed size-run of retrograde response, where a suddenly incurred penalty has a chance to ammortize away.
+% Occasionally a leap is even followed size-run of retrograde response, where a suddenly incurred penalty has a chance to amortize away.
 % The frameworks tend to leapfrog over each other, at different points, as size increases.
+%
 …
+        }
   \end{tabular}
   \caption[Variety of IR duration responses to list length, at small--medium lengths]{Variety of IR duration responses to list length, at small--medium lengths.  Two example use cases are shown: I, stack movement with head-only access (plot a); IX, queue movement with element-oriented removal access (plot b); both use cases have insert-first polarity.  One example is run on each machine: UC-I on AMD (ploat a); UC-IX on Intel (plot b).  Lower is better.}
+  \caption[Variety of IR duration responses to list length, at small--medium lengths]{Variety of IR duration responses to list length, at small--medium lengths.  Two example use cases are shown: I, stack movement with head-only access (plot a); IX, queue movement with element-oriented removal access (plot b); both use cases have insert-first polarity.  One example is run on each machine: UC-I on AMD (plot a); UC-IX on Intel (plot b).  Lower is better.}
   \label{fig:plot-list-zoomin-abs}
 \end{figure}
 …
 \subsubsection{Recap and Master Legend}
 For experiments performed in later sections, there are 12 use cases, which are all combinations of 2 movents, 2 polarities and 3 accessors.
 There are 4 pysical contexts, which are all combinations of 2 machines and 2 size (length) zones.
+For experiments performed in later sections, there are 12 use cases, which are all combinations of 2 movements, 2 polarities and 3 accessors.
+There are 4 physical contexts, which are all combinations of 2 machines and 2 size (length) zones.
 There are 3.25 frameworks.
 This accounting considers how LQ-list supports only the movement--polarity combination ``stack, insert first.''
 …
+        }
   \end{tabular}
   \caption[IR duration, transformed for general anaysis]{
         IR duration, transformed for general anaysis.
+  \caption[IR duration, transformed for general analysis]{
+        IR duration, transformed for general analysis.
         The analysis follows the single example setup of \VRef[Figure]{f:zoomin-abs-i-swift}, \ie Use Case I on AMD; there, IR is given as absolute duration.
         Plot (a) transforms the source dataset by conditioning on specific size.
 …
 The effect of conditioning on specific size erases the fact that \VRef[Figure]{fig:plot-list-zoomin-abs} shows, aside from the coarse hops, all frameworks getting smoothly slower as size increases.
 This effect is partiularly relevant \emph{within} a size zone, most noticeable as the data lines all going up across the Small box.
+This effect is particularly relevant \emph{within} a size zone, most noticeable as the data lines all going up across the Small box.
 Now, with size conditioned, \VRef[Figure]{f:zoomin-rel-i-swift} has the trends inside a zone box being flat.
 This flatness gives \subref*{f:histo-rel-i-swift} nicely separated histograms.
 …
 Specific size is the only factor conditioned in this view.
 This choice was made to keep the relationship between \VRef[Figures]{f:zoomin-abs-i-swift} and \VRef[]{:zoomin-rel-i-swift} perceptible.
 By contrast, general comparisons like \VRef[Figure]{fig:plot-list-1ord} condition on more, generally, everyting not presented.
+By contrast, general comparisons like \VRef[Figure]{fig:plot-list-1ord} condition on more, generally, everything not presented.
 Its physical-factor breakdown conditions on use case and framework, but not on physical factors; its other two breakdowns are defined similarly.
 The noteworthy performance hop, in this example, is LQ-@list@, which \VRef[Figures]{f:zoomin-abs-i-swift} has as consistently slow in the Small range, and consistently fast in the Medium range.
 Therefore, in \VRef{f:zoomin-histo-i-swift}, its two per-size-zone histograms are far apart, and its cross-size-zone histogram is bimdal.
+Therefore, in \VRef{f:zoomin-histo-i-swift}, its two per-size-zone histograms are far apart, and its cross-size-zone histogram is bimodal.
 Hops and distribution contributions, like this one, are common.
 They are attention-grabbing curiosities when comparing nearly any pair of individual configurations.
 …
 A condensed graphing style is used in subsequent plots to present this amount of data.
 \VRef[Figure]{fig:plot-list-rel} shows how the condensed graphing style is generated from indvidual-configuration measures.
+\VRef[Figure]{fig:plot-list-rel} shows how the condensed graphing style is generated from individual-configuration measures.
 \VRef[Figure]{f:zoomin-rel-i-swift} is formed from the data in \VRef[Figure]{f:zoomin-abs-i-swift}, transformed on the Y-axis to show duration relative to the mean across all four frameworks, at each specific size.
 \VRef[Figure]{f:zoomin-histo-i-swift} consenses the interesting data within the two boxes (Small/Medium) and their combination (Both).
+\VRef[Figure]{f:zoomin-histo-i-swift} condenses the interesting data within the two boxes (Small/Medium) and their combination (Both).
 This graph plots a vertical histogram for each of the 4 frameworks.
 A data point on \VRef[Figure]{f:zoomin-abs-i-swift} is one-to-one with a point on \VRef[Figure]{f:zoomin-rel-i-swift}; each gives one IC.
 Repeatability of the experimet being established previously, retry variance information (error bar on an individual-configuration point) is discarded, and only the expected performance of an IC (mean of its middle three out of five trials) is promoted into the histograms.
+Repeatability of the experiment being established previously, retry variance information (error bar on an individual-configuration point) is discarded, and only the expected performance of an IC (mean of its middle three out of five trials) is promoted into the histograms.
 Each histogram bin (light-shaded area) counts the number of ICs whose expected performance falls in the bin's range.
 A histogram's girth indicates the diversity of its qualifying configurations' performance expectations.
 …
 The second breakdown, movement and polarity (middle), the responses are more subdued.
 Note, LQ-list has no represntation in these comparisons because it only supports stacks that push and pop with the first element.
+Note, LQ-list has no representation in these comparisons because it only supports stacks that push and pop with the first element.
 \CFA is completely stable under movement and polarity changes.
 \uCpp and LQ show modest responses favouring queues and insertion at last.
 …
 Speculation is that \CFA's increased data dependency, a result of the tagging scheme, pairs poorly with the aliasing implied by queue movement.
 The aliasing, at length 1, is: the head's first element is the head's last element.
 With stack movement, one of these names for the first element is reaused for both insert and remove.
+With stack movement, one of these names for the first element is reused for both insert and remove.
 While with queue movement, both names are used in alternation.
 …
 The insight for contextualizing this issue was to inspect both length and width.
 The issue is seen as practically mitigated by noticing that the difficutly fades away as width increases.
+The issue is seen as practically mitigated by noticing that the difficulty fades away as width increases.
 This effect is seen both in \VRef[Figure]{fig:plot-list-short}'s easement across the top triangle rows, and, zoomed farther out, in \VRef[Figure]{fig:plot-list-wide}.
 Increasing the width matters to the aliasing hypothesis.
 In a narrow experiment, one element's insert and remove happen in rapid succession.
 So, the two aliases are exercied closer together, making a data hazard (that lacks ideal hardware treatment) stretch the instruction-pipeline schedule more significantly.
+So, the two aliases are exercised closer together, making a data hazard (that lacks ideal hardware treatment) stretch the instruction-pipeline schedule more significantly.
 Increasing the width adds harness-induced gaps between the uses of each alias, behind which a potential hazard can hide.
 In the practical scenario that judges length-1 performance as relevant, width 1 is contrived.
 A thread putting itself on an often-empty waiters' list is not doing so on one such list repeatedly, at least not without taking other situation-iduced pauses.
+A thread putting itself on an often-empty waiters' list is not doing so on one such list repeatedly, at least not without taking other situation-induced pauses.
 Thus, the congestion at low width + length comes from the harness using repetition (in order to obtain a measurable time).
 It does not reflect the situation that motivates the legitimate desire for good length-1 performance.
 There likely is a real hazard, unique to the \CFA framework, when a queue movement is repeated on a tiny list \emph{without other interventing action}.
+There likely is a real hazard, unique to the \CFA framework, when a queue movement is repeated on a tiny list \emph{without other intervening action}.
 Doing so is believed to occur only in contrived situations.
 …
 From the perspective of assessing winning/losing frameworks, these physical effects are noise.
 So, subsequent analysis conditions on the phisical effects.
+So, subsequent analysis conditions on the physical effects.
 That is, it supposes you are put into an unknown physical situation (that is one of the four being tested), then presents all the ways your outcome could change as a result of non-physical factors, assuming that the physical situation is kept constant.
 It does do by presenting results relative to the mean of the physical quadrant (\VRef[fig]{fig:plot-list-mchn-szz} histogram) to which it belogs.
 With this adjustment, absolute duration values (in nonsecods) are lost.
+It does do by presenting results relative to the mean of the physical quadrant (\VRef[fig]{fig:plot-list-mchn-szz} histogram) to which it belongs.
+With this adjustment, absolute duration values (in nanoseconds) are lost.
 In return, the physical quadrants are re-combined, enabling assessment of the non-physical factors.
 \end{comment}
 …
   This state is how LQ leaves a removed element; LQ does not offer an is-listed query.
 \item[no-iter(ation)] Removing support for well-terminating iteration.
   The \CFA list uses bit-manipulation tagging on link poiters (rather than \eg null links) to express, ``No more elements this way.''
+  The \CFA list uses bit-manipulation tagging on link pointers (rather than \eg null links) to express, ``No more elements this way.''
   This tagging has the cost of submitting a retrieved value to the ALU, and awaiting this operation's completion, before dereferencing a link pointer.
   In some cases, the is-terminating bit is transferred from one link to another, or has a similar influence on a resulting link value; this logic adds register pressure and more data dependency.

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 5faa3a5 for doc/theses/mike_brooks_MMath/list.tex

Legend:

doc/theses/mike_brooks_MMath/list.tex

Download in other formats: