Context Navigation

← Previous Changeset
Next Changeset →

Changeset 68af77b

Timestamp:

Apr 13, 2026, 12:21:18 PM (7 weeks ago)

Author:

Peter A. Buhr <pabuhr@…>

Branches:

Children:

Parents:

Message:

changes to first-order effects

Location:

doc/theses/mike_brooks_MMath

Files:

: 2 edited

list.tex (modified) (5 diffs)
plots/list-1ord.gp (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

doc/theses/mike_brooks_MMath/list.tex

-              r0f9c67bf
+              r68af77b
 \end{itemize}
 In the result analysis, a where list length is a performance-influencing factor, once truly ``large'' lengths have been dismissed, these zones are identified as representing different patterns:
+In the result analysis, where list length is a performance-influencing factor, once ``large'' lengths are dismissed, these zones are identified as representing different patterns:
 \begin{description}
         \item[size zone ``small''] lists of 4--16 elements
 …
 \end{description}
 Each zone buckets four specific sizes at which trials are run.
 \subsubsection{Experiment setup}
 …
 The preceding result shows the intrusive implementations have better performance to the wrapped lists for small to medium sized lists.
 This analysis covers the experiment position taken in \VRef{s:AddRemovePerformance} for movement, polarity, and accessor.
 \VRef[Figure]{f:ExperimentOperations} shows the experiment operations tested, which results in 12 experiments for comparing intrusive implementations.
+\VRef[Figure]{f:ExperimentOperations} shows the experiment operations tested, which results in 12 experiments (I--XII) for comparing intrusive implementations.
 To preclude hardware interference, only list sizes below 150 are examined to differentiate among the intrusive implementations,
 The data is selected from the start of \VRef[Figures]{f:Linear-swift}--\subref*{f:Linear-java}, but the start of \VRef[Figures]{f:Random-swift}--\subref*{f:Random-java} is largely the same.
 …
         X:      &  queue, insert last, I-head / R-head \\
         XI:     & queue, insert last, iI-list / R-head \\
         XII:    & queue, insert last, I-head / R-list \\
+        XII:& queue, insert last, I-head / R-list \\
         \end{tabular}
 \end{tabular}
 …
 \end{figure}
-\VRef[Figure]{fig:plot-list-1ord} gives the first-order effects.
-Its first breakdown, Machine--Size-Zone, shows the effects of an insert/remove's physical situation.
-The Intel runs faster than the AMD; the small zone runs faster than the medium zone.
-The size effect is more pronounced on the AMD than it is on the Intel.
 \begin{figure}
   \centering
   \includegraphics{plot-list-1ord.pdf}
   \caption{Histogram of operation durations, decomposed by all first-order effects.
   Each of the three breakdowns divides the entire population of test results into its mutually disjoint constituents.}
+  Each of the three breakdowns divides the entire population of test results into its mutually disjoint constituents. Higher in column is better}
   \label{fig:plot-list-1ord}
 \end{figure}
+These facts stated, you will not be chosing between these particular mahines or whether to run at one of these specific size zones.
+The key takeaway from the physical comparison is the context it establishes for interpreting the framework comparison following.
+Both the particulars of a the machine's cache design, and a list length's effect on the program's cache friendliness, affect add/remove speed in the manner illlustrated in this breakdown.
+\VRef[Figure]{fig:plot-list-1ord} gives the first-order effects.
+The first breakdown, architecture/size-zone (left), showing the overall performance of all 12 experiment on the two different hardware architectures.
+The relative experiment duration for each experiment is shown as a bar in each column and the black bar in that column shows the average of all 12 experiments.
+By inspection, Intel runs faster than AMD.
+As well, the small zone (lists of 4--16 elements) runs faster than the medium zone (lists of 50--200 elements).
+The size effect is more pronounced on the AMD with its smaller L3 cache than it is on the Intel.
+(No NUMA effects for these list sizes.)
 Specifically, a 20\% standard deviation exists here, between the means four physical-effect categories.
+The key takeaway for this comparison is the context it establishes for interpreting the following framework comparisons.
+Both the particulars of a the machine's cache design, and a list length's effect on the program's cache friendliness, affect insert/remove speed in the manner illlustrated in this breakdown.
 That is, if you are running on an unknown machine, at a scale above anomaly-prone individuals, and below where major LLC caching effects take over the general intrusive-list advantage, but with an unknown relationship to the sizing of your fickle low-level caches, you are likely to experience an unpredictable speed impact on the order of 20\%.

doc/theses/mike_brooks_MMath/plots/list-1ord.gp

r0f9c67bf	r68af77b
29	29
30	30	set xrange [-5.5:17.5];
31		set xlabel "~~Machin~~e, Size Zone; Operation; Framework; \nPrevalence Prevalence Prevalence"
	31	set xlabel "Architecture, Size Zone; Operation; Framework; \nPrevalence Prevalence Prevalence"
32	32	set xtics ( \
33	33	"AMD, sm" -5, \

Note: See TracChangeset for help on using the changeset viewer.

Download in other formats: