Changeset 68af77b
- Timestamp:
- Apr 13, 2026, 12:21:18 PM (3 days ago)
- Branches:
- master
- Parents:
- 0f9c67bf
- Location:
- doc/theses/mike_brooks_MMath
- Files:
-
- 2 edited
-
list.tex (modified) (5 diffs)
-
plots/list-1ord.gp (modified) (1 diff)
Legend:
- Unmodified
- Added
- Removed
-
doc/theses/mike_brooks_MMath/list.tex
r0f9c67bf r68af77b 706 706 \end{itemize} 707 707 708 In the result analysis, a where list length is a performance-influencing factor, once truly ``large'' lengths have beendismissed, these zones are identified as representing different patterns:708 In the result analysis, where list length is a performance-influencing factor, once ``large'' lengths are dismissed, these zones are identified as representing different patterns: 709 709 \begin{description} 710 710 \item[size zone ``small''] lists of 4--16 elements … … 712 712 \end{description} 713 713 Each zone buckets four specific sizes at which trials are run. 714 714 715 715 716 \subsubsection{Experiment setup} … … 978 979 The preceding result shows the intrusive implementations have better performance to the wrapped lists for small to medium sized lists. 979 980 This analysis covers the experiment position taken in \VRef{s:AddRemovePerformance} for movement, polarity, and accessor. 980 \VRef[Figure]{f:ExperimentOperations} shows the experiment operations tested, which results in 12 experiments for comparing intrusive implementations.981 \VRef[Figure]{f:ExperimentOperations} shows the experiment operations tested, which results in 12 experiments (I--XII) for comparing intrusive implementations. 981 982 To preclude hardware interference, only list sizes below 150 are examined to differentiate among the intrusive implementations, 982 983 The data is selected from the start of \VRef[Figures]{f:Linear-swift}--\subref*{f:Linear-java}, but the start of \VRef[Figures]{f:Random-swift}--\subref*{f:Random-java} is largely the same. … … 1036 1037 X: & queue, insert last, I-head / R-head \\ 1037 1038 XI: & queue, insert last, iI-list / R-head \\ 1038 XII: & queue, insert last, I-head / R-list \\1039 XII:& queue, insert last, I-head / R-list \\ 1039 1040 \end{tabular} 1040 1041 \end{tabular} … … 1043 1044 \end{figure} 1044 1045 1045 \VRef[Figure]{fig:plot-list-1ord} gives the first-order effects.1046 Its first breakdown, Machine--Size-Zone, shows the effects of an insert/remove's physical situation.1047 The Intel runs faster than the AMD; the small zone runs faster than the medium zone.1048 The size effect is more pronounced on the AMD than it is on the Intel.1049 1050 1046 \begin{figure} 1051 1047 \centering 1052 1048 \includegraphics{plot-list-1ord.pdf} 1053 1049 \caption{Histogram of operation durations, decomposed by all first-order effects. 1054 Each of the three breakdowns divides the entire population of test results into its mutually disjoint constituents. }1050 Each of the three breakdowns divides the entire population of test results into its mutually disjoint constituents. Higher in column is better} 1055 1051 \label{fig:plot-list-1ord} 1056 1052 \end{figure} 1057 1053 1058 These facts stated, you will not be chosing between these particular mahines or whether to run at one of these specific size zones. 1059 The key takeaway from the physical comparison is the context it establishes for interpreting the framework comparison following. 1060 Both the particulars of a the machine's cache design, and a list length's effect on the program's cache friendliness, affect add/remove speed in the manner illlustrated in this breakdown. 1054 \VRef[Figure]{fig:plot-list-1ord} gives the first-order effects. 1055 The first breakdown, architecture/size-zone (left), showing the overall performance of all 12 experiment on the two different hardware architectures. 1056 The relative experiment duration for each experiment is shown as a bar in each column and the black bar in that column shows the average of all 12 experiments. 1057 By inspection, Intel runs faster than AMD. 1058 As well, the small zone (lists of 4--16 elements) runs faster than the medium zone (lists of 50--200 elements). 1059 The size effect is more pronounced on the AMD with its smaller L3 cache than it is on the Intel. 1060 (No NUMA effects for these list sizes.) 1061 1061 Specifically, a 20\% standard deviation exists here, between the means four physical-effect categories. 1062 The key takeaway for this comparison is the context it establishes for interpreting the following framework comparisons. 1063 Both the particulars of a the machine's cache design, and a list length's effect on the program's cache friendliness, affect insert/remove speed in the manner illlustrated in this breakdown. 1062 1064 That is, if you are running on an unknown machine, at a scale above anomaly-prone individuals, and below where major LLC caching effects take over the general intrusive-list advantage, but with an unknown relationship to the sizing of your fickle low-level caches, you are likely to experience an unpredictable speed impact on the order of 20\%. 1063 1065 -
doc/theses/mike_brooks_MMath/plots/list-1ord.gp
r0f9c67bf r68af77b 29 29 30 30 set xrange [-5.5:17.5]; 31 set xlabel " Machine, Size Zone; Operation; Framework; \nPrevalence Prevalence Prevalence"31 set xlabel "Architecture, Size Zone; Operation; Framework; \nPrevalence Prevalence Prevalence" 32 32 set xtics ( \ 33 33 "AMD, sm" -5, \
Note:
See TracChangeset
for help on using the changeset viewer.