Context Navigation

← Previous Change
Next Change →

Changeset 57c7e6c4 for doc/theses

Timestamp:

May 3, 2025, 12:46:23 AM (6 months ago)

Author:

Fangren Yu <f37yu@…>

Branches:

Children:

Parents:

Message:

proofreading fix as suggested by Ondrej

Location:

doc/theses/fangren_yu_MMath

Files:

: 5 edited

background.tex (modified) (1 diff)
features.tex (modified) (9 diffs)
future.tex (modified) (2 diffs)
intro.tex (modified) (6 diffs)
resolution.tex (modified) (13 diffs)

Legend:

: Unmodified
: Added
: Removed

doc/theses/fangren_yu_MMath/background.tex

ref05cf0	r57c7e6c4
21	21	Furthermore, Cyclone's polymorphic functions and types are restricted to abstraction over types with the same layout and calling convention as @void *@, \ie only pointer types and @int@.
22	22	In \CFA terms, all Cyclone polymorphism must be dtype-static.
23		While the Cyclone design provides the efficiency benefits discussed in ~~Section~\ref{sec:generic-apps~~} for dtype-static polymorphism, it is more restrictive than \CFA's general model.
	23	While the Cyclone design provides the efficiency benefits discussed in~\VRef{s:GenericImplementation} for dtype-static polymorphism, it is more restrictive than \CFA's general model.
24	24	Smith and Volpano~\cite{Smith98} present Polymorphic C, an ML dialect with polymorphic functions, C-like syntax, and pointer types;
25	25	it lacks many of C's features, most notably structure types, and hence, is not a practical C replacement.

doc/theses/fangren_yu_MMath/features.tex

-              ref05cf0
+              r57c7e6c4
 \CFA adopts a uniform policy between pointers and references where mutability is a separate property made at the declaration.
 The following examples shows how pointers and references are treated uniformly in \CFA.
+The following examples show how pointers and references are treated uniformly in \CFA.
 \begin{cfa}[numbers=left,numberblanklines=false]
 int x = 1, y = 2, z = 3;$\label{p:refexamples}$
 …
 @&@r3 = @&@y; @&&@r3 = @&&@r4;                          $\C{// change r1, r2}$
 \end{cfa}
 Like pointers, reference can be cascaded, \ie a reference to a reference, \eg @&& r2@.\footnote{
+Like pointers, references can be cascaded, \ie a reference to a reference, \eg @&& r2@.\footnote{
 \CC uses \lstinline{&&} for rvalue reference, a feature for move semantics and handling the \lstinline{const} Hell problem.}
 Usage of a reference variable automatically performs the same number of dereferences as the number of references in its declaration, \eg @r2@ becomes @**r2@.
 …
 Interestingly, C does not give a warning/error if a @const@ pointer is not initialized, while \CC does.
 Hence, type @& const@ is similar to a \CC reference, but \CFA does not preclude initialization with a non-variable address.
 For example, in system's programming, there are cases where an immutable address is initialized to a specific memory location.
+For example, in systems programming, there are cases where an immutable address is initialized to a specific memory location.
 \begin{cfa}
 int & const mem_map = *0xe45bbc67@p@; $\C{// hardware mapped registers ('p' for pointer)}$
 …
 \end{cfa}
 While it is possible to write a reference type as the argument to a generic type, it is disallowed in assertion checking, if the generic type requires the object trait \see{\VPageref{p:objecttrait}} for the type argument, a fairly common use case.
 Even if the object trait can be made optional, the current type system often misbehaves by adding undesirable auto-dereference on the referenced-to value rather than the reference variable itself, as intended.
+Even if the object trait can be made optional, the current compiler implementation often misbehaves by adding undesirable auto-dereference on the referenced-to value rather than the reference variable itself, as intended.
 Some tweaks are necessary to accommodate reference types in polymorphic contexts and it is unclear what can or cannot be achieved.
 Currently, there are contexts where the \CFA programmer is forced to use a pointer type, giving up the benefits of auto-dereference operations and better syntax with reference types.
 …
 @[x, y, z]@ = foo( 3, 4 );  // return 3 values into a tuple
 \end{cfa}
 Along with making returning multiple values a first-class feature, tuples were extended to simplify a number of other common context that normally require multiple statements and/or additional declarations, all of which reduces coding time and errors.
+Along with making returning multiple values a first-class feature, tuples were extended to simplify a number of other common contexts that normally require multiple statements and/or additional declarations, all of which reduces coding time and errors.
 \begin{cfa}
 [x, y, z] = 3; $\C[2in]{// x = 3; y = 3; z = 3, where types may be different}$
 …
 \end{cfa}
 The type resolver only has the tuple return types to resolve the call to @bar@ as the @foo@ parameters are identical.
 The resultion involves unifying the flattened @foo@ return values with @bar@'s parameter list.
+The resulution involves unifying the flattened @foo@ return values with @bar@'s parameter list.
 However, no combination of @foo@s is an exact match with @bar@'s parameters;
 thus, the resolver applies C conversions to obtain a best match.
 …
 \section{Tuple Implementation}
 As noted, tradition languages manipulate multiple values by in/out parameters and/or structures.
+As noted, traditional languages manipulate multiple values by in/out parameters and/or structures.
 K-W C adopted the structure for tuple values or variables, and as needed, the fields are extracted by field access operations.
 As well, for the tuple-assignment implementation, the left-hand tuple expression is expanded into assignments of each component, creating temporary variables to avoid unexpected side effects.
 …
 Unfortunately, packing the variadic arguments into a rigid @struct@ type and generating all the required wrapper functions is significant work and largely wasted because most are never called.
 Interested readers can refer to pages 77-80 of Robert Schluntz's thesis to see how verbose the translator output is to implement a simple variadic call with 3 arguments.
 As the number of arguments increases, \eg a call with 5 arguments, the translator generates a concrete @struct@ types for a 4-tuple and a 3-tuple along with all the polymorphic type data for them.
+As the number of arguments increases, \eg a call with 5 arguments, the translator generates concrete @struct@ types for a 4-tuple and a 3-tuple along with all the polymorphic type data for them.
 An alternative approach is to put the variadic arguments into an array, along with an offset array to retrieve each individual argument.
 This method is similar to how the C @va_list@ object is used (and how \CFA accesses polymorphic fields in a generic type), but the \CFA variadics generate the required type information to guarantee type safety (like the @printf@ format string).
 …
 While \CC has no direct syntax to disambiguate @x@, \ie @d.B.x@ or @d.C.x@, it is possible with casts, @((B)d).x@ or @((C)d).x@.
 Like \CC, \CFA compiles the Plan-9 version and provides direct qualification and casts to disambiguate @x@.
 While ambiguous definitions are allowed, duplicate field names is poor practice and should be avoided if possible.
+While ambiguous definitions are allowed, duplicate field names are poor practice and should be avoided if possible.
 However, when a programmer does not control all code, this problem can occur and a naming workaround must exist.

doc/theses/fangren_yu_MMath/future.tex

-              ref05cf0
+              r57c7e6c4
 \section{Associated Types}
 The analysis presented in \VRef{s:AssertionSatisfaction} shows if all type parameters have to be bound before assertion resolution, the complexity of resolving assertions become much lower as every assertion parameter can be resolved independently.
+The analysis presented in \VRef{s:AssertionSatisfaction} shows if all type parameters have to be bound before assertion resolution, the complexity of resolving assertions becomes much lower as every assertion parameter can be resolved independently.
 That is, by utilizing information from higher up the expression tree for return value overloading, most of the type bindings can be resolved.
 However, there are scenarios where some intermediate types need to be involved in certain operations, which are neither input nor output types.
 …
 Note that the type @list *@ satisfies both @pointer_like( list *, int )@ and @pointer_like( list *,@ @list )@ (the latter by the built-in pointer dereference operator) and the expression @*it@ can be either a @struct list@ or an @int@.
 Requiring associated types to be unique makes the @pointer_like@ trait not applicable to @list *@, which is undesirable.
 I have not attempted to implement associated types in \CFA compiler, but based on the above discussions, one option is to make associated type resolution and return type overloading coexist:
+I have not attempted to implement associated types in the \CFA compiler, but based on the above discussions, one option is to make associated type resolution and return type overloading coexist:
 when the associated type appears in returns, it is deduced from the context and then verify the trait with ordinary assertion resolution;
 when it does not appear in the returns, the type is required to be uniquely determined by the expression that defines the associated type.

doc/theses/fangren_yu_MMath/intro.tex

-              ref05cf0
+              r57c7e6c4
 \end{quote}
 Overloading allows programmers to use the most meaningful names without fear of name clashes within a program or from external sources, like include files.
 Experience from \CC and \CFA developers shows the type system can implicitly and correctly disambiguates the majority of overloaded names, \ie it is rare to get an incorrect selection or ambiguity, even among hundreds of overloaded (variables and) functions.
+Experience from \CC and \CFA developers shows the type system can implicitly and correctly disambiguate the majority of overloaded names, \ie it is rare to get an incorrect selection or ambiguity, even among hundreds of overloaded (variables and) functions.
 In many cases, a programmer is unaware of name clashes, as they are silently resolved, simplifying the development process.
 …
 f( 'A' );                               $\C{// select (2)}\CRT$
 \end{cfa}
 The type system examines each call size and first looks for an exact match and then a best match using conversions.
+The type system examines each call site and first looks for an exact match and then a best match using conversions.
 Ada, Scala, and \CFA type-systems also use the return type in resolving a call, to pinpoint the best overloaded name.
 Essentailly, the return types are \emph{reversed curried} into output parameters of the function.
+Essentially, the return types are \emph{reversed curried} into output parameters of the function.
 For example, in many programming languages with overloading, the following functions are ambiguous without using the return type.
 \begin{cfa}
 …
 For example, if a change is made in an initialization expression, it can cascade type changes producing many other changes and/or errors.
 At some point, a variable's type needs to remain constant and the initializing expression needs to be modified or be in error when it changes.
 Often type-inferencing systems allow restricting (\newterm{branding}) a variable or function type, so the complier can report a mismatch with the constant initialization.
+Often type-inferencing systems allow restricting (\newterm{branding}) a variable or function type, so the compiler can report a mismatch with the constant initialization.
 \begin{cfa}
 void f( @int@ x, @int@ y ) {  // brand function prototype
 …
 \end{tabular}
 \end{cquote}
 Traits are implemented by flatten them at use points, as if written in full by the programmer.
+Traits are implemented by flattening them at use points, as if written in full by the programmer.
 Flattening often results in overlapping assertions, \eg operator @+@.
 Hence, trait names play no part in type equivalence.
 …
 \end{tabular}
 \end{cquote}
+\label{s:GenericImplementation}
 \CFA generic types are \newterm{fixed} or \newterm{dynamic} sized.
 Fixed-size types have a fixed memory layout regardless of type parameters, whereas dynamic types vary in memory layout depending on the type parameters.
 …
 \end{swift}
 To make a universal function useable, an abstract description is needed for the operations used on the parameters within the function body.
 Type matching these operations can occur by discover using techniques like \CC template expansion, or explicit stating, \eg interfaces, subtyping (inheritance), assertions (traits), type classes, type bounds.
+Type matching these operations can be done by using techniques like \CC template expansion, or explicit stating, \eg interfaces, subtyping (inheritance), assertions (traits), type classes, type bounds.
 The mechanism chosen can affect separate compilation or require runtime type information (RTTI).
 \begin{description}

doc/theses/fangren_yu_MMath/resolution.tex

-              ref05cf0
+              r57c7e6c4
 \begin{enumerate}[leftmargin=*]
 \item \textbf{Unsafe} cost representing a narrowing conversion of arithmetic types, \eg @int@ to @short@, and qualifier-dropping conversions for pointer and reference types.
 Narrowing conversions have the potential to lose (truncation) data.
+Narrowing conversions have the potential to lose (truncate) data.
 A programmer must decide if the computed data-range can safely be shorted in the smaller storage.
 Warnings for unsafe conversions are helpful.
 …
 \item \textbf{Specialization} cost counting the number of restrictions introduced by type assertions.
 Fewer restriction means fews parametric variables passed at the function call giving better performance.
+Fewer restriction means fewer parametric variables passed at the function call giving better performance.
 \begin{cfa}
 forall( T | { T ?+?( T, T ) } ) void f( T ); $\C[3.25in]{// 1}$
 …
 Therefore, at each resolution step, the arguments are already given unique interpretations, so the ordering only needs to compare different sets of conversion targets (function parameter types) on the same set of input.
 In \CFA, trying to use such a system is problematic because of the presence of return-type overloading of functions and variable.
+In \CFA, trying to use such a system is problematic because of the presence of return-type overloading of functions and variables.
 Specifically, \CFA expression resolution considers multiple interpretations of argument subexpressions with different types, \eg:
 so it is possible that both the selected function and the set of arguments are different, and cannot be compared with a partial-ordering system.
 …
 \end{quote}
 However, I was unable to generate any Ada example program that demonstrates this preference.
 In contrast, the \CFA overload resolution-system is at the other end of the spectrum, as it tries to order every legal interpretations of an expression and chooses the best one according to cost, occasionally giving unexpected results rather than an ambiguity.
+In contrast, the \CFA overload resolution-system is at the other end of the spectrum, as it tries to order all legal interpretations of an expression and chooses the best one according to cost, occasionally giving unexpected results rather than an ambiguity.
 Interestingly, the \CFA cost-based model can sometimes make expression resolution too permissive because it always attempts to select the lowest cost option, and only when there are multiple options tied at the lowest cost does it report the expression is ambiguous.
 …
 \end{itemize}
 In this example, option 1 produces the prototype @void f( int )@, which gives an exact match and therefore takes priority.
 The \CC resolution rules effectively makes option 2 a specialization that only applies to type @long@ exactly,\footnote{\CC does have explicit template specializations, however they do not participate directly in overload resolution and can sometimes lead to unintuitive results.} while the current \CFA rules make option 2 apply for all integral types below @long@.
+The \CC resolution rules effectively make option 2 a specialization that only applies to type @long@ exactly,\footnote{\CC does have explicit template specializations, however they do not participate directly in overload resolution and can sometimes lead to unintuitive results.} while the current \CFA rules make option 2 apply for all integral types below @long@.
 This difference could be explained as compensating for \CFA polymorphic functions being separately compiled versus template inlining;
 hence, calling them requires passing type information and assertions increasing the runtime cost.
 …
 Although it is true that both the sequence 1, 2 and 1, 3, 4 are increasingly more constrained on the argument types, option 2 is not comparable to either of option 3 or 4;
 they actually describe independent constraints on the two arguments.
 Specifically, option 2 says the two arguments must have the same type, while option 3 states the second argument must have type @int@,
+Specifically, option 2 says the two arguments must have the same type, while option 3 states the second argument must have type @int@.
 Because two constraints can independently be satisfied, neither should be considered a better match when trying to resolve a call to @f@ with argument types @(int, int)@;
 reporting such an expression as ambiguous is more appropriate.
 …
 \end{enumerate}
 These inconsistencies are not easily solvable in the current cost-model, meaning the currently \CFA codebase has to workaround these defects.
+These inconsistencies are not easily solvable in the current cost-model, meaning the current \CFA codebase has to workaround these defects.
 One potential solution is to mix the conversion cost and \CC-like partial ordering of specializations.
 For example, observe that the first three elements (unsafe, polymorphic and safe conversions) in the \CFA cost-tuple are related to the argument/parameter types, while the other two elements (polymorphic variable and assertion counts) are properties of the function declaration.
 …
 Here, the unsafe cost of signed to unsigned is factored into the ranking, so the safe conversion is selected over an unsafe one.
 Furthermore, an integral option is taken before considering a floating option.
 This model locally matches the C approach, but provides an ordering when there are many overloaded alternative.
+This model locally matches the C approach, but provides an ordering when there are many overload alternatives.
 However, as Moss pointed out overload resolution by total cost has problems, \eg handling cast expressions.
 \begin{cquote}
 …
 \section{Type Unification}
 Type unification is the algorithm that assigns values to each (free) type parameters such that the types of the provided arguments and function parameters match.
+Type unification is the algorithm that assigns values to each (free) type parameter such that the types of the provided arguments and function parameters match.
 \CFA does not attempt to do any type \textit{inference} \see{\VRef{s:IntoTypeInferencing}}: it has no anonymous functions (\ie lambdas, commonly found in functional programming and also used in \CC and Java), and the variable types must all be explicitly defined (no auto typing).
 …
 A function operates on the call-site arguments together with any local and global variables.
 When the function is polymorphic, the types are inferred at each call site.
 On each invocation, the types to be operate on are determined from the arguments provided, and therefore, there is no need to pass a polymorphic function pointer, which can take any type in principle.
+On each invocation, the types to be operated on are determined from the arguments provided, and therefore, there is no need to pass a polymorphic function pointer, which can take any type in principle.
 For example, consider a polymorphic function that takes one argument of type @T@ and polymorphic function pointer.
 \begin{cfa}
 …
 In many cases, these problems can be avoided by examining other assertions that provide insight on the desired type binding: if one assertion parameter can only be matched by a unique option, the type bindings can be updated confidently without the need for backtracking.
 The Moss algorithm currently used in \CFA was developed using a simplified type-simulator that capture most of \CFA type-system features.
+The Moss algorithm currently used in \CFA was developed using a simplified type system that captures most of \CFA type system features.
 The simulation results were then ported back to the actual language.
 The simulator used a mix of breadth- and depth-first search in a staged approach.
 …
 A type variable introduced by the @forall@ clause of function declaration can appear in parameter types, return types and assertion variables.
 If it appears in parameter types, it can be bound when matching the arguments to parameters at the call site.
 If it only appears in the return type, it can be eventually be determined from the call-site context.
+If it only appears in the return type, it can be eventually determined from the call-site context.
 Currently, type resolution cannot do enough return-type inferencing while performing eager assertion resolution: the return type information is unknown before the parent expression is resolved, unless the expression is an initialization context where the variable type is known.
 By delaying the assertion resolution until the return type becomes known, this problem can be circumvented.
 …
+}
 \end{cfa}
 This case is rare so forcing every type variable to appear at least once in parameter or return types limits does not limit the expressiveness of \CFA type system to a significant extent.
+This case is rare so forcing every type variable to appear at least once in parameter or return types does not limit the expressiveness of \CFA type system to a significant extent.
 The next section presents a proposal for including type declarations in traits rather than having all type variables appear in the trait parameter list, which is provides equivalent functionality to an unbound type parameter in assertion variables, and also addresses some of the variable cost issue discussed in \VRef{s:ExpressionCostModel}.

Note: See TracChangeset for help on using the changeset viewer.

Download in other formats: