Context Navigation

← Previous Changeset
Next Changeset →

Changeset 8b3109b

Timestamp:

May 19, 2025, 11:23:15 AM (14 months ago)

Author:

Peter A. Buhr <pabuhr@…>

Branches:

master, stuck-waitfor-destruct

Children:

Parents:

Message:

proofread background chapter

Location:

doc/theses/fangren_yu_MMath

Files:

: 3 edited

background.tex (modified) (1 diff)
features.tex (modified) (2 diffs)
intro.tex (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

doc/theses/fangren_yu_MMath/background.tex

-              r4d542db
+              r8b3109b
 ML~\cite{ML} was the first language to support parametric polymorphism.
+Like \CFA, it supports universal type parameters, but not the use of assertions and traits to constrain type arguments.
+Haskell~\cite{Haskell10} combines ML-style polymorphism, polymorphic data types, and type inference with the notion of type classes, collections of overloadable methods that correspond in intent to traits in \CFA.
+Unlike \CFA, Haskell requires an explicit association between types and their classes that specifies the implementation of operations.
+These associations determine the functions that are assertion arguments for particular combinations of class and type, in contrast to \CFA where the assertion arguments are selected at function call sites based upon the set of operations in scope at that point.
+Haskell also severely restricts the use of overloading: an overloaded name can only be associated with a single class, and methods with overloaded names can only be defined as part of instance declarations.
+Like \CFA, it supports unconstrained, universal, type parameters.
+\CFA differs by adding assertions and traits to constrain type arguments.
+Haskell~\cite{Haskell10} combines ML-style polymorphism, polymorphic data types, and type inference with the notion of type classes, collections of overloadable methods.
+The class/type-class association constrain type arguments by indicating the set of functions that become implicit assertion arguments and specify the implementation of these operations.
+As pointed out \see{\VRef[Figure]{f:ImplicitExplicitTraitInferencing}}, Haskell requires an explicit association between types and constrains (type-class).
+Otherwise, Haskell does not provide general overloading.
+\CFA differs by allowing general overloading and constraining type arguments with traits.
+Most importantly, \CFA automates selection of the assertion arguments at the function call-sites based on the set of operations in scope at that point to generalize reuse.
 \CC provides three disjoint polymorphic extensions to C: overloading, inheritance, and templates.
+The overloading is restricted because resolution does not use the return type, inheritance requires learning object-oriented programming and coping with a restricted nominal-inheritance hierarchy, templates cannot be separately compiled resulting in compilation/code bloat and poor error messages, and determining how these mechanisms interact and which to use is confusing.
+In contrast, \CFA has a single facility for polymorphic code supporting type-safe separate compilation of polymorphic functions and generic (opaque) types, which uniformly leverage the C procedural paradigm.
+The key mechanism to support separate compilation is \CFA's \emph{explicit} use of assumed type properties.
+Until \CC concepts~\cite{C++Concepts} are standardized (anticipated for \CCtwenty), \CC provides no way of specifying the requirements of a generic function beyond compilation errors during template expansion;
+furthermore, \CC concepts are restricted to template polymorphism.
+Selecting among them and understanding how they interact is part of the challenge in \CC program development.
+General overloading is available, subtyping inheritance can be single or multiple, templates are typed macro-expansion over a universal type, which precludes separate compilation.
+Universal template types are constrained using @concept@s~\cite{C++Concepts}, but only as a guideline, as the template expansion can still discover additional constrains.
+Template expansion can result in code bloat and poor error messages.
+Type inferencing is available using \lstinline[language=C++]{auto}, precluding using the return type for overload selection.
+\CFA differs by providing a simplified, uniform facility for polymorphic code, eliminating subtyping polymorphic, and encompassing overloading among parametric functions and universal (generic) types, all of which are separately compilable.
+Overload resolution uses the return type and arithmetic conversions to make precise function selections versus generating ambiguities, at the cost of type inferencing.
+Both \CFA call-site inferencing and \CC template expansion search in the local environment to satisfy explicit assertions or to find named functions to complete the template, respectively.
 Cyclone~\cite{Grossman06} also provides capabilities for polymorphic functions and existential types, similar to \CFA's @forall@ functions and generic types.
 Cyclone existential types can include function pointers in a construct similar to a virtual function table, but these pointers must be explicitly initialized at some point in the code, which is a tedious and potentially error-prone process.
+Cyclone's existential types can include function pointers in a construct similar to a virtual function table, but these pointers must be explicitly initialized at some point in the code, which is potentially error prone.
 Furthermore, Cyclone's polymorphic functions and types are restricted to abstraction over types with the same layout and calling convention as @void *@, \ie only pointer types and @int@.
+In \CFA terms, all Cyclone polymorphism must be dtype-static.
+While the Cyclone design provides the efficiency benefits discussed in~\VRef{s:GenericImplementation} for dtype-static polymorphism, it is more restrictive than \CFA's general model.
+Smith and Volpano~\cite{Smith98} present Polymorphic C, an ML dialect with polymorphic functions, C-like syntax, and pointer types;
+it lacks many of C's features, most notably structure types, and hence, is not a practical C replacement.
+In \CFA terms, all Cyclone polymorphism must be an incomplete data-type @forall( T * )@ \see{\VRef{s:PolymorphicFunction}}, which provides the efficiency benefits of fixed-size types \see{\VPageref{s:GenericImplementation}}.
+\CFA differs by adding object and variadic kinds of polymorphism, which provide more expressive reuse forms.
+Smith and Volpano~\cite{Smith98} present Polymorphic C, an ML dialect with polymorphic functions, C-like syntax, and pointer types.
+While these language purport to be C replacements, they are significantly different from C, and hence, not a practical C replacement.
+\CFA differs in providing better imperative-style polymorphism, while retaining backwards syntax and semantic compatibility with C.
+Objective-C~\cite{obj-c-book} is an industrially successful extension to C.
+However, Objective-C is a radical departure from C, using an object-oriented model with message passing.
+Objective-C did not support type-checked generics until recently \cite{xcode7}, historically using less-efficient runtime checking of object types.
+The GObject~\cite{GObject} framework also adds object-oriented programming with runtime type-checking and reference-counting garbage collection to C;
+these features are more intrusive additions than those provided by \CFA, in addition to the runtime overhead of reference counting.
+Vala~\cite{Vala} compiles to GObject-based C, adding the burden of learning a separate language syntax to the aforementioned demerits of GObject as a modernization path for existing C code bases.
+Java~\cite{Java8} included generic types in Java~5, which are type checked at compilation and type erased at runtime, similar to \CFA's.
+However, in Java, each object carries its own table of method pointers, whereas \CFA passes the method pointers separately to maintain a C-compatible layout.
+Java is also a garbage-collected, object-oriented language, with the associated resource usage and C-interoperability burdens.
+Objective-C~\cite{obj-c-book} and its successor Swift~\cite{Swift} are used in iOS phone applications.
+Swift has polymorphic functions and generics, object subtyping, traits via @extensions@, general function/method overloading, including parameter names and return type.
+Objective-C communication is via message passing rather than function call, while swift uses function call unless interacting with Objective-C.
+Swift and \CFA's type-systems are very similar, minus the object-oriented subtyping in \CFA, which is felt to be unnecessary.
+The GObject~\cite{GObject} framework adds object-oriented programming to C with runtime type-checking and reference-counting garbage collection, as a modernization path for existing C code-bases.
+Vala~\cite{Vala} compiles to GObject-based C, but requires additional language syntax over GObject.
+\CFA's path for modernizing works with the existing C type system and runtime, \ie not adding object-oriented types or garbage collection.
+Java~\cite{Java8} has object-oriented subtyping, generic @interface@s that act like traits, which are type checked at compilation and type erased at runtime similar to \CFA's, and general overloading on methods.
+However, in Java, each object carries its own table of method pointers, whereas \CFA passes trait pointers at call-site maintaining a C-compatible layout.
+Java is also garbage-collected.
+D~\cite{D}, Go, and Rust~\cite{Rust} are modern compiled languages with abstraction features similar to \CFA traits, \emph{interfaces} in D and Go, and \emph{traits} in Rust.
+However, each language represents a significant departure from C in terms of language model, and none has the same level of compatibility with C as \CFA.
+D and Go are garbage-collected languages, imposing the associated runtime overhead.
+The necessity of accounting for data transfer between managed runtimes and the unmanaged C runtime complicates foreign-function interfaces to C.
+Furthermore, while generic types and functions are available in Go, they are limited to a small fixed set provided by the compiler, with no language facility to define more.
+D restricts garbage collection to its own heap by default, whereas Rust is not garbage collected and, thus, has a lighter-weight runtime more interoperable with C.
+Rust also possesses much more powerful abstraction capabilities for writing generic code than Go.
+On the other hand, Rust's borrow checker provides strong safety guarantees but is complex and difficult to learn and imposes a distinctly idiomatic programming style.
+\CFA, with its more modest safety features, allows direct ports of C code while maintaining the idiomatic style of the original source.
+D~\cite{D}, Go, and Rust~\cite{Rust} are compiled languages with generic functions and types using traits similar to \CFA: \emph{interfaces} in D and Go, and \emph{traits} in Rust.
+D and Go are garbage-collected languages;
+Rust is not garbage collected.
+Go's generic types and functions are limited to a small fixed-set provided by the compiler, with no language facility to define more.
+Rust also possesses more powerful abstraction capabilities for writing generic code than Go.
+While Rust's borrow checker provides strong safety guarantees, it is complex and difficult to learn and imposes a distinctly idiomatic programming style different than C.
+\CFA, with its modest safety features, has a comparable type-system to Rust's, while maintaining C backwards compatibility, providing a modernization path for existing C code-bases.
 \section{Tuples/variadics}
+\section{Tuples/Variadics}
-\vspace*{-5pt}
 Many programming languages have some form of tuple construct and/or variadic functions, \eg SETL, C, KW-C, \CC, D, Go, Java, ML, and Scala.
 SETL~\cite{SETL} is a high-level mathematical programming language, with tuples being one of the primary data types.
 Tuples in SETL allow subscripting, dynamic expansion, and multiple assignment.
+KW-C~\cite{Buhr94a}, a predecessor of \CFA, introduced tuples to C as an extension of the C syntax, taking much of its inspiration from SETL.
+This work added multiple return value functions (MRVF), tuple mass and multiple assignment, and record-member access, giving unstructured tuples.
+Structured tuples (tuple variables) are also introduced including multiple coercions between structured and unstructured tuples.
+Like \CC, D provides tuples through a library variadic-template structure.
+Go does not have tuples but supports MRVF.
+Tuples are a fundamental abstraction in most functional programming languages, such as Standard ML~\cite{sml}, Haskell, and Scala~\cite{Scala}, which decompose tuples using pattern matching.
+From KW-C unstructured tuples, \CFA took MRVF, mass and multiple assignment, and record-member access.
+While \CFA has some structured-tuple capabilities, \VRef{s:TupleImplementation}, my analysis suggests this feature might be removed.
+An alternative to a tuple type is of variadic (variable argument) functions or type.
 C provides variadic functions through @va_list@ objects, but the programmer is responsible for managing the number of arguments and their types;
 thus, the mechanism is type unsafe.
+KW-C~\cite{Buhr94a}, a predecessor of \CFA, introduced tuples to C as an extension of the C syntax, taking much of its inspiration from SETL.
+The main contributions of that work were adding MRVF, tuple mass and multiple assignment, and record-member access.
+\CCeleven introduced @std::tuple@ as a library variadic-template structure.
+Tuples are a generalization of @std::pair@, in that they allow for arbitrary length, fixed-size aggregation of heterogeneous values.
+\CC{11} introduced @std::tuple@ as a library variadic-template structure.
+Tuples are a generalization of @std::pair@ allowing for arbitrary length, fixed-size aggregation of heterogeneous values.
 Operations include @std::get<N>@ to extract values, @std::tie@ to create a tuple of references used for assignment, and lexicographic comparisons.
 \CCseventeen proposes \emph{structured bindings}~\cite{Sutter15} to eliminate predeclaring variables and the use of @std::tie@ for binding the results.
+\CC{17} proposes \emph{structured bindings}~\cite{Sutter15} to eliminate predeclaring variables and the use of @std::tie@ for binding the results.
 This extension requires the use of @auto@ to infer the types of the new variables; hence, complicated expressions with a nonobvious type must be documented with some other mechanism.
 Furthermore, structured bindings are not a full replacement for @std::tie@, as it always declares new variables.
+Like \CC, D provides tuples through a library variadic-template structure.
+Go does not have tuples but supports MRVF.
+Java's variadic functions appear similar to C's but are type safe using homogeneous arrays, which are less useful than \CFA's heterogeneously typed variadic functions.
+Tuples are a fundamental abstraction in most functional programming languages, such as Standard ML~\cite{sml}, Haskell, and Scala~\cite{Scala}, which decompose tuples using pattern matching.
+Java's variadic functions appear similar to C's but are type safe using homogeneous arrays.
+\CFA's heterogeneous variadic-functions \see{\VPageref{p:VariadicFunctions}} provide a type-safe version of C variadic, although limited in features, which fits into the overall \CFA type-system design.
 \section{Type Resolution}
 \CFA expression resolution must deal with extensive overloading and inference of polymorphic types with assertions.
 The goal is to keep the base algorithm used in type unification simple enough so resolving a complicated expression can still be done reasonably fast.
 The following is work that handles the aforementioned bidirectional subtyping relations concisely.
+% \CFA expression resolution must deal with extensive overloading and inference of polymorphic types with assertions.
+% The goal is to keep the base algorithm used in type unification simple enough so resolving a complicated expression can still be done reasonably fast.
+% The following is work that handles the aforementioned bidirectional subtyping relations concisely.
 Melo~\etal~\cite{Melo17} developed PsycheC, a tool built for inferencing missing type and variable declarations of incomplete C programs, which can also be viewed as a dialect of C with type inferencing.
+Psyche-C~\cite{Melo17} is a tool built for inferencing missing type and variable declarations of incomplete C programs, which can also be viewed as a dialect of C with type inferencing.
 As PsycheC is built for analyzing standard C programs, it does not have any kind of overloading or polymorphism.
 Instead, all top-level variables and function parameters may have indeterminate types.

doc/theses/fangren_yu_MMath/features.tex

-              r4d542db
+              r8b3109b
 \section{Tuple Implementation}
+\label{s:TupleImplementation}
 As noted, traditional languages manipulate multiple values by in/out parameters and/or structures.
 …
 \end{comment}
+\label{p:VariadicFunctions}
 Finally, a type-safe variadic argument signature was added by Robert Schluntz~\cite[\S~4.1.2]{Schluntz17} using @forall@ and a new tuple parameter-type, denoted by the keyword @ttype@ in Schluntz's implementation, but changed to the ellipsis syntax similar to \CC's template parameter pack.
 For C variadics, \eg @va_list@, the number and types of the arguments must be conveyed in some way, \eg @printf@ uses a format string indicating the number and types of the arguments.

doc/theses/fangren_yu_MMath/intro.tex

r4d542db	r8b3109b
711	711
712	712	\subsection{Polymorphic Function}
	713	\label{s:PolymorphicFunction}
713	714
714	715	The signature feature of the \CFA type-system is parametric-polymorphic functions~\cite{forceone:impl,Cormack90,Duggan96}, generalized using a @forall@ clause (giving the language its name).

Note: See TracChangeset for help on using the changeset viewer.

Download in other formats: