Context Navigation

← Previous Changeset
Next Changeset →

Changeset b1b513d

Timestamp:

Apr 10, 2025, 5:27:36 PM (8 months ago)

Author:

Michael Brooks <mlbrooks@…>

Branches:

Children:

Parents:

6174ecc (diff), bb506e0 (diff)
Note: this is a merge changeset, the changes displayed below correspond to the merge itself.
Use the (diff) links above to see all the changes relative to each parent.

Message:

Merge branch 'master' of plg.uwaterloo.ca:software/cfa/cfa-cc

Files:

: 10 edited

doc/theses/fangren_yu_MMath/conclusion.tex (modified) (1 diff)
doc/theses/fangren_yu_MMath/resolution.tex (modified) (1 diff)
doc/theses/fangren_yu_MMath/uw-ethesis-frontpgs.tex (modified) (1 diff)
doc/theses/mike_brooks_MMath/string.tex (modified) (5 diffs)
libcfa/src/collections/string.cfa (modified) (5 diffs)
libcfa/src/collections/string.hfa (modified) (6 diffs)
libcfa/src/collections/string_res.cfa (modified) (3 diffs)
libcfa/src/collections/string_res.hfa (modified) (4 diffs)
tests/collections/.expect/string-api-coverage.txt (modified) (1 diff)
tests/collections/string-api-coverage.cfa (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

doc/theses/fangren_yu_MMath/conclusion.tex

-              r6174ecc
+              rb1b513d
 \chapter{Conclusion}
+The goal of this thesis is to ...
+In the past few years of development, \CFA has gone from a proof-of-concept prototype to an actual experimental language, with a few medium-sized projects written completely in \CFA included in the language's libraries ($\approx$\,45,000 lines of code).\footnote{In Fall 2024, two amazing CS343 students completed all 6 concurrent assignments in \CFA. Many small language problems were uncovered and missing features discovered; these issues are being fixed for Fall 2025.}
+The work done in this thesis is motivated by real needs arising from the development and testing of these projects, which often pushes the limits of \CFA's type system and compiler capabilities.
+While most of the previous \CFA language feature and compiler developments were done either in isolation or with limited testing, getting them to work together and with real projects is presenting significant new challenges.
+These challenges could have been foreseen before development and testing began in earnest.
+This work aims to identify and fix a number of practical issues of multiple \CFA type-system features and their interactions.
+In particular, the inclusion of reference types, tuple types, and generic structures together with rich overloading in the language makes the complexity of expression resolution much higher than in other programming languages.
+I significantly reworked the abstract syntax-tree representation and resolution algorithm to push the \CFA compilation time down to a practical level.
+The expression-cost system was also revised multiple times to make overload selection more predictable and match programmer's intuition and expectation in the majority of cases.
+Still, fundamental problems remain and fixing them will require significant changes to the language type-system, possibly from the ground up.
+As per the \CFA project motto ``describe not prescribe,'' \CFA's type system is designed to have a lot of flexibility and give programmers freedom in the usage of overloading and polymorphism.
+With such a complex type system, it is very difficult (sometimes even impossible) to try to have the compiler accept all the intuitively valid \CFA programs.
+As has been demonstrated, the \CFA programming language is still far from complete, and the primary future goal is to expand \CFA's type-resolution capability while maintaining, expressibility, decent compile-time, and excellent run-time performance.
+Stealing some theoretical insights of parametric polymorphism from functional programming, may also prove to be useful.

doc/theses/fangren_yu_MMath/resolution.tex

-              r6174ecc
+              rb1b513d
 Specifically, the resolution algorithms used in \CC and Java are greedy, selecting the best match for each subexpression without considering the higher-level ones (bottom-up).
 Therefore, at each resolution step, the arguments are already given unique interpretations, so the ordering only needs to compare different sets of conversion targets (function parameter types) on the same set of input.
-\begin{cfa}
-@generate a C++ example here@
-read more
-\end{cfa}
 In \CFA, trying to use such a system is problematic because of the presence of return-type overloading of functions and variable.

doc/theses/fangren_yu_MMath/uw-ethesis-frontpgs.tex

-              r6174ecc
+              rb1b513d
 \begin{center}\textbf{Abstract}\end{center}
+Type resolution ...
+\CFA (C-for-all) is an evolutionary extension of C programming language, which introduces many modern programming language features to C.
+\CFA has a type system built around parametric polymorphism, and the polymorphic functions are prefixed by a @forall@ declaration of type parameters, giving the language its name.
+This thesis presents a series of work on type resolution in \CFA. Every function, including the built-in C operators, can be overloaded in \CFA, therefore resolving function overloads and generic type parameters is at the heart of \CFA expression analysis. This thesis focuses on the interactions of various \CFA language features such as reference and generic types in type resolution, analyzes the known issues and presents improvements to the type system that fix those problems. Ideas for future work are also given for further improving the consistency of \CFA type system at a language design level.
 \cleardoublepage

doc/theses/mike_brooks_MMath/string.tex

-              r6174ecc
+              rb1b513d
 \section{String Operations}
+% https://en.wikipedia.org/wiki/Comparison_of_programming_languages_(string_functions)
 \VRef[Figure]{f:StrApiCompare} shows a general comparison of string APIs for C, \CC, Java and \CFA.
 …
 @strlen@                                & @length@, @size@              & @length@                      & @size@        \\
 @[ ]@                                   & @[ ]@                                 & @charAt@          & @[ ]@     \\
 @strncpy@                               & @substr@                              & @substring@       & @( )@     \\
 @strncpy@                               & @replace@                             & @replace@         & @=@ \emph{(on a substring)}\\
+@strncpy@                               & @substr@                              & @substring@       & @( )@ RHS @=@     \\
+@strncpy@                               & @replace@                             & @replace@         & @( )@ LHS @=@ \\
 @strstr@                                & @find@                                & @indexOf@         & @find@ \\
 @strcspn@                               & @find_first_of@               & @matches@         & @include@ \\
 …
 \begin{cquote}
 \sf
 \begin{tabular}{@{}rrrrrl@{}}
 \small\tt a & \small\tt b & \small\tt c & \small\tt d & \small\tt e \\
+\begin{tabular}{@{}rrrrll@{}}
+\small\tt "a & \small\tt b & \small\tt c & \small\tt d & \small\tt e" \\
 & 1 & 2 & 3 & 4 & left to right index \\
 -5 & -4 & -3 & -2 & -1 & right to left index
 …
 \begin{cfa}
 #include @<string.hfa>@
 @string@ s = "abcde", name = "MIKE", digit, alpha, punctuation, ifstmt;
+@string@ s = "abcde", name = "MIKE", digit = "0123456789";
 const char cs[] = "abc";
 int i;
+digit  = "0123456789";
+punctuation = "().,";
+ifstmt = "IF (A > B) {";
+\end{cfa}
+Note, the include file @string.hfa@ to access type @string@.
+\end{cfa}
+Note, the include file @<string.hfa>@ to access type @string@.
 …
 Extending the pattern to a regular expression is a possible extension.
+\subsection{Searching}
+The @index@ operation
+\begin{cfa}
+int index( const string & key, int start = 1, occurrence occ = first );
+\end{cfa}
+returns the position of the first or last occurrence of the @key@ (depending on the occurrence indicator @occ@ that is either @first@ or @last@) in the current string starting the search at position @start@.
+If the @key@ does not appear in the current string, the length of the current string plus one is returned.
+%If the @key@ has zero length, the value 1 is returned regardless of what the current string contains.
+A negative starting position is a specification from the right end of the string.
+The replace operation returns a string in which all occurrences of a substring are replaced by another string.
 \begin{cquote}
 \setlength{\tabcolsep}{15pt}
 \begin{tabular}{@{}l|l@{}}
 \begin{cfa}
+i = find( digit, "567" );
+i = find( digit, "567", 7 );
+i = digit.index( "567", -1, last );
+i = name.index( "E", 5, last );
+\end{cfa}
+&
+\begin{cfa}
+s = replace( "PETER", "E", "XX" );
+s = replace( "PETER", "ET", "XX" );
+s = replace( "PETER", "W", "XX" );
+\end{cfa}
+&
+\begin{cfa}
+"PXXTXXR"
+"PXXER"
+"PETER"
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The replacement is done left-to-right and substituted text is not examined for replacement.
+\subsection{Searching}
+The find operation returns the position of the first occurrence of a key string in a string.
+If the key does not appear in the current string, the length of the current string plus one is returned.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+i = find( digit, '3' );
+i = "45" ^ digit; // python style "45" in digit
+string x = "567";
+i = find( digit, x );
+\end{cfa}
+&
+\begin{cfa}
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The next two string operations test a string to see if it is or is not composed completely of a particular class of characters.
+For example, are the characters of a string all alphabetic or all numeric?
+Use of these operations involves a two step operation.
+First, it is necessary to create an instance of type @strmask@ and initialize it to a string containing the characters of the particular character class, as in:
+\begin{cfa}
+strmask digitmask = digit;
+strmask alphamask = string( "abcdefghijklmnopqrstuvwxyz" );
+\end{cfa}
+Second, the character mask is used in the functions @include@ and @exclude@ to check a string for compliance of its characters with the characters indicated by the mask.
+The @include@ operation
+\begin{cfa}
+int include( const strmask &, int = 1, occurrence occ = first );
+\end{cfa}
+returns the position of the first or last character (depending on the occurrence indicator, which is either @first@ or @last@) in the current string that does not appear in the @mask@ starting the search at position @start@;
+hence it skips over characters in the current string that are included (in) the @mask@.
+The characters in the current string do not have to be in the same order as the @mask@.
+If all the characters in the current string appear in the @mask@, the length of the current string plus one is returned, regardless of which occurrence is being searched for.
+A negative starting position is a specification from the right end of the string.
+\begin{cfa}
+i = name.include( digitmask );          $\C{// i is assigned 1}$
+i = name.include( alphamask );          $\C{// i is assigned 6}$
+\end{cfa}
+The @exclude@ operation
+\begin{cfa}
+int exclude( string &mask, int start = 1, occurrence occ = first )
+\end{cfa}
+returns the position of the first or last character (depending on the occurrence indicator, which is either @first@ or @last@) in the current string that does appear in the @mask@ string starting the search at position @start@;
+hence it skips over characters in the current string that are excluded from (not in) in the @mask@ string.
+The characters in the current string do not have to be in the same order as the @mask@ string.
+If all the characters in the current string do NOT appear in the @mask@ string, the length of the current string plus one is returned, regardless of which occurrence is being searched for.
+A negative starting position is a specification from the right end of the string.
+\begin{cfa}
+i = name.exclude( digitmask );          $\C{// i is assigned 6}$
+i = ifstmt.exclude( strmask( punctuation ) ); $\C{// i is assigned 4}$
+\end{cfa}
+The @includeStr@ operation:
+\begin{cfa}
+string includeStr( strmask &mask, int start = 1, occurrence occ = first )
+\end{cfa}
+returns the longest substring of leading or trailing characters (depending on the occurrence indicator, which is either @first@ or @last@) of the current string that ARE included in the @mask@ string starting the search at position @start@.
+A negative starting position is a specification from the right end of the string.
+\begin{cfa}
+s = name.includeStr( alphamask );       $\C{// s is assigned "MIKE"}$
+s = ifstmt.includeStr( alphamask );     $\C{// s is assigned "IF"}$
+s = name.includeStr( digitmask );       $\C{// s is assigned ""}$
+\end{cfa}
+The @excludeStr@ operation:
+\begin{cfa}
+string excludeStr( strmask &mask, int start = 1, occurrence = first )
+\end{cfa}
+returns the longest substring of leading or trailing characters (depending on the occurrence indicator, which is either @first@ or @last@) of the current string that are excluded (NOT) in the @mask@ string starting the search at position @start@.
+A negative starting position is a specification from the right end of the string.
+\begin{cfa}
+s = name.excludeStr( digitmask);        $\C{// s is assigned "MIKE"}$
+s = ifstmt.excludeStr( strmask( punctuation ) ); $\C{// s is assigned "IF "}$
+s = name.excludeStr( alphamask);        $\C{// s is assigned ""}$
+\end{cfa}
+\subsection{Miscellaneous}
+The @trim@ operation
+\begin{cfa}
+string trim( string &mask, occurrence occ = first )
+\end{cfa}
+returns a string in that is the longest substring of leading or trailing characters (depending on the occurrence indicator, which is either @first@ or @last@) which ARE included in the @mask@ are removed.
+\begin{cfa}
+// remove leading blanks
+s = string( "   ABC" ).trim( " " );     $\C{// s is assigned "ABC",}$
+// remove trailing blanks
+s = string( "ABC   " ).trim( " ", last ); $\C{// s is assigned "ABC",}$
+\end{cfa}
+The @translate@ operation
+\begin{cfa}
+string translate( string &from, string &to )
+\end{cfa}
+returns a string that is the same length as the original string in which all occurrences of the characters that appear in the @from@ string have been translated into their corresponding character in the @to@ string.
+Translation is done on a character by character basis between the @from@ and @to@ strings; hence these two strings must be the same length.
+If a character in the original string does not appear in the @from@ string, then it simply appears as is in the resulting string.
+\begin{cfa}
+// upper to lower case
+name = name.translate( "ABCDEFGHIJKLMNOPQRSTUVWXYZ", "abcdefghijklmnopqrstuvwxyz" );
+                        // name is assigned "name"
+s = ifstmt.translate( "ABCDEFGHIJKLMNOPQRSTUVWXYZ", "abcdefghijklmnopqrstuvwxyz" );
+                        // ifstmt is assigned "if (a > b) {"
+// lower to upper case
+name = name.translate( "abcdefghijklmnopqrstuvwxyz", "ABCDEFGHIJKLMNOPQRSTUVWXYZ" );
+                        // name is assigned "MIKE"
+\end{cfa}
+The @replace@ operation
+\begin{cfa}
+string replace( string &from, string &to )
+\end{cfa}
+returns a string in which all occurrences of the @from@ string in the current string have been replaced by the @to@ string.
+\begin{cfa}
+s = name.replace( "E", "XX" );          $\C{// s is assigned "PXXTXXR"}$
+\end{cfa}
+The replacement is done left-to-right.
+When an instance of the @from@ string is found and changed to the @to@ string, it is NOT examined again for further replacement.
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The character-class operations indicates if a string is composed completely of a particular class of characters, \eg, alphabetic, numeric, vowels, \etc.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+charclass vowels{ "aeiouy" };
+i = include( "aaeiuyoo", vowels );
+i = include( "aabiuyoo", vowels );
+\end{cfa}
+&
+\begin{cfa}
+  // compliant
+  // b non-compliant
+\end{cfa}
+\end{tabular}
+\end{cquote}
+@vowels@ defines a character class and function @include@ checks if all characters in the string are included in the class (compliance).
+The position of the last character plus 1 is return if the string is compliant or the position of the first non-compliant character.
+There is no relationship between the order of characters in the two strings.
+Function @exclude@ is the reverse of @include@, checking if all characters in the string are excluded from the class (compliance).
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+i = exclude( "cdbfghmk", vowels );
+i = exclude( "cdyfghmk", vowels );
+\end{cfa}
+&
+\begin{cfa}
+  // compliant
+  // y non-compliant
+\end{cfa}
+\end{tabular}
+\end{cquote}
+Both forms can return the longest substring of compliant characters.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+s = include( "aaeiuyoo", vowels );
+s = include( "aabiuyoo", vowels );
+s = exclude( "cdbfghmk", vowels );
+s = exclude( "cdyfghmk", vowels );
+\end{cfa}
+&
+\begin{cfa}
+"aaeiuyoo"
+"aa"
+"cdbfghmk"
+"cd"
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The test operation checks if each character in a string is in one of the C character classes.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+i = test( "1FeC34aB", @isxdigit@ );
+i = test( ".,;'!\"", @ispunct@ );
+i = test( "XXXx", @isupper@ );
+\end{cfa}
+&
+\begin{cfa}
+   // compliant
+   // compliant
+   // non-compliant
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The position of the last character plus 1 is return if the string is compliant or the position of the first non-compliant character.
+Combining substring and search allows actions like trimming whitespace from the start of a line.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+string line = "  \t  xxx yyy zzz";
+string trim = line( test( line, isspace ) );
+\end{cfa}
+&
+\begin{cfa}
+"xxx yyy zzz"
+\end{cfa}
+\end{tabular}
+\end{cquote}
+The translate operation returns a string with each character transformed by one of the C character transformation functions.
+\begin{cquote}
+\setlength{\tabcolsep}{15pt}
+\begin{tabular}{@{}l|l@{}}
+\begin{cfa}
+s = translate( "abc", @toupper@ );
+s = translate( "ABC", @tolower@ );
+int tospace( int c ) { return isspace( c ) ? ' ' : c; }
+s = translate( "X X\tX\nX", @tospace@ );
+\end{cfa}
+&
+\begin{cfa}
+"ABC"
+"abc"
+"X X X X"
+\end{cfa}
+\end{tabular}
+\end{cquote}

libcfa/src/collections/string.cfa

-              r6174ecc
+              rb1b513d
 // Created On       : Fri Sep 03 11:00:00 2021
 // Last Modified By : Peter A. Buhr
 // Last Modified On : Sat Apr  5 15:18:30 2025
 // Update Count     : 318
+// Last Modified On : Wed Apr  9 22:27:40 2025
+// Update Count     : 368
 //
 …
 bool ?<? ( const char * s1, const string & s2 ) { return s1 <  *s2.inner; }
-////////////////////////////////////////////////////////
-// Getter
-size_t len( const string & s ) {
-        return len( *s.inner );
+}
 ////////////////////////////////////////////////////////
 // Concatenation
 …
+}
 int find( const string & s, size_t start, size_t len, const string & key, size_t kstart, size_t klen ) {
+size_t find( const string & s, size_t start, size_t len, const string & key, size_t kstart, size_t klen ) {
         if ( start < 0 ) { start += len( s ); }
         if ( len < 0 ) { len = -len; start -= len; }
 …
         if ( kstart >= len( key ) ) return 0;
         if ( kstart + klen > len( key ) ) klen = len( key ) - kstart;
         return findFrom( *s.inner, start, *key.inner );
+}
 int find( const string & s, char key ) {
+size_t find( const string & s, char key ) {
         return find( *s.inner, key );
+}
 int find( const string & s, const string & key ) {
+size_t find( const string & s, const string & key ) {
         return find( *s.inner, *key.inner );
+}
 int find( const string & s, const char * key ) {
+size_t find( const string & s, const char * key ) {
         return find( *s.inner, key );
+}
 int find( const string & s, const char * key, size_t keysize ) {
+size_t find( const string & s, const char * key, size_t keysize ) {
         return find( *s.inner, key, keysize );
+}
 int find( const string & s, size_t start, char key ) {
+size_t find( const string & s, size_t start, char key ) {
         return findFrom( *s.inner, start, key );
+}
 int find( const string & s, size_t start, const char * key ) {
+size_t find( const string & s, size_t start, const char * key ) {
         return findFrom( *s.inner, start, key );
+}
 int find( const string & s, size_t start, const char * key, size_t keysize ) {
+size_t find( const string & s, size_t start, const char * key, size_t keysize ) {
         return findFrom( *s.inner, start, key, keysize );
+}
 …
+}
+int exclude( const string & s, const charclass & mask ) {
+size_t exclude( const string & s, const charclass & mask ) {
         return exclude( *s.inner, *mask.inner );
+}
+/*
+StrSlice exclude( string & s, const charclass & mask ) {
+}
+*/
+int include( const string & s, const charclass & mask ) {
+size_t include( const string & s, const charclass & mask ) {
         return include( *s.inner, *mask.inner );
+}
+/*
+StrSlice include( string & s, const charclass & mask ) {
+}
+*/
+size_t test( const string & s, int (*f)( int ) ) {
+        size_t l = len( s );
+        for ( i; l ) {
+                if ( ! f( s[i] ) ) return i;
+        } // for
+        return l;
+}
+string replace( string & s, const string & from, const string & to ) {
+        ssize_t pos;
+    string r;
+    pos = find( s, from );
+    if ( pos < len( s ) ) {
+                r = s( 0, pos ) + to + replace( s( pos + (ssize_t)len( from ) ), from, to );
+                string front = s( 0, pos );
+                string back = s( pos + (ssize_t)len( from ) );
+                r = front + to + replace( back, from, to );
+    } else {
+                r = s;
+    } // if
+    return r;
+}
+string translate( const string & s, int (*f)( int ) ) {
+        string r = s;
+        size_t l = len( r );
+        for ( i; l ) {
+                r[i] = (char)f( r[i] );
+        } // for
+        return r;
+}

libcfa/src/collections/string.hfa

-              r6174ecc
+              rb1b513d
 // Created On       : Fri Sep 03 11:00:00 2021
 // Last Modified By : Peter A. Buhr
 // Last Modified On : Sat Apr  5 15:16:23 2025
 // Update Count     : 180
+// Last Modified On : Wed Apr  9 22:27:41 2025
+// Update Count     : 259
 //
 …
 #include <fstream.hfa>
+// in string_res.hfa
+struct string_res;
+struct charclass_res;
+#include <string_res.hfa>
 struct string {
 …
 // Getters
+static inline size_t len( const string & s ) { return len( *s.inner ); }
 static inline size_t len( const char * cs ) { return strlen( cs ); };
-size_t len( const string & s );
 static inline size_t strlen( const string & s ) { return len( s ); }
 …
 bool contains( const string & s, char ch );                             // single character
+int find( const string & s, char key );
+static inline int ?^? ( const string & s, char key ) { return find( s, key ); }
+int find( const string & s, const char * key );
+static inline int ?^? ( const string & s, const char * key ) { return find( s, key ); }
+int find( const string & s, const string & key );
+static inline int ?^? ( const string & s, const string & key ) { return find( s, key ); }
+int find( const string & s, const char * key, size_t keysize );
+int find( const string & s, size_t start, char key );
+int find( const string & s, size_t start, const string & key );
+int find( const string & s, size_t start, const char * key );
+int find( const string & s, size_t start, const char * key, size_t keysize );
+//int find( const string & s, size_t start, size_t len, const string & key, size_t kstart, size_t klen );
+size_t find$( const string_res & s, size_t start, size_t len, const string & key_res, size_t kstart, size_t klen );
+size_t find( const string & s, char key );
+size_t find( const string & s, const char * key );
+size_t find( const string & s, const string & key );
+size_t find( const string & s, const char * key, size_t keysize );
+size_t find( const string & s, size_t start, char key );
+size_t find( const string & s, size_t start, const string & key );
+size_t find( const string & s, size_t start, const char * key );
+size_t find( const string & s, size_t start, const char * key, size_t keysize );
+static inline ?^?( const string & key, const string & s ) { return find( s, key ); }
+static inline ?^?( const char * key, const string & s ) { return find( s, key ); }
 bool includes( const string & s, const string & mask );
 …
 // Slicing
+string ?()( string & s, ssize_t start, ssize_t len );           // TODO const?
+string ?()( string & s, ssize_t start, ssize_t len );
+static inline string ?()( const string & s, ssize_t start, ssize_t len ) { string & w = (string &)s; return w( start, len ); } // FIX ME
 string ?()( string & s, ssize_t start );
+static inline string ?()( const string & s, ssize_t start ) { string & w = (string &)s; return w( start ); } // FIX ME
 static inline string ?()( string & s, char m ) { return s( find( s, m ), 1 )`share; }
+static inline string ?()( const string & s, char m ) { string & w = (string &)s; return w( find( s, m ), 1 )`share; } // FIX ME
 static inline string ?()( string & s, const char * m ) { return s( find( s, m ), len( m ) )`share; }
+static inline string ?()( const string & s, const char * m ) { string & w = (string &)s; return w( find( s, m ), len( m ) )`share; } // FIX ME
 static inline string ?()( string & s, const string & m ) { return s( find( s, m ), len( m ) )`share; }
+// Modifiers
+void padStart( string & s, size_t n );
+void padStart( string & s, size_t n, char padding );
+void padEnd( string & s, size_t n );
+void padEnd( string & s, size_t n, char padding );
+static inline string ?()( const string & s, const string & m ) { string & w = (string &)s; return w( find( s, m ), len( m ) )`share; } // FIX ME
 struct charclass {
 …
 void ^?{}( charclass & );
+int include( const string & s, const charclass & mask );
+int exclude( const string & s, const charclass & mask );
+/*
+What to do with?
+StrRet include( string & s, const charclass & mask );
+StrRet exclude( string & s, const charclass & mask );
+*/
+size_t include( const string & s, const charclass & mask );
+static inline size_t include( const char * s, const charclass & mask ) { string temp = s; return include( temp, mask ); }
+static inline string include( const string & s, const charclass & mask ) { ssize_t i = include( s, mask ); return s( 0, i )`share; }
+static inline string include( const char * s, const charclass & mask ) { string temp = s; ssize_t i = include( temp, mask ); return temp( 0, i ); }
+size_t exclude( const string & s, const charclass & mask );
+static inline size_t exclude( const char * s, const charclass & mask ) { string temp = s; return exclude( temp, mask ); }
+static inline string exclude( const string & s, const charclass & mask ) { ssize_t i = exclude( s, mask ); return s( 0, i )`share; }
+static inline string exclude( const char * s, const charclass & mask ) { string temp = s; ssize_t i = exclude( temp, mask ); return temp( 0, i ); }
+size_t test( const string & s, int (*f)( int ) );
+static inline size_t test( const char * c, int (*f)( int ) ) {
+        const string S = c;
+        return test( S, f );
+}
+string replace( string & s, const string & from, const string & to );
+static inline string replace( const char * s, const char * from, const char * to ) {
+        string S = s, From = from, To = to;
+        return replace( S, From, To );
+}
+static inline string replace( string & s, const char * from, const char * to ) {
+        string From = from, To = to;
+        return replace( s, From, To );
+}
+static inline string replace( string & s, const char * from, const string & to ) {
+        string From = from;
+        return replace( s, From, to );
+}
+static inline string replace( string & s, string & from, const char * to ) {
+        string To = to;
+        return replace( s, from, To );
+}
+string translate( const string & s, int (*f)( int ) );
+static inline string translate( const char * c, int (*f)( int ) ) {
+        const string S = c;
+        return translate( S, f );
+}

libcfa/src/collections/string_res.cfa

-              r6174ecc
+              rb1b513d
 // Created On       : Fri Sep 03 11:00:00 2021
 // Last Modified By : Peter A. Buhr
 // Last Modified On : Sun Apr  6 07:38:02 2025
 // Update Count     : 111
+// Last Modified On : Wed Apr  9 08:44:17 2025
+// Update Count     : 128
 //
 …
 const char * DEBUG_string_heap_start( VbyteHeap * heap ) {
         return heap->StartVbyte;
+}
-// Returns the size of the string in bytes
-size_t len(const string_res & s) with(s) {
-        return Handle.lnth;
+}
 …
+}
+int find(const string_res & s, char search) {
+// int find$( const string_res & s, ssize_t start, ssize_t len, const string_res & k, ssize_t kstart, ssize_t klen ) {
+//     if ( start < 0 ) start = s.Handle.lnth + start;          // adjust negative starting locations
+//     if ( kstart < 0 ) kstart = k.Handle.lnth + kstart;
+//      if ( start + len > s.Handle.lnth ) return start + 1;  // cannot be there
+//      if ( kstart + len > k.Handle.lnth ) return start + 1;
+//      if ( klen > len ) return start + 1;
+//      int i, r;
+//      for ( i = max( start, 1 ); ; i += 1 ) {
+//              if ( i > s.Handle.lnth - k.Handle.lnth + 1 ) {
+//                      r = s.Handle.lnth + 1;
+//                      break;
+//          } // exit
+//              if ( HeapArea->ByteCmp( s.Handle.s, i, k.Handle.lnth, k.Handle.s, 1, k.Handle.lnth ) == 0 ) {
+//                      r = i;
+//                      break;
+//          } // exit
+//      } // for
+//      return r;
+// }
+int find( const string_res & s, char search ) {
         return findFrom(s, 0, search);
+}
 int findFrom(const string_res & s, size_t fromPos, char search) {
+int findFrom( const string_res & s, size_t fromPos, char search ) {
         // FIXME: This paricular overload (find of single char) is optimized to use memchr.
         // The general overload (find of string, memchr applying to its first character) and `contains` should be adjusted to match.

libcfa/src/collections/string_res.hfa

-              r6174ecc
+              rb1b513d
 // Created On       : Fri Sep 03 11:00:00 2021
 // Last Modified By : Peter A. Buhr
 // Last Modified On : Sun Apr  6 07:35:44 2025
 // Update Count     : 70
+// Last Modified On : Wed Apr  9 15:16:29 2025
+// Update Count     : 76
 //
 …
 // Getters
 size_t len( const string_res & s);
+static inline size_t len( const string_res & s ) { return s.Handle.lnth; }
 // Constructors, Assignment Operators, Destructor
 …
 bool contains( const string_res & s, char ch); // single character
+int find$( const string_res & s, ssize_t start, ssize_t len, const string_res & key, ssize_t kstart, ssize_t klen );
 int find( const string_res & s, char search);
 int find( const string_res & s, const string_res & search);
 …
 int exclude( const string_res & s, const charclass_res & mask);
-// Modifiers
-void padStart(string_res & s, size_t n);
-void padStart(string_res & s, size_t n, char padding);
-void padEnd(string_res & s, size_t n);
-void padEnd(string_res &s, size_t n, char padding);

tests/collections/.expect/string-api-coverage.txt

r6174ecc	rb1b513d
85	85	true false true false true true true false true false
86	86	3 0 0 11 26 0
	87	abc abcdefghijk abcdefghijklmnopqrstuvwxyz

tests/collections/string-api-coverage.cfa

-              r6174ecc
+              rb1b513d
     sout
+        | include( alphabet, cc_cba )  // 3
+        | exclude( alphabet, cc_cba )  // 0
+        | include( alphabet, cc_onml )  // 0
+        | exclude( alphabet, cc_onml )  // 11
+        | include( alphabet, cc_alphabet )  // 26
+        | exclude( alphabet, cc_alphabet ); // 0
+        | (return size_t)include( alphabet, cc_cba )  // 3
+        | (return size_t)exclude( alphabet, cc_cba )  // 0
+        | (return size_t)include( alphabet, cc_onml )  // 0
+        | (return size_t)exclude( alphabet, cc_onml )  // 11
+        | (return size_t)include( alphabet, cc_alphabet )  // 26
+        | (return size_t)exclude( alphabet, cc_alphabet ); // 0
+    sout
+        | (return string)include( alphabet, cc_cba )  // "abc"
+        | (return string)exclude( alphabet, cc_cba )  // ""
+        | (return string)include( alphabet, cc_onml )  // ""
+        | (return string)exclude( alphabet, cc_onml )  // "abcdefghijk"
+        | (return string)include( alphabet, cc_alphabet )  // "abcdefghijklmnopqrstuvwxyz"
+        | (return string)exclude( alphabet, cc_alphabet ); // ""
+}

Note: See TracChangeset for help on using the changeset viewer.

Download in other formats: