Changeset 2d373440
- Timestamp:
- Nov 27, 2023, 1:06:19 PM (13 months ago)
- Branches:
- master
- Children:
- 5ddb8bf, da5ac2e
- Parents:
- 69e06ff
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
doc/proposals/enum.tex
r69e06ff r2d373440 289 289 \end{lstlisting} 290 290 291 \subsection{Runtime Enumeration}292 293 The companion structure definition is visible to users, and users can create an instance of companion object themselves, which effectively constructs a \textit{Runtime Enumeration}.294 \begin{lstlisting}[ label=lst:runtime_enum ]295 const char values[$\,$] = { "Hello", "World" };296 const char labels[$\,$] = { "First", "Second" };297 Companion(char *) MyEnum = { .values: values, .labels: labels, .length: 2 };298 \end{lstlisting}299 A runtime enumeration can be used to call enumeration functions.300 \begin{lstlisting}[ label=lst:runtime_enum_usage ]301 sout | charatstic_string( MyEnum, 1 );302 >>> Label: Second; Value: World303 \end{lstlisting}304 However, a runtime enumeration cannot create an enumeration instance, and it does not support enum-qualified syntax.305 \begin{lstlisting}[ label=lst:runtime_enum_usage ]306 MyEnum e = MyEnum.First; // Does not work: cannot create an enumeration instance e,307 // and MyEnum.First is not recognizable308 \end{lstlisting}309 During the compilation, \CFA adds enumeration declarations to an enumeration symbol table and creates specialized function definitions for \CFA enumeration.310 \CFA does not recognize runtime enumeration during compilation and would not add them to the enumeration symbol table, resulting in a lack of supports for runtime enumeration.311 312 \PAB{Not sure how useful this feature is.}313 314 \section{Enumeration Features}291 % \subsection{Runtime Enumeration} 292 293 % The companion structure definition is visible to users, and users can create an instance of companion object themselves, which effectively constructs a \textit{Runtime Enumeration}. 294 % \begin{lstlisting}[ label=lst:runtime_enum ] 295 % const char values[$\,$] = { "Hello", "World" }; 296 % const char labels[$\,$] = { "First", "Second" }; 297 % Companion(char *) MyEnum = { .values: values, .labels: labels, .length: 2 }; 298 % \end{lstlisting} 299 % A runtime enumeration can be used to call enumeration functions. 300 % \begin{lstlisting}[ label=lst:runtime_enum_usage ] 301 % sout | charatstic_string( MyEnum, 1 ); 302 % >>> Label: Second; Value: World 303 % \end{lstlisting} 304 % However, a runtime enumeration cannot create an enumeration instance, and it does not support enum-qualified syntax. 305 % \begin{lstlisting}[ label=lst:runtime_enum_usage ] 306 % MyEnum e = MyEnum.First; // Does not work: cannot create an enumeration instance e, 307 % // and MyEnum.First is not recognizable 308 % \end{lstlisting} 309 % During the compilation, \CFA adds enumeration declarations to an enumeration symbol table and creates specialized function definitions for \CFA enumeration. 310 % \CFA does not recognize runtime enumeration during compilation and would not add them to the enumeration symbol table, resulting in a lack of supports for runtime enumeration. 311 312 % \PAB{Not sure how useful this feature is.} 313 314 % \section{Enumeration Features} 315 315 316 316 A trait is a collection of constraints in \CFA that can be used to describe types. 317 317 The \CFA standard library defines traits to categorize types with related enumeration features. 318 318 319 \section{Enumerator Initialization} 320 An enumerator must have a deterministic immutable value, either be explicitly initialized in the enumeration definition, or implicitly initialized by rules. 321 322 \subsection{C Enumeration Rule} 323 A C enumeration has an integral type. If not initialized, the first enumerator implicitly has the integral value 0, and other enumerators have a value equal to its $predecessor + 1$. 324 319 325 \subsection{Auto Initializable} 320 326 \label{s:AutoInitializable} 321 327 322 TODO: make the initialization rule a separate section. 323 324 If no explicit initializer is given to an enumeration constant, C initializes the first enumeration constant with value 0, and the next enumeration constant has a value equal to its $predecessor + 1$. 328 325 329 \CFA enumerations have the same rule in enumeration constant initialization. 326 330 However, only \CFA types that have defined traits for @zero_t@, @one_t@, and an addition operator can be automatically initialized by \CFA. … … 356 360 }; 357 361 \end{lstlisting} 358 Note ,there is no mechanism to prevent an even value for the direct initialization, such as @C = 6@.362 Note that there is no mechanism to prevent an even value for the direct initialization, such as @C = 6@. 359 363 360 364 In \CFA, character, integral, float, and imaginary types are all @AutoInitialiable@. … … 367 371 >>> F, o, z 368 372 \end{lstlisting} 369 373 \section{Enumeration Features} 370 374 \subsection{Iteration and Range} 371 375 372 It is convenient to iterate over a \CFA enumeration , e.g.:376 It is convenient to iterate over a \CFA enumeration value, e.g.: 373 377 \begin{lstlisting}[label=lst:range_functions] 374 for ( Alphabet alph; Alphabet ) { 375 printf( "%d ", alph ); 376 } 377 >>> A B C ... 378 for ( Alphabet alph; Alphabet ) { sout | alph; } 379 >>> A B C ... D 378 380 \end{lstlisting} 379 381 The for-loop uses the enumeration type @Alphabet@ its range, and iterates through all enumerators in the order defined in the enumeration. 380 382 @alph@ is the iterating enumeration object, which returns the value of an @Alphabet@ in this context according to the precedence rule. 381 383 382 \ CFA offers a shorthand for iterating all enumeration constants:384 \textbullet\ \CFA offers a shorthand for iterating all enumeration constants: 383 385 \begin{lstlisting}[label=lst:range_functions] 384 for ( Alphabet alph ) { 385 printf( "%d ", alph ); 386 } 387 >>> A B C ... 388 \end{lstlisting} 389 The following different loop-control syntax is supported: 390 \begin{lstlisting}[label=lst:range_functions] 391 for ( Alphabet.D ) 392 for ( alph; Alphabet.g ~ Alphabet.z ) 393 for ( Alphabet alph; Alphabet.R ~ Alphabet.X ~ 2 ) 394 \end{lstlisting} 395 \PAB{Explain what each loop does.} 396 Notably, the meaning of ``step'' for an iteration has changed for enumeration. 397 Consider the following example: 398 \begin{lstlisting}[label=lst:range_functions] 399 enum(int) Sequence { 400 A = 10, B = 12, C = 14; 401 } 402 for ( s; Sequence.A ~ Sequence.C ) { 403 printf( "%d ", s ); 404 } 405 >>> 10 12 14 406 407 for ( s; Sequence.A ~ Sequence.A ~ 2 ) { 408 printf( "%d ", s ); 409 } 410 >>> 10 14 411 \end{lstlisting} 412 The range iteration of enumeration does not return the @current_value++@ until it reaches the upper bound. 413 The semantics is to return the next enumeration constant. 414 If a stepping is specified, 2 for example, it returns the 2 enumeration constant after the current one, rather than the @current+2@. 386 for ( Alphabet alph ) { sout | alph; } 387 >>> A B C ... D 388 \end{lstlisting} 389 390 The following are examples for constructing for-control using an enumeration. Note that the type declaration of the iterating variable is optional, because \CFA can infer the type as EnumInstType based on the range expression, and possibly convert it to one of its attribute types. 391 392 \textbullet\ H is implicit up-to exclusive range [0, H). 393 \begin{lstlisting}[label=lst:range_function_1] 394 for ( alph; Alphabet.D ) { sout | alph; } 395 >>> A B C 396 \end{lstlisting} 397 398 \textbullet\ ~= H is implicit up-to inclusive range [0,H]. 399 \begin{lstlisting}[label=lst:range_function_2] 400 for ( alph; ~= Alphabet.D ) { sout | alph; } 401 >>> A B C D 402 \end{lstlisting} 403 404 \textbullet\ L ~ H is explicit up-to exclusive range [L,H). 405 \begin{lstlisting}[label=lst:range_function_3] 406 for ( alph; Alphabet.B ~ Alphabet.D ) { sout | alph; } 407 // for ( Alphabet alph = Alphabet.B; alph < Alphabet.D; alph += 1 ); 1 is one_t 408 >>> B C 409 \end{lstlisting} 410 411 \textbullet\ L ~= H is explicit up-to inclusive range [L,H]. 412 \begin{lstlisting}[label=lst:range_function_4] 413 for ( alph; Alphabet.B ~= Alphabet.D ) { sout | alph; } 414 >>> B C D 415 \end{lstlisting} 416 417 \textbullet\ L -~ H is explicit down-to exclusive range [H,L), where L and H are implicitly interchanged to make the range down-to. 418 \begin{lstlisting}[label=lst:range_function_5] 419 for ( alph; Alphabet.D -~ Alphabet.B ) { sout | alph; } 420 >>> D C 421 \end{lstlisting} 422 423 \textbullet\ L -~= H is explicit down-to exclusive range [H,L], where L and H are implicitly interchanged to make the range down-to. 424 \begin{lstlisting}[label=lst:range_function_6] 425 for ( alph; Alphabet.D -~= Alphabet.B ) { sout | alph; } 426 >>> D C B 427 \end{lstlisting} 428 429 A user can specify the ``step size'' of an iteration. There are two different stepping schemes of enumeration for-loop. 430 \begin{lstlisting}[label=lst:range_function_stepping] 431 enum(int) Sequence { A = 10, B = 12, C = 14, D = 16, D = 18 }; 432 for ( s; Sequence.A ~= Sequence.D ~ 1 ) { sout | alph; } 433 >>> 10 12 14 16 18 434 for ( s; Sequence.A ~= Sequence.D; s+=1 ) { sout | alph; } 435 >>> 10 11 12 13 14 15 16 17 18 436 \end{lstlisting} 437 The first syntax is stepping to the next enumeration constant, which is the default stepping scheme if not explicitly specified. The second syntax, on the other hand, is to call @operator+=@ @one_type@ on the @value( s )@. Therefore, the second syntax is equivalent to 438 \begin{lstlisting}[label=lst:range_function_stepping_converted] 439 for ( typeof( value(Sequence.A) ) s=value( Sequence.A ); s <= Sequence.D; s+=1 ) { sout | alph; } 440 >>> 10 11 12 13 14 15 16 17 18 441 \end{lstlisting} 442 443 % \PAB{Explain what each loop does.} 415 444 416 445 It is also possible to iterate over an enumeration's labels, implicitly or explicitly: … … 424 453 \end{lstlisting} 425 454 455 456 % \subsection{Non-uniform Type} 457 % TODO: Working in Progress, might need to change other sections. Conflict with the resolution right now. 458 459 % \begin{lstlisting} 460 % enum T( int, char * ) { 461 % a=42, b="Hello World" 462 % }; 463 % \end{lstlisting} 464 % The enum T declares two different types: int and char *. The enumerators of T hold values of one of the declared types. 465 466 \subsection{Enumeration Inheritance} 467 468 \begin{lstlisting}[label=lst:EnumInline] 469 enum( char * ) Name { Jack = "Jack", Jill = "Jill" }; 470 enum /* inferred */ Name2 { inline Name, Sue = "Sue", Tom = "Tom" }; 471 \end{lstlisting} 472 \lstinline{Inline} allows Enumeration Name2 to inherit enumerators from Name1 by containment, and a Name enumeration is a subtype of enumeration Name2. An enumeration instance of type Name can be used where an instance of Name2 is expected. 473 \begin{lstlisting}[label=lst:EnumInline] 474 Name Fred; 475 void f( Name2 ); 476 f( Fred ); 477 \end{lstlisting} 478 If enumeration A declares @inline B@ in its enumeration body, enumeration A is the "inlining enum" and enumeration B is the "inlined enum". 479 480 An enumeration can inline at most one other enumeration. The inline declaration must be placed before the first enumerator of the inlining enum. The inlining enum has all the enumerators from the inlined enum, with the same labels, values, and position. 481 \begin{lstlisting}[label=lst:EnumInline] 482 enum /* inferred */ Name2 { inline Name, Sue = "Sue", Tom = "Tom" }; 483 // is equivalent to enum Name2 { Jack = "Jack", Jill="Jill", Sue = "Sue", Tom = "Tom" }; 484 \end{lstlisting} 485 Name.Jack is equivalent to Name2.Jack. Their attributes are all identical. Opening both Name and Name2 in the same scope will not introduce ambiguity. 486 \begin{lstlisting}[label=lst:EnumInline] 487 with( Name, Name2 ) { Jack; } // Name.Jack and Name2.Jack are equivalent. No ambiguity 488 \end{lstlisting} 489 426 490 \section{Implementation} 427 491 428 \CFA places the definition of Companion structure and non-parameterized Companion functions in the prelude, visible globally. 492 \subsection{Compiler Representation} 493 The definition of an enumeration is represented by an internal type called @EnumDecl@. At the minimum, it stores all the information needed to construct the companion object. Therefore, an @EnumDecl@ can be represented as the following: 494 \begin{lstlisting}[label=lst:EnumDecl] 495 forall(T) 496 class EnumDecl { 497 T* values; 498 char** label; 499 }; 500 \end{lstlisting} 501 502 The internal representation of an enumeration constant is @EnumInstType@. 503 An @EnumInstType@ has a reference to the \CFA-enumeration declaration and the position of the enumeration constant. 504 \begin{lstlisting}[label=lst:EnumInstType] 505 class EnumInstType { 506 EnumDecl enumDecl; 507 int position; 508 }; 509 \end{lstlisting} 510 In the later discussion, we will use @EnumDecl<T>@ to symbolize a @EnumDecl@ parameterized by type T, and @EnumInstType<T>@ is a declared instance of @EnumDecl<T>@. 511 512 % \subsection{Preluede} 513 % \CFA places the definition of Companion structure and non-parameterized Companion functions in the prelude, visible globally. 429 514 430 515 \subsection{Declaration} … … 479 564 If the @aggregation_name@ is identified as a \CFA enumeration, the compiler checks if @field@ presents in the declared \CFA enumeration. 480 565 481 \subsection{\lstinline{with} Statement} 482 483 \emph{Work in Progress} 484 566 \subsection{\lstinline{with} Clause/Statement} 485 567 Instead of qualifying an enumeration expression every time, the @with@ can be used to expose enumerators to the current scope, making them directly accessible. 568 \begin{lstlisting}[label=lst:declaration] 569 enum Color( char * ) { Red="R", Green="G", Blue="B" }; 570 enum Animal( int ) { Cat=10, Dog=20 }; 571 with ( Color, Animal ) { 572 char * red_string = Red; // value( Color.Red ) 573 int cat = Cat; // value( Animal.Cat ) 574 } 575 \end{lstlisting} 576 The \lstinline{with} might introduce ambiguity to a scope. Consider the example: 577 \begin{lstlisting}[label=lst:declaration] 578 enum Color( char * ) { Red="R", Green="G", Blue="B" }; 579 enum RGB( int ) { Red=0, Green=1, Blue=2 }; 580 with ( Color, RGB ) { 581 // int red = Red; 582 } 583 \end{lstlisting} 584 \CFA will not try to resolve the expression with ambiguity. It would report an error. In this case, it is necessary to qualify @Red@ even inside of the \lstinline{with} clause. 486 585 487 586 \subsection{Instance Declaration} 488 587 489 \emph{Work in Progress} 490 491 \begin{lstlisting}[label=lst:declaration] 588 589 \begin{lstlisting}[label=lst:var_declaration] 492 590 enum Sample s1; 493 Sample s2; 494 \end{lstlisting} 495 A declaration of \CFA enumeration instance that has no difference than a C enumeration or other \CFA aggregation. 496 The compiler recognizes the type of a variable declaration by searching the name in all possible types. 497 The @enum@ keyword is not necessary but helps to disambiguate types (questionable). 498 The generated code for a \CFA enumeration declaration is utterly an integer, which is meant to store the position. 499 \begin{lstlisting}[label=lst:declaration] 591 \end{lstlisting} 592 593 The declaration \CFA-enumeration variable has the same syntax as the C-enumeration. Internally, such a variable will be represented as an EnumInstType. 594 \begin{lstlisting}[label=lst:declaration_code] 500 595 int s1; 501 int s2; 502 \end{lstlisting} 503 504 \subsection{Compiler Representation} 505 506 \emph{Work in Progress} 507 508 The internal representation of an enumeration constant is @EnumInstType@. 509 The minimum information an @EnumInstType@ stores is a reference to the \CFA-enumeration declaration and the position of the enumeration constant. 510 \begin{lstlisting}[label=lst:EnumInstType] 511 class EnumInstType { 512 EnumDecl enumDecl; 513 int position; 514 }; 515 \end{lstlisting} 596 \end{lstlisting} 597 The generated code for an enumeration instance is simply an int. It is to hold the position of an enumeration. And usage of variable @s1@ will be converted to return one of its attributes: label, value, or position, with respect to the @Unification@ rule 516 598 517 599 \subsection{Unification and Resolution } 518 600 519 \emph{Work in Progress}520 601 521 602 \begin{lstlisting} … … 604 685 @unification( EnumInstType<Colour>, int )@: @position( EnumInstType< Colour > )@ 605 686 \item 606 return the enumeration constant at theposition 1687 return the enumeration constant at position 1 607 688 \end{enumerate} 608 689 \begin{lstlisting} … … 623 704 The downside of the precedence rule: @EnumInstType@ $\Rightarrow$ @int ( value )@ $\Rightarrow$ @EnumInstType@ may return a different @EnumInstType@ because the value can be repeated and there is no way to know which one is expected $\Rightarrow$ want uniqueness 624 705 706 \subsection{Casting} 707 Casting an EnumInstType to some other type T works similarly to unify the EnumInstType with T. For example: 708 \begin{lstlisting} 709 enum( int ) Foo { A = 10, B = 100, C = 1000 }; 710 (int) Foo.A; 711 \end{lstlisting} 712 The \CFA-compiler unifies @EnumInstType<int>@ with int, with returns @value( Foo.A )@, which has statically known value 10. In other words, \CFA-compiler is aware of a cast expression, and it forms the context for EnumInstType resolution. The expression with type @EnumInstType<int>@ can be replaced by the compile with a constant expression 10, and optionally discard the cast expression. 713 714 \subsection{Value Conversion} 715 As discussed in section~\ref{lst:var_declaration}, \CFA only saves @position@ as the necessary information. It is necessary for \CFA to generate intermediate code to retrieve other attributes. 716 717 \begin{lstlisting} 718 Foo a; // int a; 719 int j = a; 720 char * s = a; 721 \end{lstlisting} 722 Assume stores a value x, which cannot be statically determined. When assigning a to j in line 2, the compiler @Unify@ j with a, and returns @value( a )@. The generated code for the second line will be 723 \begin{lstlisting} 724 int j = value( Foo, a ) 725 \end{lstlisting} 726 Similarly, the generated code for the third line is 727 \begin{lstlisting} 728 char * j = label( Foo, a ) 729 \end{lstlisting} 730 625 731 \end{document} 626 732
Note: See TracChangeset
for help on using the changeset viewer.