Context Navigation

← Previous Changeset
Next Changeset →

Changeset 5a48d79

Timestamp:

Apr 11, 2017, 9:37:48 AM (9 years ago)

Author:

Rob Schluntz <rschlunt@…>

Branches:

ADT, aaron-thesis, arm-eh, ast-experimental, cleanup-dtors, deferred_resn, demangler, enum, forall-pointer-decay, jacob/cs343-translation, jenkins-sandbox, master, new-ast, new-ast-unique-expr, new-env, no_list, persistent-indexer, pthread-emulation, qualifiedEnum, resolv-new, stuck-waitfor-destruct, with_gc

Children:

4b0f997, e6dceef

Parents:

2ccb93c (diff), b39e3dae (diff)
Note: this is a merge changeset, the changes displayed below correspond to the merge itself.
Use the (diff) links above to see all the changes relative to each parent.

Message:

Merge branch 'master' of plg.uwaterloo.ca:/u/cforall/software/cfa/cfa-cc

Location:

Files:

: 10 edited

bibliography/cfa.bib (modified) (1 diff)
generic_types/evaluation/Makefile (modified) (1 diff)
generic_types/evaluation/bench.h (modified) (2 diffs)
generic_types/evaluation/bench.hpp (modified) (2 diffs)
generic_types/evaluation/cfa-stack.c (modified) (1 diff)
generic_types/evaluation/cpp-vstack.cpp (modified) (1 diff)
generic_types/generic_types.tex (modified) (5 diffs)
proposals/concurrency/concurrency.tex (modified) (1 diff)
proposals/concurrency/style.tex (modified) (1 diff)
proposals/concurrency/version (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

doc/bibliography/cfa.bib

r2ccb93c	r5a48d79
5463	5463	contributer = {pabuhr@plg},
5464	5464	title = {The Programming Language {Ada}: Reference Manual},
	5465	author = {Ada},
5465	5466	organization= {United States Department of Defense},
5466	5467	edition = {{ANSI/MIL-STD-1815A-1983}},

doc/generic_types/evaluation/Makefile

-              r2ccb93c
+              r5a48d79
 run-c: c-bench
+        @echo
         @echo '## C ##'
         @./c-bench
         @printf 'source_size:\t%7d lines\n' `cat c-bench.c bench.h c-stack.h c-stack.c | wc -l`
         @printf 'binary_size:\t%7d bytes\n' `wc -c < c-bench`
+        @/usr/bin/time -f 'max_memory:\t%M kilobytes' ./c-bench
+        @printf 'source_size:\t%8d lines\n' `cat c-bench.c bench.h c-stack.h c-stack.c | wc -l`
+        @printf 'binary_size:\t%8d bytes\n' `stat -c %s c-bench`
 run-cfa: cfa-bench
+        @echo
         @echo '## Cforall ##'
         @./cfa-bench
         @printf 'source_size:\t%7d lines\n' `cat cfa-bench.c bench.h cfa-stack.h cfa-stack.c | wc -l`
         @printf 'binary_size:\t%7d bytes\n' `wc -c < cfa-bench`
+        @/usr/bin/time -f 'max_memory:\t %M kilobytes' ./cfa-bench
+        @printf 'source_size:\t%8d lines\n' `cat cfa-bench.c bench.h cfa-stack.h cfa-stack.c | wc -l`
+        @printf 'binary_size:\t%8d bytes\n' `stat -c %s cfa-bench`
 run-cpp: cpp-bench
+        @echo
         @echo '## C++ ##'
         @./cpp-bench
         @printf 'source_size:\t%7d lines\n' `cat cpp-bench.cpp bench.hpp cpp-stack.hpp | wc -l`
         @printf 'binary_size:\t%7d bytes\n' `wc -c < cpp-bench`
+        @/usr/bin/time -f 'max_memory:\t %M kilobytes' ./cpp-bench
+        @printf 'source_size:\t%8d lines\n' `cat cpp-bench.cpp bench.hpp cpp-stack.hpp | wc -l`
+        @printf 'binary_size:\t%8d bytes\n' `stat -c %s cpp-bench`
 run-cppv: cpp-vbench
+        @echo
         @echo '## C++ virtual ##'
         @./cpp-vbench
         @printf 'source_size:\t%7d lines\n' `cat cpp-vbench.cpp bench.hpp object.hpp cpp-vstack.hpp cpp-vstack.cpp | wc -l`
         @printf 'binary_size:\t%7d bytes\n' `wc -c < cpp-vbench`
+        @/usr/bin/time -f 'max_memory:\t%M kilobytes' ./cpp-vbench
+        @printf 'source_size:\t%8d lines\n' `cat cpp-vbench.cpp bench.hpp object.hpp cpp-vstack.hpp cpp-vstack.cpp | wc -l`
+        @printf 'binary_size:\t%8d bytes\n' `stat -c %s cpp-vbench`
 run: run-c run-cfa run-cpp run-cppv

doc/generic_types/evaluation/bench.h

-              r2ccb93c
+              r5a48d79
 #include <time.h>
+ #define N 100000000
+#define N 100000000
 long ms_between(clock_t start, clock_t end) {
 …
         code \
         _end = clock(); \
         printf("%s:\t%7ld ms\n", name, ms_between(_start, _end)); \
+        printf("%s:\t%8ld ms\n", name, ms_between(_start, _end)); \
+}

doc/generic_types/evaluation/bench.hpp

-              r2ccb93c
+              r5a48d79
 #include <time.h>
+ #define N 100000000
+static const int N = 100000000;
 long ms_between(clock_t start, clock_t end) {
 …
         code \
         _end = clock(); \
         std::cout << name << ":\t" << std::setw(7) << ms_between(_start, _end) << std::setw(0) << " ms" << std::endl; \
+        std::cout << name << ":\t" << std::setw(8) << ms_between(_start, _end) << std::setw(0) << " ms" << std::endl; \
+}

doc/generic_types/evaluation/cfa-stack.c

-              r2ccb93c
+              r5a48d79
         s->head = n->next;
         T x = n->value;
+        delete(n);
+        ^n{};
+        free(n);
         return x;
+}

doc/generic_types/evaluation/cpp-vstack.cpp

r2ccb93c	r5a48d79
25	25	delete crnt;
26	26	}
	27	head = nullptr;
27	28	}
28	29

doc/generic_types/generic_types.tex

-              r2ccb93c
+              r5a48d79
 (3) \CFA code must be at least as portable as standard C code;
 (4) Extensions introduced by \CFA must be translated in the most efficient way possible.
+These goals ensure existing C code-bases can be converted to \CFA incrementally with minimal effort, and C programmers can productively generate \CFA code without training beyond the features being used. In its current implementation, \CFA is compiled by translating it to the GCC-dialect of C~\citep{GCCExtensions}, allowing it to leverage the portability and code optimizations provided by GCC, meeting goals (1)-(3). Ultimately, a compiler is necessary for advanced features and optimal performance.
+These goals ensure existing C code-bases can be converted to \CFA incrementally with minimal effort, and C programmers can productively generate \CFA code without training beyond the features being used.
+We claim \CC is diverging from C, and hence, incremental additions of language features require significant effort and training, while suffering from historically poor design choices.
+\CFA is currently implemented as a source-to-source translator from \CFA to the GCC-dialect of C~\citep{GCCExtensions}, allowing it to leverage the portability and code optimizations provided by GCC, meeting goals (1)-(3). Ultimately, a compiler is necessary for advanced features and optimal performance.
 This paper identifies shortcomings in existing approaches to generic and variadic data types in C-like languages and presents a design for generic and variadic types avoiding those shortcomings. Specifically, the solution is both reusable and type-checked, as well as conforming to the design goals of \CFA with ergonomic use of existing C abstractions. The new constructs are empirically compared with both standard C and \CC; the results show the new design is comparable in performance.
 …
 The @identity@ function above can be applied to any complete \emph{object type} (or @otype@). The type variable @T@ is transformed into a set of additional implicit parameters encoding sufficient information about @T@ to create and return a variable of that type. The \CFA implementation passes the size and alignment of the type represented by an @otype@ parameter, as well as an assignment operator, constructor, copy constructor and destructor. If this extra information is not needed, \eg for a pointer, the type parameter can be declared as a \emph{data type} (or @dtype@).
 Here, the runtime cost of polymorphism is spread over each polymorphic call, due to passing more arguments to polymorphic functions; preliminary experiments have shown this overhead is similar to \CC virtual function calls. An advantage of this design is that, unlike \CC template functions, \CFA polymorphic functions are compatible with C \emph{separate} compilation, preventing code bloat.
+In \CFA, the polymorphism runtime-cost is spread over each polymorphic call, due to passing more arguments to polymorphic functions; preliminary experiments show this overhead is similar to \CC virtual-function calls. An advantage of this design is that, unlike \CC template-functions, \CFA polymorphic-functions are compatible with C \emph{separate compilation}, preventing compilation and code bloat.
 Since bare polymorphic-types provide only a narrow set of available operations, \CFA provides a \emph{type assertion} mechanism to provide further type information, where type assertions may be variable or function declarations that depend on a polymorphic type-variable. For example, the function @twice@ can be defined using the \CFA syntax for operator overloading:
 …
 int val = twice( twice( 3.7 ) );
 \end{lstlisting}
 which works for any type @T@ with a matching addition operator. The polymorphism is achieved by creating a wrapper function for calling @+@ with @T@ bound to @double@, then passing this function to the first call of @twice@. There is now the option of using the same @twice@ and converting the result to @int@ on assignment, or creating another @twice@ with type parameter @T@ bound to @int@ because \CFA uses the return type~\cite{Ada} in its type analysis. The first approach has a late conversion from @int@ to @double@ on the final assignment, while the second has an eager conversion to @int@. \CFA minimizes the number of conversions and their potential to lose information, so it selects the first approach, which corresponds with C-programmer intuition.
+which works for any type @T@ with a matching addition operator. The polymorphism is achieved by creating a wrapper function for calling @+@ with @T@ bound to @double@, then passing this function to the first call of @twice@. There is now the option of using the same @twice@ and converting the result to @int@ on assignment, or creating another @twice@ with type parameter @T@ bound to @int@ because \CFA uses the return type (as in~\cite{Ada}) in its type analysis. The first approach has a late conversion from @int@ to @double@ on the final assignment, while the second has an eager conversion to @int@. \CFA minimizes the number of conversions and their potential to lose information, so it selects the first approach, which corresponds with C-programmer intuition.
 Crucial to the design of a new programming language are the libraries to access thousands of external software features.
 \CFA inherits a massive compatible library-base, where other programming languages must rewrite or provide fragile inter-language communication with C.
+Like \CC, \CFA inherits a massive compatible library-base, where other programming languages must rewrite or provide fragile inter-language communication with C.
 A simple example is leveraging the existing type-unsafe (@void *@) C @bsearch@ to binary search a sorted floating-point array:
 \begin{lstlisting}
 …
 int posn = bsearch( 5.0, vals, 10 );
 \end{lstlisting}
 The nested routine @comp@ provides the hidden interface from typed \CFA to untyped (@void *@) C, plus the cast of the result.
+The nested routine @comp@ (impossible in \CC as lambdas do not use C calling conventions) provides the hidden interface from typed \CFA to untyped (@void *@) C, plus the cast of the result.
 As well, an alternate kind of return is made available: position versus pointer to found element.
 \CC's type-system cannot disambiguate between the two versions of @bsearch@ because it does not use the return type in overload resolution, nor can \CC separately compile a templated @bsearch@.
 …
 forall( otype T `| summable( T )` ) T sum( T a[$\,$], size_t size ) {  // use trait
         `T` total = { `0` };                                    $\C{// instantiate T from 0 by calling its constructor}$
+        for ( unsigned int i = 0; i < size; i += 1 )
+                total `+=` a[i];                                        $\C{// select appropriate +}$
+        for ( unsigned int i = 0; i < size; i += 1 ) total `+=` a[i]; $\C{// select appropriate +}$
         return total; }
 \end{lstlisting}
+A trait name plays no part in type equivalence; it is solely a macro for a list of assertions.
+Traits may overlap assertions without conflict, and therefore, do not form a hierarchy.
 In fact, the set of operators is incomplete, \eg no assignment, but @otype@ is syntactic sugar for the following implicit trait:

doc/proposals/concurrency/concurrency.tex

-              r2ccb93c
+              r5a48d79
 \subsection{Internal scheduling} \label{insched}
+Monitors also need to schedule waiting threads internally as a mean of synchronization. Internal scheduling is one of the simple examples of such a feature. It allows users to declare condition variables and have threads wait and signaled from them. Here is a simple example of such a technique :
+\begin{lstlisting}
+        mutex struct A {
+                condition e;
+        }
+        void foo(A & mutex a) {
+                //...
+                wait(a.e);
+                //...
+        }
+        void bar(A & mutex a) {
+                signal(a.e);
+        }
+\end{lstlisting}
+Note that in \CFA, \code{condition} have no particular need to be stored inside a monitor, beyond any software engineering reasons. Here routine \code{foo} waits for the \code{signal} from \code{bar} before making further progress, effectively ensuring a basic ordering.
+As for simple mutual exclusion, these semantics must also be extended to include \gls{group-acquire} :
 \begin{center}
 \begin{tabular}{ c @{\hskip 0.65in} c }
+Thread 1 & Thread 2 \\
+\begin{lstlisting}
+void foo(A & mutex a,
+           A & mutex b) {
+        //...
+        wait(a.e);
+        //...
+}
+foo(a, b);
+\end{lstlisting} &\begin{lstlisting}
+void bar(A & mutex a,
+           A & mutex b) {
+        signal(a.e);
+}
+bar(a, b);
+\begin{lstlisting}[language=Pseudo]
+acquire A
+        wait A
+release A
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire A
+        signal A
+release A
 \end{lstlisting}
 \end{tabular}
 \end{center}
+To define the semantics of internal scheduling, it is important to look at nesting and \gls{group-acquire}. Indeed, beyond concerns about lock ordering, without scheduling the two following pseudo codes are mostly equivalent. In fact, if we assume monitors are ordered alphabetically, these two pseudo codes would probably lead to exactly the same implementation :
+\begin{table}[h!]
+\centering
+\begin{tabular}{c c}
+\begin{lstlisting}[language=pseudo]
+monitor A, B, C
+Easy : like uC++
+\begin{center}
+\begin{tabular}{ c @{\hskip 0.65in} c }
+\begin{lstlisting}[language=Pseudo]
 acquire A
+        acquire B & C
+                        //Do stuff
+        release B & C
+        acquire B
+                wait B
+        release B
 release A
+\end{lstlisting} &\begin{lstlisting}[language=pseudo]
+monitor A, B, C
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire A
+        acquire B
+                signal B
+        release B
+release A
+\end{lstlisting}
+\end{tabular}
+\end{center}
+Also easy : like uC++
+\begin{center}
+\begin{tabular}{ c @{\hskip 0.65in} c }
+\begin{lstlisting}[language=Pseudo]
+acquire A & B
+        wait A & B
+release A & B
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire A & B
+        signal A & B
+release A & B
+\end{lstlisting}
+\end{tabular}
+\end{center}
+Simplest extension : can be made like uC++ by tying B to A
+\begin{center}
+\begin{tabular}{ c @{\hskip 0.65in} c }
+\begin{lstlisting}[language=Pseudo]
+acquire A
+        // Code Section 1
+        acquire B
+                // Code Section 2
+                wait A & B
+                // Code Section 3
+        release B
+        // Code Section 4
+release A
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire A
+        // Code Section 5
+        acquire B
+                // Code Section 6
+                signal A & B
+                // Code Section 7
+        release B
+        // Code Section 8
+release A
+\end{lstlisting}
+\end{tabular}
+\end{center}
+Hard extension :
+Incorrect options for the signal :
+\begin{description}
+ \item[-] Release B and baton pass after Code Section 8 : Passing b without having it
+ \item[-] Keep B during Code Section 8 : Can lead to deadlocks since we secretly keep a lock longer than specified by the user
+ \item[-] Instead of release B transfer A and B to waiter then try to reacquire A before running Code Section 8 : This allows barging
+\end{description}
+Since we don't want barging we need to pass A \& B and somehow block and get A back.
+\begin{center}
+\begin{tabular}{ c @{\hskip 0.65in} c }
+\begin{lstlisting}[language=Pseudo]
 acquire A
         acquire B
                 acquire C
+                        //Do stuff
+                release C
+        release B
+release A
+                        wait A & B & C
+: release C
+: release B
+: release A
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire A
+        acquire B
+                acquire C
+                        signal A & B & C
+: release C
+: release B
+: release A
 \end{lstlisting}
 \end{tabular}
+\end{table}
+Once internal scheduling is introduce however, semantics of \gls{group-acquire} become relevant. For example, let us look into the semantics of the following pseudo-code :
+\end{center}
+To prevent barging :
+\begin{description}
+ \item[-] When the signaller hits 4 : pass A, B, C to waiter
+ \item[-] When the waiter hits 2 : pass A, B to signaller
+ \item[-] When the signaller hits 5 : pass A to waiter
+\end{description}
+\begin{center}
+\begin{tabular}{ c @{\hskip 0.65in} c }
 \begin{lstlisting}[language=Pseudo]
-: monitor A, B, C
-: condition c1
-:
-: acquire A
-:              acquire A & B & C
-:                              signal c1
-:              release A & B & C
-: release A
-\end{lstlisting}
-Without \gls{group-acquire} signal simply baton passes the monitor lock on the next release. In the case above, we therefore need to indentify the next release. If line 8 is picked at the release point, then the signal will attempt to pass A \& B \& C, without having ownership of B \& C. Since this violates mutual exclusion, we conclude that line 7 is the only valid location where signalling can occur. The traditionnal meaning of signalling is to transfer ownership of the monitor(s) and immediately schedule the longest waiting task. However, in the discussed case, the signalling thread expects to maintain ownership of monitor A. This can be expressed in two differents ways : 1) the thread transfers ownership of all locks and reacquires A when it gets schedulled again or 2) it transfers ownership of all three monitors and then expects the ownership of A to be transferred back.
-However, the question is does these behavior motivate supporting acquireing non-disjoint set of monitors. Indeed, if the previous example was modified to only acquire B \& C at line 5 (an release the accordingly) then in respects to scheduling, we could add the simplifying constraint that all monitors in a bulk will behave the same way, simplifying the problem back to a single monitor problem which has already been solved. For this constraint to be acceptble however, we need to demonstrate that in does not prevent any meaningful possibilities. And, indeed, we can look at the two previous interpretation of the above pseudo-code and conclude that supporting the acquiring of non-disjoint set of monitors does not add any expressiveness to the language.
-Option 1 reacquires the lock after the signal statement, this can be rewritten as follows without the need for non-disjoint sets :
-\begin{lstlisting}[language=Pseudo]
-monitor A, B, C
-condition c1
-acquire A & B & C
-        signal c1
-release A & B & C
 acquire A
+release A
+\end{lstlisting}
+This pseudo code has almost exaclty the same semantics as the code acquiring intersecting sets of monitors.
+Option 2 uses two-way lock ownership transferring instead of reacquiring monitor A. Two-way monitor ownership transfer is normally done using signalBlock semantics, which immedietely transfers ownership of a monitor before getting the ownership back when the other thread no longer needs the monitor. While the example pseudo-code for Option 2 seems toe transfer ownership of A, B and C and only getting A back, this is not a requirement. Getting back all 3 monitors and releasing B and C differs only in performance. For this reason, the second option could arguably be rewritten as :
+\begin{lstlisting}[language=Pseudo]
+monitor A, B, C
+condition c1
+acquire A
+        acquire B & C
+                signalBlock c1
+        release B & C
+release A
+\end{lstlisting}
+Obviously, the difference between these two snippets of pseudo code is that the first one transfers ownership of A, B and C while the second one only transfers ownership of B and C. However, this limitation can be removed by allowing user to release extra monitors when using internal scheduling, referred to as extended internal scheduling (pattent pending) from this point on. Extended internal scheduling means the two following pseudo-codes are functionnaly equivalent :
+\begin{table}[h!]
+\centering
+\begin{tabular}{c @{\hskip 0.65in} c}
+\begin{lstlisting}[language=pseudo]
+monitor A, B, C
+condition c1
+acquire A
+        acquire B & C
+                signalBlock c1 with A
+        release B & C
+release A
+\end{lstlisting} &\begin{lstlisting}[language=pseudo]
+monitor A, B, C
+condition c1
+acquire A
+        acquire A & B & C
+                signal c1
+        release A & B & C
+release A
+        acquire C
+                acquire B
+                        wait A & B & C
+: release B
+: release C
+: release A
+\end{lstlisting}&\begin{lstlisting}[language=Pseudo]
+acquire B
+        acquire A
+                acquire C
+                        signal A & B & C
+: release C
+: release A
+: release B
 \end{lstlisting}
 \end{tabular}
+\end{table}
+It must be stated that the extended internal scheduling only makes sense when using wait and signalBlock, since they need to prevent barging, which cannot be done in the context of signal since the ownership transfer is strictly one-directionnal.
+One critic that could arise is that extended internal schedulling is not composable since signalBlock must be explicitly aware of which context it is in. However, this argument is not relevant since acquire A, B and C in a context where a subset of them is already acquired cannot be achieved without spurriously releasing some locks or having an oracle aware of all monitors. Therefore, composability of internal scheduling is no more an issue than composability of monitors in general.
+The main benefit of using extended internal scheduling is that it offers the same expressiveness as intersecting monitor set acquiring but greatly simplifies the selection of a leader (or representative) for a group of monitor. Indeed, when using intersecting sets, it is not obvious which set intersects with other sets which means finding a leader representing only the smallest scope is a hard problem. Where as when using disjoint sets, any monitor that would be intersecting must be specified in the extended set, the leader can be chosen as any monitor in the primary set.
+\end{center}
+To prevent barging : When the signaller hits 4 : pass A, B, C to waiter. When the waiter hits 1 it must release B,
+\begin{description}
+ \item[-]
+ \item[-] When the waiter hits 1 : pass A, B to signaller
+ \item[-] When the signaller hits 5 : pass A, B to waiter
+ \item[-] When the waiter hits 2 : pass A to signaller
+\end{description}
+% Monitors also need to schedule waiting threads internally as a mean of synchronization. Internal scheduling is one of the simple examples of such a feature. It allows users to declare condition variables and have threads wait and signaled from them. Here is a simple example of such a technique :
+% \begin{lstlisting}
+%       mutex struct A {
+%               condition e;
+%       }
+%       void foo(A & mutex a) {
+%               //...
+%               wait(a.e);
+%               //...
+%       }
+%       void bar(A & mutex a) {
+%               signal(a.e);
+%       }
+% \end{lstlisting}
+% Note that in \CFA, \code{condition} have no particular need to be stored inside a monitor, beyond any software engineering reasons. Here routine \code{foo} waits for the \code{signal} from \code{bar} before making further progress, effectively ensuring a basic ordering.
+% As for simple mutual exclusion, these semantics must also be extended to include \gls{group-acquire} :
+% \begin{center}
+% \begin{tabular}{ c @{\hskip 0.65in} c }
+% Thread 1 & Thread 2 \\
+% \begin{lstlisting}
+% void foo(A & mutex a,
+%            A & mutex b) {
+%       //...
+%       wait(a.e);
+%       //...
+% }
+% foo(a, b);
+% \end{lstlisting} &\begin{lstlisting}
+% void bar(A & mutex a,
+%            A & mutex b) {
+%       signal(a.e);
+% }
+% bar(a, b);
+% \end{lstlisting}
+% \end{tabular}
+% \end{center}
+% To define the semantics of internal scheduling, it is important to look at nesting and \gls{group-acquire}. Indeed, beyond concerns about lock ordering, without scheduling the two following pseudo codes are mostly equivalent. In fact, if we assume monitors are ordered alphabetically, these two pseudo codes would probably lead to exactly the same implementation :
+% \begin{table}[h!]
+% \centering
+% \begin{tabular}{c c}
+% \begin{lstlisting}[language=pseudo]
+% monitor A, B, C
+% acquire A
+%       acquire B & C
+%                       //Do stuff
+%       release B & C
+% release A
+% \end{lstlisting} &\begin{lstlisting}[language=pseudo]
+% monitor A, B, C
+% acquire A
+%       acquire B
+%               acquire C
+%                       //Do stuff
+%               release C
+%       release B
+% release A
+% \end{lstlisting}
+% \end{tabular}
+% \end{table}
+% Once internal scheduling is introduce however, semantics of \gls{group-acquire} become relevant. For example, let us look into the semantics of the following pseudo-code :
+% \begin{lstlisting}[language=Pseudo]
+% 1: monitor A, B, C
+% 2: condition c1
+% 3:
+% 4: acquire A
+% 5:            acquire A & B & C
+% 6:                            signal c1
+% 7:            release A & B & C
+% 8: release A
+% \end{lstlisting}
+% Without \gls{group-acquire} signal simply baton passes the monitor lock on the next release. In the case above, we therefore need to indentify the next release. If line 8 is picked at the release point, then the signal will attempt to pass A \& B \& C, without having ownership of B \& C. Since this violates mutual exclusion, we conclude that line 7 is the only valid location where signalling can occur. The traditionnal meaning of signalling is to transfer ownership of the monitor(s) and immediately schedule the longest waiting task. However, in the discussed case, the signalling thread expects to maintain ownership of monitor A. This can be expressed in two differents ways : 1) the thread transfers ownership of all locks and reacquires A when it gets schedulled again or 2) it transfers ownership of all three monitors and then expects the ownership of A to be transferred back.
+% However, the question is does these behavior motivate supporting acquireing non-disjoint set of monitors. Indeed, if the previous example was modified to only acquire B \& C at line 5 (an release the accordingly) then in respects to scheduling, we could add the simplifying constraint that all monitors in a bulk will behave the same way, simplifying the problem back to a single monitor problem which has already been solved. For this constraint to be acceptble however, we need to demonstrate that in does not prevent any meaningful possibilities. And, indeed, we can look at the two previous interpretation of the above pseudo-code and conclude that supporting the acquiring of non-disjoint set of monitors does not add any expressiveness to the language.
+% Option 1 reacquires the lock after the signal statement, this can be rewritten as follows without the need for non-disjoint sets :
+% \begin{lstlisting}[language=Pseudo]
+% monitor A, B, C
+% condition c1
+% acquire A & B & C
+%       signal c1
+% release A & B & C
+% acquire A
+% release A
+% \end{lstlisting}
+% This pseudo code has almost exaclty the same semantics as the code acquiring intersecting sets of monitors.
+% Option 2 uses two-way lock ownership transferring instead of reacquiring monitor A. Two-way monitor ownership transfer is normally done using signalBlock semantics, which immedietely transfers ownership of a monitor before getting the ownership back when the other thread no longer needs the monitor. While the example pseudo-code for Option 2 seems toe transfer ownership of A, B and C and only getting A back, this is not a requirement. Getting back all 3 monitors and releasing B and C differs only in performance. For this reason, the second option could arguably be rewritten as :
+% \begin{lstlisting}[language=Pseudo]
+% monitor A, B, C
+% condition c1
+% acquire A
+%       acquire B & C
+%               signalBlock c1
+%       release B & C
+% release A
+% \end{lstlisting}
+% Obviously, the difference between these two snippets of pseudo code is that the first one transfers ownership of A, B and C while the second one only transfers ownership of B and C. However, this limitation can be removed by allowing user to release extra monitors when using internal scheduling, referred to as extended internal scheduling (pattent pending) from this point on. Extended internal scheduling means the two following pseudo-codes are functionnaly equivalent :
+% \begin{table}[h!]
+% \centering
+% \begin{tabular}{c @{\hskip 0.65in} c}
+% \begin{lstlisting}[language=pseudo]
+% monitor A, B, C
+% condition c1
+% acquire A
+%       acquire B & C
+%               signalBlock c1 with A
+%       release B & C
+% release A
+% \end{lstlisting} &\begin{lstlisting}[language=pseudo]
+% monitor A, B, C
+% condition c1
+% acquire A
+%       acquire A & B & C
+%               signal c1
+%       release A & B & C
+% release A
+% \end{lstlisting}
+% \end{tabular}
+% \end{table}
+% It must be stated that the extended internal scheduling only makes sense when using wait and signalBlock, since they need to prevent barging, which cannot be done in the context of signal since the ownership transfer is strictly one-directionnal.
+% One critic that could arise is that extended internal schedulling is not composable since signalBlock must be explicitly aware of which context it is in. However, this argument is not relevant since acquire A, B and C in a context where a subset of them is already acquired cannot be achieved without spurriously releasing some locks or having an oracle aware of all monitors. Therefore, composability of internal scheduling is no more an issue than composability of monitors in general.
+% The main benefit of using extended internal scheduling is that it offers the same expressiveness as intersecting monitor set acquiring but greatly simplifies the selection of a leader (or representative) for a group of monitor. Indeed, when using intersecting sets, it is not obvious which set intersects with other sets which means finding a leader representing only the smallest scope is a hard problem. Where as when using disjoint sets, any monitor that would be intersecting must be specified in the extended set, the leader can be chosen as any monitor in the primary set.
 % We need to make sure the semantics for internally scheduling N monitors are a natural extension of the single monitor semantics. For this reason, we introduce the concept of \gls{mon-ctx}. In terms of context internal scheduling means "releasing a \gls{mon-ctx} and waiting for an other thread to acquire the same \gls{mon-ctx} and baton-pass it back to the initial thread". This definitions requires looking into what a \gls{mon-ctx} is and what the semantics of waiting and baton-passing are.

doc/proposals/concurrency/style.tex

r2ccb93c	r5a48d79
1	1	\input{common} % bespoke macros used in the document
2	2
3		\CFADefaultStyle
	3	% \CFADefaultStyle
4	4
5	5	\lstset{

doc/proposals/concurrency/version

r2ccb93c	r5a48d79
1		0.7.134
	1	0.7.141

Note: See TracChangeset for help on using the changeset viewer.

Download in other formats: