Analysis of Concurrent and Distributed Programs
CS 4405, Spring 2022
Course description:
Software systems are becoming highly concurrent and distributed to utilize modern multicore architectures and increasing speed and bandwidth in networks. Shared-memory concurrency in multicore programs and message-passing concurrency in distributed programs share many common abstractions and problems.
In the multicore era, all performance-critical software employs some form of concurrent programming; typically shared memory concurrency. In this setting, programmers use a number of primitives to develop efficient and correct concurrent programs. To do so the programmers have to understand the behaviors of the primitives and reason about them. It is also important to match the programming paradigms and underlying architectures. For instance, traditionally programmers have assumed that a multithreaded program executed simply by interleaving the executions of its threads—a model known as sequential consistency (SC). This assumption is, however, invalidated both by mainstream multicore architectures, which often execute instructions out of order, and by compilers, whose optimizations affect the outcomes of concurrent programs. As a result, concurrent programs have more outcomes than SC allows.
In the distributed setting, the units of concurrency are independent processes that do not share memory but communicate by exchanging asynchronous messages. The execution of such a system involves two main sources of nondeterminism: concurrency and partial failures. As the processes run concurrently, the exchanged messages can be delivered and processed in many different orderings. The distributed set of processes is also prone to network of process failures. The trade-off between the system’s availability in the existence of failures and the consistency between the processes gives rise to a spectrum of weak consistency notions. It is important to reason about concurrency, possible failures, and consistency guarantees to implement distributed programs correctly and understand their behavior.
This course aims to explore analysis techniques for concurrent and distributed programs.
Course organization
- 5 ECTS: You need to devote at least 140 hours of study for this course.
- Online meetings: The course consists of 14 2-hour meetings. You are not required, but you are strongly encouraged, to attend.
- Assignments: Two homework assignments ((2 x 10%) of the course grade)
- Project: Project implementation, report, and a presentation (80% of the course grade)
- Teams: The students are responsible to form teams and communicate them to the course TAs.
Study goals:
By the end of this course, you should be able to:
- Describe the fundamental concurrency models in multicore and distributed systems
- Explain the concurrency nondeterminism in the executions of concurrent and distributed programs
- Discover concurrency bugs in multicore and distributed programs by program analysis and testing techniques
- Evaluate the pros and cons of program analysis and testing techniques for specific multicore and distributed programs
Teachers:
Teaching assistants
Dennis Sprokholt
Course schedule:
- Feb 09
- Introduction to concurrency
- SC
- Feb 11
- Primitives for concurrent programs
- SC
- Feb 16
- Symptoms of concurrency bugs
- SC
- Feb 18
- Concurrency analysis for multithreaded programs
- SC
- Feb 23
- Lock-free synchronization
- SC
- Feb 25
- Weak-memory consistency – I
- SC
- Mar 02
- Weak-memory consistency – II
- SC
- Mar 04
- Many flavors of concurrency
- BKO
- Mar 09
- Distributed concurrency
- BKO
- Mar 11
- More on distributed concurrency
- BKO
- Mar 16
- Concurrency analysis for distributed systems
- BKO
- Mar 18
- Strong consistency: SC and Linearizability
- BKO
- Mar 23-25
- In-progress project evaluation
- Mar 30
- CAP theorem & weak consistency
- BKO
- Apr 01
- Active research directions
- SC, BKO
- Apr 06-08
- No lectures
- Apr 13-15
- Final project presentations
The schedule is subject to small changes during the term.
Course projects:
Data Race Detection in C/C++ Concurrent Programs: In this project, you will detect data races in the executions of C11 concurrent programs. More information
Checking linearizability or sequential consistency of a distributed database system: In this project, you will test a distributed database of your choice and check whether its executions satisfy linearizability or sequential consistency properties. More information
Alternatively, you can suggest your own project that involves replicating an existing work or proposing a new approach to an existing problem. If you have exciting ideas for the course project, contact the TA’s and the teachers to determine the context of your project.