Skip to content

Commit

Permalink
update schedulue
Browse files Browse the repository at this point in the history
  • Loading branch information
neilernst committed Nov 2, 2021
1 parent 3b96225 commit 26aedbf
Show file tree
Hide file tree
Showing 2 changed files with 56 additions and 22 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,10 +16,11 @@ See the schedule on Brightspace for quizzes and project milestones. All submissi
| 5 | [Architecture Views & Styles](https://github.com/SENG350/course/blob/master/lectures/5-modules.md)[Views on Architecture - C&C](https://github.com/SENG350/course/blob/master/lectures/6-cc.md) |[SEI view example](https://wiki.sei.cmu.edu/sad/index.php/OPC_Module_Uses_View) | text chapter 22.1-4 |
| 6 | [Architecture and Design](https://github.com/SENG350/course/blob/master/lectures/7-design.md)[OO Principles](lectures/ooprinciples.md) | [What is Architectural Design](http://www.informit.com/articles/article.aspx?p=2738304&seqNum=2) | [HomeAssistant sections 0-5](home_assistant_arch.pdf) • text chapter 20 |
| 7 | [Design Patterns](lectures/patterns.md) | [Head-first design patterns](https://www-oreilly-com.ezproxy.library.uvic.ca/library/view/head-first-design/9781492077992/) | [DesignPatterns paper](http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.44.9717) • source code |
| 7 | [Programming Styles]() | | Lopes ["Exercises in Programming Style"](https://learning-oreilly-com.ezproxy.library.uvic.ca/library/view/exercises-in-programming/9781482227376/cover.xhtml) (Uvic access required)|
| 8 | [Architecture analysis](https://github.com/SENG350/course/blob/master/lectures/12-analysis.md) | | text chapter 21 |
| A4 | [Architecture Tactics - Performance](lectures/performance.md) | | Text chapter 9 |
| 8 | [Programming Styles](lectures/prog-styles.md) | | Lopes ["Exercises in Programming Style"](https://learning-oreilly-com.ezproxy.library.uvic.ca/library/view/exercises-in-programming/9781482227376/cover.xhtml) (Uvic access required)|
| A5 | [Architecture Tactics - Safety](lectures/safety.md) || Text ch. 10 |

For module 7, readings are
For the Programming Styles lectures, readings are

* Prologue
* Chapter 3, Monolithic
Expand All @@ -39,8 +40,7 @@ We will also cover these specific architecture tactics if time permits.
| A1 | [Architecture Tactics-Availability](lectures/avail.md) | | text chapter 4, quality attribute chapters 5 |
| A2 | [Architecture Tactics - Modifiability](lectures/modifiability.md) | [Ambler CRC](http://www.agilemodeling.com/artifacts/crcModel.htm) | text chapter 8 |
| A3 | [Architecture Tactics - Privacy](lectures/privacy.md) | [Privacy Patterns](https://privacypatterns.org/) | none |
| A4 | [Architecture Tactics - Performance](lectures/performance.md) | | Text chapter 9 |
| A5 | [Architecture Tactics - Safety](lectures/safety.md) || Text ch. 10 |
| 8 | [Architecture analysis](https://github.com/SENG350/course/blob/master/lectures/12-analysis.md) | | text chapter 21 |


### Lab Schedule
Expand Down
68 changes: 51 additions & 17 deletions lectures/performance.md
Original file line number Diff line number Diff line change
@@ -1,73 +1,103 @@
----
Title: Performance Tactics
Author: Neil Ernst
---
title: Performance Tactics
author: Neil Ernst
marp: true
---

# SKA Telescope Performance

Scenario: When a schedule block is being processed, the Central Signal Processor will send the raw visibilities to SDP to process into observations. SDP shoul d be able to handle 0.4 Tb/s ingest without problem.
Scenario: When a schedule block is being processed, the Central Signal Processor will send the raw visibilities to SDP to process into observations. SDP should be able to handle 0.4 Tb/s ingest without problem.

We walked through the SKA SDP architecture.
- The SKA SDP architecture.

## Performance Scenarios
----
# Performance Scenarios

In HomeAssistant.

In your applications.

### Performance Measures
## Performance Measures
* time taken to do X
* amount of resource consumed (CPU/bandwidth/data...)


----
# Tactics - Summary
* consider in the context of QAS

----
# Performance Tactics

Events Arrive → (tactics) → response generated within time constraints

(Latency): the time taken to generate the response.

After an event arrives, either the system is processing on that event or the processing is blocked for some reason. This leads to the two basic contributors to the response time: **resource consumption** and **blocked time.**
After an event arrives, either the system is processing on that event or the processing is blocked for some reason.

This leads to the two basic contributors to the response time: **resource consumption** and **blocked time.**

1. **Resource consumption.** Resources include CPU, data stores, network communication bandwidth, and memory, but it can also include entities defined by the particular system under design. For example, buffers must be managed and access to critical sections must be made sequential. Events can be of varying types (as just enumerated), and each type goes through a processing sequence. For example, a message is generated by one component, is placed on the network, and arrives at another component. It is then placed in a buffer; transformed in some fashion, and processed according to some algorithm; transformed for output; placed in an output buffer; and sent onward to another component, another system, or the user. Each of these phases contributes to the overall latency of the processing of that event.
2. **Blocked time.** A computation can be blocked from using a resource because of contention for it, because the resource is unavailable, or because the computation depends on the result of other computations that are not yet available.
- \*Contention for resources.\* Events may be in a single stream or in multiple streams. Multiple streams vying for the same resource or different events in the same stream vying for the same resource contribute to latency. In general, the more contention for a resource, the more likelihood of latency being introduced. However, this depends on how the contention is arbitrated and how individual requests are treated by the arbitration mechanism.
- *Availability of resources.\* Even in the absence of contention, computation cannot proceed if a resource is unavailable. Unavailability may be caused by the resource being offline or by failure of the component or for some other reason. In any case, the architect must identify places where resource unavailability might cause a significant contribution to overall latency.
- *Dependency on other computation.\* A computation may have to wait because it must synchronize with the results of another computation or because it is waiting for the results of a computation that it initiated. For example, it may be reading information from two different sources, if these two sources are read sequentially, the latency will be higher than if they are read in parallel.
----

![](img/05fig07.gif)
## Resource consumption
Resources include CPU, data stores, network communication bandwidth, and memory, but it can also include entities defined by the particular system under design.

### RESOURCE DEMAND
For example, a message is generated by one component, is placed on the network, and arrives at another component. It is then placed in a buffer; transformed in some fashion, and processed according to some algorithm; transformed for output; placed in an output buffer; and sent onward to another component, another system, or the user. Each of these phases contributes to the overall latency of the processing of that event.

----
## Blocked time
A computation can be **blocked** from using a resource because of contention for it, because the resource is unavailable, or because the computation depends on the result of other computations that are not yet available.
- **Contention for resources.** Events may be in a single stream or in multiple streams. Multiple streams vying for the same resource or different events in the same stream vying for the same resource contribute to latency. In general, the more contention for a resource, the more likelihood of latency being introduced.
- **Availability of resources** Even in the absence of contention, computation cannot proceed if a resource is unavailable. Unavailability may be caused by the resource being offline or by failure of the component or for some other reason.
- **Dependency on other computation.** A computation may have to wait because it must synchronize with the results of another computation or because it is waiting for the results of a computation that it initiated.

----
![width:900px](img/05fig07.jpg.gif)

----
# RESOURCE DEMAND

Event streams are the source of resource demand. Two characteristics of demand are the time between events in a resource stream (how **often** a request is made in a stream) and how **much** of a resource is consumed by each request.

One tactic for reducing latency is to reduce the resources required for processing an event stream. Ways to do this include the following.
One tactic for reducing latency is to reduce the resources required for processing an event stream.

----
Ways to do this include the following.

- *Increase computational efficiency.* One step in the processing of an event or a message is applying some algorithm. Improving the algorithms used in critical areas will decrease latency. Sometimes one resource can be traded for another. For example, intermediate data may be kept in a repository or it may be regenerated depending on time and space resource availability. This tactic is usually applied to the processor but is also effective when applied to other resources such as a disk.
- *Reduce computational overhead.* If there is no request for a resource, processing needs are reduced. The use of intermediaries (so important for modifiability) increases the resources consumed in processing an event stream, and so removing them improves latency. This is a classic modifiability/performance tradeoff.

----
Another tactic for reducing latency is to reduce the number of events processed. This can be done in one of two fashions.

- *Manage event rate.* If it is possible to reduce the sampling frequency at which environmental variables are monitored, demand can be reduced. Sometimes this is possible if the system was overengineered. Other times an unnecessarily high sampling rate is used to establish harmonic periods between multiple streams. That is, some stream or streams of events are oversampled so that they can be synchronized.
- *Control frequency of sampling.* If there is no control over the arrival of externally generated events, queued requests can be sampled at a lower frequency, possibly resulting in the loss of requests.

----
Other tactics for reducing or managing demand involve controlling the use of resources.

- *Bound execution times.* Place a limit on how much execution time is used to respond to an event. Sometimes this makes sense and sometimes it does not. For iterative, data-dependent algorithms, limiting the number of iterations is a method for bounding execution times.
- *Bound queue sizes.* This controls the maximum number of queued arrivals and consequently the resources used to process the arrivals.

----
### RESOURCE MANAGEMENT

Even though the demand for resources may not be controllable, the management of these resources affects response times. Some resource management tactics are:

- *Introduce concurrency.* If requests can be processed in parallel, the blocked time can be reduced. Concurrency can be introduced by processing different streams of events on different threads or by creating additional threads to process different sets of activities. Once concurrency has been introduced, appropriately allocating the threads to resources (load balancing) is important in order to maximally exploit the concurrency.

----
- *Maintain multiple copies of either data or computations.* Clients in a client-server pattern are replicas of the computation. The purpose of replicas is to reduce the contention that would occur if all computations took place on a central server. Caching is a tactic in which data is replicated, either on different speed repositories or on separate repositories, to reduce contention. Since the data being cached is usually a copy of existing data, keeping the copies consistent and synchronized becomes a responsibility that the system must assume.
- *Increase available resources.* Faster processors, additional processors, additional memory, and faster networks all have the potential for reducing latency. Cost is usually a consideration in the choice of resources, but increasing the resources is definitely a tactic to reduce latency.

----
### RESOURCE ARBITRATION

Whenever there is contention for a resource, the resource must be scheduled. Processors are scheduled, buffers are scheduled, and networks are scheduled. The architect's goal is to **understand the characteristics of each resource's use and choose the scheduling strategy** that is compatible with it.

A scheduling policy conceptually has two parts: a **priority assignment** and **dispatching**. All scheduling policies assign priorities. In some cases the assignment is as simple as first-in/first-out. In other cases, it can be tied to the deadline of the request or its semantic importance. Competing criteria for scheduling include optimal resource usage, request importance, minimizing the number of resources used, minimizing latency, maximizing throughput, preventing starvation to ensure fairness, and so forth. The architect needs to be aware of these possibly conflicting criteria and the effect that the chosen tactic has on meeting them.

----
A high-priority event stream can be dispatched only if the resource to which it is being assigned is available. Sometimes this depends on pre-empting the current user of the resource.

Possible preemption options are as follows:
Expand All @@ -76,27 +106,31 @@ Possible preemption options are as follows:
- can occur only at specific pre-emption points; and
- executing processes cannot be pre-empted.

----
Some common scheduling policies are:

- *First-in/First-out*. FIFO queues treat all requests for resources as equals and satisfy them in turn. One possibility with a FIFO queue is that one request will be stuck behind another one that takes a long time to generate a response. As long as all of the requests are truly equal, this is not a problem, but if some requests are of higher priority than others, it is problematic.
- *Fixed-priority scheduling.* Fixed-priority scheduling assigns each source of resource requests a particular priority and assigns the resources in that priority order. This strategy insures better service for higher-priority requests but admits the possibility of a low-priority, but important, request taking an arbitrarily long time to be serviced because it is stuck behind a series of higher-priority requests.

----
Three common prioritization strategies are
- *semantic importance.* Each stream is assigned a priority statically according to some domain characteristic of the task that generates it. This type of scheduling is used in mainframe systems where the domain characteristic is the time of task initiation.

- *deadline monotonic.* Deadline monotonic is a static priority assignment that assigns higher priority to streams with shorter deadlines. This scheduling policy is used when streams of different priorities with real-time deadlines are to be scheduled.

- *rate monotonic.* Rate monotonic is a static priority assignment for periodic streams that assigns higher priority to streams with shorter periods. This scheduling policy is a special case of deadline monotonic but is better known and more likely to be supported by the operating system.

----
*Dynamic priority scheduling*
- *round robin.* Round robin is a scheduling strategy that orders the requests and then, at every assignment possibility, assigns the resource to the next request in that order. A special form of round robin is a cyclic executive where assignment possibilities are at fixed time intervals.
- *earliest deadline first.* Earliest deadline first assigns priorities based on the pending requests with the earliest deadline.
- *Static scheduling.* A cyclic executive schedule is a scheduling strategy where the pre-emption points and the sequence of assignment to the resource are determined offline.

----
# Further Reading

These notes are drawn from Chapter 8 in the book. More on rate-monotonic and other scheduling approaches at https://resources.sei.cmu.edu/library/asset-view.cfm?assetid=11337

The Apollo priority scheduler is described in detail at http://klabs.org/history/apollo_11_alarms/eyles_2004/eyles_2004.htm

Daliuge is at https://daliuge.readthedocs.io/en/latest/dataflow.html
Daliuge is at https://daliuge.readthedocs.io/en/latest/dataflow.html and https://www.sciencedirect.com/science/article/abs/pii/S2213133716301214?via%3Dihub

0 comments on commit 26aedbf

Please sign in to comment.