We give an overview of the clock-based approach we use to model a distributed system.

Environmental Assumptions

We plan to start from simple controllers and environmental specifications.

We currently model asynchrony in the environment.

We do not model:

crashes
network message loss
network partition
staleness on the Kubernetes side (this will be implemented by adding API server hosts into distributed system part)

Abstraction

The abstract state includes:

a map where the key is the object key (i.e., combination of resource type, namespace, name) and the value is the corresponding kubernetes object
a clock for each controller

We have predicates on the abstract state that should always be true:

if the clock is C, cm1 and cm2 both exist in the map
if cm2 exists in the map, cm1 exists in the map

Distributed System

The distributed system comprises the following:

controller host
database host
network

We start from having just one controller host. We also need a few client events to trigger the distributed system, like sending a message to the database host to create the custom resource.

Controller Host

The controller host can take the following actions:

receive a state update message from the database host and push it to a queue
pop out a message from the queue and start a reconcile
run one step of the reconcile logic that ends with one state object update or query -- this is the controller logic function
receive an OK and the object from the database and unblock the next step
end a reconcile (this only happens when all the reconcile logic steps are done)

We start from assuming controller has no local indexer for caching states across reconciles. We do not allow concurrent reconciles. That is, once a reconcile starts, the controller host has to finish all the steps and ends the reconcile before starting another one. After each step, the controller has to wait for the reply from the database to continue to next step.

Database Host

We start from assuming there is only etcd hosting the entire cluster state and ignore the API servers for now. The database host can take the following actions:

receive a state query message and send an OK and the object back
receive a state update message from the controller, update the object state, send the object back with an OK and mark the object as fresh
send a message for one object marked as fresh to the controller and mark it as unfresh

Note that we are assuming (1) only the single controller cares about the object and (2) the single controller cares about every object. These two might not be true but we can stay with them for now.

Network

The network can take the following actions:

deliver a message to a destination if the message has been sent to the network

Note that this implicitly means we are considering network message duplication and reordering.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

framework_design.md

framework_design.md

Environmental Assumptions

Abstraction

Distributed System

Controller Host

Database Host

Network

Files

framework_design.md

Latest commit

History

framework_design.md

File metadata and controls

Environmental Assumptions

Abstraction

Distributed System

Controller Host

Database Host

Network