To discover the govrning equation of (possibly noisy) measurement data of nonlinear dynamical systems. Mathematically speaking, we consider systems that evolve as
The vector
A non linear system is a system in which the change in input is not proportional to the change in output. This means that the relationship between the input and output is some unknown function with polynomial, sinusoidal, and logarithmic terms. In case of systems described by multiple variables, the system will be described by a set of such non linear equations.
Most real-world systems are described by non linear equations, a few examples being financial markets, the spread of disease, and climate patterns. Gaining a clear picture of how these systems evolve and behave is clearly important - in giving us a better picture of what to expect, and in making decisions that better prepare us for the predicted outcomes.
The paper introduces a method known as Sparse Identification of Nonlinear Dynamics (SINDy). The methodology leverages the fact that most physical systems have only a few relevant terms that govern the system and tries to find these terms using sparsity methods in dynamical systems.
The matrices
Next, they take abritrary sets of non linear functions and construct
Here, $\Sigma = \begin{bmatrix} \zeta_1 & \zeta_2 & \dots & \zeta_n\end{bmatrix}$ are the sparse vectors of coefficients which activates non linear terms in
For each column of
Often, only
Generally, physical systems are represented by partial differential equations (PDE). If data is collected from numerical discretization or from experimental measurements, the dimension
The authors demonstrated the algorithm on canonical systems, ranging from simple linear and nonlinear oscillator, to noisy measurements of the chaotic Lorenz system, also extending this method to nonlinear Partial Differential Equations and high-dimensional data.
In each example, this paper explores the ability to identify the dynamics from state measurements alone, without access to derivatives.
The images below show the model prediction for simple dynamic systems
In the Lorenz example, the ability to capture dynamics on the attractor is more important than the ability to predict an individual trajectory, because chaos will quickly cause any small variations in initial conditions or model coefficients to diverge exponentially. The algorithm not only identifies the correct terms in the dynamics, but it accurately determines the coefficients to within .03% of the true values. The correct dynamics are identified, and the attractor is preserved for surprisingly large noise values.
Limitations:
- The sparse identification method relies on a fortunate choice of coordinates and function basis that facilitate sparse representation of the dynamics.
- The algorithm fails to identify dynamics of Lorenz system in non linear coordinates.
- The algorithm identified dynamics of only some of the variables of glycolytic oscillator model.
SINDy is a powerful technique to identify nonlinear dynamical systems from data without assumptions on the form of the governing equations. Building on prior work in symbolic regression but with innovations related to sparse regression, algorithms can be scaled to high-dimensional systems. There are numerous fields where this method may be applied, where there are ample data and the absence of governing equations, including neuroscience, climate science, epidemiology, and financial markets. Further, SINDy algorithms can even be appreciated in chemical kinetic models in the human body, in plasma dynamics, non-linear optic systems, or even complex mechanical systems.
A few challenges to the successful implementation of the SINDy algorithm are as listed below:
- It relies on having enough clean data of high quality, that is well sampled in time.
- Correct choice of measurement coordinates; how do we know that the variables we measure admit a spare dynamical system?
- Choice of sparsifying function basis; given the proper collection of data and the selection of the right coordinates, how do we define a library of functions that provides a system that’s sparse and accurate?
- Out of the available sparse optimization algorithms available, which one do we use to get the sparsest possible model?
Given that the algorithm fails to identify an accurate sparse model when the measurement coordinates and function basis are not amenable to sparse representation, there may be some hope of obtaining the correct coordinate system and function basis without knowing the solution ahead of time on the belief that we often know something about the physics that guide the choice of function space. The successes of this approach discussed in the paper are mainly in systems, the physics of which, were known beforehand. Regardless, this method has a lot of scope given its robustness and if we are able to get past its challenges, we will have the key to decode systems where data is abundant, but physical laws remain elusive.