Add Clad FP Error Estimation demo.

grimmmyshini · vgvassilev · commit feedbb4dc861 · 2021-09-06T20:37:30.000+03:00
diff --git a/_data/tutorialslist.yml b/_data/tutorialslist.yml
@@ -27,3 +27,16 @@
 
   url: /tutorials/clad_jupyter/
   date: 2021-08-20
+
+- title: Floating-Point Error Estimation Using Automatic Differentiation with Clad
+  author: Garima Singh
+  abstract: |
+    CLAD provides an in-built floating-point error estimation framework that
+    can automatically annotate code with error estimation information. This
+    framework also provides the ability for users to write their own error models
+    and use the same for generating error estimates. The aim of this tutorial is 
+    to demonstrate building a simple custom error model and using it in conjunction
+    with CLAD's error estimation framework.
+
+  url: /tutorials/fp_error_estimation_clad_tutorial/
+  date: 2021-08-21
diff --git a/_includes/head.html b/_includes/head.html
@@ -34,6 +34,10 @@
   {% endfor %}
   {% endif %}
 
+  <!-- Mathjax Support -->
+  <script type="text/javascript" async
+          src="https://cdn.mathjax.org/mathjax/latest/MathJax.js?config=TeX-MML-AM_CHTML">
+  </script>
 
   {% if jekyll.environment == 'production' %}
   {% include analytics.html %}
diff --git a/_pages/tutorials/2021-08-24-fp_error_estimation_clad.md b/_pages/tutorials/2021-08-24-fp_error_estimation_clad.md
@@ -0,0 +1,289 @@
+---
+title: "How to Estimate Floating Point Errors Using Automatic Differentiation"
+layout: post
+excerpt: "This tutorial demostrates how automatic differentiation can be used
+          to estimate floating point errors of algorithms."
+sitemap: false
+permalink: /tutorials/fp_error_estimation_clad_tutorial/
+date: 24-08-2021
+author: Garima Singh
+---
+
+*Tutorial level: Intro*
+
+For the purpose of this tutorial, we will look at a very simple estimation
+model that is a slight modification of clad’s in-built Taylor approximation
+model. The first step of translating your model to code is to understand how
+you could represent it mathematically as a formula. For example, the
+mathematical formula behind the in-built Taylor approximation model is:
+
+$$
+\begin{align*}
+A_f = \sum\limits_{i=1}^n \frac{\partial f}{\partial x_i} \ast |x_i| \ast |\varepsilon_M|
+\end{align*}
+$$
+
+Where,
+- **Af** is the absolute error in the function that you want to analyse.
+  We will refer to this as the “target function”.
+- **df/dx<sub>i</sub>** is the partial derivative of the target function
+with respect to the i<sup>th</sup> variable. This is where clad comes in:
+it provides us with the derivatives we need in our models.
+- **x<sub>i</sub>** is the i<sup>th</sup> variable of interest, here all
+floating point variables in the function are our variables of interest.
+All values, including temporary ones with limited scope, have to be
+analysed to get the most accurate estimation.
+- **E<sub>m</sub>** is the machine epsilon, which is a system dependent value
+  giving the maximum error in the representation of a floating point number.
+  This is the relative error in floating point values and has to be scaled
+  up to be of any meaning. Hence, you see the x<sub>i</sub> in multiplication.
+
+Each i<sup>th</sup> term in the above summation becomes the error contribution
+to the total error by the i<sup>th</sup> assignment. The summation of all
+those values gives us the final absolute floating point error in the target
+function.
+
+Now that is what clad implements, and you might be able to tell that because
+of the use of the machine epsilon, the estimates we get from the default model
+are very strict (i.e., they report a tight upper bound on the error in the
+function). Which means while we can say that our target function will never
+exceed this specific amount of floating point error, it is sometimes necessary
+to get even tighter bounds. To achieve that, we can go one step further by
+incorporating the actual errors in the values themselves. This means that we
+can simply subtract the current value by the same value but with lower
+precision and use that as the error metric. So, our modified Taylor
+approximation model would look something like this
+
+$$
+\begin{align*}
+A_f = \sum\limits_{i=1}^n \frac{\partial f}{\partial x_i} \ast |x_i - (float)x_i|
+\end{align*}
+$$
+
+Here we have cast x<sub>i</sub> down to `float` from `double`. This model
+however imposes a constraint that all our floating point variables should
+be of type `double` or higher.
+
+Once you have a solid formula for each value of interest, the only thing
+that remains is translating that into a function. We have a formula that
+we just derived in the previous section and now we have to convert that to
+a function. So before we begin writing a function, it is important to
+understand how clad is internally able to generate error estimation code.
+
+Clad works on the very simple concept of a *Recursive AST Visitor*.
+For brevity's sake, we will not be discussing ASTs in depth in this tutorial.
+In a gist, these visitors recursively visit each statement in a program in a
+depth-first manner. To illustrate the visitation, consider the following simple
+ example:
+
+Let’s suppose we have a semantically correct statement in our program as follows:
+
+$$
+\begin{align*}
+x = a + b
+\end{align*}
+$$
+
+The AST visitation of the above statement will look like this:
+
+<div align=center style="max-width:1095px; margin:0 auto;">
+  <img src="/images/tutorials/fp_error_estimation_clad_tutorial/ast.gif"
+  style="max-width:90%;"><br/>
+ <p align="center">
+  </p>
+</div>
+
+Now, clad will generate error estimation specific code on the leftmost leaf
+node of the above tree using the `AssignError()` function, which we will be
+overriding later in the tutorial. This also implies that clad will only
+generate error estimates for assignment operations, more specifically, it will
+generate errors for the LHS of all assignment operations.
+
+Now that we have an idea of how the error estimation framework works, all we
+have to do is:
+
+1. Inherit from the `clad::EstimationModel` class
+2. Override `AssignError()` and `SetError()` functions
+3. Register our derived class with the
+   `clad::plugin::ErrorEstimationModelRegistry`
+4. Compile our new library into a shared object
+
+And we will be ready to use our custom estimation model!
+
+First, let us make a header and a `.cpp` file to get started. For this tutorial,
+ let us refer to our custom model as CustomModel. Once you have created the
+ files, populate them with the following code:
+
+**CustomModel.h**
+```cpp
+#include "clad/Differentiator/EstimationModel.h"
+#include "clad/tools/ClangPlugin.h"
+
+class CustomModel : public clad::EstimationModel {
+public:
+ clang::Expr* AssignError(clad::StmtDiff refExpr);
+
+ clang::Expr* SetError(clad::VarDeclDiff declStmt);
+};
+```
+
+**CustomModel.cpp**
+```cpp
+#include "CustomModel.h"
+
+clang::Expr* CustomModel::AssignError(clad::StmtDiff refExpr) {
+    // Populate here with your code
+}
+clang::Expr* CustomModel::SetError(clad::VarDeclDiff declStmt) {
+    // Populate here with your code
+}
+
+static clad::plugin::ErrorEstimationModelRegistry::Add<CustomModel>
+CX("customModel", "Custom model for error estimation in clad");
+```
+
+Before we start filling in the definitions, let’s look at the last statement in `CustomModel.cpp`
+
+```cpp
+static clad::plugin::ErrorEstimationModelRegistry::Add<CustomModel>
+CX("customModel", "Custom model for error estimation in clad");
+```
+
+This statement tells clad to register our custom model. Without this
+statement, clad will be unable to locate our derived class and its functions
+and will throw a symbol lookup error during its usage.
+
+Now, the only thing left to do is to populate the two functions with our custom
+logic of error estimation.
+
+First, let’s look at `SetError`, which is used to initialize the error associated
+with each variable of interest. This function might be suitable if we know of an
+intrinsic error for all the variables and want that to be a part of the final
+error. For now, let us return a `nullptr` which clad will interpret as a 0
+literal. Which means that clad will initialize all error values as 0.
+
+```cpp
+clang::Expr* CustomModel::SetError(clad::VarDeclDiff declStmt) {
+    return nullptr;
+}
+```
+
+Now, let’s look at the `AssignError` function. Recall that we want to translate
+the following mathematical formula into code:
+
+$$
+\begin{align*}
+\triangle x_i = \frac{\partial f}{\partial x_i} \ast |x_i - (float)x_i|
+\end{align*}
+$$
+
+First, we can obtain the partial derivative of x<sub>i</sub> using `getExpr_dx()`.
+Here, `refExpr` refers to x<sub>i</sub>
+
+```cpp
+clang::Expr* CustomModel::AssignError(clad::StmtDiff refExpr) {
+   // Get the df/dx term
+   auto derivative = refExpr.getExpr_dx();
+}
+```
+
+Now, we need to create the float subtraction expression. So, we essentially
+need to get x<sub>i</sub> and cast it to float and then build a subtraction
+expression. First let's look at how we can build a cast expression. Here we
+use `BuildCStyleCastExpr` as follows:
+
+```cpp
+clang::Expr* CustomModel::AssignError(clad::StmtDiff refExpr) {
+   // Get the df/dx term
+   auto derivative = refExpr.getExpr_dx();
+   auto expr = refExpr.getExpr();
+   // Get the cast expression
+   auto castExpr = m_Sema
+                      .BuildCStyleCastExpr(
+                          expr->getBeginLoc(),
+                          m_Context.getTrivialTypeSourceInfo(m_Context.FloatTy),
+                          expr->getEndLoc(), expr)
+                      .get();
+}
+```
+
+Now, let’s use the `BuildOp` shorthand to build the subtraction expression.
+Note, the first argument to `BuildOp` is the type of operation and the next
+are the LHS and RHS of that operation.
+
+```cpp
+clang::Expr *CustomModel::AssignError(clad::StmtDiff refExpr) {
+  // Get the df/dx term
+  auto derivative = refExpr.getExpr_dx();
+  auto expr = refExpr.getExpr();
+  // Get the cast expression
+  auto castExpr = m_Sema
+                      .BuildCStyleCastExpr(
+                          expr->getBeginLoc(),
+                          m_Context.getTrivialTypeSourceInfo(m_Context.FloatTy),
+                          expr->getEndLoc(), expr)
+                      .get();
+  // Build subtraction operator
+  auto subExpr = BuildOp(BO_Sub, expr, castExpr);
+}
+```
+
+Since clad internally builds the assignment, we only need to return the
+multiplication expression. We can use the same `buildOp` to get that done!
+
+```cpp
+clang::Expr* CustomModel::AssignError(clad::StmtDiff refExpr) {
+   // Get the df/dx term
+   auto derivative = refExpr.getExpr_dx();
+   auto expr = refExpr.getExpr();
+   // Get the cast expression
+   auto castExpr = m_Sema
+                      .BuildCStyleCastExpr(
+                          expr->getBeginLoc(),
+                          m_Context.getTrivialTypeSourceInfo(m_Context.FloatTy),
+                          expr->getEndLoc(), expr)
+                      .get();
+   // Build subtraction operator
+   auto subExpr = BuildOp(BO_Sub, expr, castExpr);
+   // Return the final multiplication operation
+   return BuildOp(BO_Mul, derivative, subExpr);
+
+}
+```
+
+And voila! We are all done with writing our custom estimation model!
+
+Lastly, we have to compile our source into a shared object. For this we shall
+first set up some environment variables to simplify following the rest of the
+tutorial. In a terminal, run the following:
+
+```bash
+export CLAD_INST=$PWD/../inst;
+export CLAD_BASE=$PWD/../clad;
+```
+
+Then execute the following:
+
+```bash
+CLAD_INST/bin/clang -ICLAD_BASE/include -fPIC -shared -fno-rtti -Wl,-undefined -Wl,suppress ./CustomModel.cpp -o libCustomModel.so
+```
+
+The above command creates a shared library called libCustomModel.so. Now to use
+this shared library with clad, simply pass it via CLI when invoking clad as
+follows:
+
+```bash
+-Xclang -plugin-arg-clad -Xclang -fcustom-estimation-model -Xclang -plugin-arg-clad  -Xclang  ./libCustomModel.so
+```
+
+That is, a typical invocation to clad will now look like this
+
+```bash
+CLAD_INST/bin/clang -Xclang -add-plugin -Xclang clad -Xclang -load -Xclang CLAD_INST/clad.so -ICLAD_BASE/include -x c++ -lstdc++ -Xclang -plugin-arg-clad -Xclang -fcustom-estimation-model -Xclang -plugin-arg-clad -Xclang  ./libCustomModel.so  some.cpp
+```
+
+Yay! You have successfully generated code instrumented with your custom
+estimation logic!
+
+To know about what all the error estimation framework is capable of,
+checkout the docs [here](https://clad.readthedocs.io/en/latest/index.html).
diff --git a/images/tutorials/fp_error_estimation_clad_tutorial/ast.gif b/images/tutorials/fp_error_estimation_clad_tutorial/ast.gif