Cauchy kernel post & security fix

Jonathan Moussa · Jonathan Moussa · commit 0201a29d1f3a · 2019-09-16T21:27:51.000-04:00
diff --git a/Gemfile b/Gemfile
@@ -22,6 +22,9 @@ group :jekyll_plugins do
   gem "jekyll-feed", "~> 0.6"
 end
 
+# To deal with security vulnerability:
+gem "nokogiri", ">= 1.10.4"
+
 # Windows does not include zoneinfo files, so bundle the tzinfo-data gem
 gem "tzinfo-data", platforms: [:mingw, :mswin, :x64_mingw, :jruby]
 
diff --git a/Gemfile.lock b/Gemfile.lock
@@ -205,7 +205,7 @@ GEM
       jekyll-seo-tag (~> 2.1)
     minitest (5.11.3)
     multipart-post (2.0.0)
-    nokogiri (1.10.2)
+    nokogiri (1.10.4)
       mini_portile2 (~> 2.4.0)
     octokit (4.14.0)
       sawyer (~> 0.8.0, >= 0.5.3)
@@ -244,8 +244,8 @@ PLATFORMS
 DEPENDENCIES
   github-pages
   jekyll-feed (~> 0.6)
-  jekyll-sitemap
   minima (~> 2.0)
+  nokogiri (>= 1.10.4)
   tzinfo-data
 
 BUNDLED WITH
diff --git a/_posts/2019-09-16-cauchy-kernel-triage.md b/_posts/2019-09-16-cauchy-kernel-triage.md
@@ -0,0 +1,113 @@
+---
+layout: post
+title: "Triage of my Cauchy kernel project"
+categories: research
+---
+
+This is a post about my latest paper, ["Minimax separation of the Cauchy kernel"](https://arxiv.org/abs/1909.06911),
+ which I have just posted to the arXiv and submitted to the
+ [SIAM Journal on Numerical Analysis](https://www.siam.org/Publications/Journals/SIAM-Journal-on-Numerical-Analysis-SINUM).
+I have wanted to write and publish a math paper for a long time, to signify an increased emphasis in my research
+ on the mathematical underpinnings of atomistic simulation.
+It has taken me longer than expected to reach this goal because it has been difficult to focus on mathematical work
+ that is entirely unrelated to work obligations,
+ and my attempts to align employment with this sort of research activity have repeatedly failed.
+However, I am an extremely stubborn person,
+ and I will persist in my more mathematical work even if my pace is slower than I'd like.
+For example, I had to heavily triage my original plan to complete this already massively delayed paper in a timely manner
+  by dropping a set of planned applications and saving them for follow-up papers.
+
+I have always enjoyed math a lot, since early in grade school.
+It was my favorite subject all throughout high school (I was underwhelmed by my science classes until college),
+ and was even on several competitive high school math teams.
+I would have been a math major in college (rather than the hedge of a dual math/physics major),
+ but I wanted to pursue academic research and was repeatedly told that a math degree could only lead to an actuarial career.
+While I strayed from a more mathematical career by going to graduate school for physics,
+ my research interests in physics have always been of a highly mathematical nature.
+I like to view computational physics as a kind of experimental math,
+ where approximations and algorithms are tested for their relevance to physical simulation.
+My overly simple grad-school perspective on quantum mechanics was as a physical manifestation of linear algebra,
+ which meant that computational condensed matter physics was just a very complicated numerical linear algebra problem.
+However, working in quantum information theory for five years have given me a broader perspective
+ on the statistical and complexity-theoretic aspects of quantum mechanics.
+
+I discussed the background on this project a bit in a [previous post]({% post_url 2019-05-12-linear-scaling-too-little %}).
+Around 2007, I was working on a banded Hermitian eigensolver that was inspired by the physics concept of Wannier functions
+ (a.k.a. Boys orbitals in quantum chemistry).
+These functions demonstrated that simultaneous spatial and spectral locality was possible,
+ which allowed for more flexibility in divide-and-conquer strategies for eigensolving beyond the
+ [one that had already been established at the time](https://doi.org/10.1137/S0895479892241287).
+In the course of these numerical experiments,
+it became necessary to fit low-rank approximations of the Cauchy kernel with a general form
+
+$$ \frac{1}{x - y} \approx \sum_{i=1}^{r} \frac{f_i(y)}{x - y_i} $$
+
+where I was free to choose both $$y_i$$ and $$f_i(y)$$.
+For a fixed value of $$y$$, this was a standard linear minimax optimization problem.
+I had previously messed around with linear and nonlinear minimax optimization problems
+ that I thought would be useful in electronic structure
+ (for example, [my paper on rational approximations of Fermi-Dirac functions](https://doi.org/10.1063/1.4965886) originated from earlier attempts during grad school),
+ so I already had some experience with problems of this form.
+I proceeded as I had in the past, with an ad-hoc implementation of the Remez algorithm.
+This time, I could make use of the analytical expressions for Cauchy matrix inverses and determinants
+ in fitting the approximant at a set of max-residual points.
+It turned out that the relative error was what would lead to the tightest error bounds in
+ my intended application, so I adapted my solution to minimize relative error.
+I then noticed something very surprising: the location of the maximum residual points did not change when I changed $$y$$.
+Because of the Cauchy matrix inverse formula,
+ this meant that the optimal $$f_i(y)$$ were rational functions rather than something more general and arbitrary.
+Also, I was observing much more regular trends in the solutions than in other problems that I had solved,
+ such as the maximum residual points and optimal $$y_i$$ values for different $$r$$ values collapsing
+ onto a common function just like with Chebyshev nodes.
+
+I don't keep detailed research records (I've mostly operated under the assumption that published papers are record enough),
+ but at some point either in late grad school or early in my academic post-doc at UT Austin,
+ I decided that this result was probably optimal with respect to maximum relative error from a basic dimension-counting argument
+ (I couldn't see how a more general functional form could reduce the maximum error) and I wanted to prove it.
+I'm not really big into proving things because it's hard and you aren't really rewarded for doing so as a physicist.
+However, I do think electronic structure simulations need a more rigorous underpinning,
+ and there is probably a small set of core results waiting to be discovered that are important enough for a formal proof.
+I decided that this was one such result.
+My basic proof strategy was to prove separate upper and lower bounds, whereby optimality was proven if the bounds were equal.
+I proved the upper bound very quickly because it was a very straightforward task of restricting a minimization to a simple subset of the original domain.
+However, the lower bound just kicked my ass and I got nowhere in explaining all the special structure that I observed, so I eventually gave up and put it aside.
+
+I finally put together the lower-bound proof while at Sandia,
+ but only after returning to it multiple times for brief but intense bursts of activity.
+Around summer 2012, I finally learned about Zolotarev's work in applying elliptic integrals/functions to rational approximation.
+This was very exciting at the time, because it explained all of the special structure that I had observed numerically.
+I can't imagine how anyone could make the connection from those numerical observations to the theory of elliptical integrals,
+ and indeed Zolotarev had worked in the other direction - his adviser, Chebyshev, had instructed him to look for useful applications of elliptic integrals.
+Unfortunately, my excitement died down when I realized that none of this made a lower-bound proof any easier,
+ although I could now rule out the previously-unexplained structure as being relevant to a proof.
+I did eventually cobble together a lower-bound proof as I was exposed to the necessary mathematical ideas while at Sandia.
+I worked on an idea for [relaxing classical stat mech problems into linear programs](https://arxiv.org/abs/1603.05180),
+ which was itself a catastrophic failure, but it increased my familiarity with convex optimization and linear programming (two linear programs being essential steps in my proof).
+Also, I was exposed to many lower-bound proofs of classical computational complexity in quantum information theory
+ that were a part of complexity-theoretic separation proofs between classical and quantum computing power.
+While none of those proofs were directly relevant, I learned about the wide variety of clever tricks and strategies
+ that are often employed in producing lower bounds for optimization problems.
+I finally had a serviceable draft of a proof in summer 2017, which I polished up a few months ago for this paper.
+I probably could have wrapped this up a lot sooner if Sandia was supportive of my research interests, but it was not,
+ and research tends to go a lot slower when you are being paid to do something else as your full-time job.
+
+This is my oldest research project that has been brought to fruition,
+ and hopefully it is a sign of maturity and a precursor to other results following suit (albeit very slowly).
+My initial proof in summer 2017 was tied to a paper on banded Hermitian eigensolvers,
+ and it collapsed under the weight of trying to accomplish too much at once.
+Up until three weeks ago, this paper was tied to three applications that would have further delayed it by many months if I hadn't cut them.
+I have a natural tendency to make big plans for papers,
+ and I need to actively fight that by breaking up projects into smaller publishable pieces.
+Having shorter, more numerous papers is advantageous for many reasons,
+ and it is almost essential for surviving the rat race of modern academia.
+
+My immediate next research priority is finishing up the first delayed application of this paper to electronic structure calculations.
+However, I need to do a better job of mixing blog posts about active research papers
+ with other research discussions about other interests that are in a more passive mode.
+I'd like to start discuss quantum computing on this blog soon, and I have two quantum computing papers lined up as #2 and #4 in my publication queue.
+I also continue to refine and develop my semiempirical electronic structure plans,
+ and I will discuss that at some point as well.
+I am happy to have finally finished this long-delayed paper,
+ but my research agenda and career are very deep in a hole
+ and it will take a lot of work before I can escape that hole.
+There will be a lot more research triage in my future.
diff --git a/assets/CV.aux b/assets/CV.aux
@@ -23,23 +23,21 @@
 \@writefile{toc}{\contentsline {section}{Job Experience}{1}{section*.5}}
 \@writefile{toc}{\contentsline {section}{H-index}{1}{section*.6}}
 \@writefile{toc}{\contentsline {section}{Publications}{2}{section*.7}}
-\gdef\etaremune@i{22}
-\gdef\etaremune@ii{11}
+\gdef\etaremune@i{23}
 \@writefile{toc}{\contentsline {section}{Unpublished Work}{3}{section*.8}}
+\gdef\etaremune@ii{10}
 \gdef\etaremune@iii{1}
-\gdef\etaremune@iv{1}
-\gdef\etaremune@v{5}
+\gdef\etaremune@iv{5}
 \@writefile{toc}{\contentsline {section}{Awarded Grants}{4}{section*.9}}
-\@writefile{toc}{\contentsline {section}{Proposed Grants}{4}{section*.10}}
-\@writefile{toc}{\contentsline {section}{Invited Talks}{4}{section*.11}}
-\@writefile{toc}{\contentsline {section}{Contributed Talks}{4}{section*.12}}
-\gdef\etaremune@vi{16}
-\gdef\etaremune@vii{1}
-\@writefile{toc}{\contentsline {section}{Patents}{5}{section*.13}}
-\@writefile{toc}{\contentsline {section}{Referee for Journals}{5}{section*.14}}
-\@writefile{toc}{\contentsline {section}{Awards}{5}{section*.15}}
-\@writefile{toc}{\contentsline {section}{Teaching Experience}{5}{section*.16}}
-\@writefile{toc}{\contentsline {section}{Programming Skills}{5}{section*.17}}
+\@writefile{toc}{\contentsline {section}{Invited Talks}{4}{section*.10}}
+\@writefile{toc}{\contentsline {section}{Contributed Talks}{4}{section*.11}}
+\gdef\etaremune@v{16}
+\gdef\etaremune@vi{1}
+\@writefile{toc}{\contentsline {section}{Patents}{5}{section*.12}}
+\@writefile{toc}{\contentsline {section}{Referee for Journals}{5}{section*.13}}
+\@writefile{toc}{\contentsline {section}{Awards}{5}{section*.14}}
+\@writefile{toc}{\contentsline {section}{Teaching Experience}{5}{section*.15}}
+\@writefile{toc}{\contentsline {section}{Programming Skills}{5}{section*.16}}
 \newlabel{LastPage}{{}{5}{}{page.5}{}}
 \xdef\lastpage@lastpage{5}
 \xdef\lastpage@lastpageHy{5}
diff --git a/assets/CV.log b/assets/CV.log
@@ -1,4 +1,4 @@
-This is pdfTeX, Version 3.14159265-2.6-1.40.19 (TeX Live 2018) (preloaded format=pdflatex 2018.4.16)  19 APR 2019 16:24
+This is pdfTeX, Version 3.14159265-2.6-1.40.19 (TeX Live 2018) (preloaded format=pdflatex 2018.4.16)  31 JUL 2019 11:02
 entering extended mode
  restricted \write18 enabled.
  file:line:error style messages enabled.
@@ -342,55 +342,61 @@ LaTeX Font Info:    Font shape `OMS/cmr/m/n' in size <10> not available
  [1
 
 {/usr/local/texlive/2018/texmf-var/fonts/map/pdftex/updmap/pdftex.map}]
-Overfull \hbox (31.05458pt too wide) in paragraph at lines 325--327
+Overfull \hbox (12.6987pt too wide) in paragraph at lines 325--327
+\OT1/cmr/m/n/10 rithms for elec-tronic struc-ture. \OT1/cmr/m/it/10 Elec-tronic
+ Struc-ture \OT1/cmr/m/n/10 1:033001 (2019). doi:[][]10.1088/2516-
+ []
+
+
+Overfull \hbox (31.05458pt too wide) in paragraph at lines 328--330
 \OT1/cmr/m/n/10 Ge/SiGe quan-tum wells. \OT1/cmr/m/it/10 Nan-otech-nol-ogy \OT1
 /cmr/m/n/10 30:215202 (2019). doi:[][]10.1088/1361-6528/ab061e[][]. 
  []
 
 
-Overfull \hbox (35.2604pt too wide) in paragraph at lines 328--330
+Overfull \hbox (35.2604pt too wide) in paragraph at lines 331--333
 \OT1/cmr/m/n/10 ma-nium be-yond the dif-fu-sive regime. \OT1/cmr/m/it/10 Nanosc
 ale \OT1/cmr/m/n/10 10:20559 (2018). doi:[][]10.1039/C8NR05677C[][]. 
  []
 
 
-Overfull \hbox (0.54602pt too wide) in paragraph at lines 346--348
+Overfull \hbox (0.54602pt too wide) in paragraph at lines 349--351
 \OT1/cmr/m/n/10 con-strained mixed dis-crete op-ti-miza-tion with an adi-a-bati
 c quan-tum op-ti-mizer. \OT1/cmr/m/it/10 Phys-
  []
 
 [2]
-Overfull \hbox (5.19522pt too wide) in paragraph at lines 392--395
+Overfull \hbox (5.19522pt too wide) in paragraph at lines 395--398
 \OT1/cmr/m/n/10 consistency in lay-ered quan-tum semi-con-duc-tor struc-tures. 
 \OT1/cmr/m/it/10 Jour-nal of Ap-plied Physics\OT1/cmr/m/n/10 ,
  []
 
 
-Overfull \hbox (18.81572pt too wide) in paragraph at lines 408--410
+Overfull \hbox (18.81572pt too wide) in paragraph at lines 411--413
 []\OT1/cmr/bx/n/10 Moussa, J. E.\OT1/cmr/m/n/10 . Measurement-Based Quan-tum Me
 tropo-lis Al-go-rithm. [][]arXiv:1903.01451[][]
  []
 
 [3] [4] 
 AED: lastpage setting LastPage
 [5]
-Package atveryend Info: Empty hook `BeforeClearDocument' on input line 549.
-Package atveryend Info: Empty hook `AfterLastShipout' on input line 549.
+Package atveryend Info: Empty hook `BeforeClearDocument' on input line 543.
+Package atveryend Info: Empty hook `AfterLastShipout' on input line 543.
  (./CV.aux)
-Package atveryend Info: Executing hook `AtVeryEndDocument' on input line 549.
-Package atveryend Info: Executing hook `AtEndAfterFileList' on input line 549.
+Package atveryend Info: Executing hook `AtVeryEndDocument' on input line 543.
+Package atveryend Info: Executing hook `AtEndAfterFileList' on input line 543.
 Package rerunfilecheck Info: File `CV.out' has not changed.
-(rerunfilecheck)             Checksum: BE3DC9CDB36F31C0003BA94B89A77F3F;863.
-Package atveryend Info: Empty hook `AtVeryVeryEnd' on input line 549.
+(rerunfilecheck)             Checksum: EA64E58B17EAB629346B2DA401AA005D;810.
+Package atveryend Info: Empty hook `AtVeryVeryEnd' on input line 543.
  ) 
 Here is how much of TeX's memory you used:
- 5854 strings out of 492649
- 86874 string characters out of 6129622
- 190465 words of memory out of 5000000
- 9685 multiletter control sequences out of 15000+600000
+ 5851 strings out of 492649
+ 86837 string characters out of 6129622
+ 190462 words of memory out of 5000000
+ 9683 multiletter control sequences out of 15000+600000
  5200 words of font info for 19 fonts, out of 8000000 for 9000
  1141 hyphenation exceptions out of 8191
- 37i,14n,68p,243b,474s stack positions out of 5000i,500n,10000p,200000b,80000s
+ 37i,13n,68p,243b,474s stack positions out of 5000i,500n,10000p,200000b,80000s
 </usr/local/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmbx10.pfb
 ></usr/local/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmbx12.pfb>
 </usr/local/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmcsc10.pfb>
@@ -401,10 +407,10 @@ sr/local/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmr10.pfb></usr
 cal/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmsy10.pfb></usr/loc
 al/texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmsy7.pfb></usr/local
 /texlive/2018/texmf-dist/fonts/type1/public/amsfonts/cm/cmti10.pfb>
-Output written on CV.pdf (5 pages, 141217 bytes).
+Output written on CV.pdf (5 pages, 141002 bytes).
 PDF statistics:
- 196 PDF objects out of 1000 (max. 8388607)
- 177 compressed objects within 2 object streams
- 23 named destinations out of 1000 (max. 500000)
- 137 words of extra memory for PDF output out of 10000 (max. 10000000)
+ 193 PDF objects out of 1000 (max. 8388607)
+ 174 compressed objects within 2 object streams
+ 22 named destinations out of 1000 (max. 500000)
+ 129 words of extra memory for PDF output out of 10000 (max. 10000000)
 
diff --git a/assets/CV.out b/assets/CV.out
@@ -7,11 +7,10 @@
 \BOOKMARK [1][-]{section*.7}{Publications}{}% 7
 \BOOKMARK [1][-]{section*.8}{Unpublished Work}{}% 8
 \BOOKMARK [1][-]{section*.9}{Awarded Grants}{}% 9
-\BOOKMARK [1][-]{section*.10}{Proposed Grants}{}% 10
-\BOOKMARK [1][-]{section*.11}{Invited Talks}{}% 11
-\BOOKMARK [1][-]{section*.12}{Contributed Talks}{}% 12
-\BOOKMARK [1][-]{section*.13}{Patents}{}% 13
-\BOOKMARK [1][-]{section*.14}{Referee for Journals}{}% 14
-\BOOKMARK [1][-]{section*.15}{Awards}{}% 15
-\BOOKMARK [1][-]{section*.16}{Teaching Experience}{}% 16
-\BOOKMARK [1][-]{section*.17}{Programming Skills}{}% 17
+\BOOKMARK [1][-]{section*.10}{Invited Talks}{}% 10
+\BOOKMARK [1][-]{section*.11}{Contributed Talks}{}% 11
+\BOOKMARK [1][-]{section*.12}{Patents}{}% 12
+\BOOKMARK [1][-]{section*.13}{Referee for Journals}{}% 13
+\BOOKMARK [1][-]{section*.14}{Awards}{}% 14
+\BOOKMARK [1][-]{section*.15}{Teaching Experience}{}% 15
+\BOOKMARK [1][-]{section*.16}{Programming Skills}{}% 16
diff --git a/assets/CV.synctex.gz b/assets/CV.synctex.gz