New default hyperparameters and miscellaneous bug fixes for Kriging. #374

archermarx · 2022-07-11T20:58:38Z

Addresses #369 and #368. This PR defines new default hyperparameters for the Kriging surrogate. Along the way, I found that the implementation of the 1D kriging surrogate was incorrect and did not work properly for theta not equal to 1, so I've fixed that as well.

To summarize, the default hyperparameters for Kriging used to be p = ones(d), theta = ones(d), where d is the problem dimension. These are very bad choices. I have changed them to

$$p_i = 2, i \in [1, d]$$

$$\theta_i = \frac{1}{2}\mathrm{std}(\mathbf{X_i})^{-p_i}, i\in [1, d]$$

The latter comes from A Practical Guide to Gaussian Processes and the interpretation of $\theta_i = 1/2l_i^2$, where $l_i$ is a characteristic correlation length scale for variable $i$. Scaling the length scale with the standard deviation of the samples makes the hyperparameter initialization independent of the box size, which is a very desirable property. With this initialization, here's some comparisons to the old defaults:

Sphere function

Old

New

L1-norm function

Old

New

Branin function

Old

New

Rosenbrock

Old

New

Clearly, the new hyperparameters are a significant improvement. During implementation, I found that the 1D kriging surrogate only worked properly when theta = 1, so I have fixed that as well:

Old (with old default hyperparams)

Old (with new default hyperparams)

New (with new default hyperparams)

Still to do is update the documentation for kriging to make use of the better defaults, as the current documentation doesn't make kriging look all that effective

codecov · 2022-07-11T21:33:10Z

Codecov Report

Merging #374 (a862299) into master (aba57f3) will decrease coverage by 0.13%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #374      +/-   ##
==========================================
- Coverage   78.99%   78.85%   -0.14%     
==========================================
  Files          16       16              
  Lines        2266     2280      +14     
==========================================
+ Hits         1790     1798       +8     
- Misses        476      482       +6

Impacted Files	Coverage Δ
src/Kriging.jl	`94.44% <100.00%> (+0.82%)`	⬆️
src/GEKPLS.jl	`91.20% <0.00%> (-0.93%)`	⬇️
src/Optimization.jl	`72.06% <0.00%> (-0.37%)`	⬇️

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

archermarx · 2022-07-11T22:05:18Z

I have added nugget regularization to address the problem where the correlation matrix becomes singular when points are sufficiently close together. The way it currently works is to add a small "nugget" value to the main diagonal of the correlation matrix to keep the condition number less than 1e8. This has the same effect as a tiny noise variance, and prevents the matrix from becoming singular at the cost of slightly relaxing the interpolation condition.

ChrisRackauckas · 2022-07-12T00:02:44Z

test/optimization.jl

@@ -21,27 +18,45 @@ a = 2
 b = 6

 #Using Kriging
-my_k_SRBF1 = Kriging(x, y, lb, ub)
-surrogate_optimize(objective_function, SRBF(), a, b, my_k_SRBF1, UniformSample())
+begin


what's the purpose of the blocks here? It's fine but a little odd.

ChrisRackauckas · 2022-07-12T00:03:25Z

src/Kriging.jl

+    # Estimate nugget based on maximum allowed condition number
+    # This regularizes R to allow for points being close to eachother without R becoming
+    # singular, at the cost of slightly relaxing the interpolation condition
+    λ = eigen(R).values
+
+    λmax = λ[end]
+    λmin = λ[1]
+
+    κmax = 1e8
+    λdiff = λmax - κmax * λmin
+    if λdiff ≥ 0
+        nugget = λdiff / (κmax - 1)
+    else
+        nugget = 0.0
+    end


Comment on the source for this?

archermarx · 2022-07-12T00:04:40Z

That was for ease of running on my end, no other reason. I can remove them

…

On Mon, Jul 11, 2022, 8:03 PM Christopher Rackauckas < ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In test/optimization.jl <#374 (comment)>: > @@ -21,27 +18,45 @@ a = 2 b = 6 #Using Kriging -my_k_SRBF1 = Kriging(x, y, lb, ub) -surrogate_optimize(objective_function, SRBF(), a, b, my_k_SRBF1, UniformSample()) +begin what's the purpose of the blocks here? It's fine but a little odd. — Reply to this email directly, view it on GitHub <#374 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AOP3QGXZMRWD475WT35V4SDVTSY27ANCNFSM53IZXSXQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

ChrisRackauckas · 2022-07-12T00:05:09Z

yeah probably best to remove the blocks then.

ChrisRackauckas · 2022-07-12T03:48:41Z

amazing, thanks!

archermarx added 2 commits July 11, 2022 12:28

fix 1D kriging predictions and add new hyperparam initialization

8ae80a9

Update some docs and add formatting

3428c80

archermarx mentioned this pull request Jul 11, 2022

Performance of kriging compared to mogp-emulator? #251

Closed

Fix test

aa8c23e

minor tweak to new defaults, add extra tests

24cebb2

archermarx added 2 commits July 11, 2022 18:07

add nugget regularization to fix singular exception when p = 2.0

617eb34

formatting

f4e6c7d

archermarx marked this pull request as ready for review July 11, 2022 22:15

fix for default theta = Inf when using section samplers

2c62b12

ChrisRackauckas reviewed Jul 12, 2022

View reviewed changes

formatting and add source for nugget regularization

a862299

ChrisRackauckas approved these changes Jul 12, 2022

View reviewed changes

ChrisRackauckas merged commit 5bd2810 into SciML:master Jul 12, 2022

This was referenced Jul 12, 2022

Kriging Produces Nearly Identical Results for All Inputs After 3 Dimensions #369

Closed

Making Kriging differentiable and better initial hyperparams #368

Closed

archermarx mentioned this pull request Jul 12, 2022

Further Kriging improvements - Noise variance, log likelihood, and simplifying implementation #375

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New default hyperparameters and miscellaneous bug fixes for Kriging. #374

New default hyperparameters and miscellaneous bug fixes for Kriging. #374

archermarx commented Jul 11, 2022 •

edited

Loading

codecov bot commented Jul 11, 2022 •

edited

Loading

archermarx commented Jul 11, 2022 •

edited

Loading

ChrisRackauckas Jul 12, 2022

ChrisRackauckas Jul 12, 2022

archermarx commented Jul 12, 2022 via email

ChrisRackauckas commented Jul 12, 2022

ChrisRackauckas commented Jul 12, 2022

New default hyperparameters and miscellaneous bug fixes for Kriging. #374

New default hyperparameters and miscellaneous bug fixes for Kriging. #374

Conversation

archermarx commented Jul 11, 2022 • edited Loading

Sphere function

L1-norm function

Branin function

Rosenbrock

codecov bot commented Jul 11, 2022 • edited Loading

Codecov Report

archermarx commented Jul 11, 2022 • edited Loading

ChrisRackauckas Jul 12, 2022

Choose a reason for hiding this comment

ChrisRackauckas Jul 12, 2022

Choose a reason for hiding this comment

archermarx commented Jul 12, 2022 via email

ChrisRackauckas commented Jul 12, 2022

ChrisRackauckas commented Jul 12, 2022

archermarx commented Jul 11, 2022 •

edited

Loading

codecov bot commented Jul 11, 2022 •

edited

Loading

archermarx commented Jul 11, 2022 •

edited

Loading