Lots of update to documentation

Restructure documentation to something more standard (user guide, examples, apis,...) Use gallery for example. Use autosummary for API documentation. Add documentation of internal APIs
Gurobi · Nov 29, 2023 · 5c5e193 · 5c5e193
1 parent 9ec0d02
commit 5c5e193
Show file tree

Hide file tree

Showing 79 changed files with 2,503 additions and 4,458 deletions.
diff --git a/.pre-commit-config.yaml b/.pre-commit-config.yaml
@@ -39,12 +39,6 @@ repos:
         language: python
         types: [python]
 
-    - repo: https://github.com/mwouts/jupytext
-      rev: v1.15.2
-      hooks:
-      - id: jupytext
-        args: ["--sync"]
-
     - repo: local
       hooks:
         - id: license_embedded

diff --git a/docs/Makefile b/docs/Makefile
@@ -19,6 +19,8 @@ help:
 %: Makefile
 	@$(SPHINXBUILD) -M $@ "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
 
-github:
-	@make html
-	@cp -a build/html/. ../docs
+clean:
+	rm -rf $(BUILDDIR)/*
+	rm -rf $(SOURCEDIR)/auto_examples/
+	rm -rf $(SOURCEDIR)/auto_userguide/
+	rm -rf $(SOURCEDIR)/auto_generated/
diff --git a/docs/examples/README.rst b/docs/examples/README.rst
@@ -0,0 +1,29 @@
+.. only:: html
+
+     .. image:: ../_static/gurobi_light.png
+          :width: 220
+          :target: https://www.gurobi.com
+          :alt: Gurobi
+          :align: right
+          :class: float-right, only-light
+
+
+     .. image:: ../_static/gurobi_dark.png
+          :width: 220
+          :target: https://www.gurobi.com
+          :alt: Gurobi
+          :align: right
+          :class: float-right, only-dark
+
+
+Examples
+########
+
+..  rst-class::  clear-both
+
+|
+|
+
+We develop here examples that showcase specific applications where
+Gurobi Machine Learning can be used. Each example should be self-contained, and explains
+different aspects of using the package.
diff --git a/docs/examples/example_2DPeakFunction.py b/docs/examples/example_2DPeakFunction.py
@@ -0,0 +1,248 @@
+r"""
+Surrogate Models
+================
+
+Some industrial applications require modeling complex processes that can
+result either in highly nonlinear functions or functions defined by a
+simulation process. In those contexts, optimization solvers often
+struggle. The reason may be that relaxations of the nonlinear functions
+are not good enough to make the solver prove an acceptable bound in a
+reasonable amount of time. Another issue may be that the solver is not
+able to represent the functions.
+
+An approach that has been proposed in the literature is to approximate
+the problematic nonlinear functions via neural networks with ReLU
+activation and use MIP technology to solve the constructed approximation
+(see e.g. \ `Heneao Maravelias
+2011 <https://doi.org/https://doi.org/10.1002/aic.12341>`__\ ,
+`Schweitdmann et.al. 2022 <https://arxiv.org/abs/2207.12722>`__\ ). This
+use of neural networks can be motivated by their ability to provide a
+universal approximation (see e.g. \ `Lu et.al.
+2017 <https://proceedings.neurips.cc/paper/2017/file/32cbf687880eb1674a07bf717761dd3a-Paper.pdf>`__\ ).
+This use of ML models to replace complex processes is often referred to
+as *surrogate models*.
+
+In the following example, we approximate a nonlinear function via
+``Scikit-learn`` ``MLPRegressor`` and then solve an optimization problem
+that uses the approximation of the nonlinear function with Gurobi.
+
+The purpose of this example is solely illustrative and doesn’t relate to
+any particular application.
+
+The function we approximate is the `2D peaks
+function <https://www.mathworks.com/help/matlab/ref/peaks.html#mw_46aeee28-390e-4373-aa47-e4a52447fc85>`__.
+
+The function is given as
+
+.. math::
+
+    \begin{aligned} f(x) = & 3 \cdot (1-x_1)^2 \cdot \exp(-x_1^2 - (x_2+1)^2) -
+   \\
+            & 10 \cdot (\frac{x_1}{5} - x_1^3 - x_2^5) \cdot \exp(-x_1^2 - x_2^2) - \\
+            & \frac{1}{3} \cdot \exp(-(x_1+1)^2 - x_2^2).
+   \end{aligned}
+
+In this example, we want to find the minimum of :math:`f` over the
+interval :math:`[-2, 2]^2`:
+
+.. math::  y = \min \{f(x) : x \in [-2,2]^2\}.
+
+The `global minimum of this problem can be found
+numerically <https://www.math.uwaterloo.ca/~hwolkowi/henry/reports/talks.d/t09talks.d/09waterloomatlab.d/optimTipsWebinar/html/optimTipsTricksWalkthrough.html#18>`__
+to have value :math:`-6.55113` at the point :math:`(0.2283, -1.6256)`.
+
+Here to find this minimum of :math:`f`, we approximate :math:`f(x)`
+through a neural network function :math:`g(x)` to obtain a MIP and solve
+
+.. math::  \hat y = \min \{g(x) : x \in [-2,2]^2\} \approx y.
+
+First import the necessary packages. Before applying the neural network,
+we do a preprocessing to extract polynomial features of degree 2.
+Hopefully this will help us to approximate the smooth function. Besides,
+``gurobipy``, ``numpy`` and the appropriate ``sklearn`` objects, we also
+use ``matplotlib`` to plot the function, and its approximation.
+
+"""
+
+import gurobipy as gp
+import numpy as np
+from gurobipy import GRB
+from matplotlib import cm
+from matplotlib import pyplot as plt
+from sklearn import metrics
+from sklearn.neural_network import MLPRegressor
+from sklearn.pipeline import make_pipeline
+from sklearn.preprocessing import PolynomialFeatures
+
+from gurobi_ml import add_predictor_constr
+
+######################################################################
+# Define the nonlinear function of interest
+# -----------------------------------------
+#
+# We define the 2D peak function as a python function.
+#
+
+
+def peak2d(x1, x2):
+    return (
+        3 * (1 - x1) ** 2.0 * np.exp(-(x1**2) - (x2 + 1) ** 2)
+        - 10 * (x1 / 5 - x1**3 - x2**5) * np.exp(-(x1**2) - x2**2)
+        - 1 / 3 * np.exp(-((x1 + 1) ** 2) - x2**2)
+    )
+
+
+######################################################################
+# To train the neural network, we make a uniform sample of the domain of
+# the function in the region of interest using ``numpy``\ ’s ``arrange``
+# function.
+#
+# We then plot the function with ``matplotlib``.
+#
+
+x1, x2 = np.meshgrid(np.arange(-2, 2, 0.01), np.arange(-2, 2, 0.01))
+y = peak2d(x1, x2)
+
+fig, ax = plt.subplots(subplot_kw={"projection": "3d"})
+# Plot the surface.
+surf = ax.plot_surface(x1, x2, y, cmap=cm.coolwarm, linewidth=0.01, antialiased=False)
+# Add a color bar which maps values to colors.
+fig.colorbar(surf, shrink=0.5, aspect=5)
+plt.show()
+
+
+######################################################################
+# Approximate the function
+# ------------------------
+#
+# To fit a model, we need to reshape our data. We concatenate the values
+# of ``x1`` and ``x2`` in an array ``X`` and make ``y`` one dimensional.
+#
+
+X = np.concatenate([x1.ravel().reshape(-1, 1), x2.ravel().reshape(-1, 1)], axis=1)
+y = y.ravel()
+
+
+######################################################################
+# To approximate the function, we use a ``Pipeline`` with polynomial
+# features and a neural-network regressor. We do a relatively small
+# neural-network.
+#
+
+# Run our regression
+layers = [30] * 2
+regression = MLPRegressor(hidden_layer_sizes=layers, activation="relu")
+pipe = make_pipeline(PolynomialFeatures(), regression)
+pipe.fit(X=X, y=y)
+
+
+######################################################################
+# To test the accuracy of the approximation, we take a random sample of
+# points, and we print the :math:`R^2` value and the maximal error.
+#
+
+X_test = np.random.random((100, 2)) * 4 - 2
+
+r2_score = metrics.r2_score(peak2d(X_test[:, 0], X_test[:, 1]), pipe.predict(X_test))
+max_error = metrics.max_error(peak2d(X_test[:, 0], X_test[:, 1]), pipe.predict(X_test))
+print("R2 error {}, maximal error {}".format(r2_score, max_error))
+
+
+######################################################################
+# While the :math:`R^2` value is good, the maximal error is quite high.
+# For the purpose of this example we still deem it acceptable. We plot the
+# function.
+#
+
+fig, ax = plt.subplots(subplot_kw={"projection": "3d"})
+# Plot the surface.
+surf = ax.plot_surface(
+    x1,
+    x2,
+    pipe.predict(X).reshape(x1.shape),
+    cmap=cm.coolwarm,
+    linewidth=0.01,
+    antialiased=False,
+)
+# Add a color bar which maps values to colors.
+fig.colorbar(surf, shrink=0.5, aspect=5)
+plt.show()
+
+
+######################################################################
+# Visually, the approximation looks close enough to the original function.
+#
+# Build and Solve the Optimization Model
+# --------------------------------------
+#
+# We now turn to the optimization model. For this model we want to find
+# the minimal value of ``y_approx`` which is the approximation given by
+# our pipeline on the interval.
+#
+# Note that in this simple example, we don’t use matrix variables but
+# regular Gurobi variables instead.
+#
+
+m = gp.Model()
+
+x = m.addVars(2, lb=-2, ub=2, name="x")
+y_approx = m.addVar(lb=-GRB.INFINITY, name="y")
+
+m.setObjective(y_approx, gp.GRB.MINIMIZE)
+
+# add "surrogate constraint"
+pred_constr = add_predictor_constr(m, pipe, x, y_approx)
+
+pred_constr.print_stats()
+
+
+######################################################################
+# Now call ``optimize``. Since we use polynomial features the resulting
+# model is a non-convex quadratic problem. In Gurobi, we need to set the
+# parameter ``NonConvex`` to 2 to be able to solve it.
+#
+
+m.Params.TimeLimit = 20
+m.Params.MIPGap = 0.1
+m.Params.NonConvex = 2
+
+m.optimize()
+
+
+######################################################################
+# After solving the model, we check the error in the estimate of the
+# Gurobi solution.
+#
+
+print(
+    "Maximum error in approximating the regression {:.6}".format(
+        np.max(pred_constr.get_error())
+    )
+)
+
+
+######################################################################
+# Finally, we look at the solution and the objective value found.
+#
+
+print(
+    f"solution point of the approximated problem ({x[0].X:.4}, {x[1].X:.4}), "
+    + f"objective value {m.ObjVal}."
+)
+print(
+    f"Function value at the solution point {peak2d(x[0].X, x[1].X)} error {abs(peak2d(x[0].X, x[1].X) - m.ObjVal)}."
+)
+
+
+######################################################################
+# The difference between the function and the approximation at the
+# computed solution point is noticeable, but the point we found is
+# reasonably close to the actual global minimum. Depending on the use case
+# this might be deemed acceptable. Of course, training a larger network
+# should result in a better approximation.
+#
+
+
+######################################################################
+# Copyright © 2023 Gurobi Optimization, LLC
+#