ScipyKrylov

ScipyKrylov#

ScipyKrylov is an iterative linear solver that wraps the methods found in scipy.sparse.linalg. The default method is “gmres”, or the Generalized Minimal RESidual method. Support for other scipy.sparse.linalg solvers will be added over time. This linear solver is capable of handling any system topology very effectively. It also solves all subsystems below it in the hierarchy, so assigning different solvers to subsystems will have no effect on the solution at this level.

This is a serial solver, so it should never be used under MPI; use PETScKrylov instead.

Here, we calculate the total derivatives across the Sellar system.

SellarDis1withDerivatives class definition

class SellarDis1withDerivatives(SellarDis1):
    """
    Component containing Discipline 1 -- derivatives version.
    """

    def setup_partials(self):
        # Analytic Derivs
        self.declare_partials(of='*', wrt='*')

    def compute_partials(self, inputs, partials):
        """
        Jacobian for Sellar discipline 1.
        """
        partials['y1', 'y2'] = -0.2
        partials['y1', 'z'] = np.array([[2.0 * inputs['z'][0], 1.0]])
        partials['y1', 'x'] = 1.0

SellarDis2withDerivatives class definition

class SellarDis2withDerivatives(SellarDis2):
    """
    Component containing Discipline 2 -- derivatives version.
    """

    def setup_partials(self):
        # Analytic Derivs
        self.declare_partials(of='*', wrt='*')

    def compute_partials(self, inputs, J):
        """
        Jacobian for Sellar discipline 2.
        """
        y1 = inputs['y1']
        if y1.real < 0.0:
            y1 *= -1
        if y1.real < 1e-8:
            y1 = 1e-8

        J['y2', 'y1'] = .5*y1**-.5
        J['y2', 'z'] = np.array([[1.0, 1.0]])

import numpy as np
import openmdao.api as om
from openmdao.test_suite.components.sellar import SellarDis1withDerivatives, SellarDis2withDerivatives

prob = om.Problem()
model = prob.model

model.add_subsystem('d1', SellarDis1withDerivatives(), promotes=['x', 'z', 'y1', 'y2'])
model.add_subsystem('d2', SellarDis2withDerivatives(), promotes=['z', 'y1', 'y2'])

model.add_subsystem('obj_cmp', om.ExecComp('obj = x**2 + z[1] + y1 + exp(-y2)',
                                           z=np.array([0.0, 0.0]), x=0.0),
                    promotes=['obj', 'x', 'z', 'y1', 'y2'])

model.add_subsystem('con_cmp1', om.ExecComp('con1 = 3.16 - y1'), promotes=['con1', 'y1'])
model.add_subsystem('con_cmp2', om.ExecComp('con2 = y2 - 24.0'), promotes=['con2', 'y2'])

model.nonlinear_solver = om.NonlinearBlockGS()

model.linear_solver = om.ScipyKrylov()

prob.setup()

prob.set_val('x', 1.)
prob.set_val('z', np.array([5.0, 2.0]))

prob.run_model()

wrt = ['z']
of = ['obj']

J = prob.compute_totals(of=of, wrt=wrt, return_format='flat_dict')

NL: NLBGS Converged in 8 iterations

ScipyKrylov Options#

Option	Default	Acceptable Values	Acceptable Types	Description
assemble_jac	False	[True, False]	['bool']	Activates use of assembled jacobian by this solver.
atol	1e-12	N/A	N/A	absolute error tolerance
err_on_non_converge	False	[True, False]	['bool']	When True, AnalysisError will be raised if we don't converge.
iprint	1	N/A	['int']	whether to print output
maxiter	1000	N/A	['int']	maximum number of iterations
restart	20	N/A	['int']	Number of iterations between restarts. Larger values increase iteration cost, but may be necessary for convergence. This option applies only to gmres.
rhs_checking	False	N/A	['bool', 'dict']	If True, check RHS vs. cache and/or zero to avoid some solves.Can also be set to a dict of options for the LinearRHSChecker to allow finer control over it. Allowed options are: ('check_zero', 'rtol', 'atol', 'max_cache_entries', 'collect_stats', 'auto', 'verbose')
rtol	1e-10	N/A	N/A	relative error tolerance
solver	gmres	['gmres']	N/A	function handle for actual solver

ScipyKrylov Constructor#

The call signature for the ScipyKrylov constructor is:

ScipyKrylov.__init__(**kwargs)[source]

Declare the solver option.

ScipyKrylov Option Examples#

maxiter

maxiter lets you specify the maximum number of GMRES iterations to apply. The default maximum is 1000, which is much higher than the other linear solvers because each multiplication by the system Jacobian is considered to be an iteration. You may have to decrease this value if you have a coupled system that is converging very slowly. (Of course, in such a case, it may be better to add a preconditioner.) Alternatively, you may have to raise it if you have an extremely large number of components in your system (a 1000-component ring would need 1000 iterations just to make it around once.)

This example shows what happens if you set maxiter too low. Since iteration is terminated after only 3 iterations, the solver does not converge and the derivatives will not be correct.

from openmdao.test_suite.components.sellar import SellarDis1withDerivatives, SellarDis2withDerivatives

prob = om.Problem()
model = prob.model

model.add_subsystem('d1', SellarDis1withDerivatives(), promotes=['x', 'z', 'y1', 'y2'])
model.add_subsystem('d2', SellarDis2withDerivatives(), promotes=['z', 'y1', 'y2'])

model.add_subsystem('obj_cmp', om.ExecComp('obj = x**2 + z[1] + y1 + exp(-y2)',
                                           z=np.array([0.0, 0.0]), x=0.0),
                    promotes=['obj', 'x', 'z', 'y1', 'y2'])

model.add_subsystem('con_cmp1', om.ExecComp('con1 = 3.16 - y1'), promotes=['con1', 'y1'])
model.add_subsystem('con_cmp2', om.ExecComp('con2 = y2 - 24.0'), promotes=['con2', 'y2'])

model.nonlinear_solver = om.NonlinearBlockGS()

model.linear_solver = om.ScipyKrylov()
model.linear_solver.options['maxiter'] = 3

prob.setup()

prob.set_val('x', 1.)
prob.set_val('z', np.array([5.0, 2.0]))

prob.run_model()

wrt = ['z']
of = ['obj']

J = prob.compute_totals(of=of, wrt=wrt, return_format='flat_dict')

NL: NLBGS Converged in 8 iterations
LN: SCIPYSolver 'LN: SCIPY' on system '' failed to converge in 3 iterations.

print(J['obj', 'z'][0][0])
print(J['obj', 'z'][0][1])

9.265405444309767
1.8724662355885655

Depending on your situation, you may want to set the err_on_non_converge option to True, so that this will raise an AnalysisError.

from openmdao.test_suite.components.sellar import SellarDis1withDerivatives, SellarDis2withDerivatives
from openmdao.api import AnalysisError

prob = om.Problem()
model = prob.model

model.add_subsystem('d1', SellarDis1withDerivatives(), promotes=['x', 'z', 'y1', 'y2'])
model.add_subsystem('d2', SellarDis2withDerivatives(), promotes=['z', 'y1', 'y2'])

model.add_subsystem('obj_cmp', om.ExecComp('obj = x**2 + z[1] + y1 + exp(-y2)',
                                           z=np.array([0.0, 0.0]), x=0.0),
                    promotes=['obj', 'x', 'z', 'y1', 'y2'])

model.add_subsystem('con_cmp1', om.ExecComp('con1 = 3.16 - y1'), promotes=['con1', 'y1'])
model.add_subsystem('con_cmp2', om.ExecComp('con2 = y2 - 24.0'), promotes=['con2', 'y2'])

model.nonlinear_solver = om.NonlinearBlockGS()

model.linear_solver = om.ScipyKrylov()
model.linear_solver.options['maxiter'] = 3
model.linear_solver.options['err_on_non_converge'] = True

prob.setup()

prob.set_val('x', 1.)
prob.set_val('z', np.array([5.0, 2.0]))

prob.run_model()

wrt = ['z']
of = ['obj']

try:
    J = prob.compute_totals(of=of, wrt=wrt, return_format='flat_dict')
except AnalysisError:
    # Special handling for a failure to converge when computing derivatives
    print("===\nLinear Solver failed to converge, derivatives are not valid.\n===")

NL: NLBGS Converged in 8 iterations
LN: SCIPYSolver 'LN: SCIPY' on system '' failed to converge in 3 iterations.
===
Linear Solver failed to converge, derivatives are not valid.
===

atol

Here, we set the absolute tolerance to a much tighter value (default is 1.0e-12) to show what happens. In practice, the tolerance serves a dual role in GMRES. In addition to being a termination criteria, the tolerance also defines what GMRES considers to be tiny. Tiny numbers are replaced by zero when the argument vector is normalized at the start of each new matrix-vector product. The end result here is that we iterate longer to get a marginally better answer.

You may need to adjust this setting if you have abnormally large or small values in your global Jacobian.

from openmdao.test_suite.components.sellar import SellarDis1withDerivatives, SellarDis2withDerivatives

prob = om.Problem()
model = prob.model

model.add_subsystem('d1', SellarDis1withDerivatives(), promotes=['x', 'z', 'y1', 'y2'])
model.add_subsystem('d2', SellarDis2withDerivatives(), promotes=['z', 'y1', 'y2'])

model.add_subsystem('obj_cmp', om.ExecComp('obj = x**2 + z[1] + y1 + exp(-y2)',
                                           z=np.array([0.0, 0.0]), x=0.0),
                    promotes=['obj', 'x', 'z', 'y1', 'y2'])

model.add_subsystem('con_cmp1', om.ExecComp('con1 = 3.16 - y1'), promotes=['con1', 'y1'])
model.add_subsystem('con_cmp2', om.ExecComp('con2 = y2 - 24.0'), promotes=['con2', 'y2'])

model.nonlinear_solver = om.NonlinearBlockGS()

model.linear_solver = om.ScipyKrylov()
model.linear_solver.options['atol'] = 1.0e-20

prob.setup()

prob.set_val('x', 1.)
prob.set_val('z', np.array([5.0, 2.0]))

prob.run_model()

wrt = ['z']
of = ['obj']

J = prob.compute_totals(of=of, wrt=wrt, return_format='flat_dict')

NL: NLBGS Converged in 8 iterations

print(J['obj', 'z'][0][0])
print(J['obj', 'z'][0][1])

9.610010556989941
1.7844853356313644

rtol

The ‘rtol’ setting is not supported by SciPy GMRES for versions earlier than SciPy 1.12.0.

Specifying a Preconditioner#

You can specify a preconditioner to improve the convergence of the iterative linear solution by setting the precon attribute. The motivation for using a preconditioner is the observation that iterative methods have better convergence properties if the linear system has a smaller condition number, so the goal of the preconditioner is to improve the condition number in part or all of the Jacobian.

Here, we add a Gauss-Seidel preconditioner to a problem that contains two subgroups, each with an implicit component that implements a quadratic equation. These are solved together by a Newton solver at the top. The goal of the preconditioner here is to solve the smaller linear systems for the quadratic components independently, and use that solution to precondition the full system solution. This is accomplished by setting up the linear solver hierarchy so that the preconditioner is LinearBlockGS and the subsytems sub1 and sub2 contain a DirectSolver.

Note that the number of GMRES iterations is lower when using the preconditioner.

QuadraticComp class definition

class QuadraticComp(om.ImplicitComponent):
    """
    A Simple Implicit Component representing a Quadratic Equation.

    R(a, b, c, x) = ax^2 + bx + c

    Solution via Quadratic Formula:
    x = (-b + sqrt(b^2 - 4ac)) / 2a
    """

    def setup(self):
        self.add_input('a', val=1.)
        self.add_input('b', val=5.)
        self.add_input('c', val=1.)
        self.add_output('x', val=1.)

    def setup_partials(self):
        self.declare_partials(of='x', wrt='*')

    def apply_nonlinear(self, inputs, outputs, residuals):
        a = inputs['a']
        b = inputs['b']
        c = inputs['c']
        x = outputs['x']
        residuals['x'] = a * x ** 2 + b * x + c

    def solve_nonlinear(self, inputs, outputs):
        a = inputs['a']
        b = inputs['b']
        c = inputs['c']
        outputs['x'] = (-b + (b ** 2 - 4 * a * c) ** 0.5) / (2 * a)

    def linearize(self, inputs, outputs, partials):
        a = inputs['a']
        b = inputs['b']
        x = outputs['x']

        partials['x', 'a'] = x ** 2
        partials['x', 'b'] = x
        partials['x', 'c'] = 1.0
        partials['x', 'x'] = 2 * a * x + b

        self.inv_jac = 1.0 / (2 * a * x + b)

from openmdao.test_suite.components.quad_implicit import QuadraticComp

prob = om.Problem()
model = prob.model

sub1 = model.add_subsystem('sub1', om.Group())
sub1.add_subsystem('q1', QuadraticComp())
sub1.add_subsystem('z1', om.ExecComp('y = -6.0 + .01 * x'))
sub2 = model.add_subsystem('sub2', om.Group())
sub2.add_subsystem('q2', QuadraticComp())
sub2.add_subsystem('z2', om.ExecComp('y = -6.0 + .01 * x'))

model.connect('sub1.q1.x', 'sub1.z1.x')
model.connect('sub1.z1.y', 'sub2.q2.c')
model.connect('sub2.q2.x', 'sub2.z2.x')
model.connect('sub2.z2.y', 'sub1.q1.c')

model.nonlinear_solver = om.NewtonSolver(solve_subsystems=False)
model.linear_solver = om.ScipyKrylov()

prob.setup()

model.sub1.linear_solver = om.DirectSolver()
model.sub2.linear_solver = om.DirectSolver()

model.linear_solver.precon = om.LinearBlockGS()

prob.set_solver_print(level=2)
prob.run_model()

NL: Newton 0 ; 13.9900036 1
|  | precon: LNBGS 0 ; 13.9900036 1
|  | precon: LNBGS 1 ; 6.99 0.499642474
|  | precon: LNBGS 2 ; 1.42653061e-05 1.01967852e-06
|  | precon: LNBGS 3 ; 2.91136004e-11 2.0810288e-12
|  | precon:LN: LNBGS Converged
|  | precon: LNBGS 0 ; 2.91136004e-11 1
|  | precon:LN: LNBGS Converged
|  | precon: LNBGS 0 ; 0 0
|  | precon:LN: LNBGS Converged
|  LN: SCIPY 0 ; 0 0
NL: Newton 1 ; 2.87790104e-06 2.05711244e-07
|  | precon: LNBGS 0 ; 13.9899975 1
|  | precon: LNBGS 1 ; 6.98999999 0.499642691
|  | precon: LNBGS 2 ; 1.42769417e-05 1.02051067e-06
|  | precon: LNBGS 3 ; 2.91603814e-11 2.0843736e-12
|  | precon:LN: LNBGS Converged
|  | precon: LNBGS 0 ; 27.979997 1
|  | precon: LNBGS 1 ; 13.98 0.499642655
|  | precon: LNBGS 2 ; 2.85538835e-05 1.0205106e-06
|  | precon: LNBGS 3 ; 5.83204596e-11 2.08436261e-12
|  | precon:LN: LNBGS Converged
|  | precon: LNBGS 0 ; 0 0
|  | precon:LN: LNBGS Converged
|  LN: SCIPY 0 ; 0 0
|  LN: SCIPYSolver 'LN: SCIPY' on system '' failed to converge in 1 iterations.
NL: Newton 2 ; 5.84040093e-11 4.17469581e-12
NL: Newton Converged

print(prob.get_val('sub1.q1.x'))
print(prob.get_val('sub2.q2.x'))

[0.99857318]
[0.99857318]