Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Best way to define gradient for linear black box function

See original GitHub issue

Dear jax team,

I’d like to use a black box function in jax with grad where the function is a linear operator, i.e. a function f(x) executing A.dot(x) where A is a matrix too large for memory so A is computed on-the-fly.

I guess I could define its gradient with jax.custom_gradient, but I assumed there are performance benefits in telling jax that the operation is linear? Either way, I would have to provide another black box function computing A.T.dot(x).

Is there a way in jax to do this? Or should I just provide the gradient rule via jax.custom_gradient?

Issue Analytics

State:
Created 4 years ago
Comments:15 (12 by maintainers)

Top GitHub Comments

1reaction

shoyercommented, Oct 21, 2019

Are the non-zero entries of the sparse matrices all in memory at some point? Or can the operators be totally opaque black-box functions?

The idea is make everything work based on black-box functions for computing matrix-vector products.

(We don’t have support for explicit sparse matrices in JAX yet.)

1reaction

jekbradburycommented, Oct 21, 2019

If you’re willing to define your function as a core.Primitive, you can set it up for AD using ad.deflinear(primitive, transpose_rule), where transpose_rule is a function that behaves like A.T.dot(x). This notebook may be helpful for elucidating the current (internal/unstable) API surface for primitives.