Switch to DifferentiationInterface #111

gdalle · 2024-07-29T19:08:14Z

Hi @olivierlabayle,
My new package DifferentiationInterface.jl is pretty much becoming the successor of AbstractDifferentiation.jl, so I took the liberty of opening this small PR to help you make the switch if you're interested

gdalle · 2024-07-30T13:08:14Z

Okay this is a bit more complicated than I thought and I can't do it blindly. Can you tell me a little more about the objects f and point_estimates for which you compute derivatives? Apparently the output of f is either a scalar or a vector? What is the type of the individual point estimates?

olivierlabayle · 2024-07-30T17:20:58Z

Okay this is a bit more complicated than I thought and I can't do it blindly. Can you tell me a little more about the objects f and point_estimates for which you compute derivatives? Apparently the output of f is either a scalar or a vector? What is the type of the individual point estimates?

Hi @gdalle, thank you for taking the time to open this PR. Could you tell me a bit more about your new DifferentiationInterface.jl as compared to AbstractDifferentiation.jl and the problems it solves. I can't see any deprecation or migration notes on their github.

The f function is any differentiable function taking a vector of real numbers point_estimates and outputing a real number or a vector thereof. I have mostly resorted to splatting for now to pass the inputs since point_estimates is usually not so high dimensional in the use cases of this package.

gdalle · 2024-07-30T18:34:43Z

DI takes most of its inspiration from AbstractDifferentiation, while learning from its few design shortcomings. The main improvements wrt AD are the breadth of coverage (a dozen supported backends), the caching mechanism and support for mutation, the solid testing and benchmarking infrastructure, as well as the reliance on ADTypes for backend specification. The main limitation of DI right now is that it only supports a single argument, but it should be okay for your use case if we put the estimates into a vector.

DI is already being adopted by the SciML ecosystem, and aims to become a central component of the Julia package ecosystem. Part of this involves me spontaneously asking users (like yourself) what they need 😉

Is there a way to know a priori whether f outputs a number or a vector? In DI, the relevant operators have different names: gradient and Jacobian respectively

olivierlabayle · 2024-07-31T08:53:30Z

Thanks that looks very promising and I'm looking forward to integrating AbstractDifferentiation.

The problem I see at the moment with single argument function is that it would be breaking since users would have to define f as a single argument function. Most of the functions fprovided by users are pretty simple (e.g. f(x,y,z) = [y-x, z-y]) and the jacobian could be computed easily by hand. The interface is rather a convenience than a necessity and changing the interface would make the function less readable and easy to write for a user.

I don't know if there is an easy way to look into f's output unfortunately, it is a priori completely unknown from this package.

So I'd say supporting multiple argument functions and a seemless way to compute jacobian or gradient would be ideal for the change :-)

gdalle · 2024-07-31T09:21:15Z

Multiple argument support is not really an issue because we can always collect these arguments into a Vector before differentiating.
The uncertainty on the output type of f is more problematic, because the gradient and the Jacobian are fundamentally different objects (even though in the scalar output case one is the transpose of the other). What do those two situations correspond to in your package?

olivierlabayle · 2024-07-31T15:42:31Z

I'm sorry I'm not sure I understand the question, but basically the purpose of this package is to estimate statistical quantities that can be multidimensional e.g. [x, y, z]. Once this is done, more quantities can be of further interest, e.g. differences. One could be interested in only one difference f(x,y,z) = y - z or all of them f(x,y,z) = [y-x, z-y]. The function f is really specified dynamically as a post analysis and even though we could force the user to return a vector f(x,y,z) = [y - z], this is not very natural. Does that make sense?

gdalle · 2024-08-01T15:31:49Z

I think I have a way to handle this, which is to turn everything into a vector under the hood and always compute a Jacobian. Will update the PR accordingly

gdalle added 2 commits July 29, 2024 21:07

Switch to DifferentiationInterface

e5a85c6

Fix splatting

d3379e6

Always vector

a708a41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to DifferentiationInterface #111

Switch to DifferentiationInterface #111

gdalle commented Jul 29, 2024

gdalle commented Jul 30, 2024

olivierlabayle commented Jul 30, 2024

gdalle commented Jul 30, 2024 •

edited

Loading

olivierlabayle commented Jul 31, 2024

gdalle commented Jul 31, 2024

olivierlabayle commented Jul 31, 2024

gdalle commented Aug 1, 2024

Switch to DifferentiationInterface #111

Are you sure you want to change the base?

Switch to DifferentiationInterface #111

Conversation

gdalle commented Jul 29, 2024

gdalle commented Jul 30, 2024

olivierlabayle commented Jul 30, 2024

gdalle commented Jul 30, 2024 • edited Loading

olivierlabayle commented Jul 31, 2024

gdalle commented Jul 31, 2024

olivierlabayle commented Jul 31, 2024

gdalle commented Aug 1, 2024

gdalle commented Jul 30, 2024 •

edited

Loading