BFGS with nalgebra Vector2 #337

amfaber · 2023-03-12T13:22:53Z

Hi, amazing crate! As someone looking to get further into numerical optimization, just the fact that all of these algorithms are gathered in one place, with trait bounds explaining what each algorithm needs is just amazing, so thanks for that :)

I've been trying to get the BFGS example to work on my system and have rewritten it in an attempt to have it work on nalgebra's Vector2 (importantly not DVector, as I would like the performance benefits of stack allocating my params). This is my current code

use argmin::{
    core::{CostFunction, Executor, Error, Gradient},
    solver::{
        neldermead::NelderMead,
        quasinewton::BFGS,
        linesearch::MoreThuenteLineSearch,
    },
};
use argmin_testfunctions::{rosenbrock_2d, rosenbrock_2d_derivative};
use nalgebra::{Vector2, Matrix2};

struct Rosenbrock{
    a: f64,
    b: f64,
}

impl CostFunction for Rosenbrock{
    type Param = Vector2<f64>;

    type Output = f64;

    fn cost(&self, param: &Self::Param) -> Result<Self::Output, Error> {
        Ok(rosenbrock_2d(param.as_slice(), self.a, self.b))
    }
}

impl Gradient for Rosenbrock{
    type Param = Vector2<f64>;

    type Gradient = Vector2<f64>;

    fn gradient(&self, param: &Self::Param) -> Result<Self::Gradient, Error> {
        Ok(Vector2::from_iterator(rosenbrock_2d_derivative(param.as_slice(), self.a, self.b).into_iter()))
    }
}

fn bfgs(){
    let cost = Rosenbrock{ a: 1.0, b: 100.0 };
    let line = MoreThuenteLineSearch::new().with_c(1e-4, 0.9).unwrap();
    let solver = BFGS::new(line);
    let init_param = Vector2::new(1.2, 1.0);
    let init_hess = Matrix2::<f32>::identity();
    let res = Executor::new(cost, solver)
        .configure(|state| 
            state
                .param(init_param)
                .inv_hess(init_hess)
                .max_iters(60))
        .run()
        .unwrap();
}

However, I'm getting the following compiler error

error[E0277]: the trait bound `f64: argmin_math::ArgminEye` is not satisfied
  --> gauss_fitting\src\main.rs:58:35
   |
58 |     let res = Executor::new(cost, solver)
   |               -------------       ^^^^^^ the trait `argmin_math::ArgminEye` is not implemented for `f64`
   |               |
   |               required by a bound introduced by this call
   |
   = help: the following other types implement trait `argmin_math::ArgminEye`:
             Matrix<N, R, C, <DefaultAllocator as nalgebra::allocator::Allocator<N, R, C>>::Buffer>
             Vec<Vec<f32>>
             Vec<Vec<f64>>
             Vec<Vec<i16>>
             Vec<Vec<i32>>
             Vec<Vec<i64>>
             Vec<Vec<i8>>
             Vec<Vec<isize>>
           and 17 others
   = note: required for `BFGS<MoreThuenteLineSearch<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, f64>, f64>` to implement `Solver<Rosenbrock, IterState<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, (), f64, f64>>`
note: required by a bound in `Executor::<O, S, I>::new`
  --> C:\Users\andre\.cargo\registry\src\github.com-1ecc6299db9ec823\argmin-0.8.1\src\core\executor.rs:38:8
   |
38 |     S: Solver<O, I>,
   |        ^^^^^^^^^^^^ required by this bound in `Executor::<O, S, I>::new`

error[E0599]: the method `configure` exists for struct `Executor<Rosenbrock, BFGS<MoreThuenteLineSearch<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, f64>, f64>, IterState<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, (), f64, f64>>`, but its trait bounds were not satisfied
  --> gauss_fitting\src\main.rs:59:10
   |
59 |         .configure(|state|
   |          ^^^^^^^^^ method cannot be called due to unsatisfied trait bounds
   |
  ::: C:\Users\andre\.cargo\registry\src\github.com-1ecc6299db9ec823\argmin-0.8.1\src\solver\quasinewton\bfgs.rs:51:1
   |
51 | pub struct BFGS<L, F> {
   | --------------------- doesn't satisfy `_: Solver<Rosenbrock, IterState<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, (), f64, f64>>`
   |
   = note: the following trait bounds were not satisfied:
           `BFGS<MoreThuenteLineSearch<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, f64>, f64>: Solver<Rosenbrock, IterState<Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, Matrix<f64, Const<2>, Const<1>, ArrayStorage<f64, 2, 1>>, (), f64, f64>>`

It's unclear to me why ArgminEye should be required for f64 in this case? What can I do to get BFGS to work with the statically sized types of ndalgebra for that sweet, sweet stack allocation?

Once again, thanks for the amazing work and the time to help me with my issue :)

The text was updated successfully, but these errors were encountered:

stefan-k · 2023-03-14T08:44:03Z

Hi,

apologies for the late reply! Thanks for the kind words, I am always happy to hear when people find argmin useful :)

Admittedly, I'm not an expert on nalgebra. Most work on the nalgebra backend was done by collaborators, but from a first glance most traits were implemented for the seemingly most general Matrix type, therefore SVector and SMatrix should be covered. If not, I consider this a bug. Some are implemented for OMatrix and I'm unsure if that covers the statically allocated arrays as well.

The error message you get does seem a bit strange though. I'm not sure if this is related, but there are some inconsistencies regarding the use of f32 and f64 which may cause problems, in particular:

    type Param = Vector2<f64>;
    type Output = f64;

and

    let init_hess = Matrix2::<f32>::identity();

It would be interesting to see if changing the latter to Matrix2::<f64>::identity() has an effect on the error message.

EDIT:
Also, this should be inv_hessian(init_hess):

                .inv_hess(init_hess)

amfaber · 2023-03-15T07:34:50Z

Thanks for that, hadn't spotted those mistakes from messing around trying some different things. Unfortunately, the issue arises before the hessian is even checked for trait bounds. I can comment out the lines defining init_hess and .inv_hessian(...) and the compiler error still persists. Actually the error still shows if I just shorten the last line to

let res = Executor::new(cost, solver);

stefan-k · 2023-03-15T20:59:27Z

I did some detective work but just ended up being confused.

The type of the Hessian is inferred by whatever one passes into inv_hessian of IterState (inside the closure passed to configure). However, when Vector2 is used, the Hessian type H is for some reason inferred as f64, even without calling configure (as you stated before). I wasn't able to figure out why yet.

Interestingly, I don't even get it to work with DVector, but I think that's a different and probably unrelated problem.

I will have to do a more in-depth investigation of this, but I'm afraid that this may take a while because my time currently is pretty limited. I will do my best though. If you or someone else wants to have a go at this, feel free to do so.

Thanks for reporting this issue! :)

amfaber · 2023-03-15T21:11:14Z

It did feel like something was fundamentally off. Thanks for looking into it! My problem is actually just least squares fitting, so I'll probably end up going with a Gauss-Newton like method. Although I have an awful lot of spots to fit, so I might start looking into implementing one of the algorithms on a GPU with wgpu instead. Once again thanks for the very readable source files :))

itrumper · 2023-05-09T01:03:09Z

To possibly add to the confusion (sorry!) I am using na::Vector3<f64> as my parameter vector and everything is happy.

Typically, when I've had trait bound issues with nalgebra, its due to conflicting versions. You need to be sure to match the argmin-math dependency version with your crate's dependency version. What version of nalgebra do you have in your Cargo.toml?

For example, I have:

[dependencies]
nalgebra = { version = "0.30" }
argmin = { version = "0.8" }
argmin-math = { version = "0.3", features = ["nalgebra_v0_30-serde"] }

Hope this help!

stefan-k · 2024-01-26T11:02:18Z

I am running into the same problem with Vector3. I think I have boiled it down to this constraint in the Solver impl of BFGS:

    P: /* ... */ + ArgminDot<G, H> + ArgminDot<P, H>,

In the nalgebra case, ArgminDot is implemented for the case type(<vec1, vec2>) = scalar, here:

https://github.com/argmin-rs/argmin/blob/e9bebb21d99d2ccad1c36d7373e7f4f53eec1539/crates/argmin-math/src/nalgebra_m/dot.rs#L41C1-L53C2

and type(<vec1, vec2>) = matrix{dim1 = dim1(vec1), dim2 = dim2(vec2)}:

https://github.com/argmin-rs/argmin/blob/e9bebb21d99d2ccad1c36d7373e7f4f53eec1539/crates/argmin-math/src/nalgebra_m/dot.rs#L69C1-L86C2

The latter is the one that should be used, but if both vectors are either row- or column-vectors, it resorts to the former. In the ndarray impl, it just assumes that the first is a row- and the second a column-vector when a matrix is requested as output.

I'm surprised it works for DVector though.

I'm not that familiar with nalgebra unfortunately. Could anyone with nalgebra experience help?

troiganto · 2024-02-20T16:38:52Z

In the nalgebra case, ArgminDot is implemented for the case type(<vec1, vec2>) = scalar, here:

https://github.com/argmin-rs/argmin/blob/e9bebb21d99d2ccad1c36d7373e7f4f53eec1539/crates/argmin-math/src/nalgebra_m/dot.rs#L41C1-L53C2

is this right? The signature is:

impl<...> ArgminDot<N, OMatrix<...>> for Matrix<...>

so doesn't this implement type(vec * scalar) = vec{dim = dim(vec)}, i.e. multiplication with a scalar?

stefan-k · 2024-02-20T17:42:52Z

Oh yes that's correct, I must have meant this part:

argmin/crates/argmin-math/src/nalgebra_m/dot.rs

Lines 23 to 39 in e9bebb2

    
           impl<N, R1, R2, C1, C2, SA, SB> ArgminDot<Matrix<N, R2, C2, SB>, N> for Matrix<N, R1, C1, SA> 
        
           where 
        
               N: Scalar + Zero + ClosedAdd + ClosedMul, 
        
               R1: Dim, 
        
               R2: Dim, 
        
               C1: Dim, 
        
               C2: Dim, 
        
               SA: Storage<N, R1, C1>, 
        
               SB: Storage<N, R2, C2>, 
        
               ShapeConstraint: DimEq<R1, R2> + DimEq<C1, C2>, 
        
           { 
        
               #[inline] 
        
               #[allow(clippy::only_used_in_recursion)] 
        
               fn dot(&self, other: &Matrix<N, R2, C2, SB>) -> N { 
        
                   self.dot(other) 
        
               } 
        
           }

stefan-k added the bug Something isn't working label Mar 16, 2023

stefan-k added the help wanted Extra attention is needed label Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BFGS with nalgebra Vector2 #337

BFGS with nalgebra Vector2 #337

amfaber commented Mar 12, 2023

stefan-k commented Mar 14, 2023 •

edited

amfaber commented Mar 15, 2023

stefan-k commented Mar 15, 2023

amfaber commented Mar 15, 2023

itrumper commented May 9, 2023

stefan-k commented Jan 26, 2024

troiganto commented Feb 20, 2024

stefan-k commented Feb 20, 2024 •

edited

BFGS with nalgebra Vector2 #337

BFGS with nalgebra Vector2 #337

Comments

amfaber commented Mar 12, 2023

stefan-k commented Mar 14, 2023 • edited

amfaber commented Mar 15, 2023

stefan-k commented Mar 15, 2023

amfaber commented Mar 15, 2023

itrumper commented May 9, 2023

stefan-k commented Jan 26, 2024

troiganto commented Feb 20, 2024

stefan-k commented Feb 20, 2024 • edited

stefan-k commented Mar 14, 2023 •

edited

stefan-k commented Feb 20, 2024 •

edited