[WIP] Reverse Mode For Static Matrix Multiplication #1884

SteveBronder · 2020-05-13T18:39:20Z

Summary

If all these WIPs are getting annoying I can close all the others since this one has everything in it.

This is adds the reverse mode matrix multiplication for static matrices. At uses a trick in the chain() method to call either the standard multiplication chain method or the matrix chain method. The chain_impl() function has a doc going over how this works It also adds a multiply_vari specialization for Arith * eigen_var vs. eigen_var * Arith.

This leaks memory right now as op_vari holds the matrix which is not allocated on our stack. I've been having some trouble writing the code to allocate that memory with op_vari, if @t4c1 or @bbbales2 know some tuple magic to write that it would be very appreciated! Another option is just to remove op_vari and template dv_vari, vd_vari etc. to take in and allocate the mem for eigen matrices. It's a code density vs maintanence tradeoff. If we can find a nice solution then op_vari is fine, but if it's too confusing then it may be better to go back to the old op code. I have the start of the code for op_vari with stack allocated mem for the eigen matrices here

Tests

I just wrote an informal test right now that checks if static vs dynamic matrices return the same adjoint calculations after calling .grad()

./runTests.py -j18 ./test/unit/math/rev/core/operator_multiplication_test.cpp

Side Effects

Release notes

Checklist

Math issue #21
Copyright holder: Steve Bronder

The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

…4.1 (tags/RELEASE_600/final)

…re/op_vari

…4.1 (tags/RELEASE_600/final)

…re/op_vari

…4.1 (tags/RELEASE_600/final)

…re/op_vari

…4.1 (tags/RELEASE_600/final)

bbbales2 · 2020-06-06T18:10:56Z

@SteveBronder Yo my hope by merging feature/vari-base-templates into this and pointing the pull there would be that it would simplify the diff. There's still 78 things there, including: https://github.com/stan-dev/math/pull/1884/files#diff-320e0518dfbfe51abbc870b3ae852b13 which doesn't look like part of either pull.

Any advice on what I should be doing to getting these things more in sync?

bbbales2 · 2020-06-06T19:02:45Z

Hmm, and now there are all these merge conflicts! Not sure I'm doing my gits correctly.

SteveBronder · 2020-06-06T19:17:30Z

tbh this is kind of why I wanted to do this in pieces. There's enough very dramatic changes here that when a bunch of stuff at one level changes it causes a bunch of conflicts in the larger branch. I think we should focus efforts on #1915 so then we can start the Eigen var PR and then the adj_jac_apply PR.

bbbales2 · 2020-06-06T19:22:59Z

Well I just know I'm not gonna understand #1915 until I know how it filters up to matrices and stuff.

You think there might be a difference in rebase and merge here?

bbbales2 · 2020-06-06T19:23:10Z

Eh I'll just try it lol.

SteveBronder · 2020-06-06T19:31:39Z

Lemme look at this rq

SteveBronder · 2020-06-06T19:33:35Z

Huh seems fine, tbh just pulled it down and git merge feature/vari-base-templats

bbbales2 · 2020-06-06T19:45:48Z

Huh seems fine, tbh just pulled it down and git merge feature/vari-base-templats

I wonder if I was merging in my local version of your branch or something? But it seems like that wouldn't have given me conflicts twice. Oh well.

SteveBronder · 2020-06-06T20:01:39Z

¯\_(ツ)_/¯

…s/RELEASE_600/final)

SteveBronder · 2020-06-07T22:31:40Z

@bbbales2 moving this convo over from #1915 the convo here I think we could do something like the below where (T1 and T2 are var_values). But the problem is that op_vari needs to know what the adj_ type should be so we need to deduce that which requires a lot of weird looking duplicated code. imo it's more confusing than having the boilerplate in the function.

/**
 * Deduces the return type for matrix multiplication of two types
 */
template <typename T1, typename T2, typename = void>
struct mat_mul_return_type {};

// arithmetic is just double
template <typename T1, typename T2>
struct mat_mul_return_type<T1, T2, require_all_arithmetic_t<T1, T2>> {
  using type = double;
};

template <typename T1, typename T2>
struct mat_mul_return_type<T1, T2, require_any_eigen_t<T1, T2>> {
  using type = decltype((std::declval<T1>() * std::declval<T2>()).eval());
};

// helper alias
template <typename T1, typename T2>
using mat_mul_return_t = typename mat_mul_return_type<T1, T2>::type;

template <typename T1, typename T2>
class multiply_vari<T1, T2, require_all_var_t<T1, T2>>
    final : public op_vari<mat_mul_return_t<value_type_t<T1>, value_type_t<T2>>, vari_type_t<T2>*, vari_type_t<T2>*> {
  using op_vari<mat_mul_return_t<value_type_t<T1>, value_type_t<T2>>, vari_type_t<T2>*, vari_type_t<T2>*>::avi;
  using op_vari<mat_mul_return_t<value_type_t<T1>, value_type_t<T2>>, vari_type_t<T2>*, vari_type_t<T2>*>::bvi;
  using lhs_type = vari_type_t<T1>;
  using rhs_type = vari_type_t<T2>;
  using return_t = mat_mul_return_t<value_type_t<T1>, value_type_t<T2>>;

 public:
  multiply_vari(lhs_type* avi, rhs_type* bvi)
      : op_vari<return_t, lhs_type*, rhs_type*>(avi->val_ * bvi->val_, avi, bvi) {}

  template <typename TT1 = T1, typename TT2 = T2,
            require_all_var_vt<std::is_arithmetic, TT1, TT2>* = nullptr>
  inline void chain_impl() {
    avi()->adj_ += bvi()->val_ * this->adj_;
    bvi()->adj_ += avi()->val_ * this->adj_;
  }

  template <typename TT1 = T1, typename TT2 = T2,
            require_all_var_vt<is_eigen, TT1, TT2>* = nullptr>
  inline void chain_impl() {
    avi()->adj_ += this->adj_ * bvi()->val_.transpose();
    bvi()->adj_ += avi()->val_.transpose() * this->adj_;
  }

  void chain() {
    if (unlikely(is_any_nan(avi()->val_, bvi()->val_))) {
      fill(avi()->adj_, NOT_A_NUMBER);
      fill(bvi()->adj_, NOT_A_NUMBER);
    } else {
      chain_impl();
    }
  }
};

template <typename T1, typename T2, require_all_var_t<T1, T2>* = nullptr>
inline auto operator*(const T1& a, const T2& b) {
  using multiply_type = internal::multiply_vari<T1, T2>;
  // store the return type multiply_vari
  using mat_return = typename multiply_type::return_t;
  return var_value<mat_return>(new multiply_type(a.vi_, b.vi_)};
}

We could try to simplify this with something like a default template with a VarOpTraits class that stores the return type, value_type, and vari_type of everything but I'm really not sure if that's going to clean things up in a meaningful way

SteveBronder · 2020-06-07T23:00:09Z

From @bbbales2 comment here wrt to

template <typename T1, typename T2, require_all_var_t<T1, T2>* = nullptr>
inline auto operator*(const T1& a, const T2& b) {
  using multiply_type = internal::multiply_vari<T1, T2>;
  // store the return type multiply_vari
  using mat_return = typename multiply_type::return_type;
  return var_value<mat_return>(new multiply_type(a.vi_, b.vi_)};
}

   var_value<mat_return>
   internal::multiply_vari<T1, T2>
Both of these are constructors so you gotta pass the template arguments.

We can deduce the constructor's parameters from T1 and T2

It'd be nice to not do that, cause the types should be deducible in the arguments. Presumably we have to define this somewhere, but doing it in every function would not be great.

Yeah we need the boilerplate somewhere, imo it's simpler to have in the function in than in the class (though is annoying). One thing we could do is put the type traits stuff into op_vari. Then multiply etc. could look like

  template <typename T1, typename T2>
  class multiply_vari<T1, T2, require_all_var_t<T1, T2>>
      final : public op_vari<mat_mul_return_t<T1, T2>, T1, T2> {
    using return_t = mat_mul_return_t<T1, T2>;
    using op_vari<return_t, T1, T2>::avi;
    using op_vari<return_t, T1, T2>::bvi;
    using lhs_vari = vari_type_t<T1>;
    using rhs_vari = vari_type_t<T2>;
   public:
    multiply_vari(lhs_vari* avi, rhs_vari* bvi)
        : op_vari<return_t, T1, T2>(avi->val_ * bvi->val_, avi, bvi) {}
// yada yada

Then in op_vari do all the deduction stuff

This is connected to the multiple definitions of operator* and the multiple multiply_vari types.

I at least want less usings here.

Yeah it would be good to have less, the Q is just where do they go

SteveBronder · 2020-06-07T23:00:19Z

clicked close by accident

bbbales2 · 2020-06-07T23:16:42Z

Yeah it would be good to have less, the Q is just where do they go

Yes, and how much this will need repeated in other places.

SteveBronder · 2020-06-08T03:05:17Z

I got something almost working here.

https://github.com/stan-dev/math/tree/feature/eigen-vari-ops-vari-deduction

I'm doing something dumb with inheritance though and getting an error that it doesn't understand it can use the var_value(vari_value<T>* vi) constructor for multiply_vari

SteveBronder · 2020-06-08T03:52:48Z

^That works now. Some parts are a little icky but I'd clean that up when we are at this stage

…o feature/eigen-vari-ops

…s/RELEASE_600/final)

SteveBronder and others added 30 commits May 8, 2020 16:56

Adds vari_value and var_value types to replace var and vari

b216a80

fixup docs

f7e1dae

cpplint fixes

5cf20dc

update test_ad common_binary to check over more types

149f0cc

remove changes to test_ad

fee39f6

start templating rev operators

fbdded3

all ops use op_vari

6f8b058

mark init for vari as final

c034eb6

use const ref for operators

ea22605

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

c8c0184

…4.1 (tags/RELEASE_600/final)

cpplint

da4c0ce

merge to develop

701949d

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f003090

…4.1 (tags/RELEASE_600/final)

Eigen vari type

60df871

fix headers

7d1a757

Merge branch 'feature/op_vari' into feature/eigen-vari

223a980

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

446f77e

…4.1 (tags/RELEASE_600/final)

const Arith& in wrong place for operator_divide

28ae46a

Merge branch 'feature/op_vari' of github.com:stan-dev/math into featu…

fe67a1d

…re/op_vari

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

6fa2762

…4.1 (tags/RELEASE_600/final)

swap require_vt_arith for require_arith_t in operators

fce1d1a

Merge branch 'feature/op_vari' of github.com:stan-dev/math into featu…

f8c6ed1

…re/op_vari

update precomp_*_vari tests

011cecc

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

b7d07dc

…4.1 (tags/RELEASE_600/final)

fix fmin and fmax

dab2d86

Merge branch 'feature/op_vari' of github.com:stan-dev/math into featu…

b656b38

…re/op_vari

update to develop

4b3222e

Merge branch 'feature/op_vari' into feature/eigen-vari

38cacd7

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

7658252

…4.1 (tags/RELEASE_600/final)

Merge remote-tracking branch 'origin/develop' into feature/eigen-vari

c9e4fb3

Merge branch 'feature/vari-base-templates' into feature/eigen-vari-ops

b0fc343

SteveBronder and others added 3 commits June 7, 2020 17:51

Merge branch 'feature/vari-base-templates' into feature/eigen-vari-ops

0282a90

cpplint

1430e8e

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2 (tag…

b5ad3f4

…s/RELEASE_600/final)

SteveBronder mentioned this pull request Jun 7, 2020

Add templates to var and vari #1915

Merged

5 tasks

SteveBronder closed this Jun 7, 2020

SteveBronder reopened this Jun 7, 2020

use more deduction in op_vari

f965ba0

Add correct deduction for mat_mul return type

0089201

SteveBronder and others added 5 commits June 8, 2020 11:20

Merge branch 'feature/eigen-vari-ops' of github.com:stan-dev/math int…

c6dfe54

…o feature/eigen-vari-ops

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2 (tag…

ed7256d

…s/RELEASE_600/final)

fix templates for multiply operator chain method

c7aadf3

update to include derivatives for matrix scalar in op_vari

7b3ee83

fix mat vec mul

ee820dc

SteveBronder closed this Aug 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Reverse Mode For Static Matrix Multiplication #1884

[WIP] Reverse Mode For Static Matrix Multiplication #1884

SteveBronder commented May 13, 2020 •

edited

Loading

bbbales2 commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

SteveBronder commented Jun 7, 2020

SteveBronder commented Jun 7, 2020

SteveBronder commented Jun 7, 2020

bbbales2 commented Jun 7, 2020

SteveBronder commented Jun 8, 2020

SteveBronder commented Jun 8, 2020

[WIP] Reverse Mode For Static Matrix Multiplication #1884

[WIP] Reverse Mode For Static Matrix Multiplication #1884

Conversation

SteveBronder commented May 13, 2020 • edited Loading

Summary

Tests

Side Effects

Release notes

Checklist

bbbales2 commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

bbbales2 commented Jun 6, 2020

SteveBronder commented Jun 6, 2020

SteveBronder commented Jun 7, 2020

SteveBronder commented Jun 7, 2020

SteveBronder commented Jun 7, 2020

bbbales2 commented Jun 7, 2020

SteveBronder commented Jun 8, 2020

SteveBronder commented Jun 8, 2020

SteveBronder commented May 13, 2020 •

edited

Loading