Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gpu stanmathcl [WIP] #655

Closed
wants to merge 112 commits into from

Commits on Dec 5, 2017

  1. initial commit

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    ba4dd04 View commit details
    Browse the repository at this point in the history
  2. random fixings

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    2eef1f2 View commit details
    Browse the repository at this point in the history
  3. lintr things

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    0c6efac View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b0f6dc4 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ac0a834 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8db822c View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    f3a300c View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    f96cb58 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    264d0bf View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    598fdb0 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    96a89ad View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    83916c7 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    3cf7021 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    af87eb3 View commit details
    Browse the repository at this point in the history
  15. Inital commit with the following changes:

    - replaced rows()*cols() with size() from Eigen, added size() to matrix_gpu
    - pulled the enums in the stan::math namespace
    - removed stan::math qualifiers when inside the namespace
    - suffixed the matrix_gpu class variables (rows_,cols_), added rows() and cols() getter function
    - suffixed the ocl class variables
    - const added to catch (cl::Error&)
    - changed the conditions for matrix dimension checks
    - added spaces after for and if, starting { to the condition line
    - added initializers for the constructor
    - returning context, queue & buffer by reference
    - change dynamicly allocated arrays to std::vector
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    f02225f View commit details
    Browse the repository at this point in the history
  16. - reading kernels from files

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    e91a870 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    618f145 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    e851141 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    f4cb8eb View commit details
    Browse the repository at this point in the history
  20. - added nan value checks for the GPU

    - added zeros on diagonal checks for the GPU
    - copy of lower to upper triangular is made on the GPU, not sure if still needed
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    cfa79f0 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    3385baf View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    66da004 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    103d92b View commit details
    Browse the repository at this point in the history
  24. passing cpplint

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    5c6ea69 View commit details
    Browse the repository at this point in the history
  25. Configuration menu
    Copy the full SHA
    4590b03 View commit details
    Browse the repository at this point in the history
  26. - changed all dimension checks to exsitent /prim/mat/err checks

    - added overloaded functions for check_square and check_matching_dims
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    8bd0b8e View commit details
    Browse the repository at this point in the history
  27. - added doxygen comments; only prim and rev cholesky_decompose_gpu.hp…

    …p are now missing comments
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    b3386c5 View commit details
    Browse the repository at this point in the history
  28. Configuration menu
    Copy the full SHA
    38b65c3 View commit details
    Browse the repository at this point in the history
  29. Configuration menu
    Copy the full SHA
    64e1ff4 View commit details
    Browse the repository at this point in the history
  30. - passes cpplint

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    fa56656 View commit details
    Browse the repository at this point in the history
  31. - implemented the blocked version of the rev/cholesky_decompose on th…

    …e GPU
    
    - added copy_submatrix
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    17434a9 View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    c1a8e5e View commit details
    Browse the repository at this point in the history
  33. Configuration menu
    Copy the full SHA
    a67548a View commit details
    Browse the repository at this point in the history
  34. Added OpenCL headers

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    2578605 View commit details
    Browse the repository at this point in the history
  35. Configuration menu
    Copy the full SHA
    80c41cf View commit details
    Browse the repository at this point in the history
  36. Configuration menu
    Copy the full SHA
    a7fe86a View commit details
    Browse the repository at this point in the history
  37. Configuration menu
    Copy the full SHA
    e7e8b3e View commit details
    Browse the repository at this point in the history
  38. Change OpenCL header

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    6686a4f View commit details
    Browse the repository at this point in the history
  39. opencl headers

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    760daaf View commit details
    Browse the repository at this point in the history
  40. Configuration menu
    Copy the full SHA
    6047f22 View commit details
    Browse the repository at this point in the history
  41. Configuration menu
    Copy the full SHA
    9684668 View commit details
    Browse the repository at this point in the history
  42. fix check_gpu_tests

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    5d7cfd6 View commit details
    Browse the repository at this point in the history
  43. Configuration menu
    Copy the full SHA
    22f3ded View commit details
    Browse the repository at this point in the history
  44. Configuration menu
    Copy the full SHA
    31d110a View commit details
    Browse the repository at this point in the history
  45. Configuration menu
    Copy the full SHA
    e0fafde View commit details
    Browse the repository at this point in the history
  46. Configuration menu
    Copy the full SHA
    3052592 View commit details
    Browse the repository at this point in the history
  47. - cpplint errors cleanup

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    af7fa93 View commit details
    Browse the repository at this point in the history
  48. Configuration menu
    Copy the full SHA
    0802f69 View commit details
    Browse the repository at this point in the history
  49. Configuration menu
    Copy the full SHA
    7ae8263 View commit details
    Browse the repository at this point in the history
  50. - transpose copies on output

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    4c7bb3a View commit details
    Browse the repository at this point in the history
  51. Configuration menu
    Copy the full SHA
    c43a944 View commit details
    Browse the repository at this point in the history
  52. Configuration menu
    Copy the full SHA
    8757bb5 View commit details
    Browse the repository at this point in the history
  53. - inverse copy on output

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    c4501b4 View commit details
    Browse the repository at this point in the history
  54. Configuration menu
    Copy the full SHA
    a1ff9ce View commit details
    Browse the repository at this point in the history
  55. Configuration menu
    Copy the full SHA
    6109dc8 View commit details
    Browse the repository at this point in the history
  56. Configuration menu
    Copy the full SHA
    13b8410 View commit details
    Browse the repository at this point in the history
  57. Configuration menu
    Copy the full SHA
    51f688d View commit details
    Browse the repository at this point in the history
  58. add AMD OpenCL install to travis.yml so that CPU version of OpenCL is…

    … used for testing on travis
    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    1751f7e View commit details
    Browse the repository at this point in the history
  59. Configuration menu
    Copy the full SHA
    db6f798 View commit details
    Browse the repository at this point in the history
  60. Configuration menu
    Copy the full SHA
    81f6249 View commit details
    Browse the repository at this point in the history
  61. Configuration menu
    Copy the full SHA
    df71123 View commit details
    Browse the repository at this point in the history
  62. Configuration menu
    Copy the full SHA
    153fd31 View commit details
    Browse the repository at this point in the history
  63. add POST_LDLIBS when compiling .o file, this avoids warnings in clang…

    … and allows g++ to compile
    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    c4e48f2 View commit details
    Browse the repository at this point in the history
  64. Update AMD SDK to 3.0

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    c8ca938 View commit details
    Browse the repository at this point in the history
  65. Configuration menu
    Copy the full SHA
    f0c4a61 View commit details
    Browse the repository at this point in the history
  66. - context passed by reference

    - changes to the cholesky_decompose 1st step kernel
    - CPU passes precision tests
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    2133c14 View commit details
    Browse the repository at this point in the history
  67. Configuration menu
    Copy the full SHA
    f35359b View commit details
    Browse the repository at this point in the history
  68. Configuration menu
    Copy the full SHA
    0ad3dc7 View commit details
    Browse the repository at this point in the history
  69. - default work-group size is 16x16 to support all CPUs

    - cpplint cleanup of .cl files
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    bac8c78 View commit details
    Browse the repository at this point in the history
  70. Configuration menu
    Copy the full SHA
    a93f5bb View commit details
    Browse the repository at this point in the history
  71. Configuration menu
    Copy the full SHA
    d1d9d0d View commit details
    Browse the repository at this point in the history
  72. Configuration menu
    Copy the full SHA
    80fd359 View commit details
    Browse the repository at this point in the history
  73. Configuration menu
    Copy the full SHA
    53a5ca6 View commit details
    Browse the repository at this point in the history
  74. - missing kernel size

    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    8cb7c76 View commit details
    Browse the repository at this point in the history
  75. Configuration menu
    Copy the full SHA
    f1afe40 View commit details
    Browse the repository at this point in the history
  76. - matrix multiply with matrix of size() 0 returns immediately

    - cpplint and doxygen cleanup
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    96ac7df View commit details
    Browse the repository at this point in the history
  77. - added multiply with self transposed

    - rewrote the cholesky 2nd and 3rd step using existing /prim GPU functions
    - speedup of choleskyy
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    ee2c22c View commit details
    Browse the repository at this point in the history
  78. - removed test for removed kernels

    - tests for multiply_with_self_transposed
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    b39b9b1 View commit details
    Browse the repository at this point in the history
  79. Configuration menu
    Copy the full SHA
    c4a5f78 View commit details
    Browse the repository at this point in the history
  80. Configuration menu
    Copy the full SHA
    c54ba4a View commit details
    Browse the repository at this point in the history
  81. - added check_symmetric_gpu, tests

    - set block sizes that give best overall times for different sizes
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    a7f0436 View commit details
    Browse the repository at this point in the history
  82. Configuration menu
    Copy the full SHA
    4a91a88 View commit details
    Browse the repository at this point in the history
  83. fix cpplint errors

    SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    599d11b View commit details
    Browse the repository at this point in the history
  84. - updated the GPU kernels for readability

    - removed the unused kernels
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    103a6f7 View commit details
    Browse the repository at this point in the history
  85. Configuration menu
    Copy the full SHA
    801a065 View commit details
    Browse the repository at this point in the history
  86. Configuration menu
    Copy the full SHA
    51861d4 View commit details
    Browse the repository at this point in the history
  87. Configuration menu
    Copy the full SHA
    1227672 View commit details
    Browse the repository at this point in the history
  88. - cleanup of check_opencl, does not throws on 0

    - changed matrix_gpu try/catch scopes
    - changed size checks to use size()
    - rows_, cols_ moved to private, const added
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    754ed8d View commit details
    Browse the repository at this point in the history
  89. Configuration menu
    Copy the full SHA
    1f9fbe6 View commit details
    Browse the repository at this point in the history
  90. - appropriate scopes for openCL try/catch for all added functions

    - removed _gpu form check functions
    rok-cesnovar authored and SteveBronder committed Dec 5, 2017
    Configuration menu
    Copy the full SHA
    8aca5b1 View commit details
    Browse the repository at this point in the history
  91. Configuration menu
    Copy the full SHA
    70deee5 View commit details
    Browse the repository at this point in the history
  92. Configuration menu
    Copy the full SHA
    ab33aef View commit details
    Browse the repository at this point in the history
  93. Configuration menu
    Copy the full SHA
    baf39a1 View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2017

  1. Configuration menu
    Copy the full SHA
    70ce68b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    242b823 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    77d331c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    9010fb6 View commit details
    Browse the repository at this point in the history

Commits on Dec 7, 2017

  1. added define for OpenCL expections in check_opencl. In a previous com…

    …mit I was auto changing all spaces to tabs and accidentally did that to boost. This is corrected here
    SteveBronder committed Dec 7, 2017
    Configuration menu
    Copy the full SHA
    0717822 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1cd8ab6 View commit details
    Browse the repository at this point in the history

Commits on Dec 9, 2017

  1. Configuration menu
    Copy the full SHA
    cde6ff7 View commit details
    Browse the repository at this point in the history
  2. ...

    SteveBronder committed Dec 9, 2017
    Configuration menu
    Copy the full SHA
    58e380b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2c43e0e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    26f79dc View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2b1a6a3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    e686c52 View commit details
    Browse the repository at this point in the history

Commits on Dec 11, 2017

  1. Configuration menu
    Copy the full SHA
    895ce30 View commit details
    Browse the repository at this point in the history
  2. ...

    SteveBronder committed Dec 11, 2017
    Configuration menu
    Copy the full SHA
    8077b41 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    54d9c19 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    eb174f2 View commit details
    Browse the repository at this point in the history

Commits on Dec 28, 2017

  1. Configuration menu
    Copy the full SHA
    fa0376b View commit details
    Browse the repository at this point in the history

Commits on Jan 6, 2018

  1. Configuration menu
    Copy the full SHA
    64e0317 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b2be485 View commit details
    Browse the repository at this point in the history