-
-
Notifications
You must be signed in to change notification settings - Fork 187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gpu stanmathcl [WIP] #655
gpu stanmathcl [WIP] #655
Commits on Dec 5, 2017
-
Configuration menu - View commit details
-
Copy full SHA for ba4dd04 - Browse repository at this point
Copy the full SHA ba4dd04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2eef1f2 - Browse repository at this point
Copy the full SHA 2eef1f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c6efac - Browse repository at this point
Copy the full SHA 0c6efacView commit details -
Configuration menu - View commit details
-
Copy full SHA for b0f6dc4 - Browse repository at this point
Copy the full SHA b0f6dc4View commit details -
Configuration menu - View commit details
-
Copy full SHA for ac0a834 - Browse repository at this point
Copy the full SHA ac0a834View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8db822c - Browse repository at this point
Copy the full SHA 8db822cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f3a300c - Browse repository at this point
Copy the full SHA f3a300cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f96cb58 - Browse repository at this point
Copy the full SHA f96cb58View commit details -
Configuration menu - View commit details
-
Copy full SHA for 264d0bf - Browse repository at this point
Copy the full SHA 264d0bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 598fdb0 - Browse repository at this point
Copy the full SHA 598fdb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 96a89ad - Browse repository at this point
Copy the full SHA 96a89adView commit details -
gpu_chol now gives back the original matrix when the size if zero. Te…
…sts still failing
Configuration menu - View commit details
-
Copy full SHA for 83916c7 - Browse repository at this point
Copy the full SHA 83916c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3cf7021 - Browse repository at this point
Copy the full SHA 3cf7021View commit details -
Configuration menu - View commit details
-
Copy full SHA for af87eb3 - Browse repository at this point
Copy the full SHA af87eb3View commit details -
Inital commit with the following changes:
- replaced rows()*cols() with size() from Eigen, added size() to matrix_gpu - pulled the enums in the stan::math namespace - removed stan::math qualifiers when inside the namespace - suffixed the matrix_gpu class variables (rows_,cols_), added rows() and cols() getter function - suffixed the ocl class variables - const added to catch (cl::Error&) - changed the conditions for matrix dimension checks - added spaces after for and if, starting { to the condition line - added initializers for the constructor - returning context, queue & buffer by reference - change dynamicly allocated arrays to std::vector
Configuration menu - View commit details
-
Copy full SHA for f02225f - Browse repository at this point
Copy the full SHA f02225fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e91a870 - Browse repository at this point
Copy the full SHA e91a870View commit details -
Configuration menu - View commit details
-
Copy full SHA for 618f145 - Browse repository at this point
Copy the full SHA 618f145View commit details -
Configuration menu - View commit details
-
Copy full SHA for e851141 - Browse repository at this point
Copy the full SHA e851141View commit details -
Configuration menu - View commit details
-
Copy full SHA for f4cb8eb - Browse repository at this point
Copy the full SHA f4cb8ebView commit details -
- added nan value checks for the GPU
- added zeros on diagonal checks for the GPU - copy of lower to upper triangular is made on the GPU, not sure if still needed
Configuration menu - View commit details
-
Copy full SHA for cfa79f0 - Browse repository at this point
Copy the full SHA cfa79f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3385baf - Browse repository at this point
Copy the full SHA 3385bafView commit details -
Configuration menu - View commit details
-
Copy full SHA for 66da004 - Browse repository at this point
Copy the full SHA 66da004View commit details -
Configuration menu - View commit details
-
Copy full SHA for 103d92b - Browse repository at this point
Copy the full SHA 103d92bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c6ea69 - Browse repository at this point
Copy the full SHA 5c6ea69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4590b03 - Browse repository at this point
Copy the full SHA 4590b03View commit details -
- changed all dimension checks to exsitent /prim/mat/err checks
- added overloaded functions for check_square and check_matching_dims
Configuration menu - View commit details
-
Copy full SHA for 8bd0b8e - Browse repository at this point
Copy the full SHA 8bd0b8eView commit details -
- added doxygen comments; only prim and rev cholesky_decompose_gpu.hp…
…p are now missing comments
Configuration menu - View commit details
-
Copy full SHA for b3386c5 - Browse repository at this point
Copy the full SHA b3386c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 38b65c3 - Browse repository at this point
Copy the full SHA 38b65c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 64e1ff4 - Browse repository at this point
Copy the full SHA 64e1ff4View commit details -
Configuration menu - View commit details
-
Copy full SHA for fa56656 - Browse repository at this point
Copy the full SHA fa56656View commit details -
- implemented the blocked version of the rev/cholesky_decompose on th…
…e GPU - added copy_submatrix
Configuration menu - View commit details
-
Copy full SHA for 17434a9 - Browse repository at this point
Copy the full SHA 17434a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for c1a8e5e - Browse repository at this point
Copy the full SHA c1a8e5eView commit details -
Configuration menu - View commit details
-
Copy full SHA for a67548a - Browse repository at this point
Copy the full SHA a67548aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2578605 - Browse repository at this point
Copy the full SHA 2578605View commit details -
Configuration menu - View commit details
-
Copy full SHA for 80c41cf - Browse repository at this point
Copy the full SHA 80c41cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for a7fe86a - Browse repository at this point
Copy the full SHA a7fe86aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e7e8b3e - Browse repository at this point
Copy the full SHA e7e8b3eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6686a4f - Browse repository at this point
Copy the full SHA 6686a4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 760daaf - Browse repository at this point
Copy the full SHA 760daafView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6047f22 - Browse repository at this point
Copy the full SHA 6047f22View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9684668 - Browse repository at this point
Copy the full SHA 9684668View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d7cfd6 - Browse repository at this point
Copy the full SHA 5d7cfd6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 22f3ded - Browse repository at this point
Copy the full SHA 22f3dedView commit details -
Configuration menu - View commit details
-
Copy full SHA for 31d110a - Browse repository at this point
Copy the full SHA 31d110aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e0fafde - Browse repository at this point
Copy the full SHA e0fafdeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3052592 - Browse repository at this point
Copy the full SHA 3052592View commit details -
Configuration menu - View commit details
-
Copy full SHA for af7fa93 - Browse repository at this point
Copy the full SHA af7fa93View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0802f69 - Browse repository at this point
Copy the full SHA 0802f69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ae8263 - Browse repository at this point
Copy the full SHA 7ae8263View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4c7bb3a - Browse repository at this point
Copy the full SHA 4c7bb3aView commit details -
Configuration menu - View commit details
-
Copy full SHA for c43a944 - Browse repository at this point
Copy the full SHA c43a944View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8757bb5 - Browse repository at this point
Copy the full SHA 8757bb5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c4501b4 - Browse repository at this point
Copy the full SHA c4501b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for a1ff9ce - Browse repository at this point
Copy the full SHA a1ff9ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6109dc8 - Browse repository at this point
Copy the full SHA 6109dc8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 13b8410 - Browse repository at this point
Copy the full SHA 13b8410View commit details -
update check_nan tests. We have nothing to test infinite values which…
… we probably should.
Configuration menu - View commit details
-
Copy full SHA for 51f688d - Browse repository at this point
Copy the full SHA 51f688dView commit details -
add AMD OpenCL install to travis.yml so that CPU version of OpenCL is…
… used for testing on travis
Configuration menu - View commit details
-
Copy full SHA for 1751f7e - Browse repository at this point
Copy the full SHA 1751f7eView commit details -
Configuration menu - View commit details
-
Copy full SHA for db6f798 - Browse repository at this point
Copy the full SHA db6f798View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81f6249 - Browse repository at this point
Copy the full SHA 81f6249View commit details -
Configuration menu - View commit details
-
Copy full SHA for df71123 - Browse repository at this point
Copy the full SHA df71123View commit details -
Configuration menu - View commit details
-
Copy full SHA for 153fd31 - Browse repository at this point
Copy the full SHA 153fd31View commit details -
add POST_LDLIBS when compiling .o file, this avoids warnings in clang…
… and allows g++ to compile
Configuration menu - View commit details
-
Copy full SHA for c4e48f2 - Browse repository at this point
Copy the full SHA c4e48f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8ca938 - Browse repository at this point
Copy the full SHA c8ca938View commit details -
Configuration menu - View commit details
-
Copy full SHA for f0c4a61 - Browse repository at this point
Copy the full SHA f0c4a61View commit details -
- changes to the cholesky_decompose 1st step kernel - CPU passes precision tests
Configuration menu - View commit details
-
Copy full SHA for 2133c14 - Browse repository at this point
Copy the full SHA 2133c14View commit details -
Configuration menu - View commit details
-
Copy full SHA for f35359b - Browse repository at this point
Copy the full SHA f35359bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ad3dc7 - Browse repository at this point
Copy the full SHA 0ad3dc7View commit details -
- default work-group size is 16x16 to support all CPUs
- cpplint cleanup of .cl files
Configuration menu - View commit details
-
Copy full SHA for bac8c78 - Browse repository at this point
Copy the full SHA bac8c78View commit details -
Configuration menu - View commit details
-
Copy full SHA for a93f5bb - Browse repository at this point
Copy the full SHA a93f5bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for d1d9d0d - Browse repository at this point
Copy the full SHA d1d9d0dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 80fd359 - Browse repository at this point
Copy the full SHA 80fd359View commit details -
Configuration menu - View commit details
-
Copy full SHA for 53a5ca6 - Browse repository at this point
Copy the full SHA 53a5ca6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8cb7c76 - Browse repository at this point
Copy the full SHA 8cb7c76View commit details -
Configuration menu - View commit details
-
Copy full SHA for f1afe40 - Browse repository at this point
Copy the full SHA f1afe40View commit details -
- matrix multiply with matrix of size() 0 returns immediately
- cpplint and doxygen cleanup
Configuration menu - View commit details
-
Copy full SHA for 96ac7df - Browse repository at this point
Copy the full SHA 96ac7dfView commit details -
- added multiply with self transposed
- rewrote the cholesky 2nd and 3rd step using existing /prim GPU functions - speedup of choleskyy
Configuration menu - View commit details
-
Copy full SHA for ee2c22c - Browse repository at this point
Copy the full SHA ee2c22cView commit details -
- removed test for removed kernels
- tests for multiply_with_self_transposed
Configuration menu - View commit details
-
Copy full SHA for b39b9b1 - Browse repository at this point
Copy the full SHA b39b9b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for c4a5f78 - Browse repository at this point
Copy the full SHA c4a5f78View commit details -
Configuration menu - View commit details
-
Copy full SHA for c54ba4a - Browse repository at this point
Copy the full SHA c54ba4aView commit details -
- added check_symmetric_gpu, tests
- set block sizes that give best overall times for different sizes
Configuration menu - View commit details
-
Copy full SHA for a7f0436 - Browse repository at this point
Copy the full SHA a7f0436View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a91a88 - Browse repository at this point
Copy the full SHA 4a91a88View commit details -
Configuration menu - View commit details
-
Copy full SHA for 599d11b - Browse repository at this point
Copy the full SHA 599d11bView commit details -
- updated the GPU kernels for readability
- removed the unused kernels
Configuration menu - View commit details
-
Copy full SHA for 103a6f7 - Browse repository at this point
Copy the full SHA 103a6f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 801a065 - Browse repository at this point
Copy the full SHA 801a065View commit details -
Configuration menu - View commit details
-
Copy full SHA for 51861d4 - Browse repository at this point
Copy the full SHA 51861d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1227672 - Browse repository at this point
Copy the full SHA 1227672View commit details -
- cleanup of check_opencl, does not throws on 0
- changed matrix_gpu try/catch scopes - changed size checks to use size() - rows_, cols_ moved to private, const added
Configuration menu - View commit details
-
Copy full SHA for 754ed8d - Browse repository at this point
Copy the full SHA 754ed8dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f9fbe6 - Browse repository at this point
Copy the full SHA 1f9fbe6View commit details -
- appropriate scopes for openCL try/catch for all added functions
- removed _gpu form check functions
Configuration menu - View commit details
-
Copy full SHA for 8aca5b1 - Browse repository at this point
Copy the full SHA 8aca5b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 70deee5 - Browse repository at this point
Copy the full SHA 70deee5View commit details -
Configuration menu - View commit details
-
Copy full SHA for ab33aef - Browse repository at this point
Copy the full SHA ab33aefView commit details -
Configuration menu - View commit details
-
Copy full SHA for baf39a1 - Browse repository at this point
Copy the full SHA baf39a1View commit details
Commits on Dec 6, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 70ce68b - Browse repository at this point
Copy the full SHA 70ce68bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 242b823 - Browse repository at this point
Copy the full SHA 242b823View commit details -
Configuration menu - View commit details
-
Copy full SHA for 77d331c - Browse repository at this point
Copy the full SHA 77d331cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9010fb6 - Browse repository at this point
Copy the full SHA 9010fb6View commit details
Commits on Dec 7, 2017
-
added define for OpenCL expections in check_opencl. In a previous com…
…mit I was auto changing all spaces to tabs and accidentally did that to boost. This is corrected here
Configuration menu - View commit details
-
Copy full SHA for 0717822 - Browse repository at this point
Copy the full SHA 0717822View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1cd8ab6 - Browse repository at this point
Copy the full SHA 1cd8ab6View commit details
Commits on Dec 9, 2017
-
Configuration menu - View commit details
-
Copy full SHA for cde6ff7 - Browse repository at this point
Copy the full SHA cde6ff7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58e380b - Browse repository at this point
Copy the full SHA 58e380bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c43e0e - Browse repository at this point
Copy the full SHA 2c43e0eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 26f79dc - Browse repository at this point
Copy the full SHA 26f79dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b1a6a3 - Browse repository at this point
Copy the full SHA 2b1a6a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for e686c52 - Browse repository at this point
Copy the full SHA e686c52View commit details
Commits on Dec 11, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 895ce30 - Browse repository at this point
Copy the full SHA 895ce30View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8077b41 - Browse repository at this point
Copy the full SHA 8077b41View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54d9c19 - Browse repository at this point
Copy the full SHA 54d9c19View commit details -
Configuration menu - View commit details
-
Copy full SHA for eb174f2 - Browse repository at this point
Copy the full SHA eb174f2View commit details
Commits on Dec 28, 2017
-
Configuration menu - View commit details
-
Copy full SHA for fa0376b - Browse repository at this point
Copy the full SHA fa0376bView commit details
Commits on Jan 6, 2018
-
Configuration menu - View commit details
-
Copy full SHA for 64e0317 - Browse repository at this point
Copy the full SHA 64e0317View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2be485 - Browse repository at this point
Copy the full SHA b2be485View commit details