-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Probably a problem with single threaded dpotri #410
Comments
Hi, I am able to reproduce the error and will try to provide a fix soon. Thanks Werner On 10.07.2014 10:02, Andreas Noack Jensen wrote:
|
@andreasnoackjensen , as a fast workaround, I disabled the optimized potri functions. You can test this by using the code form my repository: https://github.com/wernsaar/OpenBLAS.git Best regards |
Just tried it and it solves the problem. Thanks. I had OpenBLAS 0.2.8 libraries and tried them as well and they don't have the problem. |
Hi, after v0.2.8, we updated Lapack to version 3.5.0, which provided a lot Best regards Werner On 10.07.2014 15:24, Andreas Noack Jensen wrote:
|
@xianyi Julia 0.3-rc1 is now released. Could we have the next openblas release candidate with this fix, and a few of the other recent fixes that all look like they are good to have? We will include this in our next release candidate. If no more bugs are found, perhaps we can release julia 0.3 and openblas 0.2.10 simultaneously. Thanks. |
@ViralBShah , we will release 0.2.10 version today. In 3 or 4 weeks, we will release 0.2.11 version. I think openblas 0.2.11 can match julia 0.3. |
Ok - that would work too, if required. |
I checked the performance of lapack native ?potri with OpenBLAS against MKL. About 85% of the cycles are spent in the gemm kernel, OpenBLAS is faster than MKL. There is no need, to provide custom ?potri functions. Werner |
At the Julia user list a user reported a problem that appears to be related to the single threaded
dpotri
. Consider the following programWith this I get
My machine is a mid 2009 macbook pro and OpenBLAS is compiled with 64 bit integer support and automatic core detection.
The text was updated successfully, but these errors were encountered: