Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Complie HPL 2.2 failed with OpenBLAS Develop #909

Closed
JohnCenteno opened this issue Jun 9, 2016 · 41 comments
Closed

Complie HPL 2.2 failed with OpenBLAS Develop #909

JohnCenteno opened this issue Jun 9, 2016 · 41 comments

Comments

@JohnCenteno
Copy link

Dear Xianyi

Do you know why I complie HPL 2.2 failed with OpenBLAS dev, but it worked with GotoBLAS2

/root/OpenBLAS/lib/libopenblas.a(memory.o): In function openblas_fork_handler': memory.c:(.text+0x300): undefined reference topthread_atfork'
collect2: ld returned 1 exit status
make[2]: *** [dexe.grd] Error 1
make[2]: Leaving directory /root/hpl-2.2/testing/ptest/Linux_PII_CBLAS' make[1]: *** [build_tst] Error 2 make[1]: Leaving directory/root/hpl-2.2'
make: *** [build] Error 2
[root@localhost hpl-2.2]#

Thanks.
John Centeno

@brada4
Copy link
Contributor

brada4 commented Jun 10, 2016

You need to add -pthread to linker lines to allow this symbol. If you may call BLAS from concurrent threads it must be USE_OPENMP=1 instead. Other option is to build dynamic OpenBLAS so that it imports -lgomp or -lpthread by itself. Please tell how it goes

@JohnCenteno
Copy link
Author

Dear Adrew,

Thanks a lot. I am not good at Linux, so I am not sure how to add -pthread to linker lines" or how to imports lgomp or -lpthread

With Best Regards.
John Centos

@JohnCenteno
Copy link
Author

Dear Andrews,

I also tried "gmake USE_OPENMP=1" & gmake PREFIX=/root/OpenBLAS
vi .bash_profile
export PATH=/root/mpich/bin:$PATH
export OPENBLAS_NUM_THREADS=1

With Best Regards.
John Centeno

@brada4
Copy link
Contributor

brada4 commented Jun 14, 2016

Please catch FULL OUTPUT of build and attach as a text file (DO NOT PASTE IN COMMENT FORM)

@karanchhabra
Copy link

Hi,

I'm facing problems in compiling HPL for testing the GPU on my system but while compiling I'm facing the following error. I've tried every possible solution available on goolge but can't resolve it.

mpicc -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops -DAdd__ -DF77_INTEGER=int -DStringSunStyle -DCUDA -I/root/hpl/hpl-2.0_FERMI_v15_latest/include -I/root/hpl/hpl-2.0_FERMI_v15_latest/include/CUDA -I/root/hpl/mpich/include -I/usr/local/cuda-9.1/include -o /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA/xhpl HPL_pddriver.o HPL_pdinfo.o HPL_pdtest.o /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a -L /root/hpl/hpl-2.0_FERMI_v15_latest/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L/root/hpl/OpenBLAS-0.2.20/libopenblas.so -lpthread /root/hpl/mpich/lib/libmpi.so
/root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_idamax.o): In function HPL_idamax': HPL_idamax.c:(.text+0x1a): undefined reference to idamax_'
/root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dgemv.o): In function HPL_dgemv': HPL_dgemv.c:(.text+0xba): undefined reference to dgemv_'
HPL_dgemv.c:(.text+0x136): undefined reference to dgemv_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dcopy.o): In function HPL_dcopy':
HPL_dcopy.c:(.text+0x1e): undefined reference to dcopy_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_daxpy.o): In function HPL_daxpy':
HPL_daxpy.c:(.text+0x2f): undefined reference to daxpy_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dscal.o): In function HPL_dscal':
HPL_dscal.c:(.text+0x22): undefined reference to dscal_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dtrsv.o): In function HPL_dtrsv':
HPL_dtrsv.c:(.text+0xc0): undefined reference to dtrsv_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dger.o): In function HPL_dger':
HPL_dger.c:(.text+0x73): undefined reference to dger_' HPL_dger.c:(.text+0xa1): undefined reference to dger_'
collect2: error: ld returned 1 exit status
make[2]: *** [dexe.grd] Error 1
make[2]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA' make[1]: *** [build_tst] Error 2 make[1]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest'
make: *** [build] Error 2

Kindly help.
Thanks,
Karan Chhabra

@martin-frbg
Copy link
Collaborator

Looks like you have an error on that long mpicc command line - where you have "-L/root/OpenBLAS-0.2.20/libopenblas.so" it should probably be "-L/root/OpenBLAS-0.2.20 -lopenblas"

@karanchhabra
Copy link

libopenblas.so is the library which has to be mentioned in there in order for hpl to compile.

Followed your suggesstion and got following error:
mpicc -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops -DAdd__ -DF77_INTEGER=int -DStringSunStyle -DCUDA -I/root/hpl/hpl-2.0_FERMI_v15_latest/include -I/root/hpl/hpl-2.0_FERMI_v15_latest/include/CUDA -I/root/hpl/mpich/include -I/usr/local/cuda-9.1/include -o /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA/xhpl HPL_pddriver.o HPL_pdinfo.o HPL_pdtest.o /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a -L /root/hpl/hpl-2.0_FERMI_v15_latest/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L/root/hpl/OpenBLAS-0.2.20/lib -lopenblas -lpthread /root/hpl/mpich/lib/libmpi.so
/bin/ld: cannot find -lopenblas
collect2: error: ld returned 1 exit status
make[2]: *** [dexe.grd] Error 1
make[2]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA' make[1]: *** [build_tst] Error 2 make[1]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest'
make: *** [build] Error 2

@martin-frbg
Copy link
Collaborator

Well, are you certain you have it in /root/hpl/OpenBLAS-0.2.20/lib ? Normally this gets built directly under OpenBLAS-0.2.20. The "-L" tells the compiler/linker where to search, the -l tells it what library - and you leave off the "lib" prefix of the name on Unix/Linux.

@karanchhabra
Copy link

karanchhabra commented Mar 9, 2018

I'm stuck on this since last 2 weeks and can't find any solution on web.

Versions installed:

  1. mpich-3.2.1
  2. OpenBLAS-0.2.20
  3. hpl-2.0_FERMI_v15_latest

It is running perfectly fine for testing CPUs but in case of benchmarking GPUs i have to use the customized version of the benchmark provided by nvidia.

I did install all the pre requisites for this:

  1. nvidia driver.
  2. cuda installation toolkit.
    now i'm not able to compile hpl using the version provided by NVIDIA.

My Make.CUDA file is as follows:


  • shell --------------------------------------------------------------

SHELL = /bin/sh

CD = cd
CP = cp
LN_S = ln -fs
MKDIR = mkdir -p
RM = /bin/rm -f
TOUCH = touch


  • Platform identifier ------------------------------------------------

ARCH = CUDA


  • HPL Directory Structure / HPL library ------------------------------

TOPdir =/root/hpl/hpl-2.0_FERMI_v15_latest
endif
INCdir = $(TOPdir)/include
BINdir = $(TOPdir)/bin/$(ARCH)
LIBdir = $(TOPdir)/lib/$(ARCH)

HPLlib = $(LIBdir)/libhpl.a


  • Message Passing library (MPI) --------------------------------------

MPinc tells the C compiler where to find the Message Passing library
header files, MPlib is defined to be the name of the library to be
used. The variable MPdir is only used for defining MPinc and MPlib.

MPdir = /root/hpl/mpich
MPinc = -I$(MPdir)/include
MPlib = $(MPdir)/lib/libmpi.so


  • Linear Algebra library (BLAS) -----------------------------

LAinc tells the C compiler where to find the Linear Algebra library
header files, LAlib is defined to be the name of the library to be
used. The variable LAdir is only used for defining LAinc and LAlib.

LAdir = /root/hpl/OpenBLAS-0.2.20
LAinc =
LAlib = -L $(TOPdir)/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L$(LAdir)/libopenblas.so -lpthread

F2CDEFS = -DAdd__ -DF77_INTEGER=int -DStringSunStyle


  • HPL includes / libraries / specifics -------------------------------

HPL_INCLUDES = -I$(INCdir) -I$(INCdir)/$(ARCH) $(LAinc) $(MPinc) -I/usr/local/cuda-9.1/include
HPL_LIBS = $(HPLlib) $(LAlib) $(MPlib)

  • Compile time options -----------------------------------------------

-DHPL_COPY_L force the copy of the panel L before bcast;
-DHPL_CALL_CBLAS call the cblas interface;
-DHPL_DETAILED_TIMING enable detailed timers;
-DASYOUGO enable timing information as you go (nonintrusive)
-DASYOUGO2 slightly intrusive timing information
-DASYOUGO2_DISPLAY display detailed DGEMM information
-DENDEARLY end the problem early
-DFASTSWAP insert to use DLASWP instead of HPL code

HPL_OPTS = -DCUDA

HPL_DEFS = $(F2CDEFS) $(HPL_OPTS) $(HPL_INCLUDES)


  • Compilers / linkers - Optimization flags ---------------------------

CC = mpicc
CCFLAGS = -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops $(HPL_DEFS)
CCNOOPT = $(HPL_DEFS) -O0 -w

LINKER = $(CC)
LINKFLAGS = $(CCFLAGS)

ARCHIVER = ar
ARFLAGS = r
RANLIB = echo


MAKE = make TOPdir=$(TOPdir)

@martin-frbg
Copy link
Collaborator

So your LAlib line says the libopenblas.so lives in /root/hpl/OpenBLAS-0.2.20/lib (it takes the LAdir and appends /lib to it). This will be correct for an installed OpenBLAS (i.e. you ran "make install" after building it) but not for the original build directory. Can you please check where the libopenblas.so is on your system ?

@karanchhabra
Copy link

Ok. I have it in the right location /root/hpl/OpenBLAS-0.2.20/libopenblas.so.

mpicc -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops -DAdd__ -DF77_INTEGER=int -DStringSunStyle -DCUDA -I/root/hpl/hpl-2.0_FERMI_v15_latest/include -I/root/hpl/hpl-2.0_FERMI_v15_latest/include/CUDA -I/usr/local/cuda-9.1/include -o /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA/xhpl HPL_pddriver.o HPL_pdinfo.o HPL_pdtest.o /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a -L /root/hpl/hpl-2.0_FERMI_v15_latest/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L/root/hpl/OpenBLAS-0.2.20/libopenblas.so -lpthread
/root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_idamax.o): In function HPL_idamax': HPL_idamax.c:(.text+0x1a): undefined reference to idamax_'
/root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dgemv.o): In function HPL_dgemv': HPL_dgemv.c:(.text+0xba): undefined reference to dgemv_'
HPL_dgemv.c:(.text+0x136): undefined reference to dgemv_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dcopy.o): In function HPL_dcopy':
HPL_dcopy.c:(.text+0x1e): undefined reference to dcopy_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_daxpy.o): In function HPL_daxpy':
HPL_daxpy.c:(.text+0x2f): undefined reference to daxpy_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dscal.o): In function HPL_dscal':
HPL_dscal.c:(.text+0x22): undefined reference to dscal_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dtrsv.o): In function HPL_dtrsv':
HPL_dtrsv.c:(.text+0xc0): undefined reference to dtrsv_' /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a(HPL_dger.o): In function HPL_dger':
HPL_dger.c:(.text+0x73): undefined reference to dger_' HPL_dger.c:(.text+0xa1): undefined reference to dger_'
collect2: error: ld returned 1 exit status
make[2]: *** [dexe.grd] Error 1
make[2]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA' make[1]: *** [build_tst] Error 2 make[1]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest'
make: *** [build] Error 2

@martin-frbg
Copy link
Collaborator

Still not quite right, can you please change the LAlib line in your file to
LAlib = -L $(TOPdir)/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L$(LAdir) -lopenblas -lpthread

@karanchhabra
Copy link

karanchhabra commented Mar 9, 2018

Martin,

Thank you very much!

It worked. Such a small issue and i wasn't able to fix it for 2 weeks. Hopefully i can run it successfully and benchmark my system's GPUs.

mpicc -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops -DAdd__ -DF77_INTEGER=int -DStringSunStyle -DCUDA -I/root/hpl/hpl-2.0_FERMI_v15_latest/include -I/root/hpl/hpl-2.0_FERMI_v15_latest/include/CUDA -I/usr/local/cuda-9.1/include -o /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA/xhpl HPL_pddriver.o HPL_pdinfo.o HPL_pdtest.o /root/hpl/hpl-2.0_FERMI_v15_latest/lib/CUDA/libhpl.a -L /root/hpl/hpl-2.0_FERMI_v15_latest/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L/root/hpl/OpenBLAS-0.2.20 -lopenblas -lpthread
make TOPdir=/root/hpl/hpl-2.0_FERMI_v15_latest /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA/HPL.dat
make[3]: Entering directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA' ( cp ../HPL.dat /root/hpl/hpl-2.0_FERMI_v15_latest/bin/CUDA ) make[3]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA'
touch dexe.grd
make[2]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest/testing/ptest/CUDA' make[1]: Leaving directory /root/hpl/hpl-2.0_FERMI_v15_latest'

Thanks,
Karan

@karanchhabra
Copy link

Hi,

Compiling HPL was successful but facing problems in running the benchmark.

When I run the code it complaints about missing libmkl_intel_lp64.so but when i do ldd for xhpl, it shows no dependency on libmkl_intel_lp64.so.

Any suggesstions.

What I run:
/root/hpl/mpich/bin/mpirun -np 1 -hostfile nodes ./run_linpack

Error:
libmkl_intel_lp64.so: cannot open shared object file: No such file or directory

Dependency tree:
ldd xhpl
linux-vdso.so.1 => (0x00007ffeb597c000)
libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fd1e10f4000)
libdgemm.so.1 => /root/hpl/hpl-2.0_FERMI_v15_latest/src/cuda/libdgemm.so.1 (0x00007fd1e0eeb000)
libcublas.so.9.1 => /usr/local/cuda/lib64/libcublas.so.9.1 (0x00007fd1dd954000)
libcuda.so.1 => /usr/lib64/nvidia/libcuda.so.1 (0x00007fd1dcdb4000)
libcudart.so.9.1 => /usr/local/cuda/lib64/libcudart.so.9.1 (0x00007fd1dcb45000)
libopenblas.so.0 => /root/hpl/openblas/lib/libopenblas.so.0 (0x00007fd1dbbb6000)
libmpi.so.12 => /root/hpl/mpich/lib/libmpi.so.12 (0x00007fd1db737000)
libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fd1db510000)
libc.so.6 => /lib64/libc.so.6 (0x00007fd1db14d000)
/lib64/ld-linux-x86-64.so.2 (0x00005633fe7d7000)
librt.so.1 => /lib64/librt.so.1 (0x00007fd1daf45000)
libdl.so.2 => /lib64/libdl.so.2 (0x00007fd1dad40000)
libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd1daa38000)
libm.so.6 => /lib64/libm.so.6 (0x00007fd1da736000)
libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd1da51f000)
libnvidia-fatbinaryloader.so.390.30 => /usr/lib64/nvidia/libnvidia-fatbinaryloader.so.390.30 (0x00007fd1da2d3000)
libgfortran.so.3 => /lib64/libgfortran.so.3 (0x00007fd1d9fb1000)
libquadmath.so.0 => /lib64/libquadmath.so.0 (0x00007fd1d9d74000)

*Can't see libmkl_intel_lp64.so mentioned anywhere there but still everytime i run HPL it complaints about missing libmkl_intel_lp64.so

Thanks,
Karan

@martin-frbg
Copy link
Collaborator

Probably one of those files was created with the Intel compiler and depends on libmkl, try doing the ldd on that libdemm.so.1 for instance.

@karanchhabra
Copy link

karanchhabra commented Mar 12, 2018

Not sure but here is the output:

--ldd /root/hpl/hpl-2.0_FERMI_v15_latest_copy/src/cuda/libdgemm.so.1

    linux-vdso.so.1 =>  (0x00007ffc73972000)
    libcudart.so.9.1 => /usr/local/cuda-9.1/targets/x86_64-linux/lib/libcudart.so.9.1 (0x00007fd6f12a9000)
    libcuda.so.1 => /usr/lib64/nvidia/libcuda.so.1 (0x00007fd6f0708000)
    libmpi.so.12 => /root/hpl/mpich/lib/libmpi.so.12 (0x00007fd6f0289000)
    libc.so.6 => /lib64/libc.so.6 (0x00007fd6efec6000)
    libdl.so.2 => /lib64/libdl.so.2 (0x00007fd6efcc1000)
    libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fd6efaa5000)
    librt.so.1 => /lib64/librt.so.1 (0x00007fd6ef89d000)
    libm.so.6 => /lib64/libm.so.6 (0x00007fd6ef59a000)
    libnvidia-fatbinaryloader.so.390.30 => /usr/lib64/nvidia/libnvidia-fatbinaryloader.so.390.30 (0x00007fd6ef34e000)
    libgfortran.so.3 => /lib64/libgfortran.so.3 (0x00007fd6ef02c000)
    libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd6eee15000)
    libquadmath.so.0 => /lib64/libquadmath.so.0 (0x00007fd6eebd9000)
    /lib64/ld-linux-x86-64.so.2 (0x000055a1bd532000)

@martin-frbg
Copy link
Collaborator

Ok, that leaves libcuda I guess ?

@karanchhabra
Copy link

karanchhabra commented Mar 12, 2018

I've the following from:
--ldd /usr/lib64/nvidia/libcuda.so.1

    linux-vdso.so.1 =>  (0x00007ffef41c7000)
    libm.so.6 => /lib64/libm.so.6 (0x00007f8a3ba04000)
    libc.so.6 => /lib64/libc.so.6 (0x00007f8a3b641000)
    libdl.so.2 => /lib64/libdl.so.2 (0x00007f8a3b43c000)
    libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f8a3b220000)
    librt.so.1 => /lib64/librt.so.1 (0x00007f8a3b018000)
    libnvidia-fatbinaryloader.so.390.30 => /usr/lib64/nvidia/libnvidia-fatbinaryloader.so.390.30 (0x00007f8a3adcb000)
    /lib64/ld-linux-x86-64.so.2 (0x00005622f7b92000)

Not sure, I could be totally wrong but is it something to do with the compiler section in Make.CUDA

  • Compilers / linkers - Optimization flags ---------------------------

next two lines for GNU Compilers:
CC = mpicc
CCFLAGS = -fopenmp -lpthread -fomit-frame-pointer -O3 -funroll-loops $(HPL_DEFS)
CCNOOPT = $(HPL_DEFS) -O0 -w

On some platforms, it is necessary to use the Fortran linker to find
the Fortran internals used in the BLAS library.

LINKER = $(CC)
LINKFLAGS = $(CCFLAGS)

ARCHIVER = ar
ARFLAGS = r
RANLIB = echo

I used mpif77 as a linker while running it for CPUs but i guess this shouldn't be a problem as it is a linker.?

@martin-frbg
Copy link
Collaborator

martin-frbg commented Mar 12, 2018

No idea, but I do not think mpif77 has anything to do with it. Are you sure that the xhpl binary you are running the ldd on is actually the one that gets called by your run_linpack script ?

@karanchhabra
Copy link

karanchhabra commented Mar 12, 2018

yes it is the one which gets called by my run_linpack script. I did check all dependencies for xhpl and there sub dependencies but can't see any relevance to lib_mkl. The most wiered thing is when i'm not using intel's MKL than why is HPL asking me for the library which is specifically provided by intel with MKL.

run_linpack:
----#!/bin/sh

----export HPL_DIR=/root/hpl/hpl-2.0_FERMI_v15_latest
----CPU_CORES_PER_GPU=128
----export CUDA_DGEMM_SPLIT=0.21
----export CUDA_DTRSM_SPLIT=0.11
----$HPL_DIR/bin/CUDA/xhpl > $HPL_DIR/HPL.out

@matveevma
Copy link

matveevma commented Jun 15, 2019

Hi all!

This problem with HPL very important for me.
@karanchhabra if you use

$ cd HplSourceDir
$ grep -rn mk

you will see many matches with src/cuda/cuda_dgemm.c. Content this source file include some logic about MKL, ACML and GOTO. I think libdgemm.so compiled espetially for MKL.

I am still try find solution for HPL without Intel implementations. Do you have new results?

@brada4
Copy link
Contributor

brada4 commented Jun 16, 2019

Probably if you remove either MKL or OpenBLAS from build machine, HPL goes with one remaining.

@matveevma
Copy link

Similar error about MKL: libmkl_intel_lp64.so: cannot open shared object file: No such file or directory. Maybe exist newest libdgemm.so for CUDA?

@brada4
Copy link
Contributor

brada4 commented Jun 16, 2019

This is not Intel support forum.
You need to look for "MKL link line advisor" to fix MKL error.

If you want OpenBLAS, do your best to move MKL out of compilers way.

@matveevma
Copy link

Ok, i will install ACML or GOTO because dgemm.c included it.

Thanks!

@martin-frbg
Copy link
Collaborator

martin-frbg commented Jun 16, 2019

Perhaps someone on the nvidia CUDA ZONE forum will know the answer (though I see now that karanchhabra did not get much help there last year) - maybe you can even get away with copying some other library to the libmkl_intel_lp64.so name (if nothing actually uses it)

@brada4
Copy link
Contributor

brada4 commented Jun 16, 2019

@matveevma I actually asked to remove surplus libraries, not to add more and more.

@matveevma
Copy link

matveevma commented Jun 17, 2019

As i say earlier, at first my system have not BLAS libraries. Then i try compile HPL with ATLAS - return MKL error, then deleted ATLAS. Next try - with Lapack from netlib - return MKL error.

libmkl_intel_lp64.so: cannot open shared object file: No such file or directory

grep -rn mkl show matches with dgemm source and i see hardcode about MKL in file `src/cuda/cuda_dgemm.c". I will asked NVIDIA (again) about newest libdgemm.so.

@martin-frbg
Copy link
Collaborator

@matveevma I guess the source for their CUDA dgemm is not available, and you have to rely on source comments in the objects ? (Could be NVIDIA relies on some environment variable being set or unset for their decision to import mkl, but I should really hope they remembered to document that somewhere. I suppose strace is no help either)

@brada4
Copy link
Contributor

brada4 commented Jun 17, 2019

There is original HPL without nvidia's customisation:
http://www.netlib.org/benchmark/hpl/
Does it work?

@matveevma
Copy link

matveevma commented Jun 17, 2019

Yes, running without problem with IntelMPI and MKL. I am checking another CPU cases with OpenMPI and none MKL

@brada4
Copy link
Contributor

brada4 commented Jun 17, 2019

Yes, you need to dig through HPL-2.3 configure script
4136: cat <<HPLEOF > hplvars.txt
I see no chance it could detect absent MKL libraries.
Certainly MKL and OpenBLAS and reference BLAS are all equally supported.

@matveevma
Copy link

matveevma commented Jun 23, 2019

Fixed by link

[user1@ac922 hpl-2.0_FERMI_v15]$ vi ./src/cuda/cuda_dgemm.c
…
// handle2 = dlopen (“libmkl_intel_lp64.so”, RTLD_LAZY);
handle2 = dlopen (“libopenblas.so”, RTLD_LAZY);
…
// dgemm_mkl = (void(*)())dlsym(handle, “dgemm”);
dgemm_mkl = (void(*)())dlsym(handle, “dgemm_”);
…
// handle = dlopen (“libmkl_intel_lp64.so”, RTLD_LAZY);
handle = dlopen (“libopenblas.so”, RTLD_LAZY);
…
// mkl_dtrsm = (void(*)())dlsym(handle2, “dtrsm”);
mkl_dtrsm = (void(*)())dlsym(handle2, “dtrsm_”);
…

Thanks all!

@brada4
Copy link
Contributor

brada4 commented Jun 23, 2019

Probably you should report it to cudahpl authors, not here, default hpl picks up cuda or openblas just fine

@matveevma
Copy link

default hpl picks up cuda or openblas just fine

What you mean? CUDA support have only HPL 2.0 from NVIDIA. It is very old version for FERMI architecture and build from source code with newerlest libraries not so easy now.

@martin-frbg
Copy link
Collaborator

Thx, added to the faq in the wiki (for want of a better place)

@brada4
Copy link
Contributor

brada4 commented Jun 24, 2019

I mean absence of newer nvidia distribution means that open-source HPL distribution already works fine with CUDA (it imports only dgemm from supposed BLAS library)

@martin-frbg
Copy link
Collaborator

@brada4 I believe you must be mistaken - I see no mention of CUDA whatsoever in either the online HPL documentation at netlib.org nor the INSTALL (etc) document included in the hpl-2.3 release of december 2018. I do not think the modifications in the NVIDIA version of HPL are as simple as just specifying libcuda as its BLAS.

@matveevma
Copy link

matveevma commented Jun 24, 2019

@martin-frbg, i think @brada4 idea very interesting. I will check integrating dgemm functionality to another blas libraries. I mean building HPL 2.3 from netlib with openblas, cuda and openmpi and i hope it will be work with new HPL version because blas libraries newest too.

The second way to use new HPL with cuda :

$ diff hpl-2.0-directory hpl-2.3-directory >> diff-hpl-versions
$ diff hpl-2.0-cuda-directory hpl-2.3-directory >> diff-cuda

Remove copyrights for easy compare diff results.

This case works fine for me.

@brada4
Copy link
Contributor

brada4 commented Jun 24, 2019

@martin-frbg

Yes, you need to dig through HPL-2.3 configure script
4136: cat < hplvars.txt
I see no chance it could detect absent MKL libraries.

@MarpleJane
Copy link

Still not quite right, can you please change the LAlib line in your file to LAlib = -L $(TOPdir)/src/cuda -ldgemm -L/usr/local/cuda/lib64 -lcublas -lcuda -lcudart -L$(LAdir) -lopenblas -lpthread

Helps a lot!

honno added a commit to honno/OpenBLAS that referenced this issue Aug 4, 2023
* Created Installation Guide (markdown)
* Updated quick installation (markdown)
* Updated Home (markdown)
* Updated Document (markdown)
* Updated Document (markdown)
* Updated Document (markdown)
* Created Installation Guide (markdown)
* Created Home (markdown)
* Init version
* Updated OpenBLAS Wiki (markdown)
* Updated OpenBLAS Wiki (markdown)
* Updated OpenBLAS Wiki (markdown)
* Updated Document (markdown)
* Updated Installation Guide (markdown)
* Updated Installation Guide (markdown)
* Created Download (markdown)
* Created Faq (markdown)
* Updated Faq (markdown)
* Updated FAQ
* Created How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Document (markdown)
* Updated Faq (markdown)
* Updated Faq (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Faq (markdown)
* Updated OpenBLAS Wiki (markdown)
* Updated Home (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Created How to generate import library for MingW (markdown)
* Updated Document (markdown)
* Updated Faq (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Build instrunctions for FreeBSD
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Installation Guide (markdown)
* Updated Faq (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* minor edits
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Faq (markdown)
* Installation instructions for Windows
* Updated Faq (markdown)
* G77 conventions no longer needed with GCC 4.7+
* Updated Home (markdown)
* Document why issue 168 occurred.
* Updated Home (markdown)
* Created Publications (markdown)
* Updated Home (markdown)
* Updated Document (markdown)
* Updated Faq (markdown)
* Updated Download (markdown)
* Updated Publications (markdown)
* Updated Faq (markdown)
* Updated Document (markdown)
* Revert 7580d38ffad37e6613e6304707aaaa681f3d78c2 ... b1bd4ff37d2106bbd5c4730a08dbb789cc44e7d4
* Created Mailing List (markdown)
* Updated Mailing List (markdown)
* Updated Mailing List (markdown)
* Updated Home (markdown)
* Updated Document (markdown)
* Updated Publications (markdown)
* Updated Download (markdown)
* Updated Faq (markdown)
* Updated Home (markdown)
* Updated Faq (markdown)
* Updated Home (markdown)
* Revert b69f1417cdf8820be046cc27a2b96b42a25bc3a3 ... 90a227c317c3572ced943461ac3a252c40790f44 on Home
* Updated Home (markdown)
* Updated Publications (markdown)
* Updated Faq (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* We already ensure the stack alignment in Makefile.system for Win32.
* Updated Faq (markdown)
* Updated Faq (markdown)
* Updated Publications (markdown)
* Created Donation (markdown)
* Updated Home (markdown)
* Updated Document (markdown)
* Updated Faq (markdown)
* Updated Publications (markdown)
* Updated Download (markdown)
* Updated Mailing List (markdown)
* Updated Donation (markdown)
* Updated Download (markdown)
* Updated Donation (markdown)
* Updated Donation (markdown)
* Updated Donation (markdown)
* Updated Donation (markdown)
* Updated Home (markdown)
* Updated Faq (markdown)
* Updated Download (markdown)
* Updated Home (markdown)
* Updated Home (markdown)
* Add new entry for static linking and pthread.
* Fix named anchors (see http://stackoverflow.com/questions/5319754/cross-reference-named-anchor-in-markdown/7335259#7335259)
* Created Related packages that use OpenBLAS (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Document (markdown)
* Created To-do List (markdown)
* Updated To do List (markdown)
* Updated Fixed optimized kernels To do List (markdown)
* Fix English idiom
* Remove trailing whitespace
* Updated Fixed optimized kernels To do List (markdown)
* Updated Fixed optimized kernels To do List (markdown)
* Updated Fixed optimized kernels To do List (markdown)
* Updated Fixed optimized kernels To do List (markdown)
* Updated Faq (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Created Machine List (markdown)
* Updated Document (markdown)
* Updated Installation Guide (markdown)
* Created User Manual (markdown)
* Updated User Manual (markdown)
* Updated Document (markdown)
* Updated User Manual (markdown)
* Updated User Manual (markdown)
* Updated User Manual (markdown)
* Updated User Manual (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated Faq (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated Machine List (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Updated Related packages that use OpenBLAS (markdown)
* Add a note about building in QEMU
* Updated Home (markdown)
* Updated Faq (markdown)
* update for allocating too many meory error.
* Updated Faq (markdown)
* Updated Faq (markdown)
* Updated Installation Guide (markdown)
* Updated Faq (markdown)
* Init function doc
* Updated Document (markdown)
* Updated User Manual (markdown)
* Updated User Manual (markdown)
* Created How to build OpenBLAS for Android (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Updated Home (markdown)
* Part of the description is really no clear, I add some more information, so it would be easier for VS user to fix the problems facing them.
* Created Developer manual (markdown)
* Updated Document (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* a typo, download ** frome -> download from
* Updated Faq (markdown)
* English (minor edit)
* Updated Developer manual (markdown)
* Updated Developer manual (markdown)
* Updated Developer manual (markdown)
* Updated Machine List (markdown)
* Updated Developer manual (markdown)
* Updated Developer manual (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* issue#842
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Added FC for building with Fortran
* Change link for the Intel MKL documentation
* Updated User Manual (markdown)
* Updated User Manual (markdown)
* Added MIPS build instructions from OpenMathLib#949
* use TARGET_CFLAGS and TARGET_LDFLAGS instead of CFLAGS and LDFLAGS for linking OpenBLAS on ARMv7
* Add Windows updates (msys2,mingw/w64 merger), Android/MIPS pointers, qemu hint
* Building libs & netlib targets to prevent errors in tests
* Recipes not targets (for make)
* Making only libs, not netlib (which also contains link/run tests...)
* Copied from instructions by Ivan Ushakov, originally posted in OpenMathLib#569
* Updated How to build OpenBLAS for iPhone iOS (markdown)
* Updated Faq (markdown)
* Created How to build OpenBLAS for iPhone iOS (markdown)
* error code (0xc000007b) was missing a character
* Updated How to build OpenBLAS for iPhone iOS (ARMv8) (markdown)
* Updated How to build OpenBLAS for iPhone iOS (ARMv8) (markdown)
* Revert 7e9dd0ebf079e002e3aa831fa671fde3e8cfad81...8d105c7be8cd447482f61e0295c0c146f5314eb5 on How to build OpenBLAS for iPhone iOS
* Add guide on how to reversibly supplant Ubuntu LTS libblas.so.3
* typo
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated User Manual (markdown)
* Updated Faq (markdown)
* Updated Download (markdown)
* Add perl to pacman package list
* Fixed formatting on general questions
* Copied from issue OpenMathLib#1136
* Added instructions for building for Windows UWP.
* To clear confusions vs super-fat-binaries that dont exist.
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Update for 0.2.20 (full builds, ARMv7 softfp support, newer NDKs using CLANG)
* Updated How to build OpenBLAS for Android (markdown)
* Fix some formatting issues
* Updated How to build OpenBLAS for Android (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to build OpenBLAS for Android (markdown)
* Created Precompiled installation packages (markdown)
* Updated Precompiled installation packages (markdown)
* Example - debian?
* Mention (and link to) distribution-specific packages
* Updated Installation Guide (markdown)
* OpenSuSE (13.2, SLE included)
* Updated Precompiled installation packages (markdown)
* Updated Precompiled installation packages (markdown)
* Make it look consistent.
* Fedora+EPEL // maybe rpmbuild is too heavy
* Updated Precompiled installation packages (markdown)
* Updated Precompiled installation packages (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Precompiled installation packages (markdown)
* fix toolchain argument in armv8 clang build as per OpenMathLib#1337
* add note about stdio.h not found error
* Add flang instructions
* Use the SVG Travis badge
* homebrew option for OSX
* Promote native MSVC builds with LLVM
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Direct people to the appropriate instructions
* Add link to the Goto paper
* Add CMAKE_BUILD_TYPE
* Add note about having to specify AR on a Mac, from issue 1435
* Mention requirement to build a standalone toolchain in the clang section as well
* added 'perl' to conda install command
* homebrew/science was deprecated. This tap is now empty as all its formulae were migrated.
* Added hint for "expected identifier" error message to mingw section following OpenMathLib#1503
* Revert 9161c3b54281131e892dec739d888f35e6c59cf3...03f879be0c9e6a55705bc7efd5ee193299e04029 on How to use OpenBLAS in Microsoft Visual Studio
* Revert to recommending mingw-w64 from sf.net and add note about issue 1503
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Update MSVC installation procedure with info from OpenMathLib#1521
* Add downgrade option for msys2 mingw compiler issue as suggested by econwang in OpenMathLib#1503
* Add note about static linking bug with NDK 16 and API>22
* Updated Precompiled installation packages (markdown)
* Updated Precompiled installation packages (markdown)
* Updated Faq (markdown)
* OBS is renamed and deep link format changed. Apparently recent SLE includes rpm by default too.
* Add links to Conda-Forge and to staticfloat's builds for Julia
* Mention _64 suffix appended to Julia builds with INTERFACE64 (issue 1617)
* Fix unwanted markdown italicization
* Add instruction to change to the generic sgemmkernel implementation from issue 1531
* Added hint about stack size requirements for running lapack-test from PR 1645; fixed markup of section headings
* Add link to RvdG's publications page as a non-paywalled source of the "Goto paper"
* Add section about non-suitability of the IBM XL compiler on POWER8
* Mention cmake version requirement in view or recent issues with link failures in utest etc.
* Replace outdated entry for Sandybridge support with more general section on AVX512, Ryzen and GPU
* Mention Apple Accelerate here as iOS build issue tickets usually die as soon as someone points out this option to the questioner.
* Add section about unexpectedly using an older pre-installed version of the shared library (issue 1822)
* fix markup of new entry
* Mention perl and C compiler as prerequisites on the build host
* Save WIP page
* Updated Notes on parallelism and OpenBLAS (markdown)
* Updated Notes on parallelism and OpenBLAS (markdown)
* Updated Notes on parallelism and OpenBLAS (markdown)
* Updated [WIP] Notes on parallelism and OpenBLAS (markdown)
* Updated [WIP] Notes on parallelism and OpenBLAS (markdown)
* Updated [WIP] Notes on parallelism and OpenBLAS (markdown)
* Destroyed [WIP] Notes on parallelism and OpenBLAS (markdown)
* Updated Faq (markdown)
* Add small note on AVX512 for CentOS/RHEL section.
* document the extension functions
* formatting
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated Download (markdown)
* Add brief general usage information from issue 1925
* Add link to Pete Warden blog article on GEMM rather than just deep-linking to a diagram from it
* Document some of the less useful parameters from param.h
* Updated Installation Guide (markdown)
* Done with OpenMathLib#2089
* Add note about changed library names for update-alternatives on Debian/Ubuntu
* Updated Home (markdown)
* Add note about using OpenBLAS with CUDA_HPL 2.3 from issue OpenMathLib#909
* Fix typos in previous commit
* Add pdb instructions fir cross-builds
* Add note about generic QEMU CPUID clashing with existing P2(MMX)
* typo
* typo
* C code syntax highlight
* Updated multithreading section to introduce option USE_LOCKING (issue 2164)
* Updated How to build OpenBLAS for iPhone iOS (ARMv8) (markdown)
* Updated How to build OpenBLAS for iPhone iOS (ARMv8) (markdown)
* Clarify Miniconda/cmake install instructions and redact outdated note about msys2
* Document cmake install step
* Updated How to build OpenBLAS for Android (markdown)
* Add solution for programs that look for libblas.so/liblapack.so
* Add entry for powersaving modes on ARM boards (from issue 2540)
* Add suggestion for speed problems on big.little systems from issue 2589
* Convert the ARMV8 big.little tidbit to a separate topic and update it with more details from the issue ticket
* Add entry about problems caused by using the raw cblas.h (issue 2593)
* complete quote symbol around CPATH environment variable
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Add note about running conda activate when working in a VS window (from issue 2637)
* Add note about (not) compiling with -fbounds-check (ticket 2657)
* Add entry about compile-time NUM_THREADS setting (issue 2678)
* Added some sketchy description of adding cpuids for autodetection, adding targets and architectures
* Markup and typo fixes
* Add openblas_set_affinity from PR 2547
* Created _Footer (markdown)
* Destroyed _Footer (markdown)
* Add LAPACK-like SHGEMM to document the "official" status of the SH prefix
* fix formatting of latest addition
* Move outdated instructions for gcc-based NDK versions to the bottom, add hint about x86 builds
* Add help for cpuid recognition failure
* Update source tree layout & mention extraneous cpu paramerts
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Explain why pure VS builds are slower, and highlight that they do not support DYNAMIC_ARCH
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Mention fortran requirement and incompatibility of ifort with msvc
* preliminary page for understanding the build system, needs a lot more work and input from more knowledgeable people than me
* Updated Build system overview (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* add information for HOSTCC, HOST_CFLAGS
* Added alternative script which was tested on OSX with latest NDK
* added link to targets list
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* added script for x86_64 architecture
* Updated WIP   Build system overview (community made) (markdown)
* Updated WIP   Build system overview (community made) (markdown)
* updated link to FLAME publications list
* Created How to use OpenBLAS on Cortex-M (markdown)
* Updated How to use OpenBLAS on Cortex M (markdown)
* Updated Precompiled installation packages (markdown)
* Updated How to use OpenBLAS on Cortex M (markdown)
* Updated How to use OpenBLAS on Cortex M (markdown)
* Updated How to use OpenBLAS on Cortex M (markdown)
* Update source layout graph and start a short section on benchmarking to collect various pointers from the issue tracker
* Add workaround for building with CMAKE on OSX
* Use actual small headings to fix... weird bullet indent shit
* Oops
* Updated Faq (markdown)
* Updated Faq (markdown)
* Updated How to generate import library for MingW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* Updated How to generate import library for MinGW (markdown)
* explicitly set CMAKE_MT to replace the new cmake default llvm-mt (failing)
* Add -Wl,-rpath,/your_path/OpenBLAS/lib option to gcc linker line in "Link shared library" section + explanation for why it is needed/can be omitted. Also make note that -lgfortran not needed if only making LAPACKE calls.
* Add note explaining that build flags passed to make should also be passed to make install
* give example of install error
* Describe how to build openblas library for win/arm64 targets
* Add Xen to the existing entry for QEMU/KVM based on issue 3445
* Updated Download (markdown)
* Updated Installation Guide (markdown)
* Updated Installation Guide (markdown)
* Revert b8da0e8523b898a2206d1e2fe99dbfb4ebb0ffa8...bc55aade759d2f925689b000828da249e1fc6a1a on Installation Guide
* Revert b0c9a2ee060b8dd0b46b4c58375ef2a743c0363a...cecf8cf67963bd77a0bb97086e3a457a4cee11ff on Download
* Revert bc55aade759d2f925689b000828da249e1fc6a1a...134894a0f09a0e92eef1b9a5c9e63f459d2db55e on Installation Guide
* Add NDK23B example
* Makes iOS build more robust
* Double -isysroot
* Bump up required devtoolset version for AVX-512 intrinsics.
* Updated Installation Guide (markdown)
* Updated How to build OpenBLAS for Windows on ARM64 (markdown)
* Revert b8da0e8523b898a2206d1e2fe99dbfb4ebb0ffa8...75bba70832f8765faee693931c4a9e3eb6c84d98 on Installation Guide
* Revert 75bba70832f8765faee693931c4a9e3eb6c84d98...d171e711a5cd8026b2eb507b249b5e51fa28b2a2 on Installation Guide
* restore Windows link after malicious edit
* Revert 1bcb03dcef85c675aace7f0a755d5aa36ec46eca...f732906434146b1a1ee82abe944a6d51d8f43b81 on Installation Guide
* restore Windows link after malicious edit
* Updated Installation Guide (markdown)
* Bump up AVX-512 devtoolset because of identified packaging issues
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* n-dash html entity instead of -
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Add the bfloat16 functions
* mention AXPBY
* Update building for Apple M1
* Updated How to build OpenBLAS for Windows on ARM64 (markdown)
* Created How to build OpenBLAS for macOS M1 / arm64 (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Add NO_AVX2 build hint for OSX Docker Desktop/xhyve (issues 2194 and 2244)
* Mention the ELF offset/address bug from binutils 2.38 ld
* moved issue 665 (sparse matrix/vector support) to a faq entry
* Update and simplify based on CI experience and 3741
* Updated Download (markdown)
* Updated How to build OpenBLAS for Windows on ARM64 (markdown)
* Revert 0dcee87d486028fbd88c603853cdcae810e025c6...bf3d15e74d42b0b01618b4beb7b9d658fb905118 on Download
* Revert a02f9e470f8e26eda1b8d8601ad2486557721ccf...c862aeb3492c29b487858d43c93676855b60a1f2 on How to build OpenBLAS for Windows on ARM64
* Updated How to use OpenBLAS in Microsoft Visual Studio (markdown)
* Revert 9db97d11d88c801e8c5e9b8d6cc85fb44e5bca61...d2eb48810f3ecc1680900581473005f79c394ca4 on How to use OpenBLAS in Microsoft Visual Studio
* start with the smallest configs, Appveyor and Cirrus
* Updated CI jobs overview (markdown)
* Add Azure CI
* Add github workflows
* Add the crossbuild parts of the dynamic_arch workflow
* remove trailing separator
* Add FreeBSD/Cirrus
* Add ILP64 jobs on Cirrus
* Add C910V and the OSUOSL Jenkins jobs (currently configured for my fork)
* Updated Installation Guide (markdown)
* Expand section on precompiled windows binaries to mention INTERFACE64=0 option
* Remove reference to buildbot (domain reregistered to someone else, issue 4148
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants