This is Akihiro@GATech.
I'm very excited to see the prototype GPU codegen feature released in 1.24! Congratulations! I'd like to run the
gpuAddNums.chpl example on one of my GPU machines, but looks like the compiler does not generate a fatbin file. Any suggestions?
For more details, please see below:
I'm trying to compile this program: chapel/gpuAddNums.chpl at master · chapel-lang/chapel · GitHub
I gave the compiler options in gpuAddNums.compopts.
The only change I made was to change the compute capability option from sm_60 -> sm_61. However, regardless of this, the compiler gives me this warning:
warning: argument unused during compilation: '--cuda-gpu-arch=sm_61'
To build chapel, I used the tar.gz file and here is the output of my printchplenv:
Looks like the NVPTX backend is enabled in the bundled LLVM:
$CHPL_HOME/third-party/llvm/install/linux64-x86_64-gnu/bin/llc --version LLVM (http://llvm.org/):
LLVM version 11.0.1
Default target: x86_64-unknown-linux-gnu
Host CPU: skylake
aarch64 - AArch64 (little endian)
aarch64_32 - AArch64 (little endian ILP32)
aarch64_be - AArch64 (big endian)
arm64 - ARM64 (little endian)
arm64_32 - ARM64 (little endian ILP32)
nvptx - NVIDIA PTX 32-bit
nvptx64 - NVIDIA PTX 64-bit
x86 - 32-bit X86: Pentium-Pro and above
x86-64 - 64-bit X86: EM64T and AMD64
Please let me know if you need more information!