Welcome to LLVM GPU News, a bi-weekly newsletter on all the GPU things under the LLVM umbrella. This issue covers the period from July 8 to July 22, 2022.

We welcome your feedback and suggestions. Let us know if we missed anything interesting, or want us to bring attention to your (sub)project, revisions under review, or proposals. Please see the bottom of the page for details on how to submit suggestions and contribute.

Industry News and Community Events

LLVM and Clang

Discussions

Commits

  • Added the PrepareFunctions pass for SPIR-V, responsible handling function signatures prior to IR translation and used during call lowering. D129730
  • The NVPTX backend learned to promote integer function parameters to power-of-two-wide registers. ‘This patch is a step towards enabling Rust compilation to ptx while retaining the target independent optimizations.’ D129291
  • A number of commits improving support for the gfx11 and gfx9a AMDGPU targets.
  • CUDA now passes aggregate kernel arguments by value when targeting SPIR-V. D130387

MLIR

Discussions

Commits

  • nvgpu.ldmatrix and nvgpu.mma.sync ops gained better verification. D129669, D129400
  • Added the amdgpu.lds_barrier op. D129522
  • The VectorToROCDL pass was removed. D129308
  • spv.GLSL.*/spv.OCL.* ops were renamed to spv.GL.*/spv.CL.* ops for better consistency. D130194, D130280

OpenMP (Target Offloading)

Discussions

Commits

  • Added support for CUDA/HIP compilation in the non-RDC mode with the new driver. D129655, D129784
  • Static library will not be linked when -nogpulib is specified. D129534

External Compilers

LLPC

  • Fixed CMake to support both static and dynamic linking of the LLVM library. LLPC#1903