Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ROCM] Fix feature flags for gfx1100 and improve flag handling (iree-…
…org#18781) For gfx1100, the feature string being passed was `+wavefrontsize32-fma-mix-insts`. This is clearly invalid as the flags aren't seperated. This patch improves the feature handling for multiple flags. This fix also found a bug with passing -fma-mix-insts on gfx1100, which causes a crash in the compiler: ``` iree-compile: /home/kunwar/Work/iree/third_party/llvm-project/llvm/lib/Target/AMDGPU/SIInsertHardClauses.cpp:118: HardClauseType (anonymous namespace)::SIInsertHardClauses::getHardClauseType(const MachineInstr &): Assertion `ST->getGeneration() >= AMDGPUSubtarget::GFX11' failed. Please report issues to https://github.com/iree-org/iree/issues and include the crash backtrace. Stack dump: 0. Running pass 'CallGraph Pass Manager' on module '_winograd_input_nchw_dispatch_0'. 1. Running pass 'SI Insert Hard Clauses' on function '@_winograd_input_nchw_dispatch_0_winograd_input_transform_8x8x1x1x1x1xf32_dispatch_tensor_store' iree-compile: /home/kunwar/Work/iree/third_party/llvm-project/llvm/lib/Target/AMDGPU/SIInsertHardClauses.cpp:118: HardClauseType (anonymous namespace)::SIInsertHardClauses::getHardClauseType(const MachineInstr &): Assertion `ST->getGeneration() >= AMDGPUSubtarget::GFX11' failed. Aborted (core dumped) ``` To fix this, the patch also restricts passing -fma-mix-insts to gfx9
- Loading branch information