Skip to content

Convert inputs from BF16 to FP32 and use FP32 vector madds. 18% faster.

3356043
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

Added ability to accumulate in FP16 for GEMM for RISC-V #5640

Convert inputs from BF16 to FP32 and use FP32 vector madds. 18% faster.
3356043
Select commit
Loading
Failed to load commit list.
build (cmake, gfortran, 1, 1)
succeeded Feb 11, 2026 in 20m 58s