Skip to content

Optimize ZROT_RVV for the non-unit-stride case#5677

Merged
martin-frbg merged 1 commit intoOpenMathLib:developfrom
CheryDan:riscv/zdrot
Mar 17, 2026
Merged

Optimize ZROT_RVV for the non-unit-stride case#5677
martin-frbg merged 1 commit intoOpenMathLib:developfrom
CheryDan:riscv/zdrot

Conversation

@CheryDan
Copy link

Optimize the RVV implementation of ZROT when inc_x and inc_y are non-unit strides (inc_x != 1, inc_y != 1).

Reorder several operations to reduce vector register pressure and avoid unnecessary vector register spill to the stack. This helps GCC keep vector values in registers and reduces redundant spill/reload instructions, improving runtime performance.

No functional change.

Optimize the RVV implementation of ZROT when inc_x and inc_y are
non-unit strides (inc_x != 1, inc_y != 1).

Reorder several operations to reduce vector register pressure and
avoid unnecessary vector register spill to the stack. This helps GCC
keep vector values in registers and reduces redundant spill/reload
instructions, improving runtime performance.

No functional change.

Signed-off-by: daichengrong <daichengrong@iscas.ac.cn>
@martin-frbg martin-frbg added this to the 0.3.32 milestone Mar 16, 2026
@martin-frbg
Copy link
Collaborator

Thank you

@martin-frbg martin-frbg merged commit a1fd7a4 into OpenMathLib:develop Mar 17, 2026
101 of 102 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants