Skip to content
Snippets Groups Projects
Commit 3bf9c483 authored by Martin Storsjö's avatar Martin Storsjö
Browse files

aarch64: vp9lpf: Use dup+rev16+uzp1 instead of dup+lsr+dup+trn1


This is one cycle faster in total, and three instructions fewer.

Before:
vp9_loop_filter_mix2_v_44_16_neon: 123.2
After:
vp9_loop_filter_mix2_v_44_16_neon: 122.2

Signed-off-by: default avatarMartin Storsjö <martin@martin.st>
parent c582cb85
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment