[x265] [PATCH 0/6] AArch64: Fix SVE(2) kernels for vectors >128 bits
chen
chenm003 at 163.com
Tue Jan 7 07:42:21 UTC 2025
Hi,
Thank for the fixes.
It looks good.
However, I have not environment to verify it, please run smoke-test before push, thanks.
At 2025-01-07 01:16:04, "George Steed" <george.steed at arm.com> wrote:
>Hi,
>
>There are existing SVE and SVE2 assembly implementations of some kernels
>that inspect the SVE vector length to determine which kernel loop to
>execute, however some of these have issues when running on vector
>lengths other than 128 bits.
>
>This patch series fixes all issues that previously failed in the
>TestBench executable when running with vector lengths longer than 128
>bits.
>
>This series is based on the master branch.
>
>Thanks,
>George
>
>George Steed (6):
> mc-a-sve2.S: Fix addAvg_{16,32}xh_sve2 for longer SVE vectors
> pixel-util-sve2.S: Fix accumulators in pixel_var_*_sve2
> pixel-util-sve2.S: Fix branch target in pixel_sub_ps_64x64_sve2
> asm-primitives.cpp: Delete dequant_scaling SVE2 implementation
> blockcopy8-sve.S: Fix branch target in cpy1Dto2D_shr_32x32_sve
> pixel-util-sve2.S: Fix normFact/ssimDist64 for longer SVE vectors
>
> source/common/aarch64/asm-primitives.cpp | 3 +-
> source/common/aarch64/blockcopy8-sve.S | 2 +-
> source/common/aarch64/mc-a-sve2.S | 24 +-----
> source/common/aarch64/pixel-util-sve2.S | 94 +++---------------------
> 4 files changed, 14 insertions(+), 109 deletions(-)
>
>--
>2.34.1
>
>_______________________________________________
>x265-devel mailing list
>x265-devel at videolan.org
>https://mailman.videolan.org/listinfo/x265-devel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20250107/c953e92d/attachment.htm>
More information about the x265-devel
mailing list