[x265] [PATCH 0/2] AArch64: Fix SVE DCT implementations

Mahesh Pittala mahesh at multicorewareinc.com
Tue Jun 24 16:59:30 UTC 2025


Apologies for the delay, the review and smoke testing on our end took longer than expected. The changes have now been pushed.
________________________________
From: x265-devel <x265-devel-bounces at videolan.org> on behalf of Jonathan Wright <Jonathan.Wright at arm.com>
Sent: Tuesday, June 24, 2025 10:06 PM
To: x265-devel at videolan.org <x265-devel at videolan.org>
Cc: nd <nd at arm.com>
Subject: Re: [x265] [PATCH 0/2] AArch64: Fix SVE DCT implementations

Hi,

Could someone take a look at these patches please?

Thanks,
Jonathan
________________________________
From: x265-devel <x265-devel-bounces at videolan.org> on behalf of Jonathan Wright <jonathan.wright at arm.com>
Sent: 10 June 2025 18:40
To: x265-devel at videolan.org <x265-devel at videolan.org>
Cc: nd <nd at arm.com>
Subject: [x265] [PATCH 0/2] AArch64: Fix SVE DCT implementations

Hi,

This patch series fixes bugs in the Arm SVE 16x16 and 32x32 DCT
implementations, and also mitigates a portion of the performance
regression due to the fix. Both SVE DCT implementations are still
sgnificantly faster than the equivalent Neon paths.

Note that the DCT unit tests did not show these bugs. They were found
after differences in encoded output videos were observed on Arm and
x86 for veryslow, slower and slow encoding presets. With these patches
applied encoded output matches for all speed presets.

Thanks,
Jonathan

Jonathan Wright (2):
  AArch64: Fix SVE 16x16 and 32x32 DCT implementations
  AArch64: Specialize passes of 16x16 and 32x32 SVE DCTs

 source/common/aarch64/dct-prim-sve.cpp | 338 ++++++++++++++++++++++---
 1 file changed, 306 insertions(+), 32 deletions(-)

--
2.39.5 (Apple Git-154)

_______________________________________________
x265-devel mailing list
x265-devel at videolan.org
https://mailman.videolan.org/listinfo/x265-devel
IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20250624/3a65ff4e/attachment-0001.htm>


More information about the x265-devel mailing list