[x265] [PATCH 0/7] AArch64: Add Neon impl of transform functions

Micro Daryl Robles microdaryl.robles at arm.com
Tue Nov 26 13:23:58 UTC 2024


Hi,

This patch series adds Neon intrinsic implementations of DST4, IDST4, 
DCT4, IDCT4, IDCT8 and includes improved implementations of IDCT16 
and IDCT32.

This series is based on the master branch.

Many thanks,
Micro

Micro Daryl Robles (7):
  AArch64: Add Neon implementation of 4x4 DST
  AArch64: Add Neon implementation of 4x4 IDST
  AArch64: Add Neon implementation of 4x4 DCT
  AArch64: Add Neon implementation of 4x4 IDCT
  AArch64: Add Neon implementation of 8x8 IDCT
  AArch64: Improve the Neon implementation of 16x16 IDCT
  AArch64: Improve the Neon implementation of 32x32 IDCT

 source/common/aarch64/dct-prim.cpp | 1438 +++++++++++++++++++++-------
 1 file changed, 1117 insertions(+), 321 deletions(-)

-- 
2.34.1



More information about the x265-devel mailing list