[x265] [PATCH 0/7] AArch64: Add Neon impl of transform functions
Micro Daryl Robles
microdaryl.robles at arm.com
Tue Nov 26 13:23:58 UTC 2024
Hi,
This patch series adds Neon intrinsic implementations of DST4, IDST4,
DCT4, IDCT4, IDCT8 and includes improved implementations of IDCT16
and IDCT32.
This series is based on the master branch.
Many thanks,
Micro
Micro Daryl Robles (7):
AArch64: Add Neon implementation of 4x4 DST
AArch64: Add Neon implementation of 4x4 IDST
AArch64: Add Neon implementation of 4x4 DCT
AArch64: Add Neon implementation of 4x4 IDCT
AArch64: Add Neon implementation of 8x8 IDCT
AArch64: Improve the Neon implementation of 16x16 IDCT
AArch64: Improve the Neon implementation of 32x32 IDCT
source/common/aarch64/dct-prim.cpp | 1438 +++++++++++++++++++++-------
1 file changed, 1117 insertions(+), 321 deletions(-)
--
2.34.1
More information about the x265-devel
mailing list