[x264-devel] [PATCH 03/11] aarch64: NEON asm for missing x264_zigzag_* functions
Janne Grunau
janne-x264 at jannau.net
Mon Aug 25 11:45:46 CEST 2014
On 2014-08-22 17:26:36 +0200, Janne Grunau wrote:
> zigzag_scan_4x4_field_neon, zigzag_sub_4x4_field_neon,
> zigzag_sub_4x4ac_field_neon, zigzag_sub_4x4_frame_neon,
> igzag_sub_4x4ac_frame_neon more than 2 times faster
>
> zigzag_scan_8x8_frame_neon, zigzag_scan_8x8_field_neon,
> zigzag_sub_8x8_field_neon, zigzag_sub_8x8_frame_neon 4-5 times faster
>
> zigzag_interleave_8x8_cavlc_neon 6 times faster
> ---
> common/aarch64/dct-a.S | 327 +++++++++++++++++++++++++++++++++++++++++++++++++
> common/aarch64/dct.h | 13 ++
> common/dct.c | 21 +++-
> 3 files changed, 360 insertions(+), 1 deletion(-)
patch is broken, fixed in my git tree. It expected libav style ARCH_*
defines.
Janne
More information about the x264-devel
mailing list