[x264-devel] [PATCH 03/11] aarch64: NEON asm for missing x264_zigzag_* functions

Janne Grunau janne-x264 at jannau.net
Mon Aug 25 11:45:46 CEST 2014


On 2014-08-22 17:26:36 +0200, Janne Grunau wrote:
> zigzag_scan_4x4_field_neon, zigzag_sub_4x4_field_neon,
> zigzag_sub_4x4ac_field_neon, zigzag_sub_4x4_frame_neon,
> igzag_sub_4x4ac_frame_neon more than 2 times faster
> 
> zigzag_scan_8x8_frame_neon, zigzag_scan_8x8_field_neon,
> zigzag_sub_8x8_field_neon, zigzag_sub_8x8_frame_neon 4-5 times faster
> 
> zigzag_interleave_8x8_cavlc_neon 6 times faster
> ---
>  common/aarch64/dct-a.S | 327 +++++++++++++++++++++++++++++++++++++++++++++++++
>  common/aarch64/dct.h   |  13 ++
>  common/dct.c           |  21 +++-
>  3 files changed, 360 insertions(+), 1 deletion(-)

patch is broken, fixed in my git tree. It expected libav style ARCH_* 
defines.

Janne



More information about the x264-devel mailing list