[vlc-devel] [PATCH] arm_neon: Add an optimized routine for NV12/21 to I420

Martin Storsjö martin at martin.st
Mon Sep 30 18:14:17 CEST 2013


On Mon, 30 Sep 2013, Denis Charmet wrote:

> Hi,
>
> Le lundi 30 septembre 2013 à 11:50:16, Martin Storsjö a écrit :
>> +	vld1.u8		{q0},		[IY,:128]!
>> +	pld		[UV, #16]
>> +	vld2.u8		{d2, d3},	[UV,:128]!
>
> <Noob question> why using only q0 and q1? isn't it possible to load Y in
> {q0,q1} for example? </noob question>

I guess that could be done as well, although it would increase some of the 
requirements for the alignment of the width. But I'm removing the copying 
of the Y data here anyway and replacing it with a plain memcpy as 
suggested by Rémi.

// Martin


More information about the vlc-devel mailing list