[vlc-devel] [PATCH] arm neon i420_yuyv/i420_uyvy/s32_s16 : a bit less cycles (untested)
Rémi Denis-Courmont
remi at remlab.net
Fri Jun 24 08:51:37 CEST 2011
Le vendredi 24 juin 2011 04:10:34 Rafaël Carré, vous avez écrit :
> i420_*: Do not push lr on the stack and use bx lr
> should be the same number of cycles but with less memory usage
> (can't check as i can't find the pdf with the number of cycles per
> instruction)
The Cortex-A8 can load/store two registers per cycle. So your patch is
actually slower.
--
Rémi Denis-Courmont
http://www.remlab.net/
http://fi.linkedin.com/in/remidenis
More information about the vlc-devel
mailing list