[x264-devel] commit: add AltiVec implementation of x264_pixel_var_16x16 and x264_pixel_var_8x8 (Guillaume Poirier )
Loren Merritt
lorenm at u.washington.edu
Sat Jan 24 02:45:16 CET 2009
On Fri, 23 Jan 2009, git version control wrote:
> x264 | branch: master | Guillaume Poirier <gpoirier at mplayerhq.hu> | Fri Jan 23 13:53:06 2009 -0800| [71ac0a34bc0460bf67da68f300e4150bc50d9aae] | committer: Guillaume Poirier
> + sum_v = vec_add( sum_v, vec_sld( sum_v, sum_v, 8 ) );
> + sum_v = vec_add( sum_v, vec_sld( sum_v, sum_v, 4 ) );
vec_sums?
> + pix0_v = vec_perm(pix0_v, pix0_v, perm0);
> + pix1_v = vec_perm(pix1_v, pix1_v, perm1);
> + vec_u8_t pix_v = vec_mergeh(pix0_v, pix1_v);
This can be a single vec_perm. The map then can't be generated by
vec_lvsl, but there's only 4 possibilities (2 if you make stride mod16),
so LUT it.
--Loren Merritt
More information about the x264-devel
mailing list