[x264-devel] commit: add AltiVec implementation of x264_pixel_var_16x16 and x264_pixel_var_8x8 (Guillaume Poirier )

Loren Merritt lorenm at u.washington.edu
Sat Jan 24 02:45:16 CET 2009


On Fri, 23 Jan 2009, git version control wrote:

> x264 | branch: master | Guillaume Poirier <gpoirier at mplayerhq.hu> | Fri Jan 23 13:53:06 2009 -0800| [71ac0a34bc0460bf67da68f300e4150bc50d9aae] | committer: Guillaume Poirier

> + sum_v = vec_add( sum_v, vec_sld( sum_v, sum_v, 8 ) );
> + sum_v = vec_add( sum_v, vec_sld( sum_v, sum_v, 4 ) );

vec_sums?

> + pix0_v = vec_perm(pix0_v, pix0_v, perm0);
> + pix1_v = vec_perm(pix1_v, pix1_v, perm1);
> + vec_u8_t pix_v = vec_mergeh(pix0_v, pix1_v);

This can be a single vec_perm. The map then can't be generated by 
vec_lvsl, but there's only 4 possibilities (2 if you make stride mod16), 
so LUT it.

--Loren Merritt


More information about the x264-devel mailing list