[x264-devel] commit: Add AltiVec implementation of predict_8x8c_p. 2. 6x faster than scalar C. (Guillaume Poirier )

Loren Merritt lorenm at u.washington.edu
Sun Jan 18 23:13:39 CET 2009


On Sun, 18 Jan 2009, git version control wrote:

> x264 | branch: master | Guillaume Poirier <gpoirier at mplayerhq.hu> | Sun Jan 18 22:44:14 2009 +0100| [09e76c903d3419619ed326a4dd114369a55bdd6e] | committer: Guillaume Poirier
>
> +    vec_s16_t induc_v  = (vec_s16_t) CV(0, 1, 2, 3, 4, 5, 6, 7);
> +    vec_s32_t mule_b_v = vec_mule(induc_v, b_v);
> +    vec_s32_t mulo_b_v = vec_mulo(induc_v, b_v);
> +    vec_s16_t mul_b_induc0_v = vec_pack(vec_mergeh(mule_b_v, mulo_b_v), vec_mergel(mule_b_v, mulo_b_v));
> +    vec_s16_t add_i0_b_0v = vec_adds(i00_v, mul_b_induc0_v);

Is there no plain 16bit multiply? vec_mladd?

--Loren Merritt


More information about the x264-devel mailing list