<div style="line-height:1.7;color:#000000;font-size:14px;font-family:arial"><DIV>At 2014-01-17 14:00:55,"Jason Garrett-Glaser" <jason@x264.com> wrote:<BR>>+ movu m0, [r2 + 1] ; [16 15 14 13<BR>>12 11 10 9 8 7 6 5 4 3 2 1]<BR>>+ palignr m1, m0, 1 ; [x 16 15 14<BR>>13 12 11 10 9 8 7 6 5 4 3 2]<BR>><BR>>Shouldn't this be pslrdq or similar? The dependency on uninitialized<BR>>registers is a bit weird too...<BR></DIV>
<DIV>This algorithm is suggest by me, the psrldq can't move register, we</DIV>
<DIV>have to wasting some instruction to do it.</DIV>
<DIV>Of course, we have a restrict use uninitialize value on other instruction.</DIV>
<DIV> </DIV></div>