<div style="line-height:1.7;color:#000000;font-size:14px;font-family:arial"><DIV>>+movu m1, [r2]<BR>>+punpcklbw m2, m1, m0<BR>Here have a hide register copy, try to avoid it by SSE4.1 "pmovzxbw m2, m1"</DIV>
<DIV> </DIV>
<DIV>>+movu [r0], m2<BR>>+punpckhbw m1, m0<BR>>+movu [r0 + 16], m1<BR></DIV></div>