<div style="line-height:1.7;color:#000000;font-size:14px;font-family:arial"><DIV>>+punpcklbw m3, m4<BR>>+punpcklbw m7, m5, m6<BR>>+punpcklbw m3, m7<BR>are you want to transpose 4x4? see below, so you don't need constant tab_Cm</DIV>
<DIV>punpcklbw m3, m4</DIV>
<DIV>punpcklbw m5, m6</DIV>
<DIV>punpcklwd m3, m5</DIV>
<DIV> </DIV>
<DIV>>+pextrw [r2], m2, 0<BR>>+pextrw [r2 + r3], m2, 2<BR>movd [r2], m2</DIV>
<DIV>pshufd m2, m2, 1</DIV>
<DIV>movd [r2+r3], m2</DIV>
<DIV> </DIV></div>