<div dir="ltr">Ok, thanks.<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Sep 11, 2014 at 11:47 AM, Praveen Tiwari <span dir="ltr"><<a href="mailto:praveen@multicorewareinc.com" target="_blank">praveen@multicorewareinc.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">You can push 16x16 and 32x32 also they are good in performance but they need a bit more improvement, I will be sending improvement patch soon.<div><br></div><div>Regards,</div><div>Praveen Tiwari</div></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Sep 11, 2014 at 11:29 AM, Deepthi Nandakumar <span dir="ltr"><<a href="mailto:deepthi@multicorewareinc.com" target="_blank">deepthi@multicorewareinc.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Would be better to combine this asm enable with the corresponding asm patch itself. I have pushed copy_cnt8, and enabled only that for now. <br></div><div class="gmail_extra"><br><div class="gmail_quote"><div><div>On Wed, Sep 10, 2014 at 3:28 PM,  <span dir="ltr"><<a href="mailto:praveen@multicorewareinc.com" target="_blank">praveen@multicorewareinc.com</a>></span> wrote:<br></div></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div># HG changeset patch<br>
# User Praveen Tiwari<br>
# Date 1410343073 -19800<br>
# Node ID 2cd4a13086740728559fde3a176953e9aa4c0782<br>
# Parent  7bc4db02ccc728f6e2ddedd036c96e3d37b90f22<br>
copy_cnt: enable avx2 version of asm code<br>
<br>
diff -r 7bc4db02ccc7 -r 2cd4a1308674 source/common/x86/asm-primitives.cpp<br>
--- a/source/common/x86/asm-primitives.cpp      Wed Sep 10 14:45:33 2014 +0530<br>
+++ b/source/common/x86/asm-primitives.cpp      Wed Sep 10 15:27:53 2014 +0530<br>
@@ -1724,14 +1724,10 @@<br>
         p.sad_x4[LUMA_16x32] = x265_pixel_sad_x4_16x32_avx2;<br>
         p.ssd_s[BLOCK_32x32] = x265_pixel_ssd_s_32_avx2;<br>
<br>
-        /* Need to update assembly code as per changed interface of the copy_cnt primitive, once<br>
-         * code is updated, avx2 version will be enabled */<br>
-/*<br>
         p.copy_cnt[BLOCK_4x4] = x265_copy_cnt_4_avx2;<br>
         p.copy_cnt[BLOCK_8x8] = x265_copy_cnt_8_avx2;<br>
         p.copy_cnt[BLOCK_16x16] = x265_copy_cnt_16_avx2;<br>
         p.copy_cnt[BLOCK_32x32] = x265_copy_cnt_32_avx2;<br>
-*/<br>
<br>
         p.cvt32to16_shl[BLOCK_4x4] = x265_cvt32to16_shl_4_avx2;<br>
         p.cvt32to16_shl[BLOCK_8x8] = x265_cvt32to16_shl_8_avx2;<br></div></div>
_______________________________________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org" target="_blank">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a><br>
</blockquote></div><br></div>
<br>_______________________________________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org" target="_blank">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a><br>
<br></blockquote></div><br></div>
</div></div><br>_______________________________________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a><br>
<br></blockquote></div><br></div>