<div dir="ltr">It might be more prudent to start from a profile showing the top functions that are limiting performance instead of a blind list of functions to optimize. You can use gprof (on linux), or vtune to extract this profile. Just make sure to compile with RelWithDebInfo to get debug symbols for the profiler to use. If you are using vtune, you can enable ENABLE_VTUNE in cmake.<div><br></div><div>Pradeep.<br><div class="gmail_extra">
<br><div class="gmail_quote">On Tue, Aug 23, 2016 at 11:22 AM, N Vijay Anand <span dir="ltr"><<a href="mailto:nvijay.anand@trispacetech.com" target="_blank">nvijay.anand@trispacetech.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">I was under impression typecasting does sign extension<div>irrespective of unsigned/signed type of data.</div><div><br></div><div>BTW, I was trying to vectorise deblock filter.</div><div>Seems not much gain in this function. It would be useful</div><div>to have a list of function yet to be vector optimised.</div><div><br></div><div><br></div></div>
<br>______________________________<wbr>_________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" rel="noreferrer" target="_blank">https://mailman.videolan.org/<wbr>listinfo/x265-devel</a><br>
<br></blockquote></div><br></div></div></div>