<html><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span class="Apple-style-span" style="font-family: Arial, Helvetica, fantasy; font-size: 12px; line-height: 18px; "><div>OK,</div><div><br></div><div>I am in the process of packing up dependencies so that I can pass the compute of x264_me_search_ref over to a PCIe x4 multi core DSP board with 2Gb global memory. <span class="Apple-style-span" style="font-size: medium; ">Assume a messaging system is already in place between the host PC and the dsp board.</span></div><div><span class="Apple-style-span" style="font-size: medium; "><br></span></div><div><span class="Apple-style-span" style="font-size: medium; ">A function replicating x264_me_search_ref is running on DSP core 0</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">w</span><span class="Apple-style-span" style="font-size: medium; ">i</span><span class="Apple-style-span" style="font-size: medium; ">t</span><span class="Apple-style-span" style="font-size: medium; ">h</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">j</span><span class="Apple-style-span" style="font-size: medium; ">o</span><span class="Apple-style-span" style="font-size: medium; ">b</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">management</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">t</span><span class="Apple-style-span" style="font-size: medium; ">h</span><span class="Apple-style-span" style="font-size: medium; ">a</span><span class="Apple-style-span" style="font-size: medium; ">t</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">s</span><span class="Apple-style-span" style="font-size: medium; ">e</span><span class="Apple-style-span" style="font-size: medium; ">n</span><span class="Apple-style-span" style="font-size: medium; ">d</span><span class="Apple-style-span" style="font-size: medium; ">s</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">a x264_me_search_ref call<span class="Apple-style-span" style="font-size: 12px; "><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">t</span><span class="Apple-style-span" style="font-size: medium; ">o</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">e</span><span class="Apple-style-span" style="font-size: medium; ">a</span><span class="Apple-style-span" style="font-size: medium; ">c</span><span class="Apple-style-span" style="font-size: medium; ">h</span><span class="Apple-style-span" style="font-size: medium; "> </span><span class="Apple-style-span" style="font-size: medium; ">c</span><span class="Apple-style-span" style="font-size: medium; ">o</span><span class="Apple-style-span" style="font-size: medium; ">r</span><span class="Apple-style-span" style="font-size: medium; ">e</span><span class="Apple-style-span" style="font-size: medium; ">.</span></span></span></div><div><span class="Apple-style-span" style="font-size: medium; "><br></span></div><div><span class="Apple-style-span" style="font-size: medium; ">I have been trying to figure out the data dependancy with Understand v2 / Netbeans and how other threads change, if at all, the current thread structures *h & *m. Would it simply a case of copying the *h *m structures for each call of x264_me_search_ref over to the dsp board memory and returning them back at the end for each thread? I assume that would be an unnecessarily large replication of data for each call per thread.</span></div><div><span class="Apple-style-span" style="font-size: medium; "><br></span></div><div><span class="Apple-style-span" style="font-size: medium; ">How about if the structures are copied once on the first call per frame? Would that be enough to satisfy the dependancy for the rest of the frame up until </span></div><div><span class="Apple-style-span" style="font-size: medium; ">h->mb.i_mb_xy = h->m.i_mb_count </span></div><div><span class="Apple-style-span" style="font-size: medium; "><br></span></div><div><span class="Apple-style-span" style="font-size: medium; ">Any thoughts or comments welcomed.</span></div><div><span class="Apple-style-span" style="font-size: medium; "><br></span></div><div><span class="Apple-style-span" style="font-size: medium; ">Thanks,</span></div></span><div><div><font class="Apple-style-span" face="Arial, Helvetica, -webkit-fantasy"><span class="Apple-style-span" style="line-height: 18px; ">David.</span></font></div></div></body></html>