[x264-devel] RE : FPGAs and x264

David Smith agentdavo at mac.com
Sun Jul 5 12:01:44 CEST 2009


 > 16x16 SATD (I have no idea what you mean by "sub blocks") takes 170
 > cycles on a Nehalem CPU.  A SAD takes something around 42, but
 > normally 4 are batched up(SAD_X4) and that takes 152 clocks total.   
On
 > Phenom it's around 110 or so.
For example, a FPGA core that in a single cycle takes a 16x16  
macroblock then computes and returns all sub-blocks in that 16x16 area  
all the way down to 4x4 (called a systolic array in some articles).   
Same for SATD.

16,16   16,8    8,16    8,8    8,4    4,8    4,4

David.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x264-devel/attachments/20090705/0a6847ea/attachment.htm>


More information about the x264-devel mailing list