[x264-devel] RE : FPGAs and x264

chen chenm003 at 163.com
Mon Jul 6 03:10:29 CEST 2009


Hi,

	Oh my god.
    You want use 16x16 SAD array & some add tree to do this?
it will be large area, I think to educate only.
    The SAD, I suggest you use the pipeline full-search.

======= 2009-07-05 18:01:44 =======

> > 16x16 SATD (I have no idea what you mean by "sub blocks") takes 170
> > cycles on a Nehalem CPU.  A SAD takes something around 42, but
> > normally 4 are batched up(SAD_X4) and that takes 152 clocks total.   
>On
> > Phenom it's around 110 or so.
>For example, a FPGA core that in a single cycle takes a 16x16  
>macroblock then computes and returns all sub-blocks in that 16x16 area  
>all the way down to 4x4 (called a systolic array in some articles).   
>Same for SATD.
>
>16,16   16,8    8,16    8,8    8,4    4,8    4,4
>
>David.
>
>_______________________________________________
>x264-devel mailing list
>x264-devel at videolan.org
>http://mailman.videolan.org/listinfo/x264-devel
>

= = = = = = = = = = = = = = = = = = = =
			
        chen
        chenm003 at 163.com
          2009-07-06



More information about the x264-devel mailing list