[x265] [PATCH] aq: bug fix.extend border of TComPic::m_origPicYuv to a multiple of 16
Aarthi Priya Thirumalai
aarthi at multicorewareinc.com
Fri Nov 8 05:50:33 CET 2013
On Nov 7, 2013 11:24 PM, "Steve Borho" <steve at borho.org> wrote:
>
>
>
>
> On Thu, Nov 7, 2013 at 5:17 AM, Aarthi Thirumalai <
aarthi at multicorewareinc.com> wrote:
>>
>> # HG changeset patch
>> # User Aarthi Thirumalai
>> # Date 1383823017 -19800
>> # Thu Nov 07 16:46:57 2013 +0530
>> # Node ID 0823d243b4e2f9eee6323a9750eeb811bccbad77
>> # Parent 85002898f5b4308547af6ce464bbdff5f360fa13
>> aq: bug fix.extend border of TComPic::m_origPicYuv to a multiple of 16
>>
>> diff -r 85002898f5b4 -r 0823d243b4e2 source/Lib/TLibCommon/TComPicYuv.cpp
>> --- a/source/Lib/TLibCommon/TComPicYuv.cpp Thu Nov 07 14:31:05 2013
+0530
>> +++ b/source/Lib/TLibCommon/TComPicYuv.cpp Thu Nov 07 16:46:57 2013
+0530
>> @@ -345,6 +345,10 @@
>> /* width and height - without padsize */
>> int width = m_picWidth - padx;
>> int height = m_picHeight - pady;
>> + uint8_t rem = m_picWidth % 16;
>> + padx += rem ? 16 - rem : 0;
>> + rem = m_picHeight % 16;
>> + pady += rem ? 16 - rem : 0;
>
>
> if the row width was 17, padx would already be 3 in order to pad out to
the nearest multiple of 4. You would be doing: padx += 15, which would
result in a pad of 18, which is more than what you bargained for. I
believe the += should just be =. Also, Gopu just added a new else {}
clause this is missed here, so I've hoisted this operation higher in the
file.
>
>>when row width is 17, padx is already 3 and m_picWidth would be 20 at
this stage. So padx += 12 not 15..
>>
>> // Manually copy pixels to up-size them
>> for (int r = 0; r < height; r++)
>> @@ -475,10 +479,12 @@
>> uint8_t *v = (uint8_t*)pic.planes[2];
>>
>> /* width and height - without padsize */
>> - int width = (m_picWidth * (pic.bitDepth > 8 ? 2 : 1)) - padx;
>> - int height = m_picHeight - pady;
>> -
>> - // copy pixels by row into encoder's buffer
>> + int width = (m_picWidth * (pic.bitDepth > 8 ? 2 : 1)) - padx;
>> + int height = m_picHeight - pady;
>> + uint8_t rem = m_picWidth % 16;
>> + padx += rem ? 16 - rem : 0;
>> + rem = m_picHeight % 16;
>> + pady += rem ? 16 - rem : 0;
>
>
> still white-space alignment problems here, but this code has been removed
>
>>
>> for (int r = 0; r < height; r++)
>> {
>> memcpy(Y, y, width);
>> diff -r 85002898f5b4 -r 0823d243b4e2 source/common/lowres.cpp
>> --- a/source/common/lowres.cpp Thu Nov 07 14:31:05 2013 +0530
>> +++ b/source/common/lowres.cpp Thu Nov 07 16:46:57 2013 +0530
>> @@ -150,31 +150,6 @@
>> intraMbs[i] = 0;
>> }
>>
>> - int y, extWidth = (orig->getWidth() + X265_LOWRES_CU_SIZE - 1);
>> - int srcStride = orig->getStride();
>> - int srcHeight = orig->getHeight();
>> - int srcWidth = orig->getWidth();
>> - Pel *src;
>> - src = orig->getLumaAddr();
>> -
>> - /* extending right margin */
>> - if (2 * width > srcWidth)
>> - {
>> - for (y = 0; y < srcHeight; y++)
>> - {
>> - ::memset(src + srcWidth, src[srcWidth - 1], sizeof(Pel) *
(X265_LOWRES_CU_SIZE - 1));
>> - src += srcStride;
>> - }
>> - }
>> -
>> - /* extending bottom margin */
>> - src = orig->getLumaAddr() + (srcHeight - 1) * srcStride;
>> -
>> - for (y = 1; y <= 2 * lines - srcHeight; y++)
>> - {
>> - ::memcpy(src + y * srcStride, src, sizeof(Pel) * (extWidth));
>> - }
>> -
>> /* downscale and generate 4 HPEL planes for lookahead */
>> primitives.frame_init_lowres_core(orig->getLumaAddr(),
>> lowresPlane[0], lowresPlane[1],
lowresPlane[2], lowresPlane[3],
>> diff -r 85002898f5b4 -r 0823d243b4e2 source/encoder/frameencoder.cpp
>> --- a/source/encoder/frameencoder.cpp Thu Nov 07 14:31:05 2013 +0530
>> +++ b/source/encoder/frameencoder.cpp Thu Nov 07 16:46:57 2013 +0530
>> @@ -1149,9 +1149,8 @@
>> {
>> /* Derive qpOffet for each CU by averaging offsets for all
16x16 blocks in the cu. */
>> double qp_offset = 0;
>> - int blockSize = g_maxCUWidth >> 2;
>> - int maxBlockCols = (pic->getPicYuvOrg()->getWidth() +
(blockSize - 1)) / blockSize;
>> - int maxBlockRows = (pic->getPicYuvOrg()->getHeight() +
(blockSize - 1)) / blockSize;
>> + int maxBlockCols = (pic->getPicYuvOrg()->getWidth() + (16 - 1))
/ 16;
>> + int maxBlockRows = (pic->getPicYuvOrg()->getHeight() + (16 -
1)) / 16;
>
>
> much better
>
>>
>> int block_y = (cuAddr / pic->getPicSym()->getFrameWidthInCU())
* 4;
>> int block_x = (cuAddr * 4) - block_y *
pic->getPicSym()->getFrameWidthInCU();
>> int cnt = 0;
>> _______________________________________________
>> x265-devel mailing list
>> x265-devel at videolan.org
>> https://mailman.videolan.org/listinfo/x265-devel
>
>
>
>
> --
> Steve Borho
>
> _______________________________________________
> x265-devel mailing list
> x265-devel at videolan.org
> https://mailman.videolan.org/listinfo/x265-devel
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131108/32b7b57c/attachment-0001.html>
More information about the x265-devel
mailing list