September 2014 Archives by subject
Starting: Mon Sep 1 10:26:25 CEST 2014
Ending: Tue Sep 30 17:59:13 CEST 2014
Messages: 391
- [x265] 1.3+94-0e0d0309e616 - GCC error at 28%
JMK
- [x265] [PATCH 01 of 14] motion: avoid extra iterations when no subpel motion found
Steve Borho
- [x265] [PATCH 02 of 14] bitcost: use enums for special constants rather than static const ints
Steve Borho
- [x265] [PATCH 03 of 14] nits: do not check for NULL from new operations
Steve Borho
- [x265] [PATCH 04 of 14] bitstream: add paren to avoid ambiguous precedence in X265_CHECK
Steve Borho
- [x265] [PATCH 05 of 14] entropy: fix SAO enable detection (refs #80)
Steve Borho
- [x265] [PATCH 06 of 14] encoder: use %u to sprintf unsigned ints (refs #80)
Steve Borho
- [x265] [PATCH 07 of 14] TComWeightPrediction: combine duplicate inline functions (refs #80)
Steve Borho
- [x265] [PATCH 08 of 14] predict: merge TComWeightPrediction functions into Predict
Steve Borho
- [x265] [PATCH 09 of 14] predict: don't bother keeping refidx as an array
Steve Borho
- [x265] [PATCH 0 of 2 ] TComDataCU: replace with more CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH 0 of 2 ] TComDataCU: replace with more CU structure details
Steve Borho
- [x265] [PATCH 0 of 2 ] TComDataCU: replace with more CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH 10 of 14] predict: combine redundant logic paths in predInterBi()
Steve Borho
- [x265] [PATCH 11 of 14] predict: use faster unidir prediction for B frames when weighting not enabled
Steve Borho
- [x265] [PATCH 12 of 14] predict: streamline getWpScaling()
Steve Borho
- [x265] [PATCH 13 of 14] predict: remove list argument from motionCompensation(), always REF_PIC_LIST_X
Steve Borho
- [x265] [PATCH 14 of 14] predict: remove checkIdenticalMotion()
Steve Borho
- [x265] [PATCH 1 of 2] asm: fix output mistake in pixel_ssd_ss_4xN
Min Chen
- [x265] [PATCH 1 of 2] asm: reduce number of movd in dequant_normal
Min Chen
- [x265] [PATCH 1 of 2] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH 1 of 2] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH 1 of 3] asm: optimize nquant by PSIGND, improve 11k cycles -> 9.8k cycles
Min Chen
- [x265] [PATCH 1 of 3] testbench(nquant): the Round value must be less than (2 ^ qbits)
Min Chen
- [x265] [PATCH 1 of 4] add analysis data structures and param options
sagar at multicorewareinc.com
- [x265] [PATCH 1 of 4] add analysis data structures and param options
Steve Borho
- [x265] [PATCH 1 of 4] add analysis data structures and param options
Nicolas Morey-Chaisemartin
- [x265] [PATCH 1 of 4] add analysis data structures and param options
Steve Borho
- [x265] [PATCH 1 of 4] add analysis data structures and param options
sagar at multicorewareinc.com
- [x265] [PATCH 1 of 4] add analysis data structures and param options
Steve Borho
- [x265] [PATCH 1 of 4] add intra-inter data structures and param options
sagar at multicorewareinc.com
- [x265] [PATCH 1 of 4] add intra-inter data structures and param options
Steve Borho
- [x265] [PATCH 1 of 4] analysis: hoist local function into anonymous namespace (file local)
Steve Borho
- [x265] [PATCH 1 of 4] testbench(quant): the Round value must be less than (2 ^ qbits)
Min Chen
- [x265] [PATCH 1 of 5] predict: inline single call of predInterBi()
Steve Borho
- [x265] [PATCH 2 of 2] asm: AVX2 version of dequant_normal, improve 9.3k Cycles -> 4.2k Cycles
Min Chen
- [x265] [PATCH 2 of 2] asm: enable SSE2 version of pixel_ssd_ss[]
Min Chen
- [x265] [PATCH 2 of 2] TComDataCU: replace getTotalNumPart() with CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH 2 of 2] TComDataCU: replace getTotalNumPart() with CU structure details
Deepthi Nandakumar
- [x265] [PATCH 2 of 2] TComDataCU: replace getTotalNumPart() with CU structure details
Santhoshini Sekar
- [x265] [PATCH 2 of 2] TComDataCU: replace getTotalNumPart() with CU structure details
Deepthi Nandakumar
- [x265] [PATCH 2 of 2] TComDataCU: replace getTotalNumPart() with CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH 2 of 3] asm: optimize nquant by PSIGND, improve 13k cycles -> 11k cycles
Min Chen
- [x265] [PATCH 2 of 3] asm: optimize nquant by PSIGND, improve 13k cycles -> 11k cycles
Deepthi Nandakumar
- [x265] [PATCH 2 of 3] asm: optimize nquant by PSIGND, improve 13k cycles -> 11k cycles
chen
- [x265] [PATCH 2 of 3] asm: reenable IACA support, it remove by 'inappropriate instruction...' patch
Min Chen
- [x265] [PATCH 2 of 3] asm: reenable IACA support, it remove by 'inappropriate instruction...' patch
BugMaster
- [x265] [PATCH 2 of 4] analysis: remove unused LAMBDA_PARTITION_SELECT
Steve Borho
- [x265] [PATCH 2 of 4] api: add x265_alloc_inter_intra_data and x265_free_inter_intra_data methods to allocate and free analysis info
sagar at multicorewareinc.com
- [x265] [PATCH 2 of 4] api: introduce x265_alloc_analysis_data and x265_free_analysis_data methods to allocate and free analysis buffers
sagar at multicorewareinc.com
- [x265] [PATCH 2 of 4] api: introduce x265_alloc_analysis_data and x265_free_analysis_data methods to allocate and free analysis info
sagar at multicorewareinc.com
- [x265] [PATCH 2 of 4] api: introduce x265_alloc_analysis_data and x265_free_analysis_data methods to allocate and free analysis info
Steve Borho
- [x265] [PATCH 2 of 4] testbench(quant): the qBits value must be more than or equal to 8
Min Chen
- [x265] [PATCH 2 of 5] predict: inline predInterUni(), getWpScaling() and simplify motionCompensation()
Steve Borho
- [x265] [PATCH 3 of 3] asm: avx2 version of nquant(), improve 11k cycles -> 7k cycles
Min Chen
- [x265] [PATCH 3 of 3] asm: avx2 version of nquant(), improve 9.8k cycles -> 5.3k cycles
Min Chen
- [x265] [PATCH 3 of 4] analysis: coding style and comment nits
Steve Borho
- [x265] [PATCH 3 of 4] asm: improve quant by replace variant shift to fixed shift, 19k cycles -> 16.6k cycles
Min Chen
- [x265] [PATCH 3 of 4] dump intra analysis information
sagar at multicorewareinc.com
- [x265] [PATCH 3 of 4] store analysis information in buffers
sagar at multicorewareinc.com
- [x265] [PATCH 3 of 4] store intra analysis information in buffers
sagar at multicorewareinc.com
- [x265] [PATCH 3 of 4] store intra analysis information in buffers
Steve Borho
- [x265] [PATCH 3 of 5] predict: remove check for reallocations, comment nits
Steve Borho
- [x265] [PATCH 4 of 4] analysis: remove #define conditionals for control flow
Steve Borho
- [x265] [PATCH 4 of 4] asm: avx2 version of quant, improve 16.6k cycles -> 8.4k cycles
Min Chen
- [x265] [PATCH 4 of 4] cli: add cli options analysis-mode and analysis-file
sagar at multicorewareinc.com
- [x265] [PATCH 4 of 4] cli: add cli options analysis-mode and analysis-file
Steve Borho
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
sagar at multicorewareinc.com
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
Steve Borho
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
Sagar Kotecha
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
Steve Borho
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
sagar at multicorewareinc.com
- [x265] [PATCH 4 of 4] cli: add cli options analysis-out and analysis-in
Steve Borho
- [x265] [PATCH 4 of 5] predict: combine and check allocations and return failures
Steve Borho
- [x265] [PATCH 5 of 5] predict: split weighted prediction values from WeightParam
Steve Borho
- [x265] [PATCH] add avx version for chroma_copy_ss 16x4, 16x8, 16x12, 16x16, 16x24, 16x32, 16x64 based on csp, approx 1.5x-2x speedup over SSE
sagar at multicorewareinc.com
- [x265] [PATCH] add avx version for chroma_copy_ss 16x4, 16x8, 16x12, 16x16, 16x24, 16x32, 16x64 based on csp, approx 1.5x-2x speedup over SSE
chen
- [x265] [PATCH] add avx version for chroma_copy_ss 16x4, 16x8, 16x12, 16x16, 16x24, 16x32, 16x64 based on csp, approx 1.5x-2x speedup over SSE
sagar at multicorewareinc.com
- [x265] [PATCH] add avx version for chroma_copy_ss 16x4, 16x8, 16x12, 16x16, 16x24, 16x32, 16x64 based on csp, approx 1.5x-2x speedup over SSE
Sagar Kotecha
- [x265] [PATCH] add avx version for chroma_copy_ss 16x4, 16x8, 16x12, 16x16, 16x24, 16x32, 16x64 based on csp, approx 1.5x-2x speedup over SSE
chen
- [x265] [PATCH] added copy_shl primitive
praveen at multicorewareinc.com
- [x265] [PATCH] added copy_shl primitive
chen
- [x265] [PATCH] added copy_shr primitive
praveen at multicorewareinc.com
- [x265] [PATCH] added copy_shr primitive
chen
- [x265] [PATCH] add fanout validation module to check param compatibility
sagar at multicorewareinc.com
- [x265] [PATCH] add fanout validation module to check param compatibility
Steve Borho
- [x265] [PATCH] add fanout validation module to check param compatibility
sagar at multicorewareinc.com
- [x265] [PATCH] add fanout validation module to check param compatibility
Sagar Kotecha
- [x265] [PATCH] add fanout validation module to check param compatibility
sagar at multicorewareinc.com
- [x265] [PATCH]Add iteration-skip to subpel refine
shevaxu
- [x265] [PATCH]Add iteration-skip to subpel refine
Steve Borho
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
santhoshini at multicorewareinc.com
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
Deepthi Nandakumar
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
Deepthi Nandakumar
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
Santhoshini Sekar
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
Deepthi Nandakumar
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
santhoshini at multicorewareinc.com
- [x265] [PATCH] analysis: add CU specific details to encodeCU()
Steve Borho
- [x265] [PATCH] Analysis: compressIntraCU clean up
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: CU structure now holds CU-specific information,
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: CU structure now holds CU-specific information,
Steve Borho
- [x265] [PATCH] analysis: CU structure now holds CU-specific information,
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: CU structure now holds CU-specific information,
Steve Borho
- [x265] [PATCH] Analysis: fix for binary mismatch for pass 2 in compressIntraCU()
ashok at multicorewareinc.com
- [x265] [PATCH] Analysis: fix for binary mismatch for pass 2 in compressSharedIntraCTU()
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: intra picture estimation (mode and split decision)information sharing
gopu at multicorewareinc.com
- [x265] [PATCH] analysis: Intra picture estimation information sharing
gopu at multicorewareinc.com
- [x265] [PATCH] analysis: Intra picture estimation information sharing
Steve Borho
- [x265] [PATCH] analysis: Intra picture estimation information sharing
Gopu Govindaswamy
- [x265] [PATCH] analysis: Intra picture estimation information sharing
gopu at multicorewareinc.com
- [x265] [PATCH] analysis: Intra picture estimation information sharing
Steve Borho
- [x265] [PATCH] analysis: modified compressInterCU_rd0_4() with CU-specific information
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: modified compressInterCU_rd5_6() with CU-specific information
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: remove redundant arguments, clean up variable names
deepthi at multicorewareinc.com
- [x265] [PATCH] api: do not reuse the analysisData buffer for more then one picture, set it NULL
gopu at multicorewareinc.com
- [x265] [PATCH] api: do not reuse the analysisData buffer for more then one picture, set it NULL
Steve Borho
- [x265] [PATCH] api: rename SAO options and params for clarity
Steve Borho
- [x265] [PATCH] api: rename SAO options and params for clarity
Steve Borho
- [x265] [PATCH] api: use generic names for analysis api
sagar at multicorewareinc.com
- [x265] [PATCH] asm-primitives.cpp: nits
praveen at multicorewareinc.com
- [x265] [PATCH] asm: avx2 asm code for dct4
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 asm code for dct4
chen
- [x265] [PATCH] asm: avx2 asm code for dct4
chen
- [x265] [PATCH] asm: avx2 asm code for dct4
chen
- [x265] [PATCH] asm: avx2 asm code for idct32x32
murugan at multicorewareinc.com
- [x265] [PATCH] asm: avx2 asm code for idct32x32
Steve Borho
- [x265] [PATCH] asm: avx2 asm code for idct32x32
chen
- [x265] [PATCH] asm: avx2 asm code for idct32x32
Murugan Vairavel
- [x265] [PATCH] asm: avx2 assembly code for dct16
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for dct16
chen
- [x265] [PATCH] asm: avx2 assembly code for dct16
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for dct16
Steve Borho
- [x265] [PATCH] asm: avx2 assembly code for dct16
Yuvaraj Venkatesh
- [x265] [PATCH] asm: avx2 assembly code for dct16
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for dct16
chen
- [x265] [PATCH] asm: avx2 assembly code for dct16
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for dct16
chen
- [x265] [PATCH] asm: avx2 assembly code for dct32x32
murugan at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for dct32x32
chen
- [x265] [PATCH] asm: avx2 assembly code for idct16x16
murugan at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for idct16x16
chen
- [x265] [PATCH] asm: avx2 assembly code for idct32x32
murugan at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for idct4x4
murugan at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for idct8x8
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 assembly code for idct8x8
chen
- [x265] [PATCH] asm: avx2 code for dct8x8
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for dct8x8
chen
- [x265] [PATCH] asm: avx2 code for dct8x8
yuvaraj at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for dct8x8
chen
- [x265] [PATCH] asm: enable 16bpp primitives of cvt32to16 and cvt16to32 for all block sizes
murugan at multicorewareinc.com
- [x265] [PATCH] asm: fix mismatch due to dct32 avx2 assembly code
murugan at multicorewareinc.com
- [x265] [PATCH] asm: replace mova by movu to avoid AVX2 testbench crash in dct16, dct32, denoise_dct, its same speed on Haswell
Deepthi Nandakumar
- [x265] [PATCH] asm: replace mova by movu to avoid AVX2 testbench crash in dct16, dct32, denoise_dct, its same speed on Haswell
chen
- [x265] [PATCH] asm: replace mova by movu to avoid AVX2 testbench crash in dct16, dct32, denoise_dct, its same speed on Haswell
Min Chen
- [x265] [PATCH] asm: replace ssse3 instruction in pixel_ssd_ss_*_sse2
Min Chen
- [x265] [PATCH] bloccopy_pp avx asm code: 32x32, 32x48, 32x64 improved by 803.69 -> 514.90, 1126.36 -> 655.24, 1454.09 -> 835.76 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] bloccopy_pp avx asm code: 32x32, 32x48, 32x64 improved by 803.69 -> 514.90, 1126.36 -> 655.24, 1454.09 -> 835.76 cycles
chen
- [x265] [PATCH] blockcopy_pp: 32x8, 32x16, 32x24, 32x32, 32x48, 32x64 AVX version of asm code, approx double speedup comapre to SSE
praveen at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp: 32x8, 32x16, 32x24, 32x32, 32x48, 32x64 AVX version of asm code, approx double speedup comapre to SSE
chen
- [x265] [PATCH] blockcopy_pp: 64x16, 64x32, 64x48, 64x64 AVX version of asm code, approx double speedup comapre to SSE
sagar at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp: avx asm code indentation
praveen at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp: avx asm code indentation
chen
- [x265] [PATCH] blockcopy_pp_32x16: avx asm code, improved 477.74 cycles -> 309.99
praveen at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp_32x16: avx asm code, improved 477.74 cycles -> 309.99
chen
- [x265] [PATCH] blockcopy_pp_32x24: avx asm code, improved 621.84 cycles -> 371.94
praveen at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp_32x24: avx asm code, improved 621.84 cycles -> 371.94
chen
- [x265] [PATCH] blockcopy_pp_32x8: avx asm code, improved 281.20 cycles -> 165.47
praveen at multicorewareinc.com
- [x265] [PATCH] blockcopy_pp_32x8: avx asm code, improved 281.20 cycles -> 165.47
chen
- [x265] [PATCH] blockcopy_ss: 64x16, 64x32, 64x48, 64x64 AVX version of asm code, approx double speedup comapre to SSE
sagar at multicorewareinc.com
- [x265] [PATCH] blockcopy_ss: 64x16, 64x32, 64x48, 64x64 AVX version of asm code, approx double speedup comapre to SSE
chen
- [x265] [PATCH] blockfill_s_16x16 avx2 asm code, performance improved 389.21 cycles -> 204.38 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] blockfill_s_16x16 avx2 asm code, performance improved 389.21 cycles -> 204.38 cycles
chen
- [x265] [PATCH] blockfill_s_16x16 avx2 asm code: performance improved from 389.21 cycles to 204.38 cycles, over sse version of asm code
praveen at multicorewareinc.com
- [x265] [PATCH] blockfill_s_16x16 avx2 asm code: performance improved from 389.21 cycles to 204.38 cycles, over sse version of asm code
chen
- [x265] [PATCH] blockfill_s_32x32 avx2 asm code, performance improved 1354.05 cycles -> 705.81 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] blockfill_s_32x32 avx2 asm code, performance improved 1354.05 cycles -> 705.81 cycles
Steve Borho
- [x265] [PATCH] blockfill_s_32x32 avx2 asm code: performance improved from 1354.05 cycles to 705.81 cycles, over sse version of asm code
praveen at multicorewareinc.com
- [x265] [PATCH] blockfill_s_32x32 avx2 asm code: performance improved from 1354.05 cycles to 705.81 cycles, over sse version of asm code
chen
- [x265] [PATCH] Changed FrameEncoder::m_tld to a pointer and set it to one of Encoder's ThreadLocalData instances
dtyx265 at gmail.com
- [x265] [PATCH] Changed FrameEncoder::m_tld to a pointer and set it to one of Encoder's ThreadLocalData instances
Steve Borho
- [x265] [PATCH] Changed FrameEncoder::m_tld to a pointer and set it to one of Encoder's ThreadLocalData instances
dave
- [x265] [PATCH] Changed FrameEncoder::m_tld to a pointer and set it to one of Encoder's ThreadLocalData instances
Steve Borho
- [x265] [PATCH] Changed FrameEncoder::m_tld to a pointer and set it to one of Encoder's ThreadLocalData instances
dave
- [x265] [PATCH] Changes for loadCTUData
dtyx265 at gmail.com
- [x265] [PATCH] Changes for loadCTUData
Deepthi Nandakumar
- [x265] [PATCH] Changes for loadCTUData
Santhoshini Sekar
- [x265] [PATCH] Changes for loadCTUData
Ashok Kumar Mishra
- [x265] [PATCH] Cleaned up TComDataCU::getQuadtreeTULog2MinSizeInCU for clarity and a bit of performance
dtyx265 at gmail.com
- [x265] [PATCH] Cleaned up TComDataCU::getQuadtreeTULog2MinSizeInCU for clarity and a bit of performance
Steve Borho
- [x265] [PATCH] conv16to32_count C interface modification, downscaling coeff from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] conv16to32_count renamed to copy_count as per new interface
praveen at multicorewareinc.com
- [x265] [PATCH] convert c++ reference to pointer on m_scalingList
Min Chen
- [x265] [PATCH] convert c++ reference to pointer on m_scalingList
Steve Borho
- [x265] [PATCH] convert c++ reference to pointer on m_scalingList
chen
- [x265] [PATCH] copy_cnt 4x4, eliminated move instructions, +1x improvement
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt 4x4, eliminated move instructions, +1x improvement
chen
- [x265] [PATCH] copy_cnt 4x4 AVX2 asm code, as per new interface
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt 4x4 AVX2 asm code, as per new interface
Steve Borho
- [x265] [PATCH] copy_cnt 4x4 AVX2 asm code, as per new interface
chen
- [x265] [PATCH] copy_cnt: enable avx2 version of asm code
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt: enable avx2 version of asm code
Deepthi Nandakumar
- [x265] [PATCH] copy_cnt: enable avx2 version of asm code
Praveen Tiwari
- [x265] [PATCH] copy_cnt: enable avx2 version of asm code
Deepthi Nandakumar
- [x265] [PATCH] copy_cnt: nits
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt: nits
chen
- [x265] [PATCH] copy_cnt_16, AVX2 asm code as per new interface, performance improved from 14.22x to 23.57x on HASWELL-I5
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_16, AVX2 asm code as per new interface, performance improved from 14.22x to 23.57x on HASWELL-I5
chen
- [x265] [PATCH] copy_cnt_16: avx2 asm code, improved 514.32 cycles -> 313.66 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_16: avx2 asm code, improved 514.32 cycles -> 313.66 cycles
chen
- [x265] [PATCH] copy_cnt_16: avx2 asm code as per new interface, improved 514.32 cycles -> 313.66 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_16: avx2 asm code as per new interface, improved 514.32 cycles -> 313.66 cycles
chen
- [x265] [PATCH] copy_cnt_32, AVX2 asm code as per new interface, performance improved from 16.81x to 32.16x on HASWELL-I5
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_32: avx2 asm code, improved 1521.17 cycles -> 934.46 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_32: avx2 asm code, improved 1521.17 cycles -> 934.46 cycles
chen
- [x265] [PATCH] copy_cnt_32: avx2 asm code as per new interface, improved 1521.17 cycles -> 934.46 cycles
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_4: combine mova and paddb to reduce code size, same speedup
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_4: enable fast non zero coefficient count path
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_4: enable fast non zero coefficient count path
chen
- [x265] [PATCH] copy_cnt_4: faster AVX2 code
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_4: faster AVX2 code
chen
- [x265] [PATCH] copy_cnt_4 asm code, corrected register uses
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_4 avx2 asm code: nit, same speedup by sse version
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_8, AVX2 asm code as per new interface, performance improved from 5.13x to 7.59x on HASWELL-I5
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_8, AVX2 asm code as per new interface, performance improved from 5.13x to 7.59x on HASWELL-I5
chen
- [x265] [PATCH] copy_cnt_8 AVX2 asm code, as per new interface
praveen at multicorewareinc.com
- [x265] [PATCH] copy_cnt_8 AVX2 asm code, as per new interface
Steve Borho
- [x265] [PATCH] copy_cnt_8 AVX2 asm code, as per new interface
chen
- [x265] [PATCH] copy_cnt replaced align load with unaligned load to avoid code crash, we are not sure about alignment of dst buffer
praveen at multicorewareinc.com
- [x265] [PATCH] count_nonzero asm code, reduceded code size by combining mova and packsswb
praveen at multicorewareinc.com
- [x265] [PATCH] count_nonzero asm code, reduceded code size by combining mova and packsswb
chen
- [x265] [PATCH] count_nonzero primitive optimization, downscaling quantCoef from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] count_nonzero primitive optimization, downscaling quantCoef from int32_t* to int16_t*
chen
- [x265] [PATCH] cvt16to32_cnt optimization
praveen at multicorewareinc.com
- [x265] [PATCH] cvt16to32_cnt optimization
chen
- [x265] [PATCH] denoise_dct: avx2 asm code
praveen at multicorewareinc.com
- [x265] [PATCH] denoise_dct: avx2 asm code
chen
- [x265] [PATCH] denoise_dct asm code: SSE version
praveen at multicorewareinc.com
- [x265] [PATCH] denoise_dct asm code: SSE version
chen
- [x265] [PATCH] denoiseDct: align performance data while reporting speedup
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct: nit unused asm function declarations
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct: SSE version of asm code
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct: SSE version of asm code
chen
- [x265] [PATCH] denoiseDct: test bench code
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct: test bench code
Steve Borho
- [x265] [PATCH] denoiseDct: unit test code
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct: unit test code
Steve Borho
- [x265] [PATCH] denoiseDct asm code: nit faulty code, need a new SSE version
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct test code: fixed typo
praveen at multicorewareinc.com
- [x265] [PATCH] denoiseDct unit test code: fixed bound value problem
praveen at multicorewareinc.com
- [x265] [PATCH] dequant_normal asm code optimization as per new interface
praveen at multicorewareinc.com
- [x265] [PATCH] dequant_normal optimization, downscaling quantCoef from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] dequant_normal optimization, downscaling quantCoef from int32_t* to int16_t*
Steve Borho
- [x265] [PATCH] dequant_scaling optimization, downscaling quantCoef from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] dpb: select best TMVP candidate from among all of the reference frames
gopu at multicorewareinc.com
- [x265] [PATCH] dpb: select best TMVP candidate from among all of the reference frames
Steve Borho
- [x265] [PATCH] encoder: rename cuCoder to analysis for better clarity
Steve Borho
- [x265] [PATCH] Entropy: Replaced getCtxQtCbf() with table
ashok at multicorewareinc.com
- [x265] [PATCH] Entropy: Replaced getCtxQtCbf() with table
Steve Borho
- [x265] [PATCH] Entropy: Replaced getCtxQtCbf() with table
ashok at multicorewareinc.com
- [x265] [PATCH] Entropy: Replaced getCtxQtCbf() with table
Deepthi Nandakumar
- [x265] [PATCH] fix: hash/binary mismatch for new CU structure holds CU-specific info
ashok at multicorewareinc.com
- [x265] [PATCH] fix: hash/binary mismatch for new CU structure holds CU-specific info
chen
- [x265] [PATCH] fix: hash/binary mismatch for new CU structure holds CU-specific info
Deepthi Nandakumar
- [x265] [PATCH] frameencoder: remove second encodeCU() pass over CTUs when SAO is disabled
Steve Borho
- [x265] [PATCH] frameencoder: remove second encodeCU() pass over CTUs when SAO is disabled
Steve Borho
- [x265] [PATCH] frameencoder: remove second encodeCU() pass over CTUs when SAO is disabled
Deepthi Nandakumar
- [x265] [PATCH] frameencoder: remove second encodeCU() pass over CTUs when SAO is disabled
Steve Borho
- [x265] [PATCH] nquant optimization, downscaling qCoef from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] optimize cvt32to16_shl by replacing copy_shl
praveen at multicorewareinc.com
- [x265] [PATCH] param: preset tuning changes
Steve Borho
- [x265] [PATCH] psy-rd: fix bug in chroma psyEnergy for intra 4x4
deepthi at multicorewareinc.com
- [x265] [PATCH] quant.cpp, cleaned redundant code
praveen at multicorewareinc.com
- [x265] [PATCH] quant_c optimization, downscaling qCoef from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] quant path cleanup
praveen at multicorewareinc.com
- [x265] [PATCH] rc: apply maxAU size restrictions while encoding each frame
aarthi at multicorewareinc.com
- [x265] [PATCH] rc: bug fix for 2 pass when bframes = 0. fixes Issue #77
aarthi at multicorewareinc.com
- [x265] [PATCH] rc: check for changes in scenecut input between multiple passes
aarthi at multicorewareinc.com
- [x265] [PATCH] rc: fix bugs in using boundary condition for cu while encoding each frame
aarthi at multicorewareinc.com
- [x265] [PATCH] rc: fixes for 2 pass + vbv to calculate frameSizePlanned accurately
aarthi at multicorewareinc.com
- [x265] [PATCH] rc: fixes for 2 pass + vbv to calculate frameSizePlanned accurately
Deepthi Nandakumar
- [x265] [PATCH] rc: use m_frameDuration instead of rce->frameDuration to derive complexity for each frame in 2nd pass
aarthi at multicorewareinc.com
- [x265] [PATCH] rd: move lambda and analysis qp init to rdcost.h
Steve Borho
- [x265] [PATCH] rdoQuant optimization, downscaling dstCoeff fron int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] removed copy_cnt_4 avx2 asm code: SSE version is eualy faster
praveen at multicorewareinc.com
- [x265] [PATCH] removed copy_cnt_4 avx2 asm code: SSE version is eualy faster
Praveen Tiwari
- [x265] [PATCH] Removed unnecessary call to loadCTUData
dtyx265 at gmail.com
- [x265] [PATCH] remove getNumPartInCU() and replace it with constant value
santhoshini at multicorewareinc.com
- [x265] [PATCH] remove getNumPartInCU() and replace it with constant value
Deepthi Nandakumar
- [x265] [PATCH] remove getNumPartInCU() and replace it with macro
santhoshini at multicorewareinc.com
- [x265] [PATCH] remove getNumPartInCU() and replace it with macro
Santhoshini Sekar
- [x265] [PATCH] remove getNumPartInCU() and replace it with macro
santhoshini at multicorewareinc.com
- [x265] [PATCH] remove getNumPartInCU() and replace it with macro
Deepthi Nandakumar
- [x265] [PATCH] remove getNumPartInCU() and replace it with macro
santhoshini at multicorewareinc.com
- [x265] [PATCH] Resolved gcc compiler error of mismatched type
dtyx265 at gmail.com
- [x265] [PATCH] Resolved gcc compiler error of mismatched type
Deepthi Nandakumar
- [x265] [PATCH] Resolve gcc warnings
dtyx265 at gmail.com
- [x265] [PATCH] Resolve gcc warnings
Deepthi Nandakumar
- [x265] [PATCH] search.cpp: fixed type conversion warning
praveen at multicorewareinc.com
- [x265] [PATCH] search.cpp: fixed type conversion warning
Deepthi Nandakumar
- [x265] [PATCH] search: cleanup and remove redundant variable in checkintra
gopu at multicorewareinc.com
- [x265] [PATCH] search: dump best motion statistics for P and B slices into analysisdata file
gopu at multicorewareinc.com
- [x265] [PATCH] search: dump best motion statistics for P and B slices into analysisdata file
Steve Borho
- [x265] [PATCH] search: give each Search instance an Entropy encoder (no output changes)
Steve Borho
- [x265] [PATCH] search: give each Search instance an Entropy encoder (no output changes)
Steve Borho
- [x265] [PATCH] search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]
Steve Borho
- [x265] [PATCH] search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]
Steve Borho
- [x265] [PATCH] search: measure RDO of intra modes within 25% of least cost [CHANGES OUTPUTS]
dave
- [x265] [PATCH] Search: remove redundant encode coefficients in intra for performance
ashok at multicorewareinc.com
- [x265] [PATCH] Search: remove redundant encode coefficients in intra for performance
Deepthi Nandakumar
- [x265] [PATCH] Search: remove redundant encode coefficients in intra for performance
Deepthi Nandakumar
- [x265] [PATCH] search: remove redundant loacal variables in encodeResAndCalcRdSkipCU
gopu at multicorewareinc.com
- [x265] [PATCH] search: remove redundant loacal variables in encodeResAndCalcRdSkipCU
Steve Borho
- [x265] [PATCH] search: simplify and remove redundant variables in getBestIntraModeChroma
gopu at multicorewareinc.com
- [x265] [PATCH] signBitHidingHDQ optimization, downscaling coeff from int32_t* to int16_t*
praveen at multicorewareinc.com
- [x265] [PATCH] TComDataCU: Reduced repeated function call to calculate depth range
ashok at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getTotalNumPart() with CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getTotalNumPart() with CU structure details
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getTotalNumPart() with CU structure details
Steve Borho
- [x265] [PATCH] TComDataCU: replace getTotalNumPart() with CU structure details
Steve Borho
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Santhoshini Sekar
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Deepthi Nandakumar
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Deepthi Nandakumar
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Deepthi Nandakumar
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Deepthi Nandakumar
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Steve Borho
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
santhoshini at multicorewareinc.com
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
dave
- [x265] [PATCH] TComDataCU: replace getZorderIdxInCU() with encodeIdx of CU structure
Steve Borho
- [x265] [PATCH] temporarily disable avx2 version of copy_cnt primitive, need to update as per new interface
praveen at multicorewareinc.com
- [x265] [PATCH] testbench.cpp: temporary fix for testbench crash
praveen at multicorewareinc.com
- [x265] [PATCH] vps: general_frame_only_constraint_flag is true in progressive videos
deepthi at multicorewareinc.com
- [x265] [PATCH] vps: vps_timing_info is always present
deepthi at multicorewareinc.com
- [x265] [PATCH] x86asm: warn when inappropriate instruction used in function with specified cpuflags
murugan at multicorewareinc.com
- [x265] [PATCH] x86asm: warn when inappropriate instruction used in function with specified cpuflags
Steve Borho
- [x265] [PATCH] x86asm: warn when inappropriate instruction used in function with specified cpuflags
Steve Borho
- [x265] [PATCH] x86asm: warn when inappropriate instruction used in function with specified cpuflags
Murugan Vairavel
- [x265] [PATCH] x86inc.asm: fix vpbroadcastd bug on Mac platform
Min Chen
- [x265] [PATCH] x86inc.asm: fix vpbroadcastd bug on Mac platform
Min Chen
- [x265] [PATCH] x86inc.asm: fix vpbroadcastd bug on Mac platform
Satoshi Nakagawa
- [x265] [PATCH] x86inc.asm: fix vpbroadcastd bug on Mac platform
Steve Borho
- [x265] [PATCH RFC] analysis: add CU specific details to encodeCU()
santhoshini at multicorewareinc.com
- [x265] [PATCH RFC] analysis: add CU specific details to encodeCU()
Deepthi Nandakumar
- [x265] change index of m_buOffsetY[] from raster to zscan
Satoshi Nakagawa
- [x265] change index of m_buOffsetY[] from raster to zscan
Steve Borho
- [x265] divide, encode, combine?
Michael Nordberg
- [x265] divide, encode, combine?
Tom Vaughan
- [x265] fix cbf context
Satoshi Nakagawa
- [x265] fix cbf context
Steve Borho
- [x265] fix cbf context
Satoshi Nakagawa
- [x265] fix CHECKED_BUILD
Satoshi Nakagawa
- [x265] fix getQuadtreeTULog2MinSizeInCU()
Satoshi Nakagawa
- [x265] fix sao
Satoshi Nakagawa
- [x265] Fwd: [PATCH] copy_cnt_4: faster AVX2 code
Praveen Tiwari
- [x265] Fwd: [PATCH] copy_cnt_4: faster AVX2 code
chen
- [x265] Fwd: [PATCH] denoiseDct: unit test code
Praveen Tiwari
- [x265] Fwd: Fwd: [PATCH] copy_cnt_4: faster AVX2 code
Praveen Tiwari
- [x265] Fwd: Fwd: [PATCH] copy_cnt_4: faster AVX2 code
chen
- [x265] HEVC Daily Log - Murugan/Yuvaraj 9/2/22014
Murugan Vairavel
- [x265] inline simple functions
Satoshi Nakagawa
- [x265] inline simple functions
Steve Borho
- [x265] inline simple functions
Satoshi Nakagawa
- [x265] inline simple functions
Steve Borho
- [x265] note: history rewrite
Steve Borho
- [x265] primitives: intra_pred[4][35] => intra_pred[35][4] (avoid *35)
Satoshi Nakagawa
- [x265] primitives: intra_pred[4][35] => intra_pred[35][4] (avoid *35)
Deepthi Nandakumar
- [x265] Recent Frames/second benchmarks per platform and per clip?
Raul Lopez
- [x265] Recent Frames/second benchmarks per platform and per clip?
Tom Vaughan
- [x265] Recent Frames/second benchmarks per platform and per clip?
Mario Rohkrämer
- [x265] Recent Frames/second benchmarks per platform and per clip?
Tom Vaughan
- [x265] Recent Frames/second benchmarks per platform and per clip?
Mario Rohkrämer
- [x265] Recent Frames/second benchmarks per platform and per clip?
Tom Vaughan
- [x265] refine deblocking filter
Satoshi Nakagawa
- [x265] refine deblocking filter
Steve Borho
- [x265] refine deblocking filter
Satoshi Nakagawa
- [x265] refine deblocking filter
Steve Borho
- [x265] refine deblocking filter
Steve Borho
- [x265] refine deblocking filter
Satoshi Nakagawa
- [x265] Regarding changeset 8154 "vec: remove idct8, we have SSSE3 assembly for it"
James Almer
- [x265] Regarding changeset 8154 "vec: remove idct8, we have SSSE3 assembly for it"
Steve Borho
- [x265] Regarding changeset 8154 "vec: remove idct8, we have SSSE3 assembly for it"
James Almer
- [x265] sao: remove frame-based SAO
Satoshi Nakagawa
- [x265] sao: remove frame-based SAO
Steve Borho
- [x265] sao: some cleanups
Satoshi Nakagawa
- [x265] sao: some cleanups
Deepthi Nandakumar
- [x265] simplify intra filter (with fix for da61cf406f16) (Re: primitives: intra_pred[4][35] => intra_pred[35][4] (avoid *35))
Satoshi Nakagawa
- [x265] target processor detection message in CMakeLists.txt
djcj
- [x265] x265 --> YASM 1.3.0 update
Michal Powalko
Last message date:
Tue Sep 30 17:59:13 CEST 2014
Archived on: Thu Dec 11 23:20:13 CET 2014
This archive was generated by
Pipermail 0.09 (Mailman edition).