-
- Downloads
h264: Integrate clear_blocks calls with IDCT
The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
(in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
tested (cathedral), i.e. almost 30 cycles per mb faster.
Signed-off-by:
Martin Storsjö <martin@martin.st>
Showing
- libavcodec/arm/h264idct_neon.S 23 additions, 6 deletionslibavcodec/arm/h264idct_neon.S
- libavcodec/h264.c 7 additions, 7 deletionslibavcodec/h264.c
- libavcodec/h264_mb_template.c 1 addition, 10 deletionslibavcodec/h264_mb_template.c
- libavcodec/h264addpx_template.c 4 additions, 0 deletionslibavcodec/h264addpx_template.c
- libavcodec/h264dsp.c 2 additions, 2 deletionslibavcodec/h264dsp.c
- libavcodec/h264dsp.h 2 additions, 2 deletionslibavcodec/h264dsp.h
- libavcodec/h264idct_template.c 12 additions, 4 deletionslibavcodec/h264idct_template.c
- libavcodec/h264pred.h 4 additions, 4 deletionslibavcodec/h264pred.h
- libavcodec/h264pred_template.c 18 additions, 10 deletionslibavcodec/h264pred_template.c
- libavcodec/ppc/h264_altivec.c 3 additions, 0 deletionslibavcodec/ppc/h264_altivec.c
- libavcodec/svq3.c 2 additions, 2 deletionslibavcodec/svq3.c
- libavcodec/x86/h264_idct.asm 83 additions, 25 deletionslibavcodec/x86/h264_idct.asm
- libavcodec/x86/h264_idct_10bit.asm 48 additions, 5 deletionslibavcodec/x86/h264_idct_10bit.asm
Loading
Please register or sign in to comment