ref: 78136edcdc3f53bc63b58e76ec4b160a2da1a0e3
parent: b89eef8f8277d0e7142da7c4799ebf296cea8fa2
author: Jingning Han <jingning@google.com>
date: Wed Aug 7 10:45:37 EDT 2013
SSE2 high precision 32x32 forward DCT Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56