Hi, > I get 18% speedup with the attached patch on conroe x86_64 gcc-4.2.3. > It just removes some sign-extends from the inner loop. Have you tested your patch along with my patch on x86_64? However, your patch has no effect on 32bit machine. Zhou Zongyi 2009-01-12