Intel Assembler optimization
- by Søren Haagerup
I'm currently trying to optimize the code emitted from a home-made compiler, for a home-made language.
I've tried out Intel VTune to see where the bottlenecks are: http://www.imada.sdu.dk/~sorenh07/misc/vtune-assembly-optimization.png
I find it very impressive that a "subl"-instruction is responsible for over 38% of the clockticks in a program…