Additionally there is still too much performance left on the table by not properly using CPU vector units.