use new malloc interface to do a single clean malloc

6 jobs for thread_pool in 6 minutes and 46 seconds (queued for 1 second)