sysbench oltp test, the result is not stable after warmup, why?

512 threads are very heavy workload for MySQL.
While it is possible to get stable result, it will require a special tuning.
Can you try your experiment with 40 threads to see how it works?