c0 benchmark with sv-3010 network (384x30),
default settings (minibatch-size=256)
---------------------------------------------
GPU baseline optimized perf gain (%)
---------------------------------------------
Titan RTX 17443 20084 15.1
RTX 3090 26820 29767 11.0
A100 41785 48815 16.8
minibatch-size=1024, all other settings default:
---------------------------------------------
GPU baseline optimized perf gain (%)
---------------------------------------------
Titan RTX 20211 23003 13.8
RTX 3090 33032 36924 11.8
A100 52732 59134 12.1
Powered by mwForum 2.29.3 © 1999-2014 Markus Wichitill