sub_group_size can be left to its default value if you aren't using NVMe offload, but you may want to change it if you: Run into an OOM error during the optimizer step.