This requires a recent version of CUDA and works best on modern cards.