Nvidia describes the whole thing using the term 'Inference on Sample,' and the results are impressive, to say the least.
Note: You may need 80GB GPU memory to run this script with deepseek-vl2-small and even larger for deepseek-vl2.