25. Write kernels with modern GPU compilers
Use Triton-style kernel programming and compiler-driven GPU tools to express tiled parallel algorithms with less low-level boilerplate. You will compare generated kernels with CUDA kernels and see where these tools are strong or limited.