Gpu programming expert - fully remote
BerlinMercor
...for performance, efficiency, and hardware utilization. Use profiler metrics like L2 cache hit rate, L2 throughput, and occupancy to guide kernel improvements. Review GPU kernel implementations to identify bottlenecks without needing extensive algorithmic background. Write, modify, and reason about C++17, Python, and GPU [...]
Kategorie Medien / Verlag / Redaktion