Last active
September 10, 2025 14:59
-
-
Save Basten7/091df055c04edaa9c88eb0cdc7fc429d to your computer and use it in GitHub Desktop.
Prompt Processing vs Token Generation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Classic LLM-inference trace on the GPU |
Author
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
LLM-inference trace on Metal3 Build at 0.1 ms

LLM-inference trace on Vulkan Build at 0.1 ms
