Lemonade is AMD’s open, local first runtime and API layer for building and running AI applications directly on PCs. It provides a unified, OpenAI compatible API that lets developers run text, image, and speech models locally without managing hardware specific details or fragmented runtimes. Lemonade is implemented in lightweight C++ and installs quickly across Windows, Linux, macOS, and Docker. At runtime, it automatically selects and configures optimized CPU, GPU, and NPU backends for each machine, allowing multiple models to run concurrently based on available system memory. The platform sup