Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
Tracked topic
Qwen3 is an AI model family developed by Alibaba, released as a set of large language models for natural-language tasks.
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
…Thanks for sharing!! Probably unrelated question, the graph shows Qwen3 1.7B has 2B parameters. Is it correct? · Model page show that's why we made this msitake But it's because…
["qwen3"]