Search

Showing top 10 results for "Qwen3"

Qwen3

Qwen3 is an AI model family developed by Alibaba, released as a set of large language models for natural-language tasks.

40 articles indexed Last updated 2d ago See topic hub

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

Apr 30, 2025 · Igor Margulis

SmolLM3: smol, multilingual, long-context reasoner

…Thanks for sharing!! Probably unrelated question, the graph shows Qwen3 1.7B has 2B parameters. Is it correct? · Model page show that's why we made this msitake But it's because…

Sep 10, 2025 · Elie Bakouch

Get your VLM running in 3 simple steps on Intel CPUs

…Qwen3-VL 8B was the newest model I tried and it ran the fastest - only surpassed by Qwen3 3B. Benchmarking with OpenVINO https://www.linkedin.com/pulse/benchmarking-openvino-steven-leve-gcg7c…

Apr 8, 2025 · Ezequiel Lanza

SOTA OCR with Core ML and dots.ocr

…I was able to perform a rough comparison between Dots.OCR.Runner and other VLMs such as Magistral-Small-2509 and qwen3-vl-30b , using their top quantized versions that can run…

Oct 3, 2025 · Christopher Fleetwood

nanoVLM: The simplest repository to train your VLM in pure PyTorch

…python serving_bench.py \ --model /path/to/Qwen3-14B/ \ --request-rate 10 \ --num-requests 1024 \ --tensor-parallel-size 1 \ --max-num-batched-tokens 1024 \ --max-num-seqs 1024 \ --random-input-len 128…

Feb 6, 2025 · Aritra Roy Gosthipaty

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

…https://github.com/askbudi/TinyCodeAgent · that's very cool @ insightfactory ! This is my agent.json { "model": "qwen3:4b", "endpointUrl": " http://localhost:11434/ ", "provider": "auto", "servers": [ { "type": "sse", "config": { "url": " http://127.0…

Jan 12, 2025 · Célina Hanouti

Followed topics

Search

Qwen3

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

SmolLM3: smol, multilingual, long-context reasoner

Get your VLM running in 3 simple steps on Intel CPUs

SOTA OCR with Core ML and dots.ocr

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Welcome EmbeddingGemma, Google's new efficient embedding model

We Got Claude to Build CUDA Kernels and teach open models!

Vision Language Models (Better, faster, stronger)

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL