Search

Showing top 4 results for "DeepSeek" semantic rerank on

Tracked topic

DeepSeek

24 articles indexed Last updated 3d ago See topic hub

Open-R1: a fully open reproduction of DeepSeek-R1

…https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/modeling_deepseek.py Is it possible to contribute to this project? · Yes, you can look at https://huggingface.co/open-r1 and https…

Mar 27, 2025 · Elie Bakouch

How to deploy and fine-tune DeepSeek models on AWS

…I have been trying to deploy deepseek-ai/DeepSeek-R1-Distill-Qwen-32B on inferentia with a context window higher than 4096 (let's say MAX_TOTAL_TOKENS=8192 ), but it seems…

Mar 27, 2025 · Simon Pagezy

Open-source DeepResearch – Freeing our search agents

…I tested out the new DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Unbiased model yesterday. It was a very crude test, but I was quite impressed. I'm a newb over here…

Mar 27, 2025 · Aymeric Roucher

Open R1: Update #2

…According to DeepSeek's paper, DeepSeek-Distill-Qwen-7B's performance in MATH-500 and AIME24 is 92.8 and 55.5 respectively, which seems to be very different from the values…

Feb 6, 2025

Followed topics

DeepSeek

Open-R1: a fully open reproduction of DeepSeek-R1

How to deploy and fine-tune DeepSeek models on AWS

Open-source DeepResearch – Freeing our search agents

Open R1: Update #2