Running Deepseek R1 671B Versions Locally or 70B on Groq Remotely
The distilled versions of Deepseek are not as good as the full model. They are vastly inferior and other models out perform them handily. Running the full model, with a 16K or greater context window, is possible for about $2000 at about 4 tokens per second. This uses an Machine Specs AMD EPYC 7702 512GB ...
Link :
https://www.nextbigfuture.com/2025/02/running-deepseek-r1-671b-locally-or-70b-on-groq.html
Link :
https://www.nextbigfuture.com/2025/02/running-deepseek-r1-671b-locally-or-70b-on-groq.html