Just copped a 6000 Blackwell Pro. Which llm to install first?
| TurboGrafx-67 | 06/01/26 | | TurboGrafx-67 | 06/02/26 | | Dan Bilzerian | 06/02/26 | | TurboGrafx-67 | 06/02/26 | | Dan Bilzerian | 06/02/26 | | TurboGrafx-67 | 06/02/26 | | Dan Bilzerian | 06/03/26 | | TurboGrafx-67 | 06/03/26 | | Dan Bilzerian | 06/03/26 | | Mailer Daemon | 06/02/26 | | an idea whose time has come | 06/02/26 | | TurboGrafx-67 | 06/02/26 | | TurboGrafx-67 | 06/03/26 | | Mailer Daemon | 06/03/26 | | Dan Bilzerian | 06/03/26 |
Poast new message in this thread
 |
Date: June 3rd, 2026 12:59 PM Author: Dan Bilzerian
On one server I have a 5090, a 5080, a 5070 ti, and a 5060 ti, but I'm only using the 5090 and 5080 to run my Hermes Agent right now, because I only need 48gb to run Qwen3.6 27b at Q8. I have two 3090s sitting in another server and I use them to do OCR, translation, image gen and shit like that. I just have a bunch of LXC containers running Ollama on the 3090s, and my Hermes Agent connects to whatever Ollama container it needs for a given task. I used to run a separate model for coding tasks, but Qwen3.6 does that well enough now. I only paid about $700 apiece for the 3090s plus another $100 to get them re-pasted by a shop, now they go for like $1200 on ebay with 6-year old paste lol
(http://www.autoadmit.com/thread.php?thread_id=5870420&forum_id=2/#49912852) |
|
|