Our local LLMs run here with 16GB total VRAM.
The model weights are stored here.
The local LLM box
This machine runs the local models used for translation and image text extraction. The model files live on the Samsung SSD inside the same machine. Today this beta runs on two RTX 5060 GPUs with 16GB total VRAM. If FormattedChinese.com gains traction, we plan to upgrade the hardware so we can run stronger local models.