News
How to Use MCP Servers with Local LLMs | Unsloth Documentation
3+ hour, 40+ min ago (1446+ words) Learn how to connect MCP Servers to open AI models with screenshots. We'll use the open-source repos Unsloth and llama. cpp as they are popular frameworks for local model inference/deployment. MCP works for local GGUF models and cloud provider…...
Install Unsloth on Mac OS | Unsloth Documentation
6+ day, 3+ hour ago (157+ words) To install Unsloth locally on your local Apple Mac OS device, follow the steps below: Use the same command to update. Every time you want to launch Unsloth again: For detailed Unsloth Studio install instructions and requirements, view our guide....
Connect API Providers & Model Servers to Unsloth | Unsloth Documentation
6+ day, 5+ hour ago (1063+ words) Guide to connect Open AI, Anthropic, Ollama, llama. cpp, v LLM and other providers to Unsloth. Add API keys or model server URLs, load models, and use external models in chat. Learn how to run models from Open AI, Anthropic,…...
Unsloth Joins the Py Torch Ecosystem
3+ week, 3+ hour ago (20+ words) Unsloth May 11, 2026 " By Daniel & Michael Run + train models via a UI...
How to Make LLM Training Faster with Unsloth and NVIDIA
3+ week, 5+ day ago (1542+ words) Fine-tuning is one of today's most computationally intensive workloads, and it continues to push hardware to its limits. NVIDIA GPUs are purpose-built for these workloads: they break complex problems into pieces and process them in parallel. Unsloth works across the…...
How to use Unsloth as an API endpoint | Unsloth Documentation
4+ week, 14+ hour ago (887+ words) You can now use local LLMs via tools like Claude Code and Codex by connecting it to Unsloth's API endpoint. This means you'll be able to directly run local Qwen and Gemma models in those tools with Unsloth Studio with…...
How to Run Local AI Models with Hermes Agent | Unsloth Documentation
4+ week, 15+ hour ago (527+ words) Guide on using open LLMs with Hermes Agent locally. This guide enables you to run open LLMs locally with Hermes Agent via Unslotharrow-up-right. Hermes Agent is an open-source autonomous AI agent that connects to a model endpoint, executes tasks, and…...
How to Run Local AI Models with Open Claw | Unsloth Documentation
4+ week, 15+ hour ago (420+ words) Guide to running local LLMs with Open Claw. This guide will enable you to use open LLMs locally with Open Claw by connecting it to Unsloth. Open Claw is an open-source AI agent interface that connects to a model to…...
How to Run Local AI Models with Open Code | Unsloth Documentation
4+ week, 15+ hour ago (548+ words) Guide to connect open LLMs with Open Code on your local device. After setup, Open Code connects to Unsloth, where you can select a loaded model and use it as a coding agent. In this tutorial, we'll use Unsloth-gpt-oss-20b loaded…...
IBM Granite 4. 1 - How to Run Locally | Unsloth Documentation
1+ mon, 2+ day ago (661+ words) Run IBM Granite-4. 1 with Unsloth GGUFs and how to fine-tune! IBM releases Granite-4. 1 models with 3 sizes: 3 B, 8 B and 30 B. Granite-4. 1 is a long-context dense model family, built for instruction following, tool calling, chat, RAG and coding use cases. The models…...