
Sidekick
Privacy-first local LLM app for macOS — 3.2K+ GitHub stars
A native macOS app that lets users chat with a local LLM that can respond with information from files, folders, and websites — all without installing any other software. Every conversation happens offline and data stays on-device. Sidekick is local-first, with a built-in llama.cpp inference engine for GGUF models like Qwen 3.5, while also accommodating OpenAI-compatible APIs for remote model options.
Key Highlights
- 3.2K+ GitHub stars, ranked #1 on Google for "mac llm app"
- Experts & RAG: scoped knowledge domains over local files, folders, and websites with GraphRAG
- Agentic function calling: sequential tool-call loops (web search, file ops, run_python, email drafting)
- Deep Research: autonomous multi-step agent that reads 50–80 webpages and synthesizes full reports
- Persistent memory across conversations for personalized, context-aware responses
- Canvas: create, edit, and live-preview websites, code, and documents with selective text editing
- Toolbox: Inline Writing Assistant (Cmd+Ctrl+I in any app), AI Detector, Diagrammer, Slide Studio
- Image generation via CoreML with automatic prompt classification — no mode switching
- Advanced rendering: LaTeX math, auto-generated charts from tables, Mermaid diagrams
- Speculative decoding on Apple Silicon for fast local inference
Screenshots

Main conversation interface with local LLM
Development
Project Info
GitHub Stars Over Time
Milestones
Local LLM chat with file/folder/website indexing and web search
Up to 40% faster local inference on Apple Silicon
On-device image generation via CoreML (macOS 15.2+)
Support for DeepSeek-R1 and other reasoning-capable models
Prompt-to-diagram extension with integrated Mermaid preview
Prompt-to-presentation with PDF/PowerPoint export
Live website/code preview with selective in-place editing
Sequential tool-call loops replace the standalone code interpreter
Persistent memory across conversations for personalised responses
Autonomous multi-step agent reading 50–80 pages and synthesising reports
Reorganised extensions, refreshed UI, expanded model library
Latest release candidate; current as of April 2026
Latest Release
1.0.0-rc.181.0.0 Release Candidate 18 — latest release