Your AI runs on your iPhone — not in the cloud. Chat, transcribe voice notes, summarize recordings, and generate images, all on-device.
Chat with powerful language models, transcribe voice notes, and generate images — all running directly on your iPhone or iPad. No cloud. No servers. No accounts. Your data never leaves your device.
onLM is a native iOS app that brings state-of-the-art AI to a fully offline workspace. Every message, recording, and image is processed locally using your device's hardware, giving you a genuinely private AI assistant that works without an internet connection and without a subscription.
PRIVATE AI CHAT
Chat with open-source LLMs that run entirely on your device. Conversations are stored only on your iPhone or iPad — no servers, no sign-up, no telemetry. Pick a model that fits your task and switch between them anytime.
VOICE NOTES, TRANSCRIPTION & SUMMARIES
Record audio, transcribe it to text, and summarize long recordings into key points — all processed on-device. Useful for meeting notes, interviews, lectures, or spoken ideas. Recordings are auto-titled based on content for easy browsing.
ON-DEVICE IMAGE GENERATION
On supported devices, generate images from text prompts without sending anything to a remote server. Write your prompt in any supported language and onLM translates it locally before generation. Your images stay in a private gallery on your device.
CHOOSE FROM LEADING OPEN-SOURCE MODELS
- Gemma 4 E2B & E4B — Google's edge models with native audio and vision support
- Gemma 3 (4B, 12B) — strong multilingual capabilities from Google
- Qwen 3.5 (2B, 4B, 9B) — excellent all-around performance
- Llama 3 (3B, 8B) — reliable general-purpose models from Meta
- Phi 4 Mini — optimized for math, logic, and code from Microsoft
- Mistral 7B — versatile European-built model
All downloadable models are 4-bit quantized and run efficiently on mobile hardware via Apple's MLX framework.
APPLE INTELLIGENCE INTEGRATION
On supported devices, onLM can use Apple Intelligence as a built-in chat model with zero setup — no download, instant responses. Switch between Apple Intelligence and open-source models at any time.
BUILT FOR YOUR DEVICE
onLM detects your iPhone or iPad's capabilities and recommends the best models for your hardware. Quality ratings help you balance speed and intelligence, and features that require more memory are only surfaced on devices that can handle them — so you never hit a wall unexpectedly.
SEAMLESS EXPERIENCE
- Real-time streaming — watch responses appear word by word
- Background downloads — continue working while models load
- Smart memory management — stable performance on mobile hardware
- Conversation management — organize, search, and rename chats
- Stop and resume generation anytime
- Disk space checks before every download
NO SUBSCRIPTIONS. NO ADS. NO TRACKING.
Download a model once and use it as much as you want. There are no usage limits, no hidden costs, and no telemetry. onLM is a straightforward native app, not a wrapper around someone else's service.
Whether you need a private AI chatbot, an offline voice transcriber for meetings and ideas, or an on-device image generator — onLM gives you the power of modern AI without compromising your privacy.
Constantly crashes on iPhone 17 Pro and latest iOS!
El•Diablo
Buggy app. Constantly crashes on iPhone 17 Pro with basic text prompts on any LLM model shown as supported. Fix it please!
Developer Response
Thanks for the feedback! This issue has been fixed — please update to the latest version. If you still experience any crashes after updating, let me know and I'll look into it. Apologies for the inconvenience!
Отличное приложение для запуска локальных LLM
reqw41
По мне это лучшее из того, что попадалось для запуска локальных LLM. Первые версии крэшились на моем IPhone 17 pro max, но сейчас работает стабильно и транскрибация и текстовые запросы и даже генерация картинок. Интересно было бы добавить голосовой чат и приложение CarPlay, ну и какую нибудь локальную базу знаний ,чтобы решать специфичные задачи в условиях отсутствия интернета
Очень удобно
welforse
Пока это лучшее что я нашёл. Спасибо.
- 12 GB iPhone stability — capped MLX buffer pool at 20 MB; added per-model KV-cache quantization and output-token caps (Qwen 3.5 9B, Gemma 3/4, Llama 3.1, Mistral) to stop jetsam kills by turn 3.
- Image generation — fixed OOM on 12 GB iPhones (proper hw.memsize gating, quantized SDXL path) and fixed a crash on long or non-Latin prompts (CLIP token sequences now clamped to 77).
- Onboarding — Skip button on the final page once any chat-capable model is ready; downloads keep running in the background.
- Download banner — now shown across all tabs, not just Chat.
Version 1.4.2
The developer, Alexander Kryukov, indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy .
Data Not Collected
The developer does not collect any data from this app.
Privacy practices may vary, for example, based on the features you use or your age. Learn More
The developer indicated that this app supports the following accessibility features. Learn More
Supported Features
VoiceOver
Voice Control
Dark Interface
Information
Seller
Alexander Kryukov
Size
45 MB
Category
Utilities
Compatibility
Requires iOS 26.0 or later.
iPhone Requires iOS 26.0 or later.
iPad Requires iPadOS 26.0 or later.
Mac Requires macOS 26.0 or later and a Mac with Apple M1 chip or later.
Infrequent Cartoon or Fantasy Violence Realistic Violence Profanity or Crude Humor Mature or Suggestive Themes Horror/Fear Themes Alcohol, Tobacco, Drug Use or References Guns or Other Weapons