It supports GGUF & MLX models or connect to OpenAI-compatible servers. Free Plan includes most features like local models, iCloud Sync, Markdown conversion and 25+ tools.
Privacy AI introduces **MCP Marketplace**, a one-tap hub to browse and install ready-to-use MCP servers. Powered by [MCPRouter](https://mcprouter.co/), each server is auto-configured. No need to manual setup. So you can instantly add integrations for automation, image generation, data analysis, and more.
Now with **OpenAI Responses API** support: run **stateless multi-turn conversations** without resending full history. Use **conversation objects**, **prompt templates**, and **native prompt caching** for faster, cheaper repeats. The built-in Protocol Inspector shows every request, response, and SSE event in real time for easy debugging.
**Interleaved Thinking** is supported for models like **Minimax M2**, **Claude Sonnet 4**, and **OpenAI GPT-5 (Thinking)**. When enabled, the model preserves reasoning across turns, enabling planning, verification, and self-correction—turning simple Q&A into agentic, multi-step workflows.
**Memory** lets your AI remember across models. Add, edit, and sync personal context that stays consistent for local or cloud models. Your memory belongs to you—not a provider. Memory is free for local models and included in Pro for cloud models.
A **Free Plan** covers offline GGUF/MLX models, iCloud sync, Markdown conversion, and built-in tools. Cloud models, MCP Marketplace, and custom API providers require a subscription.
Privacy AI is a full-featured AI client that works your way: run models fully offline, connect to self-hosted servers, or use cloud providers—all in one app.
### Why Privacy AI
Most AI apps lock you in. Privacy AI gives you freedom: local execution for privacy, self-hosted for compliance, or cloud APIs for performance. You choose where models run, how memories persist, and how data is handled.
### Who It’s For
• Developers and infra builders (GGUF, MLX, OpenRouter, custom APIs)
• Privacy-conscious professionals in law, healthcare, finance, research
• Power users exploring automation and model performance
• Teams needing compliance or internal-only workflows
### What It Does
**Run Any Model, Anywhere**
• Offline on iPhone, iPad, or Mac
• Load GGUF/MLX from HuggingFace or iCloud
• Connect to OpenAI, Anthropic, Gemini, DeepSeek, or your own servers
• Switch between local and remote mid-chat without losing context
• Supports Apple’s on-device Foundation Model (iOS 26)
**Use Tools Seamlessly**
• Full MCP with integrated Marketplace
• Call tools like search, stocks, arXiv, HealthKit, JavaScript—even offline
• Inspect and replay tool calls in chat
**Automate Privately**
• Siri, Shortcuts, and Share Extension
• Offline TTS + transcription
• OpenAI protocol TTS (gpt-4o-tts, gpt-4o-mini-tts)
**Files On-Device**
• Convert PDFs, Office, EPUB, HTML, YouTube, audio
• OCR + Markdown conversion
• Export to Markdown, PDF, HTML, EPUB, JSON
**Images & Editing**
• Generate locally or via OpenRouter (e.g., Gemini 2.5 Flash)
• Support for gpt-image-1 and FLUX.1-dev
• Annotate with Apple Pencil or touch
**Control & Extend**
• “Natural Talk” voice UI
• Live token/context indicator
• Custom prompts, tools, preferences
• iCloud sync for models, chats, settings
**Built for Apple**
• Optimized for M-series Macs and iPad Pro
• Requires iOS 18.6+
• Deep Siri/Shortcuts + Share Extension integration
### No Account. No Surprises.
No forced sign-ups. Choose local, self-hosted, or cloud. Your data and memories stay under your control.
It’s more than a chatbot—it’s a professional-grade AI IDE in your pocket.
Your model. Your device. Your data.
Terms of Use: https://privacyai.acmeup.com/docs/policy/tos.html
Privacy Policy: https://privacyai.acmeup.com/docs/policy/privacy.html
I've tried just about every AI client on the App Store, and this is the one. I stumbled on this app when it had zero downloads and found an absolute gem. This isn't just another chat wrapper; it's a full-on AI agent for your phone. You can plug in APIs for basically any model (Gemini, DeepSeek, etc.) and it has tools that let the AI actually do things—manage your calendar, get directions, search your contacts, the list goes on. The "Super Siri" feature that lets you use your own AI model with a voice command is a complete game-changer.Now, full disclosure, the app is BRAND new, so it's a little rough around the edges. You might hit a few quirks or a random crash here and there as the developer irons things out. But honestly, that's totally expected for something this ambitious. The potential here is just off the charts.Giving this 5 stars for the vision alone and for what it can already do.
Developer Response
Thank you for trying Privacy AI when it only had a few downloads. It means a lot that you see what we’re building, not just another chat app, but a full AI workspace and agent for mobile devices. We’re glad you tried features like broad API support and the “Super Siri” voice command. Both are designed to put you in control of your AI on your own terms. Our philosophy is simple: AI should run under users’ control, free from Big Tech lock-in, and work exactly the way users need it to. The app is still new and not perfect. But we’re shipping fast and improving every week. Thank you for the five stars and for recognizing the vision. Reviews like yours help more people discover Privacy AI and motivate us to make it even better.
Features
- Direct Video Input Support
Privacy AI now supports OpenRouter’s native video input protocol. You can send video files directly to supported multimodal models: such as Gemini 2.5 Flash, Flash Lite, Pro, and others, without any manual pre-processing. The app handles encoding and upload automatically, offering a seamless video-to-AI workflow.
- Upgraded TTS/ASR Engine
sherpa-onnx has been upgraded to 1.12.15, adding support for newer TTS models like MatchTTS and expanding available voice options for higher-quality speech synthesis.
Improvements
- MLX Engine Upgrade
Updated to the latest MLX Swift framework with improved vision-model handling and greater stability. Qwen3-VL models now process images more reliably thanks to refined sanitization. Memory usage is optimized on both iOS and macOS, and the new context API enables more flexible prompt construction for advanced RAG workflows.
- llama.cpp Engine Upgrade (b7032)
Major performance improvements for Apple Silicon, including Metal 4 Tensor API acceleration on M5-class devices, ARM64 SVE optimizations, and hybrid context shifting for better memory efficiency. This update also includes KV-cache optimizations, async buffer-retention fixes, and enhanced support for A19 devices. Expect 20–40% faster inference on supported hardware with improved GGUF stability.
- WhisperKit Audio Transcription Boost
WhisperKit now uses automatic compute-unit selection, model prewarming, improved VAD chunking, and smarter decoding heuristics. Short audio less than 15s transcribes 10–20% faster, while long audio (>3 min) uses VAD to reduce memory pressure and improve reliability.
- Whisper Model UI Refresh
The Whisper Models screen has been redesigned with more model variants and clearer descriptions to help you choose the right model for your workflows.
- Improved Whisper Model Downloading
Whisper model downloads now run in the background, support resume-on-break, and display detailed progress information for a more reliable setup experience.
- Modernized About & Acknowledgements UI
The About screen has been redesigned, with all third-party libraries updated to reflect their latest versions and descriptions.
Bug Fixes
- Partial Responses
Fixed a bug that caused truncated responses for some models in the Gemini family in OpenRouter.
- Audio Transcription
Fixed an issue where audio and video files stored in iCloud Drive could not be read correctly on iPhone or iPad during transcription.
Version 1.4.7
The developer, AcmeUp Inc., indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy .
Data Used to Track You
The following data may be used to track you across apps and websites owned by other companies:
Identifiers
Usage Data
Data Not Linked to You
The following data may be collected but it is not linked to your identity:
Identifiers
Usage Data
Privacy practices may vary, for example, based on the features you use or your age. Learn More
The developer indicated that this app supports the following accessibility features. Learn More
Supported Features
Larger Text
Dark Interface
Differentiate Without Color Alone
Sufficient Contrast
Information
Seller
AcmeUp Inc.
Size
1.8 GB
Category
Productivity
Compatibility
Requires iOS 18.6 or later.
iPhone Requires iOS 18.6 or later.
iPad Requires iPadOS 18.6 or later.
Mac Requires macOS 15.6 or later and a Mac with Apple M1 chip or later.