Privacy AI: Powerful chatbot

GGUF/MLX/Remote AI supports

Free · In‑App Purchases · Designed for iPad

It supports GGUF & MLX models or connect to OpenAI-compatible servers. Free Plan includes most features like local models, iCloud Sync, Markdown conversion and 25+ tools. Privacy AI introduces **MCP Marketplace**, a one-tap hub to browse and install ready-to-use MCP servers. Powered by [MCPRouter](https://mcprouter.co/), each server is auto-configured. No need to manual setup. So you can instantly add integrations for automation, image generation, data analysis, and more. Now with **OpenAI Responses API** support: run **stateless multi-turn conversations** without resending full history. Use **conversation objects**, **prompt templates**, and **native prompt caching** for faster, cheaper repeats. The built-in Protocol Inspector shows every request, response, and SSE event in real time for easy debugging. **Interleaved Thinking** is supported for models like **Minimax M2**, **Claude Sonnet 4**, and **OpenAI GPT-5 (Thinking)**. When enabled, the model preserves reasoning across turns, enabling planning, verification, and self-correction—turning simple Q&A into agentic, multi-step workflows. **Memory** lets your AI remember across models. Add, edit, and sync personal context that stays consistent for local or cloud models. Your memory belongs to you—not a provider. Memory is free for local models and included in Pro for cloud models. A **Free Plan** covers offline GGUF/MLX models, iCloud sync, Markdown conversion, and built-in tools. Cloud models, MCP Marketplace, and custom API providers require a subscription. Privacy AI is a full-featured AI client that works your way: run models fully offline, connect to self-hosted servers, or use cloud providers—all in one app. ### Why Privacy AI Most AI apps lock you in. Privacy AI gives you freedom: local execution for privacy, self-hosted for compliance, or cloud APIs for performance. You choose where models run, how memories persist, and how data is handled. ### Who It’s For • Developers and infra builders (GGUF, MLX, OpenRouter, custom APIs) • Privacy-conscious professionals in law, healthcare, finance, research • Power users exploring automation and model performance • Teams needing compliance or internal-only workflows ### What It Does **Run Any Model, Anywhere** • Offline on iPhone, iPad, or Mac • Load GGUF/MLX from HuggingFace or iCloud • Connect to OpenAI, Anthropic, Gemini, DeepSeek, or your own servers • Switch between local and remote mid-chat without losing context • Supports Apple’s on-device Foundation Model (iOS 26) **Use Tools Seamlessly** • Full MCP with integrated Marketplace • Call tools like search, stocks, arXiv, HealthKit, JavaScript—even offline • Inspect and replay tool calls in chat **Automate Privately** • Siri, Shortcuts, and Share Extension • Offline TTS + transcription • OpenAI protocol TTS (gpt-4o-tts, gpt-4o-mini-tts) **Files On-Device** • Convert PDFs, Office, EPUB, HTML, YouTube, audio • OCR + Markdown conversion • Export to Markdown, PDF, HTML, EPUB, JSON **Images & Editing** • Generate locally or via OpenRouter (e.g., Gemini 2.5 Flash) • Support for gpt-image-1 and FLUX.1-dev • Annotate with Apple Pencil or touch **Control & Extend** • “Natural Talk” voice UI • Live token/context indicator • Custom prompts, tools, preferences • iCloud sync for models, chats, settings **Built for Apple** • Optimized for M-series Macs and iPad Pro • Requires iOS 18.6+ • Deep Siri/Shortcuts + Share Extension integration ### No Account. No Surprises. No forced sign-ups. Choose local, self-hosted, or cloud. Your data and memories stay under your control. It’s more than a chatbot—it’s a professional-grade AI IDE in your pocket. Your model. Your device. Your data. Terms of Use: https://privacyai.acmeup.com/docs/policy/tos.html Privacy Policy: https://privacyai.acmeup.com/docs/policy/privacy.html

  • 5.0
    out of 5
    2 Ratings

Features - Direct Video Input Support Privacy AI now supports OpenRouter’s native video input protocol. You can send video files directly to supported multimodal models: such as Gemini 2.5 Flash, Flash Lite, Pro, and others, without any manual pre-processing. The app handles encoding and upload automatically, offering a seamless video-to-AI workflow. - Upgraded TTS/ASR Engine sherpa-onnx has been upgraded to 1.12.15, adding support for newer TTS models like MatchTTS and expanding available voice options for higher-quality speech synthesis. Improvements - MLX Engine Upgrade Updated to the latest MLX Swift framework with improved vision-model handling and greater stability. Qwen3-VL models now process images more reliably thanks to refined sanitization. Memory usage is optimized on both iOS and macOS, and the new context API enables more flexible prompt construction for advanced RAG workflows. - llama.cpp Engine Upgrade (b7032) Major performance improvements for Apple Silicon, including Metal 4 Tensor API acceleration on M5-class devices, ARM64 SVE optimizations, and hybrid context shifting for better memory efficiency. This update also includes KV-cache optimizations, async buffer-retention fixes, and enhanced support for A19 devices. Expect 20–40% faster inference on supported hardware with improved GGUF stability. - WhisperKit Audio Transcription Boost WhisperKit now uses automatic compute-unit selection, model prewarming, improved VAD chunking, and smarter decoding heuristics. Short audio less than 15s transcribes 10–20% faster, while long audio (>3 min) uses VAD to reduce memory pressure and improve reliability. - Whisper Model UI Refresh The Whisper Models screen has been redesigned with more model variants and clearer descriptions to help you choose the right model for your workflows. - Improved Whisper Model Downloading Whisper model downloads now run in the background, support resume-on-break, and display detailed progress information for a more reliable setup experience. - Modernized About & Acknowledgements UI The About screen has been redesigned, with all third-party libraries updated to reflect their latest versions and descriptions. Bug Fixes - Partial Responses Fixed a bug that caused truncated responses for some models in the Gemini family in OpenRouter. - Audio Transcription Fixed an issue where audio and video files stored in iCloud Drive could not be read correctly on iPhone or iPad during transcription.

The developer, AcmeUp Inc., indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy .

  • Data Used to Track You

    The following data may be used to track you across apps and websites owned by other companies:

    • Identifiers
    • Usage Data
  • Data Not Linked to You

    The following data may be collected but it is not linked to your identity:

    • Identifiers
    • Usage Data

Privacy practices may vary, for example, based on the features you use or your age. Learn More

The developer indicated that this app supports the following accessibility features. Learn More

  • Supported Features

    • Larger Text

    • Dark Interface

    • Differentiate Without Color Alone

    • Sufficient Contrast

  • Seller
    • AcmeUp Inc.
  • Size
    • 1.8 GB
  • Category
    • Productivity
  • Compatibility
    Requires iOS 18.6 or later.
    • iPhone
      Requires iOS 18.6 or later.
    • iPad
      Requires iPadOS 18.6 or later.
    • Mac
      Requires macOS 15.6 or later and a Mac with Apple M1 chip or later.
    • Apple Vision
      Requires visionOS 2.6 or later.
  • Languages
    • English
  • Age Rating
    16+
  • In-App Purchases
    Yes
    • Monthly Access Plan $9.99
    • Yearly Access Plan $99.99
  • Copyright
    • © 2024 AcmeUp Inc.