
Inferencer - Private AI Studio
Advanced Local AI Assistant
Only for Mac
Free · In‑App Purchases
Mac
Inferencer lets you run, host and deeply control the latest SOTA AI models (OSS, DeepSeek, Qwen, Kimi, GLM and more) from your own computer.
No data is sent to the cloud for processing - maintaining your complete privacy.
Advanced inferencing controls give you complete control on their accuracy and outputs.
Models
Start in the models section where you can select the location of existing models or download new ones directly from Hugging Face.
Use the model streaming feature to inference larger models partially from storage - for low memory devices.
Chats
Select the model to interact with on the top menu bar and write a prompt to begin. At any point you can switch between models and continue the chat to see what else they can uncover. You can also selectively delete past messages to keep the model focused and less scatterbrain.
Chat Controls
Control the inferencing parameters including intensity of processing and model streaming to allows you to multi-task with other applications better.
Token Entropy and Inspection
Select the inspectors to peek into the inner-workings of each word outputted and see the model's confidence levels and alternative choices.
Prompt Framing
Expanding the prompt section to utilise the framing feature which allows you to control the output the model generates.
Server
If enabled, the server feature allows you to serve and connect to your own or trusted devices. No data is sent elsewhere. Also includes compatible APIs for application development.
Xcode Intelligence
Use the server feature with Compatibility APIs enabled and SSL disabled to allow Xcode to use Inferencer as a service provider.
Shortcuts
Use the Shortcuts app to automate inferencing workflows (e.g., copy text from clipboard > inference > speak result).
Settings
Includes parental controls, an automatic deletion policy and more.
Privacy
For maximum privacy, all AI processing happens offline and on your device, by default.
Subscriptions
Basic (Free): Most features unlocked for free including unlimited chats.
Professional: Upgrade for more advanced token inspection, prompt-framing and model streaming.
Terms & Support
Terms of Use: inferencer.com/terms
Privacy Policy: inferencer.com/privacy
Disclaimer
Inferenced models may not always be accurate or contextually appropriate. You are responsible for verifying the information before making important decisions.
This app hasn’t received enough ratings or reviews to display an overview.
+ Added support for MiniMax M2
+ Xcode Intelligence integration
+ Fixed issue with deleting the last conversation
+ Inferencing server improvements
+ Improved server API compatibility
+ Added support for Mixtral
+ Improved model streaming guardrails
+ Bug fixes and performance improvements
The developer, Ashraf Samy, indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy .
Data Not Collected
The developer does not collect any data from this app.
Accessibility
The developer has not yet indicated which accessibility features this app supports. Learn More
Information
- Seller
- Ashraf Samy
- Size
- 613.3 MB
- Category
- Productivity
- Compatibility
Requires macOS 15.0 or later and a Mac with Apple M1 chip or later.
- Mac
Requires macOS 15.0 or later and a Mac with Apple M1 chip or later.
- Languages
- English
- Age Rating
13+
- 13+
- This app has an age rating of 13+ with content restrictions. Some content may be rated higher, but access is managed by the developer through in-app controls.
- In-App Controls
Parental Controls
Infrequent
Cartoon or Fantasy Violence
Profanity or Crude Humor
Mature or Suggestive Themes
Horror/Fear Themes
Medical Treatment information
Alcohol, Tobacco, Drug Use or References
Guns or Other Weapons
Contains
User-Generated Content
- In-App Purchases
Yes
- Professional $9.99
- Copyright
- © 2025 Inferencer