Private LLM - Local AI Chat

Name: Private LLM - Local AI Chat
Price: 6600 KRW
Rating: 4.5 (8 reviews)
Author: Numen Technologies Limited

유틸리티

￦6,600

8개의 평가
연령

12+
세
카테고리

유틸리티
개발자

Numen Technologies Limited
언어

EN
영어
크기

1.4
GB

Discover the Ultimate Privacy-Focused AI Assistant on iOS: Private LLM Unlock a new realm of productivity and creativity on your iPhone and iPad with Private LLM, the premier AI assistant designed with your privacy in mind. Available for a one-time purchase, it offers a range of AI capabilities without needing a subscription. Experience advanced on-device AI that keeps your interactions confidential and offline. Why Private LLM is Your Go-To AI Companion: - Exclusive AI Model Selection: Choose from a diverse set of open-source LLM models optimized for performance and perplexity on iOS with state of the art OmniQuant quantization: including models from Llama 2, Llama 3.2, Llama 3.1, Google Gemma 2, Gemma 3, Microsoft Phi-3, Mistral 7B, Qwen 2.5, Qwen 3, StableLM 3B and many more. Whether you need help with creative brainstorming, coding, or daily questions, customize your AI experience to meet your unique needs. - Integrated with Siri & Shortcuts: Enhance your AI interactions with Siri commands and customizable Shortcuts. Private LLM seamlessly fits within your Apple ecosystem, making your digital assistant more accessible. - Customizable Interactions: Tailor your AI's responses and interactions with customizable system prompts to match your preferences and needs. - Uncompromised Privacy and Security: With Private LLM, your conversations stay confidential and on your device. Our advanced on-device AI performs robust computing without risking data compromise or needing an internet connection. - Family Sharing & Offline Capabilities: Benefit from a one-time purchase that includes Family Sharing. Download models as needed and enjoy the full functionality of your AI assistant, even without internet access. Supported LLM Model families: - DeepSeek R1 Distill based models - Phi 4 based models - Qwen 3 based models (Qwen3-4B-Instruct-2507) - Qwen 2.5 based models (0.5B, 1.5B, 3B and 7B) - Qwen 2.5 Coder based Models (0.5B, 1.5B, 3B, 7B and 14B) - Llama 3.1 8B based models - Llama 3.2 1B and 3B based models - Google Gemma 2 2B and 9B based models - Google Gemma 3 1B based models - Mistral 7B based models - Yi 6B based models For a full list of supported models, including detailed specifications, please visit privatellm.app/models. Private LLM is not just a chatbot; it's a comprehensive AI companion designed to respect your privacy while providing versatile, on-demand assistance. Whether you're enhancing your creative writing, tackling complex programming challenges, or just seeking answers, Private LLM adapts to meet your needs while keeping your data secure. Start your journey with Private LLM today and elevate your productivity and creative projects with the most private AI assistant for iOS devices. Private LLM is a better alternative to generic llama.cpp and MLX wrappers apps like Enchanted, Ollama, LLM Farm, LM Studio, Locally AI, RecurseChat, etc on three fronts: 1. Private LLM uses a faster and highly-optimized mlc-llm based inference engine. 2. Models in Private LLM are quantized using the state of the art quantization algorithms like OmniQuant, while competing apps use naive round-to-nearest quantization. 3. Private LLM is a fully native app built using C++, Metal and Swift with deep integrations with iOS and iPadOS, while many of the competing apps are bloated and non-native Electron or Flutter based apps. Please note that Private LLM only supports inference with text based LLMs. Model support varies by device capabilities.

평가 및 리뷰

4.5

5점 만점

8개의 평가

The fastest app for offline chat
4월 26일

lil|
Could you please add NAVER HyperCLOVA X?
The fastest app for offline chat
4월 26일

lil|

Could you please add NAVER HyperCLOVA X?
not yet
3월 10일

ㅗㅗ코코코코코코코ㅗㅗㅗㅗㅗㅋㅋㅋ
This app is a very basic app. You should absolutely not think of it as something like LM Studio and download it with that expectation. It does not have the ability to freely download and load local models from Hugging Face. You can only use the limited, outdated models it provides, and there is no file attachment function either. In other words, it cannot use RAG functionality. It needs a lot of updates in the future

개발자 답변
Unlike LM Studio, our app isn't yet another dime-a-dozen llama.cpp/MLX wrapper. Also, we offer higher quality OmniQuant and GPTQ quantized models, while LM Studio and other llama.cpp/MLX wrappers only offer naive RTN quantized models. If you're unhappy with the app for any reason, go to reportaproblem.apple.com and avail a refund, but please don't compare our app to yet another run-of-the-mill election.js based non-native desktop only app.
not yet
3월 10일

ㅗㅗ코코코코코코코ㅗㅗㅗㅗㅗㅋㅋㅋ

This app is a very basic app. You should absolutely not think of it as something like LM Studio and download it with that expectation. It does not have the ability to freely download and load local models from Hugging Face. You can only use the limited, outdated models it provides, and there is no file attachment function either. In other words, it cannot use RAG functionality. It needs a lot of updates in the future
개발자 답변 3월 13일
Unlike LM Studio, our app isn't yet another dime-a-dozen llama.cpp/MLX wrapper. Also, we offer higher quality OmniQuant and GPTQ quantized models, while LM Studio and other llama.cpp/MLX wrappers only offer naive RTN quantized models. If you're unhappy with the app for any reason, go to reportaproblem.apple.com and avail a refund, but please don't compare our app to yet another run-of-the-mill election.js based non-native desktop only app.
Would you Add llama3 q4 and q5
2024. 06. 06.

Simerotora
Would you add Llama3 q4 and q5

개발자 답변
Thanks for the review! q4 and q5 are terminologies from the llama.cpp world. We don't use llama.cpp, and we don't use naive RTN quants like llama.cpp does. Our quantization algorithm of choice is OmniQuant. We support w4g128asym quantized on the macOS app. We use the same quantization for smaller models on iOS, but we use w3g40sym quantization for models with 7B or more parameters due to memory constrains. Our plan is to support w4g128asym quantized models on iPads (and hopefully the next generation of Pro and Pro Max devices) with 16GB of RAM.
Would you Add llama3 q4 and q5
2024. 06. 06.

Simerotora

Would you add Llama3 q4 and q5
개발자 답변 2024. 06. 08.
Thanks for the review! q4 and q5 are terminologies from the llama.cpp world. We don't use llama.cpp, and we don't use naive RTN quants like llama.cpp does. Our quantization algorithm of choice is OmniQuant. We support w4g128asym quantized on the macOS app. We use the same quantization for smaller models on iOS, but we use w3g40sym quantization for models with 7B or more parameters due to memory constrains. Our plan is to support w4g128asym quantized models on iPads (and hopefully the next generation of Pro and Pro Max devices) with 16GB of RAM.