Layla
Private. Personal. All Yours.
$19.99 · In‑App Purchases · Designed for iPad. Not verified for macOS.
Offline AI. Load and run LLMs + Stable Diffusion models on your device, no need to connect to the internet.
Introducing Layla, the groundbreaking personal AI that resides directly on your phone or device. No internet connection required, no censorship, complete privacy.
No information leaves your device.
Features:
- General knowledge: ask Layla about anything under the sun, she will give you an informed response
- Just chatting: Layla has many personalities you can choose; from cute to serious to bubbly to stoic, you can talk with her however you want
- Memory: tell Layla about yourself, she'll remember you
- Personalisation: you can create your own characters if none of our presets fit your style!
- Voices: over 100+ different voices to choose from for your characters
- Roleplay: engage in roleplay with multiple characters!
- Agents: Layla supports programmable agents using Python
- Custom AI: load custom GGUF models that fit your needs
- Additional features available as free plugins: roleplay, downloading different personalities, advanced settings
Note: The app downloads a 4GB Large Language Model (LLM / brains of the AI) after the it starts for the first time. This is only needed once. This technology is VERY new, right now we only support the latest flagship phones (8GB+ RAM). If your phone does not meet the minimum requirements, you may experience degraded performance. We are working hard to bring Layla to more devices.
DISCLAIMER:
Layla is a generative language model. All her answers come from inputs you give her. This means her information may be inaccurate, outdated, or otherwise unsuitable for your situation. Layla is designed for purely entertainment purposes and her output should not be taken seriously.
IMPORTANT:
Local AI is an emerging technology and requires the latest flagship phones to run. BEFORE YOU BUY: check your device compatibility, please visit: https://www.layla-network.ai/contact
Terms and conditions: https://www.layla-network.ai/terms-and-conditions
Privacy Policy: https://www.layla-network.ai/privacy
more It'd be great if you could implement the ability to create shortcuts that are able to retrieve info on the available personalities and past conversations, chat with personalities by specifying their name in a block's parameter, or even chat in an existing conversation within the chat history. If that were to happen then I would give this app a solid 5 stars given it's price, features, and individuality on the App Store.
It'd be great if you could implement the ability to create shortcuts that are able to retrieve info on the available personalities and past conversations, chat with personalities by specifying their name in a block's parameter, or even chat in an existing conversation within the chat history. If that were to happen then I would give this app a solid 5 stars given it's price, features, and individuality on the App Store.
I had high hopes for Layla as an AI client with TavernAI-like features, but the reality is frustrating. Most of the advertised functionality simply doesn’t work as expected. Offline features are either broken or unreliable, and the only parts that consistently function are the paid online features—which feels like a bait-and-switch for an app marketed around local AI and privacy.The UI is clunky, and many features (like reminders, personalities, and roleplay) are either buggy or nonfunctional. Performance is slow, and the app only seems usable if you pay for cloud features, completely defeating the purpose of having a local AI assistant.Unless there are major improvements, I can’t recommend Layla. There are better, more reliable AI chat clients out there that actually deliver on their promises.
I had high hopes for Layla as an AI client with TavernAI-like features, but the reality is frustrating. Most of the advertised functionality simply doesn’t work as expected. Offline features are either broken or unreliable, and the only parts that consistently function are the paid online features—which feels like a bait-and-switch for an app marketed around local AI and privacy.The UI is clunky, and many features (like reminders, personalities, and roleplay) are either buggy or nonfunctional. Performance is slow, and the app only seems usable if you pay for cloud features, completely defeating the purpose of having a local AI assistant.Unless there are major improvements, I can’t recommend Layla. There are better, more reliable AI chat clients out there that actually deliver on their promises.
I’ve been using this app for quite some time. There isn’t anything else like it, very manual setup, very inclusive. Apple changes things often, these guys push the boundaries of what can be done on a mobile device. Very open. Latest Apple update broke. Bunch of stuff, awaiting update for Layla 6.4.0, can’t wait to map some long term memory, very beautiful ideas impliedEnter here, very satisfied with purchase.
I’ve been using this app for quite some time. There isn’t anything else like it, very manual setup, very inclusive. Apple changes things often, these guys push the boundaries of what can be done on a mobile device. Very open. Latest Apple update broke. Bunch of stuff, awaiting update for Layla 6.4.0, can’t wait to map some long term memory, very beautiful ideas impliedEnter here, very satisfied with purchase.
Really god app. Love that it works with both remote llms and local ones. For those of us who travel a lot, that’s very useful. The UI could be better. The “biometric lock” doesn’t seem to work, it just prompts for password or PIN. FR: Please make an iPad version! I have an M4 and would love to see this use the whole screen real estate.
Really god app. Love that it works with both remote llms and local ones. For those of us who travel a lot, that’s very useful. The UI could be better. The “biometric lock” doesn’t seem to work, it just prompts for password or PIN. FR: Please make an iPad version! I have an M4 and would love to see this use the whole screen real estate.
New features:
- added new Choose Your Own Adventure characters
- supports the latest Gemma-4 model family
- added German translations
Improvements:
- added ability to send progress messages during python execution, you can view this progress in the Agent execution card to give you more insight into what the agent is doing
- Layla Python can now generate files
- new Python agents will automatically install needed dependencies when added for the first time
- you can configure long-term memory to use a saved inference engine as summariser
- added UI setting to display chats in a bubble-like interface instead (similar to Facebook Messenger or WhatApp etc.)
- preset characters are now organised into folders
Bug fixes:
- fixed a bug where sometimes character expressions will not be saved
- fixed a bug where some memories cannot be processed
6.8.3 6d ago
New features:
- added new Choose Your Own Adventure characters
- supports the latest Gemma-4 model family
- added German translations
Improvements:
- added ability to send progress messages during python execution, you can view this progress in the Agent execution card to give you more insight into what the agent is doing
- Layla Python can now generate files
- new Python agents will automatically install needed dependencies when added for the first time
- you can configure long-term memory to use a saved inference engine as summariser
- added UI setting to display chats in a bubble-like interface instead (similar to Facebook Messenger or WhatApp etc.)
- preset characters are now organised into folders
Bug fixes:
- fixed a bug where sometimes character expressions will not be saved
- fixed a bug where some memories cannot be processed
6.8.1 Apr 15
New features:
This update fleshes out Layla's Agent Framework.
**Agent System Prompts**
Agents now have their own system prompts. These will update your character's system prompt when they are activated. For example, you don't need to update both your system prompt and the agent for status cards; the system prompt to guide the model into generating status cards will automatically be injected when you attach the corresponding agent.
**Introducing Short-term Memory**
Why do we need a short-term memory when there is the LLM context itself? The main difference lies in the fact that Layla's short-term memory is **structured**. Both the LLM context and LTM is unstructured, and lookup is done purely via heuristics (via the Attention layer in LLMs, and embeddings in LTM). The short-term memory is structured a layer that lives between the LLM's context and Layla's LTM.
Information flows from the LLM's context -> short-term memory -> LTM (as demonstrated by one of the example agents below). This can be updated by both the LLM and Layla's Agents. Because short-term memory is structured, you can use code and logic to manipulate the contents, not just via prompts. This opens up the possibility to *program* information flow. An example could be writing an agent that updates a character's Hit-Points or inventory based on triggering conditions.
This layer will also serve as a sort of *RAM* for all agents. Any agent can read/write into Layla's short-term memory, allowing a horizontal layer of communication between agents (work-in-progress). This is now possible because...
**Layla supports Python**
Layla now comes with her own embedded Python interpreter as one of the mini-apps. Install it to enable writing and executing arbitrary python scripts directly in your phone! You can install packages via `pip`, and Layla comes with a small console and code area to run your scripts.
All agents in Layla can now execute python scripts. Which means you can program arbitrary logic within an agent. You can download stuff from the internet, scrape the web, generate images, solve math problems, even program small games within Layla's Agentic Framework.
Improvements:
- added setting to disable system prompt in Inference Settings -> Custom Prompts: this will disable all character and user information. This is useful if you want to chat with the "raw model" without any instructions. Note this also stops most apps in Layla from working properly.
- redesigned the model selection area, now with the ability to export your imported models
- Claude API now allows sending images in chat
- added Japanese and Vietnamese translations
- added ability to organise characters into folders
- you can tap on "memories" in your character card to immediately view their memories in the LTM app
- added small UI sounds to notify you when Layla is listening or finished speaking
- allow adding custom JSON fields to OpenAI endpoints
Bug fixes:
- fixed a bug where long prompts will crash LTM ingestion process with a custom LLM
- fixed bug where Layla as default assistant on phone does not work with multiple inference engines attached to the character
- fixed bug where setting default character was not working with using Layla as default assistant on phone
- fixed a bug where custom SD models cannot be imported via the "Already downloaded" link
6.7.1 Mar 24
Improvements:
- improved download stability of new model downloads in the welcome screen and stable diffusion model downloads
- added TTS setting for a global default voice
- added the ability to given your character/LLM a custom instruction during chatting in Chat Actions
- added UI setting to control text area expansion in voice chat
- think content is removed when reloading the chat (a compromise between removing think content on every message causing a reload, which is too slow for mobile)
- added "copy" character button for user created characters
- TTS now skips speaking unpronounceable characters (such as ASCII art, tables etc.)
- added a button in Diagnostics popup that force restarts Layla
Bug fixes:
- fixed a bug where Dreams keep scheduling messages for the same character more than once
- fixed a bug where blockquotes were not being rendered correctly
- fixed a bug where Layla character cannot be duplicated
- fixed a bug where different model options in Inference Settings cannot be scrolled sometimes
- fixed a bug where "Speak Responses" chat actions were not working
- fixed a bug where searching chat history does not search through all chat histories
- fixed a bug where GPT-OSS model was not generating responses
- fixed a bug where if you cancel voice chat before it initialises the UI dark cover is gone
- fixed a bug where edited memories are still using old embeddings, causing recalled content to be different to queried content
- fixed a bug where reloading chat sometimes do not read from cached session and instead loads everything from scratch
- fixed prompt format for GPT-OSS
- fixed a bug where Layla as your phone's default assistant was not working properly
6.6.1 Mar 3
Improvements:
- improved download stability of new model downloads in the welcome screen and stable diffusion model downloads
- added TTS setting for a global default voice
- added the ability to given your character/LLM a custom instruction during chatting in Chat Actions
- added UI setting to control text area expansion in voice chat
- think content is removed when reloading the chat (a compromise between removing think content on every message causing a reload, which is too slow for mobile)
- added "copy" character button for user created characters
- TTS now skips speaking unpronounceable characters (such as ASCII art, tables etc.)
- added a button in Diagnostics popup that force restarts Layla
Bug fixes:
- fixed a bug where Dreams keep scheduling messages for the same character more than once
- fixed a bug where blockquotes were not being rendered correctly
- fixed a bug where Layla character cannot be duplicated
- fixed a bug where different model options in Inference Settings cannot be scrolled sometimes
- fixed a bug where "Speak Responses" chat actions were not working
- fixed a bug where searching chat history does not search through all chat histories
- fixed a bug where GPT-OSS model was not generating responses
- fixed a bug where if you cancel voice chat before it initialises the UI dark cover is gone
- fixed a bug where edited memories are still using old embeddings, causing recalled content to be different to queried content
- fixed a bug where reloading chat sometimes do not read from cached session and instead loads everything from scratch
- fixed prompt format for GPT-OSS
- fixed a bug where Layla as your phone's default assistant was not working properly
6.6.0 Mar 1
- fixed a bug with llama.cpp fix params setting
6.5.1 Feb 6
New features:
- redesigned character selection page to give you detailed information of your character before starting the chat
- added ability to save custom sampler settings in the Advanced Settings screen
- added ability to attach saved sampler presets as part of your custom Inference Engine
Improvements
- supports sending images to OpenAI compatible API inference settings
Bug fixes:
- fixed a bug where some app data was not deleted when removing a mini-app from Layla
- fixed a bug where LTM sometimes crashes when handling very very long messages
- fixed a bug where images generated during chat were not being backed-up
- fixed a bug where tags from TavernPNG were not being imported
- fixed a bug where backup cannot copy references with spaces or special characters in their name
- fixed a bug where automatically converting JPGs when saving as TavernPNG was giving an error
- fixed a bug where personas were not being saved as part of your Inference Engine
- fixed a bug where upscaling anime images caused a crash
6.5.0 Jan 26
- updated llama.cpp to support the latest iPhones
6.4.1 Jan 15
New features:
- visualisation of temporal relationships in Layla's long-term memory
A while ago, Layla introduced temporal memories: memories that evolve in time. This update brings a whole new way to visualise your character's memories!
Your memories are now laid out in a 3D space; past memories are "deeper" into the z-axis. A slider at the bottom allows you to navigate through your memories in time, jumping from past to present.
You will notice that memories naturally form "clusters" in time. These are memories that occur within a relatively short timeframe. Each cluster has it's own central node(s), with auxiliary memories branching out around them. These clusters are segmented via a time-dilated algorithm: recent memories are finely-segregated, while older memories are more clustered together. This mimics how human memories work; you often remember recent events clearly, while the longer the events are in the past, the more they combine into one big cluster.
Important nodes in each cluster are linked together in time. These are usually "common entities" between temporal clusters. Intuitively, this should be important characters, topics, etc. The graph reflects this. The "anchors" in each cluster are usually the two main participants of the conversation (you and your character). These nodes glow brighter in the memory network, and you can see faint "temporal links" between them. These temporal links are annotated by the time elapsed between each cluster, which is visible in the memory graph.
There are a lot more ways in which this marshalled data and relationships can be used during the recall process, and this is something I will continually improve on over time!
Improvements:
- significant improvements to the performance of chatting, especially for very long conversations spanning hundreds/thousands of messages
- long-term memory will now recall earlier memories from the same conversation that has been cycled out through context shifting
- fixed a few conditions in triggering new search agent
- supports the new Ministral-3 family [THINK] tags
Bug fixes:
- fixed a bug where invalid filename in References mini-app causes Layla to crash
- fixed a bug where characters list will sometimes show nothing when updating a filter
- fixed a bug where character list resets to the first page after edit/saving a character
- fixed a bug where regenerate messages leaves a large amount of empty space when the length of the regenerated message varies largely
- fixed a bug where automatically searching for models in OpenAI compatible API endpoints was not working
- fixed a bug where Layla does not stop the server from generating messages even after pressing "stop"
- fixed a bug where voice chat does not speak aloud responses
- fixed a bug where audio is only played through speakers and not AirPods
6.4.0 Jan 2
New features:
- Layla supports Qwen3-VL family models: images are automatically down-sized before sending to the LLM to keep processing times reasonable on mobile, you can control how much down-sizing occurs in Advanced Settings
- Layla supports animated PNGs and GIFs for your character cards
Improvements:
- added setting in Reference documents to control the threshold of how long a document needs to be before it is summarised instead of injected wholly into the context
- prompt templates are now case insensitive
- display character full name in chat instead of just the first name
- automatically convert JPGs to PNGs when exporting your characters as a TavernPNG
- you can share text with Layla to start a chat with that text
Bug fixes:
- fixed a bug where the document name was not detected correctly when adding a document from chat
- fixed a bug where character sorting changes back to the default after exiting chat
- fixed a bug where tool calls are included in long-term memory messages
- fixed a bug where Layla crashes on importing a huge TavernPNG card
6.3.0 11/25/2025
- fixed bug where option toggles are not showing
6.2.0 10/29/2025
- improved stability in chatting
- fixed a few bugs in edit/regen
- fixed bugs related to the calendar agent
6.1.2 10/23/2025
Layla 6 supports Agents!
Agents are fully configurable, self-contained workflows that can be triggered and executed by Layla during chats. Their functionalities can range from injecting simple context, to fully automated workflows such as reading a webpage and saving it as a reference document. Each agent is also modular: you can attach them to your own characters; they will still complete their respective tasks, but under the personality of your attached character.
An agent is comprised of two parts: triggers and tools. An agent can have multiple triggers and call multiple tools. You are free to mix and match them in any combination when creating your own agents!
New features:
- Long-term memory has been significantly improved!
- LTM now implements a 4 stage retrieval process, to balance speed and quality:
- “raw memories” are created immediately as you chat; they are available immediately when starting the next conversation, no processing time needed
- “embeddings” are created in the background; this process is relatively fast, and will be available 5-10 minutes after a conversation ends
- “summaries” are created as a background process, when your phone is idle; this process can either be done by the built in summariser, or a chosen LLM
- “knowledge graphs” are created last, with all the information available in the previous step, and require an LLM to process
- During recall, memories are retrieved starting from “raw memories”, and re-ranked by content in each level as they become available, so you will immediately get memories, but the injections will increase in quality as new memories are ingested
Improvements:
- support for Gemma 3n models
- improved prompt format slightly for vision models
- backup and restore data now prevents back button presses or navigation to avoid accidentally corrupting your data if you move away from the screen
- added ability configure a character level image generation prompt prefix
- LTM no longer processes think tags from reasoning models
- performance improvements to rendering chat messages
- Layla will now automatically select the best prompt format after you've selected your model
- backups & restore will now also backup your OpenAI inference settings
Bug fixes:
- fixed a bug where favourite characters were not being saved
- fixed a bug where searching for language names in the Offline Translator app was searching for the language's English name instead of the name in the UI language
- fixed a bug in native speech-to-text where it stops listening after the first round
- fixed a bug where horoscopes were not showing
- fixed a bug where a character with an empty description stops all characters from showing up
- fixed a bug where OpenAI inference settings was not backed up
6.0.3 10/07/2025
- fixed bug where Metal GPU inference was not working
5.6.3 07/10/2025
Improvements:
- added support for Kroko ASR (speech-to-text model, this is now the default STT model in Layla)
- ability to configure KV Cache quantisation settings in Advanced Settings
- added French, Portuguese, and Spanish UI translations
Bug fixes:
- fixed bug where the correct number of characters in Roleplay was not showing
- fixed bug where chat images cannot be expanded when tapped on
- fixed bug where slightly moving your finger during hold-to-speak mode stops Layla from listening
- fixed bug where sometimes phone native voices are not being shown or not working in text-to-speech
5.6.1 06/20/2025
- fixed bug where edit and delete actions were not showing for previous messages
- removed animated backgrounds temporarily due to a bug
5.5.1 05/27/2025
New features
- app UI has been translated to multiple languages! Currently supports: English, Chinese, Korean, Polish, Russian
- Offline Transcription: transcribe audio such as lectures, audiobooks, or videos in real time
- all preset characters now have animated backgrounds!
Improvements:
- allow configuring minimum silence length and other settings for speech-to-text
- made stop generating slightly more responsive
- improved UI performance when generating responses, should reduce speed inconsistencies where response times sometimes start slow and speeds up at the end of the message
- added TopK in advanced settings
Bug fixes:
- fixed bug where DuckDuckGo news search was not working
- fixed bug where multiple regens make the model increasing incoherent
- fixed bug where text-to-speech sometimes pauses unnecessarily after the first sentence
5.5.0 05/24/2025
- fixed bug where the AI sometimes repeats what you said before responding
5.4.2 04/18/2025
- fixed a small bug where chat screen is not redirecting to Layla Cloud login if you are logged out
5.4.1 04/14/2025
New features:
- Gemma3 family models support!
- added new CPU stable diffusion models
Improvements:
- Layla supports displaying LaTeX formulae in chat messages
- backup data will now backup your custom prompts
- added ability to copy generated code in chat messages
- added support for MultiTTS voices
- added ability to continue generating a message from the LLM
Bug fixes:
- fixed bug where GPU inference was not working with Background Inference
5.4.0 04/08/2025
New features:
- added a new animated character
- added ability to duplicate characters (including preset characters and animated characters, which will allow you to use the animation for your own characters)
- added XTC samplers in Advanced Settings
- added support for Phi-4 family models
Improvements:
- added feature to generate image prompt using your selected LLM in SD mini-app
- improved animated character lips movement; lips will start moving when sound is played instead of on LLM output
Bug fixes:
- fixed bug where saving backup data directly to file system does not work for very large backups
- fixed bug where character info is not updated after editing until a reload
- fixed bug where deleting a character does not refresh the characters list
- fixed bug where TTS voices were not playing in phone silent mode
- fixed bug where TTS voices do not play after stopping it once
- fixed bug in image generation resulting in "workspace alignment" issues
- fixed bug where character tags are not showing in the character creator
- fixed bug where deleted characters were still sending proactive messages
5.3.0 03/05/2025
New features:
- added iniVerse Mix Layla Cloud model
- added Cohere API support
- added support for importing webpages as Lorebooks
- added support for Kokoro TTS model
Improvements:
- optimised Character Selection page: added sorting options, optimised loading a large number of characters
Bug fixes:
- fixed bug where {{instruction}} tags were being force injected for some characters
5.2.0 02/19/2025
New features:
- Layla supports reasoning models Deepseek R1 family!
Improvements:
- redesigned Lorebook UI to handle lots of documents better
- improved UI of model import
- added timestamps to Long-term Memory table view
- backup data now directly allows you to choose a folder to save to
- added a Download Manager app to give the ability to view/cancel download tasks in case they get stuck
- added Whisper Base and Whisper Base (English) models
- added ability to configure the language Whisper models listen in
- Q4_0 quants are now automatically converted on the fly to support your current architecture
- allows saving TavernPNG directly to file system in character creation
- supports sherpa-onnx TTS engine APK
- redesigned chat message quick actions (copy button is now always visible, tap & hold the message to bring up a context menu with more action)
- Create Character (AI) image generation now uses the default negative prompt configured in the SD mini-app
Bug fixes:
- fixed bug when importing chat history
- fixed bug in Layla Cloud when handling very long conversation histories
- fixed bug where an error in one memory will stop ingestion of all LTM memories
- fixed bug where too many quick actions take up all your screen in chat
- fixed bug where chat accent colour was not being applied to character responses
- fixed bug in default character image generation fallback phrase
5.1.0 01/27/2025
- fixed bug where sometimes LTM says task not found
- fixed bug where sometimes messages were not being sent properly to Layla Cloud
5.0.4 01/12/2025
Improvements:
- default characters can now generate images based on your Inference Settings
Bug fixes:
- fixed a bug where Diagnostics was not working
- fixed bug "zip headers not found" when backing up data
- fixed bug where Lorebooks overall summary injection was not working
- fixed bug where Lorebooks max injection number setting was not working
- fixed a bug where multiple same memories are being injected into the same conversation
- fixed a bug where multiple memories were not being injected
5.0.3 01/07/2025
New features:
- added new Choose Your Own Adventure characters
- supports the latest Gemma-4 model family
- added German translations
Improvements:
- added ability to send progress messages during python execution, you can view this progress in the Agent execution card to give you more insight into what the agent is doing
- Layla Python can now generate files
- new Python agents will automatically install needed dependencies when added for the first time
- you can configure long-term memory to use a saved inference engine as summariser
- added UI setting to display chats in a bubble-like interface instead (similar to Facebook Messenger or WhatApp etc.)
- preset characters are now organised into folders
Bug fixes:
- fixed a bug where sometimes character expressions will not be saved
- fixed a bug where some memories cannot be processed
more Version 6.8.3 6d ago
Data Not Linked to You The following data may be collected but it is not linked to your identity: