Model Setup and Memory Planning
- Higher-memory configurations can expose 4B parameter models quantized to 8-bit.
- These larger 4B model options are not shown in iOS model picker.
- On lower-memory devices, use the default low-memory model profile.
- Selecting oversized models on low-memory devices can cause unpredictable responses and app instability, including possible crashes due to memory pressure.
Output synthesis controls: Advanced settings (temperature, top-p, repetition controls, output token limits) directly affect response style and length. For practical tuning steps, see User Guide and Tools Guide.
Recommendation: Keep the default low model selection on lower-memory devices unless you have validated stability for your workload.