Now supporting DeepSeek-V3 & Qwen2.5

Your Private AI Agents,
Always On-Device.

Run massive open-source models offline. Create custom "Pals" for coding, writing, or analysis. No data leaves your phone.

100% Private
Offline Inference
Unlimited Pals
Coding Pal
Online (Local)
Explain how transformers work in simple terms.

Imagine a transformer model like a sentence translator that pays attention to every word at once, rather than reading left-to-right...

class Transformer:
  def __init__(self):
    self.attention = ...
Ask anything...
Speed
45 tokens/s
Model Size
72B Loaded