Apple Using Google Gemini Access for On-Device AI Model Distillation

Apple is leveraging Google's Gemini AI models to create smaller, on-device versions through distillation. According to The Information, Google gave Apple "complete access" to Gemini in Google's own data centers, allowing Apple to customize the model for Siri and other AI features.
How the Distillation Process Works
Apple can ask the main Gemini model to perform tasks that provide high-quality results, including a rundown of the reasoning process. Apple then feeds the answers and reasoning information from Gemini to train smaller, cheaper models. This enables the smaller models to learn the internal computations used by Gemini, producing efficient models with Gemini-like performance but requiring less computing power.
Technical Details and Challenges
- Apple can design models built to run on Apple devices without internet connectivity
- Apple can edit Gemini as needed to ensure responses align with Apple's requirements
- Apple has encountered issues because Gemini was tuned for chatbot and coding applications, which doesn't always meet Apple's needs
- The smarter, chatbot version of Siri planned for iOS 27 will rely on Google's Gemini models
Capabilities and Development
Siri will be able to perform many of the same functions as Gemini and other chatbots, including:
- Answering questions
- Summarizing information
- Scanning and understanding uploaded documents
- Telling stories
- Providing emotional support
- Completing tasks like booking travel
The Apple Foundation Models team continues to work on Apple AI models distinct from Gemini models, indicating this is a transitional approach while Apple develops its own AI capabilities.
📖 Read the full source: HN AI Agents
👀 See Also

OpenRouter's Healer Alpha stealth model appears to be unreleased Qwen 3.5-Omni variant
OpenRouter has deployed a free anonymous omni-modal model called Healer Alpha with 262,144 context window and multimodal capabilities. Forensic analysis suggests it's an unreleased Qwen 3.5-Omni variant from Alibaba.

Google to Provide AI Agents to Pentagon for Unclassified Work
Google will provide AI agents to the Pentagon for unclassified work, according to a Bloomberg report. The article has generated discussion on Hacker News with 61 points and 52 comments.

When asking Claude about regex leads to a late-night dive into compiler design
A Reddit user asked Claude to explain a regex and ended up in a 45-minute conversation about parsers, compiler design, and language theory, questioning their career.

Manifest adds GitHub Copilot as fourth AI provider for OpenClaw routing
Manifest now supports routing OpenClaw requests through GitHub Copilot subscriptions, joining Anthropic, OpenAI, and Minimax as available providers. This allows developers to use their existing Copilot plans for code tasks through models built for development.