Documentation Index
Fetch the complete documentation index at: https://visionagents.ai/llms.txt
Use this file to discover all available pages before exploring further.
Google’s Gemini provides powerful language models with built-in tools for search, code execution, RAG, and URL context. The LLM mode requires separate STT/TTS.
Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick Start
Built-in Tools
Gemini provides built-in tools you can enable:| Tool | Description |
|---|---|
GoogleSearch | Ground responses with web data |
CodeExecution | Run Python code |
FileSearch | RAG over your documents |
URLContext | Read specific web pages |
File Search (RAG)
Managed RAG with automatic chunking and retrieval:Function Calling
Events
The Gemini plugin no longer documents provider-specific event classes for normal app code. Prefer core events and provider-agnostic response handling.Next Steps
Gemini Realtime
Speech-to-speech with optional video
Build a Voice Agent
Get started with voice

