firebase-ai-logic-basics
firebase/agent-skills
Integrate Gemini API into web apps with Firebase AI Logic for multimodal inference and structured output.
What is firebase-ai-logic-basics?
Firebase AI Logic enables developers to add generative AI capabilities to web and mobile applications using client-side SDKs that call Gemini models directly. Use it for text generation, image/audio/video analysis, chat sessions, and structured JSON responses without managing a backend.
- Text-only and multimodal generation (images, audio, video, PDFs)
- Multi-turn chat sessions with automatic history management
- Streaming responses for real-time user feedback
- Structured JSON output with schema enforcement
- On-device hybrid inference for web apps (Gemini Nano)
- Search grounding with built-in Google Search tool
How to install firebase-ai-logic-basics
npx skills add https://github.com/firebase/agent-skills --skill firebase-ai-logic-basics- Node.js 16+ and npm installed
- Firebase project with at least one app associated
- App Check configured (reCAPTCHA Enterprise) for production security
- Run `npx firebase-tools init ailogic` to provision the service
How to use firebase-ai-logic-basics
- 1.Install Firebase SDK: `npm install -g firebase@latest`
- 2.Verify project setup: `npx firebase-tools projects:list`
- 3.Confirm app exists: `npx firebase-tools apps:list`
- 4.Initialize AI Logic: `npx firebase-tools init ailogic`
- 5.Import Firebase AI Logic in your code and initialize the Gemini model
- 6.Call `generateContent()` for single requests or `generateContentStream()` for streaming
- 7.Use `startChat()` for multi-turn conversations
- 8.For production, enable App Check and consider Remote Config for model version management
Use cases
- Image captioning and object detection in web applications
- Multi-turn conversational AI chatbots with session history
- Real-time streaming responses for improved UX
- Structured data extraction from unstructured content
- Dynamic model version management via Remote Config
- Web developers building AI-powered applications
- Mobile developers (Android, iOS, Flutter)
- Teams requiring enterprise-grade production AI (Vertex AI)
- Developers prototyping with free tier (Gemini Developer API)
firebase-ai-logic-basics FAQ
Use Gemini Developer API by default (free tier + pay-as-you-go). Switch to Vertex AI Gemini API only if you need enterprise-grade production readiness and have a Blaze plan.
No. Always check the Firebase AI Logic Models documentation for currently supported model names, as older models are shut down.
No. Firebase AI Logic allows you to call Gemini models directly from client-side SDKs without managing a dedicated backend.
Yes. App Check (with reCAPTCHA Enterprise) is critical for production to prevent unauthorized API quota usage and protect backend resources.
Store large files (>20MB) in Cloud Storage for Firebase and pass their URLs to the Gemini API instead of inline data.
Full instructions (SKILL.md)
Source of truth, from firebase/agent-skills.
name: firebase-ai-logic-basics description: Official skill for integrating Firebase AI Logic (Gemini API) into web applications. Covers setup, multimodal inference, structured output, and security. version: 1.0.1
Firebase AI Logic Basics
Overview
Firebase AI Logic is a product of Firebase that allows developers to add gen AI to their mobile and web apps using client-side SDKs. You can call Gemini models directly from your app without managing a dedicated backend. Firebase AI Logic, which was previously known as "Vertex AI for Firebase", represents the evolution of Google's AI integration platform for mobile and web developers.
It supports the two Gemini API providers:
- Gemini Developer API: It has a free tier ideal for prototyping, and pay-as-you-go for production
- Vertex AI Gemini API: Ideal for scale with enterprise-grade production readiness, requires Blaze plan
Use the Gemini Developer API as a default, and only Vertex AI Gemini API if the application requires it.
Setup & Initialization
Prerequisites
- Before starting, ensure you have Node.js 16+ and npm installed. Install them if they aren’t already available.
- Identify the platform the user is interested in building on prior to starting: Android, iOS, Flutter or Web.
- If their platform is unsupported, Direct the user to Firebase Docs to learn how to set up AI Logic for their application (share this link with the user https://firebase.google.com/docs/ai-logic/get-started)
Installation
The library is part of the standard Firebase Web SDK.
npm install -g firebase@latest
If you're in a firebase directory (with a firebase.json) the currently selected project will be marked with "current" using this command:
npx -y firebase-tools@latest projects:list
Ensure there's at least one app associated with the current project
npx -y firebase-tools@latest apps:list
Initialize AI logic SDK with the init command
npx -y firebase-tools@latest init ailogic
This will automatically enable the Gemini Developer API in the Firebase console.
More info in Firebase AI Logic Getting Started
Core Capabilities
[!WARNING] CRITICAL: Use current model names: Always check the Firebase AI Logic Models documentation for the currently supported model names. Do NOT use
gemini-2.0-proorgemini-2.0-flashor other older models that are shutdown.
Text-Only Generation
Multimodal (Text + Images/Audio/Video/PDF input)
Firebase AI Logic allows Gemini models to analyze image files directly from your app. This enables features like creating captions, answering questions about images, detecting objects, and categorizing images. Beyond images, Gemini can analyze other media types like audio, video, and PDFs by passing them as inline data with their MIME type. For files larger than 20 megabytes (which can cause HTTP 413 errors as inline data), store them in Cloud Storage for Firebase and pass their URLs to the Gemini Developer API.
Chat Session (Multi-turn)
Maintain history automatically using startChat.
Streaming Responses
To improve the user experience by showing partial results as they arrive (like a
typing effect), use generateContentStream instead of generateContent for
faster display of results.
Generate Images with Nano Banana
[!WARNING] Use current Image model names: Always check the Firebase AI Logic Models documentation for the currently supported image generation (Nano Banana) model names.
- Requires an upgraded Blaze pay-as-you-go billing plan.
Search Grounding with the built in googleSearch tool
Supported Platforms and Frameworks
Supported Platforms and Frameworks include Kotlin and Java for Android, Swift for iOS, JavaScript for web apps, Dart for Flutter, and C Sharp for Unity.
Advanced Features
Structured Output (JSON)
Enforce a specific JSON schema for the response.
On-Device AI (Hybrid)
Hybrid on-device inference for web apps, where the Firebase Javascript SDK automatically checks for Gemini Nano's availability (after installation) and switches between on-device or cloud-hosted prompt execution. This requires specific steps to enable model usage in the Chrome browser, more info in the hybrid-on-device-inference documentation.
Security & Production
App Check
[!WARNING] Critical Safety Requirement: In order to use AI Logic safely, you MUST set up App Check on your app. This prevents unauthorized clients from using your API quota and accessing your backend resources.
See App Check with reCAPTCHA Enterprise for setup instructions.
Remote Config
Consider that you do not need to hardcode model names (e.g., a specific model version string). Use Firebase Remote Config to update model versions dynamically without deploying new client code. See Changing model names remotely
[!WARNING] CRITICAL: Backend Provisioning Required For all platforms (Flutter, Android, iOS, Web), you MUST run
npx firebase-tools init ailogicto provision the service.flutterfire configureONLY handles client configuration and does NOT enable the AI service, leading toPERMISSION_DENIEDerrors.
Initialization Code References
| Language, Framework, Platform | Gemini API provider | Context URL |
|---|---|---|
| Web Modular API | Gemini Developer API (Developer API) | firebase://docs/ai-logic/get-started |
| iOS (Swift) | Gemini Developer API | ios_setup.md |
| Flutter (Dart) | Gemini Developer API | flutter_setup.md |
[!WARNING] CRITICAL: Use current model names: Always check the Firebase AI Logic Models documentation for the currently supported model names. Do NOT use
gemini-2.0-proorgemini-2.0-flashor other older models that are shutdown.
References
Web SDK code examples and usage patterns iOS SDK code examples and usage patterns Flutter SDK code examples and usage patterns
Related skills
More from firebase/agent-skills and the wider catalog.
firebase-basics
Firebase CLI setup, authentication, and project management for agent workflows
firebase-auth-basics
Set up Firebase Authentication for user sign-in, management, and secure data access.
firebase-hosting-basics
Deploy static web apps, SPAs, and microservices to Firebase Hosting with a single command.
firebase-app-hosting-basics
Deploy full-stack Next.js and Angular apps with Firebase App Hosting.
firebase-data-connect
Build and deploy PostgreSQL-backed Firebase backends with GraphQL, auto-generated queries, and type-safe SDKs.
developing-genkit-js
Develop AI-powered applications using Genkit in Node.js/TypeScript with flows, tools, and multi-provider support.