archived 4 Dec 2024 17:11:08 UTCarchive.today webpage capture | Saved from | ||
| All snapshots | from host cloud.google.com | ||
| WebpageScreenshot | |||
| Offering | Best for | Key features |
|---|---|---|
Quick and easy integration of basic vision features. | Prebuilt features like image labeling, face and landmark detection, OCR, safe search. Cost-effective, pay-per-use. | |
Extracting insights from scanned documents and images, automating document workflows. | OCR (powered by Gen AI), NLP, ML for document understanding, text extraction, entity identification, document categorization. | |
Analyzing video content, content moderation and recommendation, media archives, and contextual ads. | Object detection and tracking, scene understanding, activity recognition, face detection and analysis, text detection and recognition. | |
Automating visual inspection tasks in manufacturing and industrial settings | Detecting anomaly, detecting and locating defects, and checking assembly. | |
Building and deploying custom models for specific needs. | Data preparation tools, model training and deployment, complete control over your solution. Requires technical expertise. | |
Visual analysis and understanding, multimodal question answering. | Info seeking, object recognition, digital content understanding, structured content generation, captioning/description, and extrapolation. | |
Get automated image descriptions. Image classification and search. Content moderation and recommendations. | Image generation, image editing, visual captioning, and multimodal embedding. See full list of features and their launch stages. |
| How Vision AI pricing works | Each vision offering has a set of features or processors, which have different pricing—check the detailed pricing pages for details. | ||
|---|---|---|---|
| Free tier | Product/Service | Discounted pricing | Details |
Vision API | First 1,000 units every month are free | 5,000,001+ units per month | |
Document AI | N/A Pricing is processor-sensitive. | 5,000,001+ pages per month for Enterprise Document OCR Processor | |
Video Intelligence API | First 1,000 minutes per month are free | 100,000+ minutes per month | |
Vertex AI Vision | N/A Pricing is feature-sensitive. | ||
Imagen—multimodal embeddings | US $0.0001 per image input | ||
Imagen—visual captioning | US $0.0015 per image | ||
Gemini Pro Vision | |||