Multimodal Classification
Classify inputs combining multiple image modalities with optional text
Multimodal classification models classify inputs combining images from multiple modalities with optional text prompts.
Available Models
- ResNet18 Multi-Image – Classify inputs from multiple image modalities using ResNet18
- Multimodal Classification – Universal multimodal classification combining multiple image sources and feature columns