Data Modalities
Different types of data AI works with
Data comes in different types called modalities.
The most common modalities are:
- Text (words, sentences, documents)
- Images (photos, drawings, screenshots)
- Audio (speech, music, sounds)
- Video (moving images with sound)
- Tabular (rows and columns of numbers or categories)
Each modality requires different file formats to store it.
Multimodal AI
Some AI models work with only one modality. Other models are multimodal and can work with multiple modalities at the same time.
Example: A multimodal model can understand both text and images together, like answering questions about a photo.