multimodal

multimodal

/ˌmʌltiˈmoʊdəl/

Generative AI

AI capable of processing multiple types of input like text and images

Multimodal models can describe images and answer questions about them.

Origin: From Latin multi- (many) + modus (manner, mode)