Search

Search courses, chapters, or pages...

Learn

Course page Home page Home page Courses

Search

Search courses, chapters, or pages...

Learn

Multimodal language models | Zoonk

Large Language Models

18. Multimodal language models

Work with models that combine text with images, audio, video, documents, or screen actions. This chapter covers vision-language models, OCR-like use cases, multimodal prompting, and evaluation problems unique to non-text inputs.

Chapter not available

This chapter hasn't been created yet.

Create chapter