Skip to content

Multimodal

Papers on vision-language models and multimodal systems.

Overview

This section contains 1 paper covering:

  • LLaVA Scaling - Systematic scaling study from 7B to 70B parameters