Multimodal¶ Papers on vision-language models and multimodal systems. Overview¶ This section contains 1 paper covering: LLaVA Scaling - Systematic scaling study from 7B to 70B parameters