NGC | Catalog
Logo for Kosmos-2
Kosmos-2 model is a groundbreaking multimodal large language model (MLLM). Kosmos-2 is designed to ground text to the visual world, enabling it to understand and reason about visual elements in images.
Microsoft Research
February 6, 2024
Microsoft Research Terms of Use: By using this model, you are agreeing to the terms and conditions of the license, acceptable use policy and Microsoft Research privacy policy.