NGC | Catalog
Logo for Kosmos-2
Description
Kosmos-2 model is a groundbreaking multimodal large language model (MLLM). Kosmos-2 is designed to ground text to the visual world, enabling it to understand and reason about visual elements in images.
Publisher
Microsoft Research
Modified
February 6, 2024
Microsoft Research Terms of Use: By using this model, you are agreeing to the terms and conditions of the license, acceptable use policy and Microsoft Research privacy policy.