Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, has launched two open-source large vision language models (LVLM), Qwen-VL and its conversationally fine-tuned Qwen-VL-Chat. The models can comprehend images, texts and bounding boxes in prompts and facilitate multi-round question answering in both English and Chinese. Qwen-VL is the multimodal version of Qwen-7B, Alibaba Cloud’s […]
The post Alibaba Cloud Launches Open-Source Large Vision Language Model with Image Comprehension Capability appeared first on UK Tech News.