On August 25,roger freitas, the eroticism of emasculation: confronting the baroque body of the castrato Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-27 09:28
744 views
Vizio 43" smart TV deal: Save 41% at Walmart
GET $104 OFF:The Vizio 43" smart TV is currently on sale for $148 at Walmart for a savings of 41%. O
Read More
2025-06-27 07:41
450 views
5 weird ways to get around this summer
If you thought an electric scooter was an outlandish way to travel, you better hold on — you h
Read More
2025-06-27 07:29
2559 views
Tech can help us spot fake news, but there's only one real way to stop it
In the days after the election, apoplectic progressive journalists spent their time writing boiling
Read More