Share this article
Latest news
With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low
Copilot in Outlook will generate personalized themes for you to customize the app
Microsoft will raise the price of its 365 Suite to include AI capabilities
Death Stranding Director’s Cut is now Xbox X|S at a huge discount
Outlook will let users create custom account icons so they can tell their accounts apart easier
xAI unveils Grok-1.5 Vision with the capability of ‘understanding’ images
The new model outperforms its market rivals in the RealWorldQA benchmark
2 min. read
Published onApril 14, 2024
published onApril 14, 2024
Share this article
Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more
Elon Musk’s xAI has recently announced its first multimodal model Grok-1.5 Vision, aka Grok 1.5V. This comes after the company’s last month’sannouncement of Grok-1 AIto take on ChatGPT.
The company’s first multimodal model Grok 1.5V not only understands text but is also capable of image processing. It can process everything it sees in documents, images, screenshots, charts, as well as diagrams. In arecent blog post, talking of Grok-1.5 Vision’s capabilities, the company mentioned:
Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs.
Grok-1.5 Vision outperforms its rival in the RealWorldQA benchmark
The company also detailed the advanced capabilities of the Grok-1.5 Vision with seven different samples which are as follows:
Musk-led AI company also shared a comparison chart to compare its first multimodal model with its rivals. Testing results show that Grok-1.5 Vision stands tall against its competitors like GPT-4 with Vision, Claud 3 Sonnet/Opus, and Gemini Pro 1.5.
While the results look promising, xAI’s Grok-1.5V outshines all its competitors in the RealWorldQA benchmark. According to the company, RealWorldQA is a newbenchmark designed to evaluate basic real-world spatial understanding capabilities of multimodal models.
Well, it is pretty clear that Musk’s AI company is in no mood to take the backseat and is aggressively making moves to keep up with its rival. However, we can’t deny the fact that its AI models have received a fair amount of criticism in the past. More recently,Grok AI was criticized for misinformationand more.
Lastly, Grok-1.5V will soon be available to the existing Grok users and early testers out there. So, if you are among the early testers of Grok-1.5 Vision, please share your experience of using it with our readers in the comments.
More about the topics:AI,twitter
Vlad Turiceanu
Windows Editor
Passionate about technology,Windows, and everything that has a power button, he spent most of his time developing new skills and learning more about the tech world.
Coming from a solid background in PC building and software development, with a complete expertise in touch-based devices, he is constantly keeping an eye out for the latest and greatest!
User forum
0 messages
Sort by:LatestOldestMost Votes
Comment*
Name*
Email*
Commenting as.Not you?
Save information for future comments
Comment
Δ
Vlad Turiceanu
Windows Editor
Coming from a solid background in PC building and software development, he’s a Windows 11 Privacy & Security expert.