Generative AI | Talking to Chatbots

WildVision Arena and the Battle of Multimodal AI: We Are Not the Same

Black and white line drawing generated by the ControlNet Canny model, depicting a woman in a wetsuit holding a surfboard on the beach. A text from CLIP Interrogator, describing an image, is superimposed over the drawing and reads: "a woman in a wet suit holding a surfboard on the beach with waves in the background and a blue sky, promotional image, a colorized photo, precisionism."

The intense competition in the chatbot space is reflected in the ever-increasing amount of contenders in the LMSYS Chatbot Arena leaderboard, or in my modest contribution with the SCBN Chatbot Battles I’ve introduced in this blog and complete as time allows. Today we’re exploring WildVision Arena, a new project in Hugging Face Spaces that brings vision-language models to contend. The mechanics of WildVision Arena are similar to that of LMSYS Chatbot Arena. It is a crowd-sourced ranking based on people’s votes, where you can enter any image (plus an optional text prompt), and you will be presented with two responses from two different models, keeping the name of the model hidden until you vote by choosing the answer that looks better to you. I’m sharing a few examples of what I’m testing so far, and we’ll end this post with a traditional ‘SCBN’ battle where I will evaluate the vision-language models based on my use cases.

Microsoft Copilot Pro: Chatbots’ Take on the Latest Tech News

Two computer monitors on a desk, each displaying a facial expression of annoyance or frustration. The monitor on the left displays a start menu with the label "Microsoft 365 Copilot", while the monitor on the right has a browser window open with tabs, one of which is labeled "ChatGPT Plus" and features the OpenAI logo. The setting suggests an office environment, and the background reveals a blue tiled wall, implying an indoor setting without any visible Windows. [Image produced with DALL-E and Gimp] [Caption assisted by ALT Text Artist GPT]

Microsoft has just announced the launch of its own ‘GPT Builder’ for customizing chatbots, similar to OpenAI’s ‘GPT Store’. This was part of a broader announcement of Copilot Pro, a premium AI-powered service for Microsoft 365 users to enhance productivity, code, and text writing. According to Satya Nadella’s announcement today on Threads, Microsoft and OpenAI appear to be competing entities, yet they are working on the same technology (GPT), augmented by Microsoft’s investment in OpenAI. It certainly seems like a strange business strategy for Microsoft. Please provide some insight into the move’s rationale and strategic motivations.

Tag: Generative AI

WildVision Arena and the Battle of Multimodal AI: We Are Not the Same

Microsoft Copilot Pro: Chatbots’ Take on the Latest Tech News