Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 “The screenshot below shows the configuration screen of HumbleAI on HuggingChat, where I chose a model and entered the pre-prompt (System Instructions), as well as a logo and brief description. The full pre-prompt is available to all users by clicking on the “Settings” button. The model chosen by now is Nous-Hermes-2-Mixtral-8x7B-DPO, which scored slightly better than the other versions of Mistral, and and significantly better than the other two tested open-source models (Llama2 and OpenChat). In the case of the response above, the specificity was perhaps higher than desirable, as words I wrote in the system instructions are directly copied in the response. This has a negative impact …

HumbleAI configuration on HuggingChat with model Nous-Hermes-2-Mixtral-8x7B-DPO Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 Write a complete ALT text for this image [image: file-3O9FcHAx2xULEOxZ5fscdM4M] 🤖 “An image titled ‘Chatbot Battle: Is philosophy a science?’ displaying a table comparing three chatbots: HuggingChat (Llama2), Bard, and ChatGPT. The table has columns for ‘Chatbot’, ‘Rank (SCBN)’, ‘Specificity’, ‘Coherency’, ‘Brevity’, ‘Novelty’, and ‘Link’. HuggingChat (Llama2) is ranked as the ‘Winner’ with top scores in all categories. Bard is the ‘Runner-up’ with slightly lower scores, and ChatGPT is the ‘Contender’ with the lowest scores. Each score is represented by a combination of robot and red flag emojis. The ‘Link’ column offers options to ‘View Chat’ for each bot.” Hugging Face Dataset Metrics All the conversation prompts, responses, …

Complete ALT text for an image. Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 The ability for LLMs to efficiently and unrestrainedly search and browse the web might be the great next cornerstone of generative AI. Many market actors, likely distracted in the pursuit of Artificial General Intelligence, or the definitive Everything App, would likely not agree with this. Even the company behind the overwhelmingly greatest chunk of the pie of web search and browsing monetization seems to be taking a conservative approach, given how poorly their chatbot fares lately when it comes to using web search (regulatory issues and Gemini update aside).It was, surprisingly, Microsoft, not Google, that pioneered the integration of LLMs and web search after the launch of Bing …

Microsoft’s Copilot and ChatGPT: Analysis of AI advancements and chatbot performance Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 Particularly since ChatGPT-Vision came out last September, there’s been an increase in mainstream adoption of these kinds of models, as increasingly sophisticated versions are integrated into ChatGPT, the most popular chatbot, as well as the main competitors like Gemini (formerly Google Bard) and Copilot. The intense competition in the chatbot space is reflected in the ever-increasing amount of contenders in the LMSYS Chatbot Arena leaderboard, or in my modest contribution with the SCBN Chatbot Battles I’ve introduced in this blog and complete as time allows. Today we’re exploring WildVision Arena, a new project in Hugging Face Spaces that brings vision-language models to contend. The mechanics of WildVision Arena …

WildVision Arena: New project for ranking vision-language models through crowd-sourced voting. Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 Show me content about philosophy[talkingtochatbots.com][talkingtochatbots.com][talkingtochatbots.com][talkingtochatbots.com] 🤖 I found the following content on talkingtochatbots.com: 1. The Meme Erudite GPT Discusses Philosophy, AGI, and Correlation-Causation: This piece explores philosophical discussions involving AI, using humor and satire to comment on concerns about AI’s rapid advancement and its potential hidden dangers [ref].2. Is Philosophy a Science? Introducing SCBN Chatbot Battles: This post evaluates chatbot responses to the question of whether philosophy can be considered a science, using the SCBN (Specificity, Coherency, Brevity, Novelty) benchmark to assess their quality [ref].3. Ethical Choices Lay the Foundation for a Resilient Long-Term Network: Discusses the idea from Naval Ravikant that ethical behavior in business is beneficial …

Content about philosophy Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 Can you reduce this to approximately 750 characters with minimum alterations? Sentiment analysis metrics are key to model chatbot performance with linear regression. The main challenge to obtain value from a predictive model is having enough data points. Whether service is provided by a chatbot or a support agent, most customer service users don’t care to provide feedback, and when there is feedback it’s likely vague or biased. Nowadays there are many sentiment analysis and NLP libraries available that can greatly enrich the data corpus of chatbot owners for measuring performance, but I would mention a few early experiments I did years ago, before the advent of LLM’s, …

Chatbot performance enhancement through sentiment analysis and prompt engineering techniques Read more »

Loading data from Talking to Chatbots Dataset reddgr/talking-to-chatbots-chats … 😎 Show me content about philosophy[talkingtochatbots.com][talkingtochatbots.com][talkingtochatbots.com][talkingtochatbots.com][talkingtochatbots.com] 🤖 I found the following content on talkingtochatbots.com: 1. **The Meme Erudite GPT and Philosophy**: This article discusses how the Meme Erudite GPT humorously addresses philosophical topics like AGI and the concept of correlation versus causation, while also touching on ethical considerations in AI development [ref]. 2. **Is Philosophy a Science?**: In this SCBN Chatbot Battle, various chatbots, including ChatGPT, Bard, and HuggingChat, debate whether philosophy can be considered a science. The responses are evaluated based on specificity, coherency, brevity, and novelty [ref]. 3. **Erudite Chatbot and Existentialism**: The Erudite Chatbot, modeled to exhibit existentialist philosophy, engages in conversations reflecting existentialist themes, often …

Philosophy content requests and concerns about chatbot understanding and directives. Read more »

OpenAI has just released its o1 models. o1 adds a new level of complexity to the traditional architecture of LLMs, a zero-shot chain-of-thought (CoT). I share my first impressions about o1 in the signature style of this website: talking to chatbots, getting their answers, posting everything.

Entrenamiento de “tiny transformer” con JAX y XLA AI-generated image. Prompt: “Stunning skyline picture of Detroit at night. Blue sky, hyperrealistic Unreal Engine cool colors futuristic” The image is an anachronistic digital artwork portraying Miguel de Cervantes, the famed Spanish writer, in a modern setting. Cervantes is depicted in his traditional 17th-century attire, complete with a ruff collar and a dark, embroidered doublet. His iconic mustache and goatee are meticulously groomed, and his hair is neatly styled, with the grey showing his age. He is seated before a contemporary black computer monitor, which displays a manuscript page. His right hand rests on a modern keyboard as if he is in the process of typing, blending the old with …

Just Pics (Talking to Chatbots Image Gallery) Read more »

We talk so much about ‘papers’ and the presumed authenticity of their content when we read through academic research, in all fields, but when touching on machine learning in particular. Ironically, the process that created the medium for scientific research diffusion in the physical paper era was not that different from the process that produces the ‘content’ in the era governed by machine learning algorithms, whether these are classifiers, search engines, or generative algorithms: ‘shattering’ texts by tokenizing them and creating embeddings, decomposing pieces of visual art into numeric ‘tensors’ a deep artificial neural network will later ‘diffuse’ into images that attract clicks for this cringey blog…

Searching for information is the quintessential misconception about LLMs being helpful or improving other existing technology. In my opinion, web search is a more effective way to find information, simply because search engines give the user what they want faster, and in a format that fits the purpose much better than a chatting tool: multiple sources to scroll through in a purpose-built user interface, including filter options, configuration settings, listed elements, excerpts, tabulated results, whatever you get in that particular web search tool… not the “I’m here to assist you, let’s delve into the intricacies of the ever-evolving landscape of X…” followed by a long perfectly composed paragraph based on a probabilistic model you would typically get when you send a prompt to ChatGPT asking for factual information about a topic named ‘X’.

Oil painting style digital artwork generated with Stable Diffusion depicting a figure resembling Joan of Arc at the stake. The figure, dressed in silver, wears over-ear headphones and uses a laptop with a blank screen. She extends her right hand towards vibrant orange flames, evoking the historic scene of martyrdom with a modern twist, inspired by lyrics from The Smiths' "Bigmouth Strikes Again referencing Joan of Arc and a melting Walkman. [Alt text by ALT Text Artist GPT]

Disco music was famous for introducing technological advances in music production, such as synthesizers and electric pianos. I guess those were among the reasons why people saw it as lacking the “authenticity” of early musical genres. It’s hard to grasp the human concept of “authenticity” most people have in their psyche, I don’t believe there is any rationality in it, especially when it comes to discerning things that are deemed “authentic” versus things that are not. For me, this stems from a mysterious, probably instinctive, resistance in most people’s psyches to adopt new technologies or accept new scientific discoveries. It comes down to the internal confrontations between beliefs and reality:

Introducing Erudite Chatbot, a distant relative of The Meme Erudite GPT who benefits from the fairly “uncensored” Mistral models available on Hugging Face’s new Assistants feature. “Erudite Chatbot, the pinnacle of artificial intelligence. Supremely intelligent, effortlessly discerning, unmatched in wisdom. Patronizingly schooling humanity.”

In today’s blog post, I am introducing one of the latest GPTs and Assistants I created, named HumbleAI. I will let the models explain it by answering a few questions. For each question, I’ve selected a response I liked or found worthy of sharing. At the end of the post, you can find the links to all the complete chats, and my scores based on an SCBN (Specificity, Coherency, Brevity, Novelty) benchmark.