🚀 Medial Secures Investment on Shark Tank India - Fueling the Future of Professional Social Networking. 🔥

Startup Showcase

Premium Content

Try our Valuation Calculator →

News on Medial

1960s chatbot ELIZA beat OpenAI’s GPT-3.5 in a recent Turing test study

Arstechnica

Arstechnica · 1y ago

1960s chatbot ELIZA beat OpenAI’s GPT-3.5 in a recent Turing test study

In a recent study conducted by researchers from UC San Diego, it was found that human participants correctly identified other humans in only 63 percent of interactions during a Turing test. The test compared OpenAI's GPT-4 AI language model with human participants, GPT-3.5, and the 1960s computer program ELIZA. Surprisingly, ELIZA outperformed the AI models, achieving a success rate of 27 percent. The study raises questions about using the Turing test to evaluate AI model performance and highlights the importance of linguistic style and socio-emotional traits in determining human-likeness. Read more on arstechnica.com.

Related News

The version of Google's Gemini you can use right now doesn't beat OpenAI's GPT-4

Business Insider

Business Insider · 1y ago

The version of Google's Gemini you can use right now doesn't beat OpenAI's GPT-4

Google's launch of its AI model called Gemini falls short of its goal to outperform OpenAI's GPT-4. The version currently available, Gemini Pro, can be accessed through Google's Bard chatbot. While it surpasses GPT-3.5 in most measures, it fails to beat GPT-4. The more advanced version, Gemini Ultra, which is said to outperform GPT-4, won't be available until next year. Critics have expressed mixed reactions, noting the lack of transparency in Gemini's training data and how it was filtered.

From Eliza to ChatGPT: why people spent 60 years building chatbots

The Verge

The Verge · 1y ago

From Eliza to ChatGPT: why people spent 60 years building chatbots

The concept of conversational AI chatbots has been around since the 1960s, with the development of the primitive chatbot Eliza. These chatbots are designed to mimic human conversation and make interactions with computers more engaging and user-friendly. However, the challenge has been creating chatbots that not only talk convincingly but also perform tasks efficiently. Recent advancements in language models like ChatGPT and Google Gemini show promise in bridging this gap, although they are still far from perfect. The rise of these sophisticated chatbots raises questions about the boundaries between humans and machines and the potential benefits and drawbacks of integrating AI companions into our everyday lives.

Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts

Arstechnica

Arstechnica · 1y ago

Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts

The appearance of a mystery chatbot called "gpt2-chatbot" in the LMSYS Chatbot Arena has sparked rumors that it could be a secret test version of OpenAI's GPT-4.5 or GPT-5. The new model is only accessible through the Chatbot Arena website and has a limited rate of eight queries per day. While some online comments have praised the model's abilities, others have found it to be underwhelming compared to GPT-4 Turbo. OpenAI has not provided any official comment on the matter. The true identity and purpose of "gpt2-chatbot" remain unknown.

Apple is testing a ChatGPT-like AI chatbot

Startup News FYI

Startup News FYI · 2y ago

Apple is testing a ChatGPT-like AI chatbot

Apple is reportedly testing a ChatGPT-like AI chatbot. Inspired by OpenAI's GPT-3.5, the chatbot is being developed to enhance user interactions and support services across Apple products. With this technology, the company aims to improve customer experience and provide more advanced and personalized assistance.

'GPT-5 feels dumber': Users on OpenAI’s newest model - The Economic Times

Economic Times

Economic Times · 2m ago

'GPT-5 feels dumber': Users on OpenAI’s newest model - The Economic Times

OpenAI's recent launch of GPT-5 faced criticism from users, with many perceiving it as less capable than its predecessor. OpenAI CEO Sam Altman acknowledged these concerns, attributing initial issues to technical glitches like the autoswitcher malfunction. He addressed the feedback during a Reddit AMA, pledging improvements, including expanding GPT-5's capability and considering restoring GPT-4o for users dependent on it. Further enhancements like increased GPT-5 rate limits for ChatGPT Plus users were also promised.

Bhavish Aggarwal's Krutrim AI ignores Ola, says Ather EV is best scooter in India

Livemint

Livemint · 1y ago

Bhavish Aggarwal's Krutrim AI ignores Ola, says Ather EV is best scooter in India

Krutrim AI chatbot, created by Ola CEO Bhavish Aggarwal, has faced controversy after naming Ather Energy's 450X Gen 3 as the best electric scooter in India, despite competition from Ola. Krutrim, designed to rival chatbots like Google's Gemini and OpenAI's ChatGPT, made this declaration in response to a user's question. The chatbot highlighted the Ather 450X's top-notch ride quality, performance, and battery features. Krutrim recently launched its own chatbot, Krutrim Assistant, and confirmed that it is based on OpenAI's GPT-3.5 language model.

Elon Musk Unveils Grok 3: How It Performs Against OpenAI’s GPT-4o & DeepSeek

OutlookIndia

OutlookIndia · 8m ago

Elon Musk Unveils Grok 3: How It Performs Against OpenAI’s GPT-4o & DeepSeek

Elon Musk's AI start-up, xAI, has launched Grok 3, touted as "the Smartest AI on Earth." Grok 3 outperformed major models like Google Deepmind’s Gemini-2 Pro, DeepSeek-V3, Anthropic’s Claude 3.5 Sonnet, and OpenAI’s GPT-4o in various benchmarks. xAI executives revealed that they built their own data center in a short period to support Grok’s development, and Grok 3 excelled in math, science, and coding tests, surpassing leading competitors in performance.

ETtech Explainer: Is Apple’s ReALM better than OpenAI’s GPT-4?

Economic Times

Economic Times · 1y ago

ETtech Explainer: Is Apple’s ReALM better than OpenAI’s GPT-4?

Apple has revealed some details about its artificial intelligence plans in a recent research paper. The company discussed its large language model (LLM) called Reference Resolution As Language Modeling (ReALM) and how it surpasses OpenAI's GPT-4. ReALM focuses on resolving references, such as ambiguous or contextual words, to create a better understanding in AI chatbots. Apple's models have shown improvements over existing systems, including a 5% gain for on-screen references. While ReALM performs well in specific benchmarks, it is not yet clear if it outperforms GPT-4 overall.

AIs serve up 'garbage' to questions about voting and elections

TechCrunch · 1y ago

AIs serve up 'garbage' to questions about voting and elections

AI models designed to address questions and concerns about voting and elections have performed poorly in a recent test. The study evaluated major AI services' ability to provide accurate information related to elections, such as voter registration procedures and polling locations. The models tested included Claude, Gemini, GPT-4, Llama 2, and Mixtral. All models consistently provided inaccurate, biased, and incomplete answers to the queries. The results highlight the unreliability of current AI models when it comes to crucial information, emphasizing the need for caution and skepticism when using them for important matters like elections.

OpenAI proposes a new way to use GPT-4 for content moderation, easing human workload

Startup News FYI

Startup News FYI · 2y ago

OpenAI proposes a new way to use GPT-4 for content moderation, easing human workload

OpenAI has introduced a method to leverage its advanced AI model, GPT-4, for content moderation, aiming to lessen the workload on human moderation teams. The approach, outlined in a recent OpenAI blog post, involves prompting GPT-4 with a specific policy guiding its moderation decisions. This includes creating a test dataset of content examples that may or may not violate the policy.

Trackers

Active Indian VC’s

OG Capital Email

With a hands-on approach, OG Capital aims to invest in over 20 promising...

Accel Partners Email

Early and growth-stage investments in disruptive technology companies with...

Early-stage venture capital firm investing in technology startups in India. Focus on...

Access All Trackers

Startup Showcase Winners

Sept 2025

Your Health, Simplified with Smart Care

An ecosystem for curated creative partnerships

Your Health our Priority.

Enter Ongoing Startup Showcase

Top Users

Trending News on Medial

ChatGPT Go Free for a Year in Ind ...

AI boom takes Nvidia past $5 tril ...

Lenskart Raises ₹100 Crore From S ...

Download the medial app to read full posts, comements and news.