🚀 Medial Secures Investment on Shark Tank India - Fueling the Future of Professional Social Networking. 🔥
✕
Login
Home
News
Messages
Startup Showcase
Trackers
Premium
Premium Content
Jobs
Notifications
Settings
Try our Valuation Calculator →
Log In
News on Medial
1960s chatbot ELIZA beat OpenAI’s GPT-3.5 in a recent Turing test study
Arstechnica
·
1y ago
Medial
In a recent study conducted by researchers from UC San Diego, it was found that human participants correctly identified other humans in only 63 percent of interactions during a Turing test. The test compared OpenAI's GPT-4 AI language model with human participants, GPT-3.5, and the 1960s computer program ELIZA. Surprisingly, ELIZA outperformed the AI models, achieving a success rate of 27 percent. The study raises questions about using the Turing test to evaluate AI model performance and highlights the importance of linguistic style and socio-emotional traits in determining human-likeness. Read more on arstechnica.com.
View Source
Related News
The version of Google's Gemini you can use right now doesn't beat OpenAI's GPT-4
Business Insider
·
1y ago
Medial
Google's launch of its AI model called Gemini falls short of its goal to outperform OpenAI's GPT-4. The version currently available, Gemini Pro, can be accessed through Google's Bard chatbot. While it surpasses GPT-3.5 in most measures, it fails to beat GPT-4. The more advanced version, Gemini Ultra, which is said to outperform GPT-4, won't be available until next year. Critics have expressed mixed reactions, noting the lack of transparency in Gemini's training data and how it was filtered.
View Source
From Eliza to ChatGPT: why people spent 60 years building chatbots
The Verge
·
1y ago
Medial
The concept of conversational AI chatbots has been around since the 1960s, with the development of the primitive chatbot Eliza. These chatbots are designed to mimic human conversation and make interactions with computers more engaging and user-friendly. However, the challenge has been creating chatbots that not only talk convincingly but also perform tasks efficiently. Recent advancements in language models like ChatGPT and Google Gemini show promise in bridging this gap, although they are still far from perfect. The rise of these sophisticated chatbots raises questions about the boundaries between humans and machines and the potential benefits and drawbacks of integrating AI companions into our everyday lives.
View Source
Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
Arstechnica
·
1y ago
Medial
The appearance of a mystery chatbot called "gpt2-chatbot" in the LMSYS Chatbot Arena has sparked rumors that it could be a secret test version of OpenAI's GPT-4.5 or GPT-5. The new model is only accessible through the Chatbot Arena website and has a limited rate of eight queries per day. While some online comments have praised the model's abilities, others have found it to be underwhelming compared to GPT-4 Turbo. OpenAI has not provided any official comment on the matter. The true identity and purpose of "gpt2-chatbot" remain unknown.
View Source
Apple is testing a ChatGPT-like AI chatbot
Startup News FYI
·
2y ago
Medial
Apple is reportedly testing a ChatGPT-like AI chatbot. Inspired by OpenAI's GPT-3.5, the chatbot is being developed to enhance user interactions and support services across Apple products. With this technology, the company aims to improve customer experience and provide more advanced and personalized assistance.
View Source
Bhavish Aggarwal's Krutrim AI ignores Ola, says Ather EV is best scooter in India
Livemint
·
1y ago
Medial
Krutrim AI chatbot, created by Ola CEO Bhavish Aggarwal, has faced controversy after naming Ather Energy's 450X Gen 3 as the best electric scooter in India, despite competition from Ola. Krutrim, designed to rival chatbots like Google's Gemini and OpenAI's ChatGPT, made this declaration in response to a user's question. The chatbot highlighted the Ather 450X's top-notch ride quality, performance, and battery features. Krutrim recently launched its own chatbot, Krutrim Assistant, and confirmed that it is based on OpenAI's GPT-3.5 language model.
View Source
Elon Musk Unveils Grok 3: How It Performs Against OpenAI’s GPT-4o & DeepSeek
OutlookIndia
·
5m ago
Medial
Elon Musk's AI start-up, xAI, has launched Grok 3, touted as "the Smartest AI on Earth." Grok 3 outperformed major models like Google Deepmind’s Gemini-2 Pro, DeepSeek-V3, Anthropic’s Claude 3.5 Sonnet, and OpenAI’s GPT-4o in various benchmarks. xAI executives revealed that they built their own data center in a short period to support Grok’s development, and Grok 3 excelled in math, science, and coding tests, surpassing leading competitors in performance.
View Source
ETtech Explainer: Is Apple’s ReALM better than OpenAI’s GPT-4?
Economic Times
·
1y ago
Medial
Apple has revealed some details about its artificial intelligence plans in a recent research paper. The company discussed its large language model (LLM) called Reference Resolution As Language Modeling (ReALM) and how it surpasses OpenAI's GPT-4. ReALM focuses on resolving references, such as ambiguous or contextual words, to create a better understanding in AI chatbots. Apple's models have shown improvements over existing systems, including a 5% gain for on-screen references. While ReALM performs well in specific benchmarks, it is not yet clear if it outperforms GPT-4 overall.
View Source
AIs serve up 'garbage' to questions about voting and elections
TechCrunch
·
1y ago
Medial
AI models designed to address questions and concerns about voting and elections have performed poorly in a recent test. The study evaluated major AI services' ability to provide accurate information related to elections, such as voter registration procedures and polling locations. The models tested included Claude, Gemini, GPT-4, Llama 2, and Mixtral. All models consistently provided inaccurate, biased, and incomplete answers to the queries. The results highlight the unreliability of current AI models when it comes to crucial information, emphasizing the need for caution and skepticism when using them for important matters like elections.
View Source
OpenAI proposes a new way to use GPT-4 for content moderation, easing human workload
Startup News FYI
·
1y ago
Medial
OpenAI has introduced a method to leverage its advanced AI model, GPT-4, for content moderation, aiming to lessen the workload on human moderation teams. The approach, outlined in a recent OpenAI blog post, involves prompting GPT-4 with a specific policy guiding its moderation decisions. This includes creating a test dataset of content examples that may or may not violate the policy.
View Source
Before launching, GPT-4o broke records on chatbot leaderboard under a secret name
Arstechnica
·
1y ago
Medial
OpenAI's newly announced GPT-4o AI model, previously disguised as "gpt-chatbot," has topped the leaderboard on the Chatbot Arena website. The leaderboard score for GPT-4o is the highest ever documented. The AI model underwent testing under various names, frustrating AI experts who criticized the lack of transparency. Ultimately, GPT-4o surpassed the previous models, Claude 3 Opus and GPT-4 Turbo, by a considerable margin. The success of GPT-4o on the Arena highlights the model's capabilities and its strong performance.
View Source
Trackers
Active Indian VC’s
OG Capital
Email
With a hands-on approach, OG Capital aims to invest in over 20 promising...
Accel Partners
Email
Early and growth-stage investments in disruptive technology companies with...
Blume
Email
Early-stage venture capital firm investing in technology startups in India. Focus on...
Access All Trackers
Startup Showcase Winners
June 2025
Buddy
Helping your parents when you are miles away
BiteStop
The Pit Stop Your Cravings Deserve
Bloomer
The next generation E-commerce platform
Enter Ongoing Startup Showcase
Top Users
Trending News on Medial
Download the medial app to read full posts, comements and news.
Go to Medial App
Not Now
Know everything that’s happening in the startup ecosystem, first.
Enable Notifications?
No, thanks
Count me in