Explorer

Do You Use ChatGPT For Work? This Study Says It Gets Things Wrong 30% Of The Time

A new study ranks popular AI chatbots by reliability, and ChatGPT, despite its massive popularity, didn't even make the top five. The winner might shock you.

Show Quick Read
Key points generated by AI, verified by newsroom
  • Perplexity AI leads AI chatbot reliability, scoring 85/100.
  • It boasts lowest hallucination rate at 13% and 100% uptime.
  • Grok and DeepSeek follow, outperforming popular ChatGPT and Gemini.

With AI chatbots becoming a regular part of work life, a new study has taken a closer look at which ones actually hold up when it matters. The findings, from US-based digital marketing agency Legal Guardian Digital, rank popular AI assistants based on how reliable they are for everyday tasks. The results may surprise many users: Perplexity AI came out on top, ahead of more widely recognised names like ChatGPT, Google Gemini, and Claude. 

The study assessed chatbots on hallucination rates, customer satisfaction, response consistency, and uptime reliability.

How Did Each Chatbot Perform In The Reliability Rankings?

According to the study, Perplexity AI recorded the lowest hallucination rate among major chatbots at 13%, well below the industry average of 22%. The platform also maintained a 100% uptime rate during the testing period, meaning it did not go down once. 

ALSO READ: Airtel, Jio, Vi Or BSNL: We Compared Every Annual Plan So You Don't Have To

Elon Musk's Grok ranked second, with a hallucination rate of 15% and a perfect uptime score as well. Chinese AI chatbot DeepSeek came in third, with a 14% hallucination rate and a 99.52% uptime rate.

Perplexity AI achieved the highest overall reliability score of 85 out of 100. Grok followed at 79, and DeepSeek came in at 76.

What About ChatGPT, Gemini, And The Rest?

Despite being the most widely used AI chatbot globally, ChatGPT ranked sixth in the reliability index. The report found that ChatGPT generated incorrect responses in around 30% of cases, roughly double the error rate recorded by DeepSeek. 

Even so, it scored 4.7 out of 5 in customer ratings. Google Gemini ranked eighth, Meta AI came in ninth, and Claude, developed by Anthropic, ranked seventh, with comparatively more outages than its competitors. ChatGPT scored 50 out of 100 overall, while Gemini received 41.

ALSO READ: WhatsApp Channels Can Now Post Status Updates That Disappear In 24 Hours

On pricing, Perplexity AI reportedly costs $40 per month, and Grok costs $30, while DeepSeek is free to use despite its strong reliability ranking.

Austin Hunt, CEO of Legal Guardian Digital, said users often assume ChatGPT is the most reliable AI assistant because of its popularity and early market lead. However, he argued that when it came to reliability and uptime, smaller platforms such as Perplexity and Grok performed better.

Frequently Asked Questions

Which AI chatbot ranked highest in reliability for everyday tasks?

Perplexity AI ranked highest in reliability according to a study by Legal Guardian Digital. It topped the list ahead of other popular AI assistants like ChatGPT and Google Gemini.

What factors were considered in the AI chatbot reliability study?

The study assessed chatbots on hallucination rates, customer satisfaction, response consistency, and uptime reliability to determine their overall performance.

How did ChatGPT perform in the reliability rankings compared to Perplexity AI?

ChatGPT ranked sixth in the study, with a significantly higher hallucination rate than Perplexity AI. Perplexity AI had the lowest hallucination rate among major chatbots tested.

What is the cost of Perplexity AI, the top-ranked chatbot?

Perplexity AI reportedly costs $40 per month. Despite its strong reliability, other chatbots like DeepSeek are free to use.

About the author Annie Sharma

Annie Sharma is a technology journalist at ABP Live English, focused on breaking down complex tech stories into clear, reader-friendly narratives. Gaining hands-on experience in digital storytelling and news writing with leading publications, Annie believes technology should feel accessible rather than overwhelming, and follows a clear, reader-first approach in her work.

For tips and queries, you can reach out to her at annies@abpnetwork.com.

Read More
Advertisement

Top Headlines

Wordle Answer Today (June 10): Puzzle 1817 Caught You Off-Guard? Check Hints, & Solution
Wordle Answer Today (June 10): Puzzle 1817 Caught You Off-Guard? Check Hints, & Solution
NYT Connections Answers (June 10): Puzzle 1094 Too Hard For You? Check Hints, & Solution
NYT Connections Answers (June 10): Puzzle 1094 Too Hard For You? Check Hints, & Solution
Why Is Indian Government Blocking Elon Musk's Starlink? The Answer Starts With The Iran War
Why Is Indian Government Blocking Elon Musk's Starlink? The Answer Starts With The Iran War
NASA’s Next Moon Mission Gets Its Crew: Here’s What Artemis III Will Do
NASA’s Next Moon Mission Gets Its Crew: Here’s What Artemis III Will Do
Advertisement

Videos

Middle East Crisis: Ten Major Turning Points That Shaped the US–Iran–Israel Crisis
Pakistan–Afghanistan Tensions: Pakistan Conducts Airstrikes in Afghanistan as Border Tensions Escalate
Pakistan-Occupied Kashmir: Protests Escalate in Pakistan-Occupied Kashmir as Demonstrators Demand Rights and Economic Reforms
Civil Unrest: Protests Intensify in Pakistan-Occupied Kashmir as Demonstrators Challenge Authorities Amid Crackdown
Middle East Crisis: Iranian Drones Reportedly Spotted Over Iraq as US–Iran Military Tensions Intensify
Advertisement

Photo Gallery

Advertisement
25°C
New Delhi
Rain: 100mm
Humidity: 97%
Wind: WNW 47km/h
See Today's Weather
powered by
Accu Weather
Embed widget