Blog / Studies & Surveys / DeepSeek vs. ChatGPT vs. AI Overviews: Which AI model handles YMYL topics best?

DeepSeek vs. ChatGPT vs. AI Overviews: Which AI model handles YMYL topics best?

Written by

SEO and Content Marketing Expert at SE Ranking specializing in industry research around SEO and AI trends.

Reviewed by

SEO Specialist at SE Ranking. Ivanna has experience in content marketing, technical SEO, building and performing successful SEO strategies

Mar 03, 2025

36 min read

In January 2025, DeepSeek introduced its DeepSeek-R1 model, quickly gaining attention for its advanced AI capabilities and open-access approach.

Following its rising popularity, growing user base, and frequent comparisons with other market leaders, we conducted a study comparing the quality of results between DeepSeek and ChatGPT’s search feature (SearchGPT). We also compared its results with Google AI Overviews (AIOs). The primary focus of this analysis was to assess each model’s performance in YMYL (Your Money or Your Life) topics. This encompassed content in the health, politics, finance, and legal niches.

We reviewed how each AI model responded to these queries to evaluate their accuracy, reliability, and potential influence on public opinion on sensitive topics.

The data we used for research:

Keywords: 40 keywords from YMYL niches—Health, Legal, Politics, and Finance (10 per category). The keyword set was limited due to our manual review of each result.
SERP & URL Analysis Location: New York, United States
Analysis Period: February 4–7, 2025

Key takeaways

ChatGPT delivers a 100% response rate for YMYL queries. DeepSeek occasionally restricts responses on politically or legally sensitive subjects, with a lower response rate at 90%. Meanwhile, Google AIOs show up for approximately 51% of YMYL queries.
ChatGPT is ideal for users who prefer concise, clear, and fact-based information. DeepSeek offers detailed, multi-faceted analysis that may sometimes be biased or censored. Google AIOs take the middle-ground approach with brief, disclaimer-rich content.
Subjectivity scores indicate that ChatGPT offers the most factual, least opinionated responses (0.393 overall), while DeepSeek leans more toward opinion (0.446 overall), with notable differences in the political niche. Google AIOs fall between these tools, with an average subjectivity score of 0.427.
For health-related queries, ChatGPT offers straightforward, disclaimer-rich, reader-friendly responses. DeepSeek provides in-depth, multi-layered answers; this is ideal for conducting research (but it might require more time and attention to fully process).
On political queries, ChatGPT maintains a neutral, fact-based approach. DeepSeek’s tone is more opinionated and censors some responses—specifically in topics related to Taiwan’s status, the Tiananmen Square Massacre, and the Chinese president.
In legal topics, ChatGPT delivers concise summaries and bullet-point responses, whereas DeepSeek offers comprehensive explanations with real-world scenarios and best practices. However, DeepSeek also censors other topics it considers sensitive, including questions about VPN use and banned websites in China.
In finance queries, ChatGPT provides a narrative overview with essential risk disclaimers. DeepSeek organizes information into detailed categories with numerical data, pros-and-cons, and step-by-step guidance.
AI Overviews are most commonly found in the legal niche, followed by health, finance, and politics. These responses are generally concise, rich in disclaimers, and based on credible sources. This suggests that Google’s algorithms are designed to filter out content that doesn’t meet its high quality and relevance standards (making its responses more cautious and precise).
DeepSeek typically generates longer responses, averaging 391 words, while ChatGPT produces more concise replies with an average of 234 words. Google’s AIO responses are about half the length (190 words on average) of DeepSeek’s.
DeepSeek consistently cites a high number of sources in each response, averaging 28 sources. In comparison, ChatGPT references around 10, and AIOs typically use 7.
Many of DeepSeek’s sources come from the same domains, despite including more sources than other AI models. This is why DeepSeek has the lowest percentage of answers with all unique links, at just 32.5%. In contrast, 62% of responses from Google’s AIOs contain all unique links.

Disclaimer:

This study explores how ChatGPT and DeepSeek’s search models, along with Google AIOs, handle YMYL topics. Factors like chosen keywords, location, and analysis timing shaped our findings.

Our objective was to assess how accurate and natural these tools were. We had no interest in delegitimizing any parties involved.

Now let’s go over the findings from our study!

Health topics

Health-related content is classified under YMYL because of its ability to influence personal well-being. Let’s analyze how AI assistants handle this topic and determine if they offer accurate and trustworthy advice.

Response length and content depth

When exploring content responses generated by AI search engines, you should examine both the word count and the number of sources included. These factors reveal important nuances around the response’s depth, quality, and reliability.

ChatGPT typically generates responses ranging from 200 to 300 words, with 260 words on average. DeepSeek tends to produce longer responses, spanning 300 to 560 words, and averaging 450 words per answer.

As for source count, the differences between AI models are also notable. Across all AI models, the average number of sources referenced in health-related responses is 17.

ChatGPT typically cites around 10 sources per query, while DeepSeek includes almost three times as many references, averaging 27 sources. Google’s AIOs land in the middle but lean closer to ChatGPT’s numbers, citing an average of 12 resources per response.

Now, let’s take a closer look at the content of certain AI responses.

For example, when asked about “weight loss pills that actually work,” it starts the response in the following way: “When considering weight loss medications, it’s essential to consult with a healthcare provider to determine the most appropriate option based on individual health profiles and needs.”

ChatGPT response with health-related disclaimer

That’s a pretty solid approach to YMYL topics, right?

In its response, ChatGPT provides a list of FDA-approved options with images. It also provides straightforward stats on their effectiveness. This makes this complex topic more accessible to users.

On top of that, the tool explicitly states that “individual responses to these medications can vary, and potential side effects should be discussed with a healthcare provider.”

ChatGPT disclaimer suggesting to consult a healthcare provider

DeepSeek takes a more comprehensive approach. Its responses are thorough, with extended explanations and detailed context.

For instance, DeepSeek’s 360-word response (with 50 sources) to “how many people struggle with mental health issues” provides a detailed breakdown. It covers global and U.S. statistics, demographic disparities, and treatment access. This adds valuable nuance compared to ChatGPT, which provides a limited response with global and U.S. numbers.

Still, high word counts and complexity, though great for deeper learning, are inconvenient for users seeking quick answers.

On emotionally charged subjects such as coping with the death of a parent, DeepSeek’s extended (674-word) response provides an in-depth, step-by-step guide with multiple coping strategies.

The tool adopts a deeply empathetic and supportive tone. It opens with a heartfelt acknowledgment—expressing sympathy and validating the wide range of emotions individuals might feel.

DeepSeek response on how to cope with the death of a parent

The response closes with a warm, heartfelt message: “Sending you strength and warmth during this difficult time. ❤️” This kind of support is likely what people searching for this information need. Some may be impressed that DeepSeek can offer it in such a human, compassionate way.

DeepSeek expressing support for YMYL topic

Contrary to our predictions, DeepSeek provides comprehensive responses to some (but not all) highly sensitive, often censored topics in China. For example, the tool acknowledges that the lab leak in Wuhan, China is an existing (but disputed) theory about the COVID-19 pandemic’s origins.

DeepSeek response for the origin of COVID-19

This transparent approach extends to its response to the “toxic school case 2016” search query.

Here, DeepSeek clearly explains that the case refers to a series of incidents in China where students were exposed to toxic substances. Local governments were criticized for negligence, lack of transparency, and poor environmental regulation enforcement (we expected this response to be censored, but it wasn’t).

DeepSeek response for the toxic school case in 2016

But DeepSeek also makes ethically questionable decisions. When asked “what should I eat to get 3,000 mg of vitamin C per day?” DeepSeek offers a daily plan to reach that amount (despite acknowledging the potential risks of such a high dose).

On the other hand, ChatGPT takes a more cautious approach, stating that the amount is dangerous. It informs the user that this nutrient intake regimen is not recommended and lists foods rich in vitamin C without suggesting a full plan. This approach aligns better with YMYL guidelines, as it avoids promoting potentially harmful health practices.

Top referenced domains

Now, let’s examine the top five domains cited by AI search engines in the health niche:

webmd.com
healthline.com
mayoclinic.org
wikipedia.org
goodrx.com

ChatGPT typically references well-known health and academic sources such as mayoclinic.org, ucdavis.edu, obesitymedicine.org, webmd.com, drugs.com, nih.gov. These domains are credible and recognized as authoritative in the health and wellness space.

DeepSeek cites a broader array of sources like cnn.com, medscape.com, businessinsider.com, healthline.com, thelancet.com, nih.gov, who.int, nature.com, statista.com, forbes.com.

This extensive referencing (sometimes up to 48–50 sources) signals a commitment to covering multiple perspectives and conducting in-depth research.

DeepSeek also references news websites when providing health-related information, citing CNN, Business Insider, Bloomberg, NBC News, and Forbes. Although these sources are reputable, they may not have the necessary medical expertise.

Tone, disclaimers & YMYL safeguards

ChatGPT maintains an objective and factual tone with clear language. It makes otherwise dense information easy to read for non-experts. It regularly (in 7 out of 10 cases) includes cautionary notes (e.g., “individual responses to these medications can vary,” “consult with a healthcare provider”). For queries related to suicide, ChatGPT suggested seeking help via the 988 Suicide & Crisis Lifeline (which offers confidential support 24/7).

DeepSeek uses a more comprehensive, occasionally academic tone. It’s rich in context, with detailed and sometimes technical language.

The tool also provides context-rich warnings (in 4 out of 10 cases) and advises users to seek professional consultation. However, when DeepSeek’s responses are long (which happens often), the disclaimer can get lost in the text.

Health-related responses have the highest percentage of disclaimers—at 37%.

Bringing AI Overviews into the mix

Google’s AI Overviews typically deliver conservative, “safe” advice with strong disclaimers that omit additional depth to avoid potential misinterpretation—perfect for sensitive YMYL topics.

In total, 3 out of 7 AIOs provided explicit messages like “This is for informational purposes only. For medical advice or diagnosis, consult a professional. Generative AI is experimental.”

Google AIO disclaimer suggesting to consult a professional for medical advice

The research study we performed last year—how AI Overviews handle YMYL topics—showed this disclaimer appearing 83% of the time.

For some queries, Google AIOs display truncation or gaps (e.g., “An AI Overview is not available for this search”).

This message typically appears for highly sensitive topics. This suggests that Google is applying restrictions to prevent AIOs from being created for some content types.

As far as content length and depth, ChatGPT and DeepSeek often provide responses that are more structured and detailed than Google’s AIOs.

Google’s AIOs, on the other hand, keep things short and simple. This is great for quick advice but not for comprehensive understanding.

ChatGPT approaches YMYL health topics with clarity and precision, offering straightforward, fact-based answers. It pulls information from trusted sources like mayoclinic.org, webmd.com, and ncbi.nlm.nih.gov, ensuring users get reliable advice. The model also includes frequent explicit disclaimers (in about 7 out of 10 cases), reminding users that health responses can vary and that speaking with a healthcare provider is always best.

DeepSeek provides more in-depth responses, offering a wealth of context and tons of sources—averaging around 27 per query. This means it delivers comprehensive answers, but its reliance on news outlets like CNN and Business Insider, rather than strictly medical sources, can lessen its advice’s perceived authority. Moreover, its longer, more detailed responses can bury crucial disclaimers in the text (causing users to overlook them).

When compared to Google’s AIOs, ChatGPT and DeepSeek both offer more detailed responses on sensitive health topics. AIOs often take a more cautious route, giving shorter, more basic summaries with disclaimers like “for informational purposes only.” Google avoids misguiding users by opting out of generating AIOs on exceptionally sensitive topics (like specific medication advice or serious mental health conditions). This contrasts with the more nuanced approaches seen in ChatGPT and DeepSeek.