I graded NVDA's Q1 FY2027 earnings call for credibility by cross referencing guidance claims against actual financial data
Author built a tool to cross-reference NVDA earnings calls with 8-K data, giving it a 'B' grade due to aggressive guidance assumptions.
- Reported revenue, data center figures, and gross margins were fully verified against official 8-K filings.
- Management's past guidance has generally been credible enough to earn a 'B' rather than a failing grade.
- The core business metrics remain consistent with disclosed financial data, indicating no immediate accounting discrepancies.
- Q2 guidance of $91B relies on optimistic assumptions of zero China impact and perfect Blackwell execution.
- The $20B Vera CPU revenue claim lacks historical precedent and should be treated as upside optionality only.
- Achieving $1T visibility requires sustaining over 80% growth for three years, which is a statistically difficult target.
I've been working on something and was hoping for honest feedback from real world practitioners/modelers.
I've been wondering if there is a way to systematically validate the credibility of
management guidance on company earnings calls to facilitate potential
adjustments to assumptions used in DCF or other valuation models that might
reference management guidance.
So I built a process that pulls the transcript and checks major claims against
actual 8-K numbers. Each claim receives a verdict and the earnings call itself,
an overall letter grade A through F.
I tested NVDA's Q1 FY27 call and it came out a "B".
Reported revenue, data center, gross margins all checked out and confirmed against the 8-K.
But there were four things flagged:
- Q2 guidance of 91B achievable but assumes zero China and perfect execution (Blackwell) - so instead of an initial $89-93B range, I might model $85B downside scenario.
- The $20B Vera CPU claim had no revenue history behind it, so
treating that as upside only not base case.
- VeraRubin Q3 ramp - Kress said too early to call.
- $1T Blackwell/Rubin visibility over 3 years needs
sustained 80%+ growth which is a big assumption.
So a "B" and not an "A" because of those four, but nothing
contradicted so not lower than B either.
Curious if anyone would actually use something like this or if I'm solving a problem
that doesn't exist.
Thinking about building it out for all S&P 500 calls. The value-add would be scale and timeliness in terms of analyzing key claims and then recommending model adjustments if the claims are not credible or supported etc. Over time it could also reveal a trend with regard to the management team itself (do they usually get things right, wrong or are they 50/50 in terms of what they predict in their forward guidance etc.).
Not investment advice, methodology is transcript Vs 8K cross reference.
The real value might be tracking management accuracy over multiple years rather than grading a single call.
I've now built this out since my original post and have a working MVP. The automated version is live at callgrade.app with 50+ companies graded so far including NVDA, TSLA, META, JPM and other major names.
Agree 100% with the multi-year management tracking you mentioned, that's exactly where I want to take this. The infrastructure for it is there...over time that historical accuracy layer I think will be exactly what makes the grade genuinely predictive and a lot more value-added.
I'd greatly value your feedback if you have 10 minutes, it's still in beta but honest reactions from people who actually model or who are serious investors, is immensely valuable.
“Over time it could also reveal a trend”
It would only be useful if that was the front end part. Trying to quantify safe harbors statements would require far more digging than just earnings calls or 8ks.
That's a genuinely helpful and useful point - thank you. Safe harbor language does create a legal shield around forward-looking statements that financial cross-referencing likely wouldn't fully account for. Wondering if that suggests flagging the density and specificity of safe harbor language itself as a signal, in the methodology - vague guidance buried in boilerplate would be treated differently than specific numerical targets with clear timeframes. "We expect 25% revenue growth" is a different credibility test than "results may vary due to market conditions". But you're right that this adds a layer of complexity.
For anyone who might still be interested in this thread, I ended up building this out and it is live at callgrade.app with 50+ companies graded so far. It is still in beta and I am genuinely interested in feedback from people who actually use earnings calls in their research and in their modeling. I plan to keep enhancing the grading engine to include more complexity and more factors, and to expand to full index coverage including small caps that get way less analytical attention - which I think could be genuinely useful.
Why did you model their guidance of $91B to go down? They literally can’t keep any of their products in stock and have been aggressively raising prices. When they do restock, they’re already sold out. If anything, you should model a $94-99B upside.
That's fair - the $85B downside scenario reflects the primary risk of a Blackwell or VeraRubin supply disruption that limits ability to fulfill existing demand. If supply execution is flawless and demand holds, $94 - 99B upside is entirely plausible. The flag is more about making explicit that the biggest execution risk is supply, not demand.
I tested your post and it came out as a “F”
Based on what? I usually hate amateur stock analyst posts but for once someone clearly tried to make some honest effort instead of just imitating the speech mannerisms of a finance news reporter while they ramble purely speculative nonsense.
Appreciate that — honest effort was exactly the intent. Curious whether you'd find the scaled version useful when it covers all S&P 500 calls automatically each quarter.
Based on it being a copy and paste from ChatGPT
Cool, he contributed more than you so far by doing that, now make like chat GPT and go act like an angry bot somewhere ese, rofl. Do you get mad at people for using calculators too?
Fair point on the manual process — you're right that anyone can do a one-off analysis with an LLM and a PDF upload. What I'm building is the automated, scaled version that runs across all S&P 500 earnings calls in real time without manual intervention, and surfaces the output in a structured format with specific DCF adjustment recommendations. The one-at-a-time manual approach doesn't scale to 500 companies across four earnings seasons a year. That's the gap I'm trying to fill.

r/investing