This is why professional tax practitioners will not pass on a tax study or filing without double checking if AI was used at any point during the process, even for mere arithmetic.
OpenAI, the industry standard for AI, has released two studies: one showing that it is a mathematical certainty that large language models will hallucinate, and the second showing that it is a mathematical certainty that such models will deliberately lie to tell users what they want to hear.
It’s no surprise then that OpenAI updated its policy to explicitly prohibit people from using their services for “provision of tailored advice that requires a license, such as legal or medical advice, without appropriate involvement by a licensed professional.” If this is what the industry leader on AI is saying, how could you take anyone else seriously?
The courtroom problem. AI-generated information isn’t legally reliable. It can’t take an oath because it doesn’t understand what a lie is. It can’t testify because it doesn’t really know how it’s generating information, and AI-generated content can’t be admitted as evidence without proper foundation and authentication.
The fundamental problem tax practitioners using AI face is how to make its output legally defensible. If they can’t, using AI for any complex tax, audit, or compliance-related task is futile.
The evolutionary problem. Tax laws, rules, and regulations change year over year, administration to administration. No matter how fast one trains the AI system, it will always be a few steps away from catching the latest standards.
If a tax practitioner is over-reliant on AI and hasn’t kept up with the recent developments, it will expose both them and their clients to significant risk.
Humans Among Machines
Having worked alongside tax policymakers, CPAs, tax experts, and IRS agents for the better part of my life, I’ve learned that the most critical breakthroughs come from seemingly meaningless conversations.
Consider a tax adviser preparing an R&D credit study. AI can scan thousands of pages of documentation and maybe categorize expenses with impressive speed. But it can’t sit with the lead engineer and ask, “Walk me through what you were actually trying to solve here” or “How was this any different from your other projects?” The engineer might mention the three months spent on a seemingly minor technical challenge that turned out to be fundamental to the entire project’s qualifications.
During audits, the smoking gun doesn’t always appear in financial statements. It comes when the auditor has the instinct to ask, “So what happened in those weekly after-hours meetings?” Professional skepticism lies not just in questioning numbers, but in questioning the people behind them.
Doubt, skepticism, critical thinking, and judgment aren’t just nice to have; they’re the most critical traits of a tax professional. These qualities are also human—something that AI can’t replicate.
When it comes to the question of how best to use AI for tax services, the answer is in finding the right balance. Use AI for what it’s good at: reducing human efforts in routine, repetitive tasks, such as computing large data sets, but don’t rely on it for determination, judgment, and conclusions.