As companies continue to invest billions in AI, employees and shareholders are more frequently demanding to see tangible results. Yet, at the end of 2025, only 15% of executives reported that AI integrations increased profits. To understand how businesses will close the gap between AI enthusiasm and returns, look to a surprising place beyond the private sector: the Internal Revenue Service.
A government agency might not seem like a typical business, but it faces this same urgency to show progress with AI and challenges in achieving it. I felt this pressure every day when I led the IRS through one of its most ambitious modernization efforts in decades. In 2023, we began deploying AI in targeted ways to improve taxpayer service, compliance, and operational efficiency. From the beginning, accountability was absolute. Each dollar we spent came from taxpayers so every investment had to produce measurable improvements.
Our blueprint became to identify urgent pain points, practically apply AI, measure impact, and build from there. Private businesses can follow the same methodology, but only if they understand how to identify and compound wins from different forms of AI.
The Three Paths to Value
The most successful organizations will derive returns from AI across three lanes:
- General Purpose AI For Daily Productivity
Widely available tools like LLMs and agentic workflows can help employees conduct initial research and coordinate simple tasks to free up their personal bandwidth. The biggest returns from this lane, however, will come from training workers to apply the technology in the best way for their roles.
But much of tax administration operates with a vastly lower tolerance for error; one in which even a low hallucination rate can create unacceptable risk. The IRS needed better AI tools. As companies lean on AI for more sensitive processes and trust it with proprietary information, we’ll see them make the same critical shift from broad applications to purpose-built systems.
- Domain-Specific Systems For Precision
In areas where factual completeness matters such as legal research, tax analysis, or medical documentation, domain-specific AI tools will give companies a distinct advantage. These systems are engineered around authoritative data sources and contain embedded safeguards that significantly reduce hallucinations and improve reliability. They also deliver faster ROI because they are modeled around well-defined workflows and nuanced regulatory and security constraints.
Our first foray with AI as part of the IRS modernization push was fixing the taxpayer service hotline, which had persistent backlogs, long wait times, and inconsistent answers. We deployed domain-specific AI for response handling to instantly resolve common questions and route complex issues to specialists. Within the first year, response times dropped from an average of 28 minutes to three, and millions more calls were answered live.
From employing legal contract tools to cut review time to using financial and operational AI to improve planning and supply chain decisions, successful businesses will no longer retrofit standard AI models to solve persisting problems but instead apply specialized tools to tackle more complex challenges.
- Custom AI For Solving Unique Problems
At the IRS, we only invested in bespoke AI when general or domain-specific tools couldn’t match the required level of data sophistication or compliance standards. We deployed custom case management AI, for example, to analyze patterns across millions of transactions and prioritize high-risk cases, ultimately preventing and recovering billions in fraud and improper payments in fiscal year 2024.
The mistake most organizations make is that they jump to this lane before exploring the first two. A custom AI application not only requires a large investment but also poses a greater implementation challenge, which makes proving ROI difficult and slow.
But winners in the AI race won’t always be the companies with the largest budgets. Instead, they will be companies that determine the best AI use cases through general-purpose and domain specific applications which generate the ROI and insights needed to justify custom builds.