Key Takeaways
- But McDonald's did not abandon AI drive-thru.
- AI drive-thru systems combine several distinct technology layers, each presenting its own set of challenges.
- Order accuracy is the single most important metric for AI drive-thru systems, and it's the metric that will determine whether the technology succeeds or fails at scale.
- The pitch for AI drive-thru has always included a labor component: if the AI takes orders, you need fewer people on headsets, and those crew members can be redeployed to food preparation, order assembly, and other tasks.
- AI drive-thru technology is not cheap to deploy or maintain.
The drive-thru speaker box has been functionally unchanged for decades. A tinny microphone, a distorted speaker, a human on the other end trying to hear "no pickles, extra sauce" over the sound of a diesel truck idling in the next lane. The technology was rudimentary in 1975, and it was still rudimentary in 2023.
That era is ending. Over the past 18 months, a wave of AI-powered voice ordering systems has moved from limited pilots to large-scale deployment across major QSR chains. These systems use natural language processing, speech recognition, and conversational AI to take orders directly from customers at the drive-thru speaker, without a human crew member on the headset. The potential benefits are significant: faster order times, higher accuracy, consistent upselling, and labor reallocation. The challenges are equally real: technical limitations, customer resistance, and a cost structure that demands careful ROI analysis.
As of early 2026, the AI drive-thru arms race involves virtually every major player in quick-service restaurants, a roster of AI technology vendors competing for market share, and billions of dollars in franchise investment decisions. Here's where things stand.
The Current Deployment Landscape
McDonald's and its pivot. McDonald's made headlines in June 2024 when it ended its partnership with IBM on automated order-taking (AOT) technology after a multi-year pilot at roughly 100 locations. The tests, which ran from 2021 through mid-2024, produced mixed results. Order accuracy hovered around 85%, below the 90% to 95% accuracy threshold that McDonald's considered acceptable. Viral social media videos of the system misinterpreting orders did not help.
But McDonald's did not abandon AI drive-thru. Instead, it pivoted. In December 2024, McDonald's announced a new partnership with Google Cloud to develop a next-generation voice ordering system built on Google's Gemini large language model. The new system began testing at select locations in early 2025. By Q4 2025, McDonald's had expanded testing to over 200 U.S. locations, per its third-quarter earnings call. CEO Chris Kempczinski described early results as "encouraging," noting order accuracy rates above 90% at pilot locations.
The Google Cloud partnership represents a broader bet: McDonald's is integrating AI not just at the speaker but across its technology stack, connecting voice ordering with loyalty data, dynamic menu pricing, and predictive inventory systems.
Wendy's FreshAI. Wendy's has been the most aggressive QSR brand in deploying AI drive-thru technology. Its FreshAI system, developed in partnership with Google Cloud, launched its first pilot location in Columbus, Ohio, in May 2023. By late 2025, Wendy's had expanded FreshAI to over 500 company-owned and franchisee locations, making it the largest AI voice ordering deployment in the industry.
Wendy's reported at its 2025 Investor Day that FreshAI locations showed a 22-second reduction in average order time compared to human-staffed drive-thrus, along with a 15% increase in upsell attachment rates. Order accuracy at FreshAI locations was reported at 86% initially, improving to approximately 92% after iterative model training.
Taco Bell's approach. Yum! Brands has deployed AI drive-thru technology at over 300 Taco Bell locations through a partnership with Nvidia and a proprietary system developed by its in-house technology team. Taco Bell's system faces a unique challenge: its highly customizable menu, with dozens of possible modifications per item, creates a combinatorial complexity that simpler concepts don't face. A customer ordering a "Crunchwrap Supreme, no sour cream, add jalapenos, make it a combo with a Baja Blast, no ice" is testing the system's ability to parse multiple modifications in a single utterance.
Yum! CEO David Gibbs noted on the Q4 2025 earnings call that Taco Bell's AI order accuracy for "standard" orders (no modifications) exceeded 93%, but accuracy for heavily customized orders sat closer to 82%.
Hardee's and Carl's Jr. CKE Restaurants, parent of Hardee's and Carl's Jr., rolled out AI ordering through a partnership with SoundHound AI at over 200 locations by end of 2025. SoundHound's platform, which also serves Applebee's and White Castle, uses its own proprietary speech recognition engine rather than relying on a hyperscaler like Google or Amazon.
Smaller chains and regional players. The AI drive-thru trend extends well beyond the top ten QSR brands. Del Taco, Checkers/Rally's, Sonic Drive-In (a Yum! subsidiary), and Wingstop have all announced AI ordering pilots or deployments. The technology has become accessible enough that even mid-size franchise systems can explore it through vendor partnerships.
The Technology Under the Hood
AI drive-thru systems combine several distinct technology layers, each presenting its own set of challenges.
Automatic speech recognition (ASR). Converting spoken words into text is the foundational layer. Modern ASR systems, including those from Google, OpenAI/Whisper, and SoundHound, perform well in controlled environments. A drive-thru is not a controlled environment. Background noise from engines, wind, rain, other passengers in the car, and the restaurant's own kitchen creates an acoustic challenge that indoor voice assistants rarely face.
Wind noise alone can reduce ASR accuracy by 10 to 15 percentage points, according to research published by the Association for Computational Linguistics in 2024. Vendors have responded with multi-microphone arrays, noise cancellation algorithms, and beamforming technology (which focuses the microphone's pickup pattern on the driver's seat position). These improvements have closed much of the gap, but edge cases remain.
Accent and dialect handling is another persistent challenge. A drive-thru in Miami serves customers speaking English, Spanish, Haitian Creole, and various accented combinations. A location in rural Alabama encounters dialects that differ significantly from the standard American English on which most ASR models are trained. Vendors have invested heavily in accent-adaptive models, but performance still varies by region and demographic.
Natural language understanding (NLU). Once speech is converted to text, the system must interpret intent. This is where the latest generation of large language models (LLMs) has made the biggest impact. Older NLU systems relied on rigid grammar rules and keyword matching: the system understood "I want a Big Mac" but struggled with "let me get, uh, one of those Big Macs, no wait, make it two, and can you do one without onions?"
LLM-based systems handle conversational ambiguity far better. They can process disfluencies (the "ums" and "uhs" that characterize natural speech), mid-order corrections, split orders (when multiple passengers each order separately), and colloquial phrasing. Google's Gemini model, which powers the McDonald's and Wendy's systems, was specifically fine-tuned on millions of drive-thru audio transcripts to handle QSR-specific language patterns.
Dialog management. A drive-thru order is a conversation, not a single utterance. The system needs to manage multi-turn interactions: greeting, initial order, modifications, upsell suggestion, order confirmation, and total. Each step requires the system to maintain context about what has already been ordered, what options are available, and what the customer likely wants next.
This is also where upselling happens. AI systems can be tuned to suggest specific add-ons based on the current order composition, time of day, and promotional priorities. When an AI system asks "would you like to add a medium fry for $1.99?" with consistent, natural delivery on every single order, the incremental revenue adds up. Wendy's reported that FreshAI upsell suggestions were accepted roughly 25% of the time, compared to an estimated 15% to 18% acceptance rate when human crew members (who often skip the upsell during busy periods) handled the interaction.
Integration layer. The AI ordering system must connect in real time to the restaurant's point-of-sale (POS) system, menu database, and kitchen display system. This integration is more complex than it appears. Menu items, prices, and availability change frequently. A limited-time offer might be active at some locations but not others. Certain items might be unavailable due to an ingredient outage. The AI system must reflect these realities in real time or risk taking orders the kitchen cannot fulfill.
The Order Accuracy Question
Order accuracy is the single most important metric for AI drive-thru systems, and it's the metric that will determine whether the technology succeeds or fails at scale.
The QSR industry's standard for human-staffed drive-thru accuracy, measured by organizations like Intouch Insight and SeeLevel HX in their annual drive-thru studies, typically falls between 85% and 90%. McDonald's historically scores near the top, Taco Bell near the bottom (due to menu complexity). These figures count an order as "accurate" only if every item, every modification, and every condiment is correct.
AI systems are approaching but have not consistently exceeded human accuracy levels. The key variable is what type of order is being placed:
- Simple orders (one to three items, no modifications): AI accuracy rates of 93% to 97%, generally matching or exceeding human performance.
- Moderate orders (three to five items, some modifications): AI accuracy of 88% to 93%, roughly comparable to human performance.
- Complex orders (five-plus items, heavy modifications, multiple speakers): AI accuracy of 78% to 85%, still trailing human performance in many implementations.
The gap on complex orders is significant because dissatisfied customers who receive incorrect orders are costly to the brand. An incorrect order that requires a remake costs the restaurant $3 to $7 in food waste plus the labor time to prepare the replacement, not counting the intangible cost to customer satisfaction and repeat visit likelihood.
The Labor Question
The pitch for AI drive-thru has always included a labor component: if the AI takes orders, you need fewer people on headsets, and those crew members can be redeployed to food preparation, order assembly, and other tasks.
The reality is more nuanced than the marketing materials suggest. Most operators deploying AI drive-thru have not eliminated headset positions. Instead, they've repositioned those crew members as "monitors" who listen to the AI interaction and intervene when the system struggles or when a customer becomes frustrated. At Wendy's FreshAI locations, a crew member typically monitors the AI system's interactions and can seamlessly take over the conversation if needed.
The labor savings, at least in the current deployment phase, come not from headcount reduction but from improved consistency. Human order-takers have variable performance: a strong crew member during a calm afternoon handles orders differently than a new hire during the Friday lunch rush. AI systems perform at a constant level regardless of time of day, staffing levels, or how many hours into a shift the "employee" is.
Where labor savings may become more meaningful is in the overnight and late-night dayparts. Staffing a drive-thru with a human order-taker from midnight to 5 AM is expensive relative to the transaction volume. AI ordering could allow locations to operate overnight with a skeleton crew focused entirely on food preparation, potentially making 24-hour operation profitable at locations where it currently isn't.
According to the BLS, the average hourly wage for fast food and counter workers in the U.S. reached $14.98 in January 2026. In states with higher minimums (California at $20.00, Washington at $16.66, New York at $16.00 for fast food workers), the labor cost of a dedicated order-taker is substantial. A single headset position, staffed across all dayparts, represents $35,000 to $55,000 in annual labor cost. If AI can reliably cover 70% to 80% of order-taking interactions, the reallocation of even a fraction of that labor cost funds the technology investment.
The Cost Structure
AI drive-thru technology is not cheap to deploy or maintain. The cost structure includes several components:
- Hardware: New speaker posts, microphone arrays, and in-store computing hardware typically cost $5,000 to $15,000 per location.
- Software licensing: Vendors charge monthly or annual SaaS fees ranging from $500 to $2,000 per location per month, depending on the vendor, the volume of transactions, and the scope of the integration.
- Integration: Connecting the AI system to existing POS infrastructure, menu management systems, and kitchen display systems requires custom development work, typically $2,000 to $10,000 per location for initial setup.
- Ongoing model training: AI systems require continuous data collection and model updates to maintain and improve accuracy. This cost is usually embedded in the SaaS fee but represents a significant ongoing investment for the vendor.
For a single franchise location, the total first-year cost of deploying AI drive-thru runs between $15,000 and $50,000, with ongoing costs of $6,000 to $24,000 annually. The ROI equation depends on transaction volume, labor cost savings, upsell revenue uplift, and the speed improvement that allows more cars to pass through the drive-thru during peak periods.
Wendy's has estimated that FreshAI delivers a payback period of 18 to 24 months at locations with strong drive-thru volume (above 150 cars per hour at peak). For lower-volume locations, the payback stretches to three years or more.
Customer Reception
Consumer attitudes toward AI drive-thru ordering are mixed. A 2025 survey by Technomic found that 42% of QSR customers said they were "comfortable" or "very comfortable" interacting with an AI system at the drive-thru, while 31% said they were "uncomfortable" or "very uncomfortable." The remaining 27% were neutral.
Comfort levels correlate strongly with age: 58% of consumers aged 18 to 34 expressed comfort, compared to 29% of consumers aged 55 and older. There was also a geographic split, with urban and suburban consumers more accepting than rural consumers.
The most common complaint, cited by 47% of dissatisfied respondents, was "the system didn't understand me." The second most common (31%) was "it felt impersonal." The third (22%) was "I didn't trust it got my order right."
These attitudes will likely shift as the technology improves and exposure increases. The same pattern played out with self-checkout in retail, ATMs in banking, and self-service kiosks in QSR dining rooms. Early resistance gives way to acceptance and eventually preference as the technology reaches a reliability threshold.
What Comes Next
The AI drive-thru is a transitional technology. Current deployments focus narrowly on order-taking, but the roadmap extends much further.
Visual AI integration. Several vendors are developing systems that combine voice ordering with computer vision. A camera at the menu board could detect vehicle type (to estimate party size), read license plates (to connect with loyalty accounts for personalized greetings and order suggestions), or identify repeat customers who always order the same thing. These capabilities raise significant privacy questions, but the technology exists.
Predictive ordering. As AI systems accumulate order history linked to loyalty profiles, the possibility of predictive ordering emerges. Instead of asking what you want, the system could say: "Welcome back. Your usual two McChickens and a large Diet Coke?" This reduces order time to a simple confirmation.
Multi-language support. Current AI drive-thru systems operate primarily in English, with some offering Spanish as a second option. The next generation will need to handle code-switching (customers who alternate between languages mid-sentence), a common behavior in multilingual markets like South Florida, the Texas border region, and Southern California.
The drive-thru generates roughly 70% of total QSR revenue in the United States, according to the NRA. Any technology that can improve the speed, accuracy, and revenue per transaction of that channel by even a few percentage points has enormous financial implications at scale. The AI upgrade to the drive-thru speaker is not a gimmick. It's the biggest operational technology shift the industry has seen since the introduction of the POS system.
QSR Pro Staff
The QSR Pro editorial team covers the quick service restaurant industry with in-depth analysis, data-driven reporting, and operator-first perspective.
More from QSR