Original broadcast 6/3/25
Presented by Microsoft
The first segment of AI & Automation: In Depth, hosted by Francis Rose, features an insightful conversation with Nelli Babayan, FedCIV AI Director at Microsoft. Babayan discusses how artificial intelligence is actively transforming operations across federal agencies—streamlining repetitive tasks, enhancing employee experience, and improving citizen services. She emphasizes that AI’s impact goes beyond novelty; it's about aligning tools with real mission outcomes.
Babayan urges federal leaders to begin by deeply understanding their operational problems and goals before selecting or implementing AI tools. According to her, success starts with identifying clear use cases and defining measurable success criteria. Rather than adopting AI for its own sake, agencies should focus on whether the technology improves employee workflows, enhances decision-making, or elevates service delivery to the public.
One of the most significant shifts she notes is the accessibility of AI. With today’s low-code and no-code platforms, federal employees can integrate AI into their work with minimal technical expertise. This democratization of AI opens doors for innovation at all levels of government. However, Babayan cautions that AI also brings challenges—especially in data governance. Understanding what data exists, who has access to it, and how it’s being used is foundational before deploying AI solutions.
Ultimately, Babayan advocates for a thoughtful, mission-aligned approach to AI adoption. Agencies must weigh feasibility, mission impact, and risk—not just implementation cost, but also the risk of inaction in an increasingly AI-driven world. With proper evaluation, pilot planning, and clear goals, federal agencies can transition from experimentation to operational success.
Key Takeaways:
Start with clearly defined problems and success metrics before selecting AI tools.
Democratized access through low-code/no-code platforms is accelerating AI use in government.
Data governance and human oversight are essential for responsible AI deployment.
The second segment brings together a distinguished panel of leaders: Matt Turek, Deputy Director of the Information Innovation Office (I2O) at DARPA; Dr. Mike Horton, Chief AI Officer (Acting) at the Department of Transportation (DOT); and Nelli Babayan from Microsoft. The conversation, moderated by Francis Rose, explores two divergent but complementary perspectives on AI—DARPA’s role in national security innovation and DOT’s focus on scaling AI across agency operations.
Dr. Mike Horton offers a contrasting perspective, focusing on operational scalability within DOT. Horton emphasizes the distinction between “vertical” innovation—high-impact, narrow applications like those DARPA develops—and “horizontal” scalability, which aims to embed AI across large, complex agencies. Horton’s goal is to improve productivity and efficiency at scale, steering his agency away from flashy pilots and toward practical solutions that deliver real return on investment. He warns against “proof of
Babayan reinforces the need to avoid endless piloting and emphasizes that success starts with identifying clear business goals. She lauds Horton’s “personification” framework as a smart way to bridge the technical and operational realities of AI integration. AI, she stresses, is not a magic fix—it must be one of many tools that align with mission needs.
Both Horton and Turek elaborate on how their agencies measure success. At DARPA, Turek describes customized technical benchmarks and operational transition metrics that vary by program. The goal is not just to prove the technology works, but to demonstrate its viability in real-world contexts. Horton adds that DOT evaluates success based on return on investment—not just in mission outcomes, but also in long-term sustainability and taxpayer value. He underscores that AI isn’t free; its ongoing use often increases cost due to continuous training, feedback, and maintenance. Horton uses this to advocate for smart investment strategies—only scaling tools that solve meaningful, high-impact problems.
Finally, the segment explores broader infrastructure and planning concerns. Babayan points out that cloud infrastructure, particularly high-performance computing from providers like Microsoft, is alleviating the burden on agencies to build or manage their own data centers. This platform-based approach allows agencies to focus on solving problems rather than managing infrastructure.
In closing, the segment paints a dynamic picture of how AI is evolving across government. From DARPA’s cutting-edge battlefield resilience programs to DOT’s operational scale-out of AI as workforce augmentation, the message is clear: AI must be planned, evaluated, secured, and maintained like any critical organizational asset. When done right, it becomes a strategic multiplier—not just of efficiency, but of mission impact.
Key Takeaways:
DARPA’s mission is to create transformational AI in high-stakes environments and ensure successful transition to end-users.
DOT emphasizes operational scalability and ROI, advocating for use cases that bring measurable mission and taxpayer value.
Viewing AI systems as “employees” helps agencies plan for governance, feedback, and lifecycle costs.