All work

01 · Enterprise AI · 2025 to Now

Microsoft Learning Agent

Designing how people learn and reskill with AI inside Microsoft 365, and building the research system that proves the design is working.

RoleSenior UX Lead
ScopeStrategy, Interaction, System, Research
SurfaceAI agent across M365
Year2025 to Now
Learning Agent product interface: a conversational learning surface with recommendation chips, citation and confidence signals, a skill-progress card, and a personalized recommendation rail.

A look at the Learning Agent experience: conversational guidance, grounded answers with confidence, and progress that compounds.

Context

Reskilling is the real product, AI is the medium.

I lead UX for Microsoft's AI-powered Learning and Skilling experience, the Learning Agent. The job is bigger than a chatbot. Millions of people now have powerful AI sitting next to their work, and most of them have no idea what to ask it, when to trust it, or how to fold it into the way they already work. My role is to design the experience that closes that gap, so people do not just use AI, they get measurably better at their jobs because of it.

I own the arc end to end: strategy, interaction models, the design system, and the research that tells us whether any of it is landing. I partner daily with product, engineering, and data science to co-create the roadmap and ship AI-driven features that scale across Microsoft 365 surfaces like Teams, Outlook, Word, and Excel.

M365Surfaces the experience spans
4Strategic pain-point pillars
End to endStrategy through validation
01 · Frame

Four pain points, four behavior changes.

Rather than design features in a vacuum, I framed the work around four pillars, each one a specific place where people stall with AI and a specific behavior we want the Learning Agent to drive. This became the shared language for the whole team and the structure I prototyped against for stakeholders.

01

Discovering opportunities

People do not know where AI can help. Contextual entry prompts surface the right moment to use it inside real work.

02

Selecting the right tool

Too many surfaces, too little guidance. The agent points to the right tool for the task instead of leaving people guessing.

03

Validating AI outputs

Trust breaks when answers feel like a black box. Inline citation chips and confidence signals make outputs checkable.

04

Redesigning the workflow

The real win is not one task, it is a rebuilt workflow. A human and AI task split shows what changes, before and after.

The goal was never to add another AI chat box. It was to redesign the work itself, so the human keeps the judgment and the AI takes the toil.
02 · Patterns

Interaction models that teach while people work.

I defined the core interaction models for the agent: conversational UI, personalized recommendation surfaces, and in-flow guidance that meets people inside the task instead of pulling them out to a separate course. Three patterns did most of the heavy lifting.

  • Contextual entry prompts. Small, well-timed cards that suggest an AI move at the exact moment it is useful, so discovery happens in the flow of real work.
  • Citation chips with confidence. Inline chips that show what an answer is grounded in and how sure the system is, turning a black box into something a person can verify.
  • Human and AI task split. A before and after view of a workflow that makes the redesign legible: here is what you still own, here is what the agent now handles.

To keep these honest, I built high-fidelity, interactive prototypes that lived inside the real product's design system, so stakeholders reacted to something that felt shipped rather than to a slide.

Three interaction patterns: contextual entry prompts, citation chips with a confidence signal, and a human and AI task split shown before and after.
The three patterns that did most of the work: contextual entry, citation with confidence, and the human and AI split.
03 · Motivation

Engagement mechanics, grounded in behavior.

Learning only compounds if people come back. I designed personalization and gamification mechanics, streaks, XP, and weekly challenges, driven by real behavioral and usage signals rather than vanity badges. The point was momentum: small, visible progress that makes the next session feel worth it, tuned by what the data showed people actually responded to.

04 · Evidence

I built the instrument that measures whether it works.

Design at this scale is only as credible as the evidence behind it. I built and maintain a pilot dashboard that brings every signal into one place: survey scores, in-product thumbs feedback, focus group findings, usability results, and enrollment across pilot cohorts. It uses top-two-box scoring and clear status thresholds so leadership can see, at a glance, what is healthy and what needs attention.

That dashboard is how design recommendations get made and defended. It turns scattered feedback into a single, honest read on the program, and it is what moved the work from a pilot into scaling planning.

Learning Agent pilot dashboard: KPI tiles for satisfaction, task completion lift, weekly active learners, and recommendation acceptance, an eight-sprint satisfaction trend, adoption by cohort, a satisfaction ring, and program status indicators.

The pilot dashboard I built and maintain. Cohort labels are anonymized and figures shown here are illustrative, the live program data is internal to Microsoft.

Impact

From scattered feedback to a program that scales.

Beyond the screens, I established a UX design feedback process across product teams, running design reviews, stakeholder workshops, and shared roadmap planning with PM, engineering, and research. I also translate dense system constraints into clear strategic narratives through prototypes, playbooks, and executive presentations, the artifacts that get leadership to a decision faster.

The result is a design practice that does not just produce interfaces, it produces evidence. The Learning Agent has moved past early piloting into scaling planning, with new organizations coming online, because the work can show its value, not just assert it.