2025 Schedule
The schedule is subject to change.
Monday, June 9th, 2025
Conference Introduction and Keynote:
Humans in the Loop: Engineering Leadership in a Chaotic Industry
Systems Thinking for Scaling Responsible Multi-Agent Architectures
As enterprises increasingly adopt multi-agent systems (MAS) for complex AI solutions, engineers face the dual challenge of scaling these systems while maintaining ethical responsibility.
The Ideal Micro-Frontends Platform
Micro-frontends can help organizations scale frontend development, but without the right foundations, they often lead to unnecessary complexity and performance issues. How can you implement them effectively while avoiding common pitfalls?
Vector Sync Patterns: Keeping AI Features Fresh When Your Data Changes
"Just rebuild your vectors nightly" — this seemingly reasonable approach quickly breaks down when your application code changes, leading to stale AI features, ballooning computing costs, and hoarsely frustrated users.
Break - Located in Ziskind Lounge
DevOps Is for Product Engineers, Too.
In this talk, we'll be diving into the intersection between product engineering, DevOps, and Site Reliability Engineering (SRE). We'll explore how they're combined to create a culture of technical excellence and psychological safety, both within a team and across an entire organization.
AI-Driven Software Delivery: Leveraging Lean, ChOP & LLMs to Create More Effective Learning Experiences at QCon
What happens when experienced software leaders pair with AI and take a lean thinking approach to building a new product? AI turns lean thinking experimentation into a superpower.
Java Concurrency from the Trenches: Lessons Learned in the Wild
What happens when your batch job goes from “it works on my laptop” to triggering out-of-memory errors and DDoS-ing your own dependencies?
Building an Internal Developer Portal that Empowers Developers
Internal Developer Portals are revolutionizing how teams streamline workflows, enhance developer experience, and boost productivity. But for medium-sized organizations with limited resources, the path to success is unclear and fraught with challenges.
Theme Systems at Scale: How To Build Highly Customizable Software
Efficient theming systems require an architecture that supports different users at various stages. Shopify has been pioneering that with Liquid themes for years, in this talk I'll explore the critical components that make it possible, and how to build them from authoring to rendering.
Design Patterns for Multi-Region Mission Critical Apps
Cloud service providers offer the world’s most expansive infrastructure, enabling the development of multi-region applications that deliver low-latency performance, withstand region-level outages, and meet strict data residency requirements.
Thinking Like a Detective: Solving Cloud Infrastructure Mysteries
When requests mysteriously fail in cloud environments, even experienced engineers can struggle to identify the culprit. With components spread across multiple layers and services, tracking down these elusive bugs requires both methodical investigation and specialized knowledge.
You Are Asking the Wrong Questions (About Reliability and SRE)
Grappling with reliability is trickier than it looks, even if (and perhaps especially if) you are coming at it from a site reliability engineering (SRE) perspective.
Shipping Software with Speed and Confidence in an AI-Enabled World
From building, testing, and deploying traditional deterministic applications to more modern unpredictable ones enabled by generative AI, software delivery has evolved dramatically in recent years.
Your Roadmap to a Fulfilling Career: The Pillars of Staff+ Growth
What does a truly fulfilling career in software engineering look like—and how do you build one beyond the Senior level?
Powering Enterprise AI Applications with Data and Open Source Software
Managing and serving data efficiently is critical to deploy successful AI applications at scale. Feature stores have emerged as an essential tool for organizations, centralizing and streamlining the management of data for AI/ML, from raw data ingestion to real-time model inference.
Rethinking Database Architecture to Reduce Costs at Enterprise Scale for Critical Workloads
As enterprise systems scale, so do their infrastructure costs—often exponentially. Traditional database architectures struggle to keep up with the growing demands of modern workloads that span transactional processing, real-time analytics, and AI-driven operations.
‘Dev Dialogs’ - After Hours Unconference & Peer Discussions
Join us at the end of the day to socialize with other participants and expert speakers while discussing topics you bring to the table.
Metcalf Small Ballroom
Tuesday, June 10th, 2025
Conference Introduction and Keynote:
Key Lessons from Shipping AI Products Beyond the Hype
Architecting Planet Scale, Modern Apps in the Cloud
Everything fails, all the time! Engineers should build architectures and applications with this concept in mind.
Why Observability Matters (More!) with AI Applications
As large language models (LLMs) move into production, observability is essential for ensuring reliability, performance, and responsible AI.
Scaling to 100+ as a Director: Lessons From Growing Engineering Organizations
When I took on the Director of Engineering role at Nubank, I had no idea just how many times I’d need to reinvent my leadership style.
Break - Located in Ziskind Lounge
Scaling an Embedded Database for the Cloud – Challenges and Trade-Offs
Most databases designed for the cloud follow a client-server model, but what happens when you need to scale an in-process, embeddable database like DuckDB? This talk dives into the engineering challenges of building a cloud-native analytics platform from an embedded database.
Fix SLO Breaches Before They Repeat: An SRE AI Agent for Application Workloads
In production, detecting an SLO breach is only the beginning. What if your system could automatically guide you to the root cause and suggest code-level fixes? In this talk, we introduce a new SRE agent built at Microsoft for JVM workloads running in Kubernetes.
Are You Missing a Data Frame? The Power of Data Frames in Java
There is a missing tool in your Java data structure toolkit – Data Frames!
Test Smarter, Not Harder: Achieving Confidence in Complex Distributed Systems
Testing distributed systems is hard, but it doesn't have to be fragile. In this session, we explore how to test services that depend on multiple databases, external providers, and asynchronous events.
No QA Environment? No Problem: How Classpass Enables Testing on a Single Environment in ECS
Anyone who works in a distributed system also knows the pain of trying to test it. Uplifting the entire universe on a local machine isn’t sustainable, and different engineers (backend! frontend! mobile! QA!) have different testing wants and needs.
GenAI at Scale: What It Enables, What It Costs, and How To Reduce the Pain
Generative AI is reshaping what’s possible in software, from augmenting human work with code and content generation to unlocking entirely new interfaces, such as retrieval-augmented generation, chatbots, and automated processing. But with great power comes shocking costs.
Empathy Driven Platforms: You Build It, Let’s Run It Together
The DevOps movement empowered developers to overcome traditional operational bottlenecks by embracing a "you build it, you run it" approach.
From Grassroots to Enterprise: Vanguard's Journey in SRE Transformation
Inspired by the pioneering Google SRE book released in 2016, Vanguard's Site Reliability Engineering (SRE) practices began as a grassroots initiative in 2018.
Orchestrating AI Services with the Spring AI Framework
Modern AI applications demand more than just prompt engineering—they require orchestration of models, search, and tools into a cohesive system. In this talk, we’ll show how the Spring AI Framework empowers you to do exactly that, using the full power of the Spring ecosystem.
Chatting with Your Knowledge Graph
GenAI can ingest an unstructured document and answer your natural language questions about it, but it isn’t yet as good at “talking” to structured data in a database, especially if the database models complex real-world relationships.
Growing and Cultivating Strong Machine Learning Engineers
One manager's view on how to develop successful machine learning engineers, from early in career to senior engineers. Vivek will talk about the skills that their engineers have found useful, the continuous learning that occurs and how to guide them towards success.