04 June 2026 13:30 - 14:30
Cheaper, faster agents with inference engineering
With the rise of open models, every company has the opportunity to own their intelligence and optimize models around their production workloads. However, this makes inference strategy essential.
This roundtable discussion covers how to navigate technical tradeoffs around latency, throughput, and cost and is a forum to hear from other practitioners in the industry on how their team is making AI work in production.