

ROUNDTABLE: Running AI Inference at Scale
Only 1 seat left
Thursday, December 11, 2025 11:00 AM to 12:00 PM · 1 hr. (America/New_York)
VIP Roundtables Room (1B04)
Information
In this roundtable, Nebius will share practical insights and real-world lessons from operating a high-performance, cost-efficient inference platform used by leading AI-native companies.
Attendees will gain a clear understanding of how to optimize inference at scale through techniques such as speculative decoding, prefill acceleration, quantization, and model-level optimizations that dramatically improve latency and throughput.
Nebius will walk participants through proven approaches for reducing cost per token, handling variable workloads, and extracting maximum performance from modern open-source models.
Stream
VisionAIres VIP Program
Pass Type
VIP All Access
