-
Wednesday, 10 Sep, 202509:00 AM
Technical Deep Dive into the UALink 200G Specification, Scale-Up, and Use Cases
As AI workload demands continue to accelerate, Cloud Service Providers, System OEMs, and IP/Silicon vendors require a scalable, high-performance solution to support advanced workloads. By enhancing performance, optimizing power and cost efficiency, and promoting interoperability and supply chain diversity, the UALink 200G 1.0 Specification delivers a low-latency, high-bandwidth interconnect designed for efficient communication between accelerators and switches within AI computing pods.
Location: Room 201
Duration: 40 minutes
Sponsor(s):UALink ConsortiumSession Type:WorkshopBuilding a Data Foundation to Make AI Anywhere a Reality
A staggering number of AI projects fail before reaching production, and the primary culprit is data. As enterprises re-architect their infrastructure to support generative, agentic, and physical AI across distributed infrastructures, the data layer is emerging as the lynch pin for success.
Learn how to architect a strategic data foundation that increases efficiency using infrastructure you already have, to simplify AI data pipelines, enable AI anywhere across any on-premises and cloud storage, and prepare for a future at a much larger scale.
Location: Room 206
Duration: 40 minutes
Sponsor(s):HammerspaceSpeaker(s):Jeff Echols
VP of Strategic PartnersHammerspaceWith a dynamic 20-year career in enterprise software, Jeff Echols has established himself as a leader in building differentiated solutions, strategic relationships, and driving joint partner success. He is currently the VP of Strategic Partners at Hammerspace, focusing on the company's AI Solutions and go-to-market strategy. His extensive experience spans enterprise and startup environments, including two IPOs and international assignments. Throughout his career at companies like WEKA, Nutanix, and CommVault, Jeff has led sales, product marketing, and corporate strategy teams, contributing to significant revenue growth and market differentiation.
Session Type:Workshop1:30 PMScaling LLM Inference with vLLM and AWS Tranium
Join us in this hands-on workshop to learn how to deploy and optimize large language models (LLMs) for scalable inference at enterprise scale. Participants will learn to orchestrate distributed LLM serving with vLLM on Amazon EKS, enabling robust, flexible, and highly available deployments. The session demonstrates how to utilize AWS Trainium hardware within EKS to maximize throughput and cost efficiency, leveraging Kubernetes-native features for automated scaling, resource management, and seamless integration with AWS services.
Location: Room 206
Duration: 1 hour
Sponsor(s):AWSSpeaker(s):Asheesh Goja
Principal GenAI Solutions ArchitectAWSPinak Panigrahi
Sr. Machine Learning Architect - Annapurna MLAWSSession Type:General Session (Presentation)2:45 PMFrom Rack to Response: Build & Deploy Generative AI in 30 Minutes with NeuReality
Experience the future of GenAI inference architecture with NeuReality’s fully integrated, enterprise-ready NR1® Inference Appliance. In this hands-on workshop, you'll go from cold start to live GenAI applications in under 30 minutes using our AI-CPU-powered system. The NR1® Chip – the world’s first AI-CPU purpose built for interference – pairs with any GPU or AI accelerator and optimizes any AI data workload. We’ll walk you through setup, deployment, and real-time inference using models like LLaMA, Mistral, and DeepSeek on our disaggregated architecture—built for smooth scalability, superior price/performance and near 100% GPU utilization (vs <50% with traditional CPU/NIC architecture). Join us to see how NeuReality eliminates infrastructure complexity and delivers enterprise-ready performance and ROI today.
Location: Room 201
Duration: 1 hour
Sponsor(s):NeuRealitySpeaker(s):Paul Piezzo
Enterprise Sales DirectorNeuRealityGaurav Shah
VP of Business DevelopmentNeuRealityNaveh Grofi
Customer Success EngineerNeuRealitySession Type:WorkshopDatabank Workshop - Title TBC
Location: Room 206
Duration: 1 hour
Sponsor(s):DataBankSession Type:WorkshopImprove Price Performance for LLM Serving with vLLM on TPU & GKE
Dive into a hands-on workshop designed exclusively for AI developers. Learn to leverage the power of Google Cloud TPUs, the custom accelerators behind Google Gemini, for highly efficient LLM inference using vLLM. In this trial run for Google Developer Experts (GDEs), you'll build and deploy Gemma 3 27B on Trillium TPUs with vLLM and Google Kubernetes Engine (GKE). Explore advanced tooling like Dynamic Workload Scheduler (DWS) for TPU provisioning, Google Cloud Storage (GCS) for model checkpoints, and essential observability and monitoring solutions. Your live feedback will directly shape the future of this workshop, and we encourage you to share your experience with the vLLM/TPU integration on your social channels.
Location: Room 207
Duration: 1 hour
Sponsor(s):GoogleSpeaker(s):Niranjan Hira
Senior Product ManagerGoogle CloudAs a Product Manager in our AI Infrastructure team, Hira looks out for how Google Cloud offerings can help customers and partners build more helpful AI experiences for users. With over 30 years of experience building applications and products across multiple industries, he likes to hog the whiteboard and tell developer tales.
Session Type:Workshop4:00 PMRuntime Attested HPC Cluster Reference Architecture for Confidential Computing
The rapid evolution of high-performance computing (HPC) clusters has been instrumental in driving transformative advancements in AI research and applications. These sophisticated systems enable the processing of complex datasets and support groundbreaking innovation. However, as their adoption grows, so do the critical security challenges they face, particularly when handling sensitive data in multi-tenant environments where diverse users and workloads coexist. Organizations are increasingly turning to Confidential Computing as a framework to protect AI workloads, emphasizing the need for robust HPC architectures that incorporate runtime attestation capabilities to ensure trust and integrity.
In this session, we present an advanced HPC cluster architecture designed to address these challenges, focusing on how runtime attestation of critical components – such as the kernel, Trusted Execution Environments (TEEs), and eBPF layers – can effectively fortify HPC clusters for AI applications operating across disjoint tenants. This architecture leverages cutting-edge security practices, enabling real-time verification and anomaly detection without compromising the performance essential to HPC systems.
Through use cases and examples, we will illustrate how runtime attestation integrates seamlessly into HPC environments, offering a scalable and efficient solution for securing AI workloads. Participants will leave this session equipped with a deeper understanding of how to leverage runtime attestation and Confidential Computing principles to build secure, reliable, and high-performing HPC clusters tailored for AI innovations.
Location: Room 201
Duration: 1 hour
Sponsor(s):Confidential Computing ConsortiumSpeaker(s):Jason Rogers
CEOInvaryJason Rogers is the Chief Executive Officer of Invary, a cybersecurity company that ensures the security and confidentiality of critical systems by verifying their Runtime Integrity. Leveraging NSA-licensed technology, Invary detects hidden threats and reinforces confidence in an existing security posture. Previously, Jason served as the Vice President of Platform at Matterport, successfully launched a consumer-facing IoT platform for Lowe's, and developed numerous IoT and network security software products for Motorola.
Ayal Yogev
CEO & Co-founderAnjunaSession Type:WorkshopTrain and Deploy High-Performing AI Model Development at Scale
In this session, we will explore the end-to-end workflow of managing foundation model (FM) development on Amazon SageMaker HyperPod. Our discussion will cover both distributed model training and inference using frameworks like PyTorch and KubeRay. Additionally, we will dive into operational aspects, including system observability and resiliency features for scale and cost-performance using Amazon EKS on SageMaker HyperPod. By the end of this hands-on session, you will gain a robust understanding of training and deploying FMs efficiently on AWS. You will learn to leverage cutting-edge techniques and tools to ensure high performance, reliable, and scalable FM development.
Location: Room 206
Duration: 1 hour
Sponsor(s):AWSSpeaker(s):Mark Vinciguerra
Assoc. WW Solution ArchitectAWS GenAIAravind Neelakantan
WW Solution ArchitectAWS GenAIAman Shanbhag
WW Solution ArchitectAWS GenAIAman Shanbhag is a Specialist Solutions Architect on the ML Frameworks team at Amazon Web Services (AWS), where he helps customers and partners with deploying ML training and inference solutions at scale. Before joining AWS, Aman graduated from Rice University with degrees in computer science, mathematics, and entrepreneurship.
Session Type:WorkshopNew technology modalities for the AI fabric chipset: when advanced ASICs and photonics ICs come together with 3D packaging
In the new era of AI infrastructure, CMOS scaling remains the workhorse for heavy computational workloads. But the need for an energy-efficient solution imposes a paradigm shift at the interconnect level, requiring an intimate 3D co-integration of advanced ASICs and optical connectivity.
As the architectural complexity of new products increases, relying on state-of-the-art platforms, with a short path to manufacturing. In this workshop, we will highlight how you can access following technologies for your future products:
- Advanced-node ASIC down to TSMC N2
- Imec’s integrated photonics platforms from 200G up to co-packaged optics
- Imec’s advanced 3D packaging technique from interposer to hybrid bonding
Location: Room 207
Duration: 1 hour
Sponsor(s):IMECSpeaker(s):Philippe Soussan
Technology Portfolio DirectorIMECPhilippe Soussan is Technology Portfolio Director at imec. For 20 years, he has held different positions in R&D management at imec in the field of sensors, photonics, and 3D packaging. Addressing these technologies from R&D up to manufacturing levels.
His expertise lies in wafer-scale technologies, and he has authored over 100 publications and holds more than 20 patents in these fields.
Since 2024, Philippe has been in charge of strategy definition within the “IC-link by imec” sector. This imec business line provides access to design and manufacturing services in the most advanced ASIC and specialty technologies.
Session Type:Workshop -
Thursday, 11 Sep, 20259:00am-9:40am
Autonomous Platform Security with Hardware-Embedded AI Agents
As cyber threats grow more sophisticated and infrastructure complexity increases, traditional approaches fall short. In this briefing Axiado CEO Gopi Sirineni introduces a hardware-anchored, AI-driven approach to platform security and system management, powered by intelligent agents embedded directly in trusted silicon and independent of the host CPU and OS. Running on our Trusted Control/Compute Unit (TCU), these agents autonomously detect, respond to, and recover from threats while optimizing system performance in real time. Learn how Axiado’s architecture transforms passive infrastructure into resilient, self-defending systems—delivering proactive protection and operational intelligence from the ground up.
Location: Room 201
Duration: 40 minutes
Session Topics:Agentic AIHardwareSponsor(s):AxiadoSpeaker(s):Gopi Sirineni
Founder, President & CEOAxiadoGopi Sirineni is a Silicon Valley veteran with four startups and almost thirty years of experience in the semiconductor, software and systems industries. As a senior executive, he has demonstrated exceptional skill at building highly efficient,
cost-effective organizations, managing them in rapidly changing environments, and bringing industry-changing technologies like Wi-Fi mesh, network-attached storage, and core network backplanes to market.Time:9:00amSession Type:General Session (Presentation)
Jump to: Wednesday, 10 Sep | Thursday, 11 Sep