-
Tuesday, 9 Sep, 20251:00 PM
Following the MLCommons Q3 MLPerf Inference results announcement on the morning of Tuesday 9th September on the keynote stage, MLCommons Founder & Executive Director David Kantar will deliver a detailed analysis of the results followed by a Q&A session from the audience.
David Kanter
Founder & Executive DirectorMLCommonsDavid co-founded and is the Head of MLPerf for MLCommons, the world leader in building benchmarks for AI. MLCommons is an open engineering consortium with a mission to make AI better for everyone through benchmarks and data. The foundation for MLCommons began with the MLPerf benchmarks in 2018, which rapidly scaled as a set of industry metrics to measure machine learning performance and promote transparency of machine learning techniques. In collaboration with its 125+ members, global technology providers, academics, and researchers, MLCommons is focused on collaborative engineering work that builds tools for the entire AI industry through benchmarks and metrics, public datasets, and measurements for AI Safety. Our software projects are generally available under the Apache 2.0 license and our datasets generally use CC-BY 4.0.
1:30 PMFull Stack InfraRebellions
Website: https://rebellions.ai/
Rebellions builds energy-efficient AI accelerators optimized for Generative AI workloads. The company's first mass-produced AI chip, ATOM, launched in 2023 for data center deployment. Its next-generation chip, REBEL, features a scalable chiplet architecture with UCIe connectivity and 144GB HBM3e memory, designed to handle Large Language Models and Multi-Modal Models. Founded in 2020, Rebellions has secured backing from strategic investors including Wa’ed Ventures, the venture capital arm of Aramco, and KT (Korea Telecom). Following its merger with SAPEON Korea in 2024, the company welcomed SK Telecom and SK hynix as additional strategic investors, further strengthening its position in the global AI chip market.
2:00 PMFull Stack InfraBeyond Benchmarks to Real Products:
Automatically convert trained AI models into optimized mobile apps, ready to deploy within hours—no app development required.End-to-End Automated Pipeline:
Seamlessly perform AI model conversion, device-specific optimization (NPU/GPU/CPU), and automatic SDK and app template generation (Android, iOS, Flutter).Real-World Impact & Use Cases:
Proven deployments across industries including healthcare and enterprise, delivering immediate value through reduced costs, enhanced data security, and superior real-time performance.Yeonseok Kim
CEOZeticZETIC.ai
Website: https://zetic.ai/
ZETIC.ai is building the on-device AI ecosystem—empowering AI companies to deploy, optimize, and run their models directly on mobile devices without relying on the cloud. Our core product, ZETIC.MLange, automates the entire on-device AI deployment process: converting, optimizing, global device benchmarking, and packaging models in just 6 hours.
The platform supports diverse mobile hardware environments including NPUs, GPUs, and CPUs, delivering peak runtime performance tailored to each device. Unlike conventional solutions that require months of device-specific engineering or compromise model accuracy, ZETIC.MLange ensures full model fidelity, ultra-low latency, and efficient power usage—ideal for real-time, privacy-sensitive applications.
ZETIC.ai enables AI companies to scale globally without the cost, complexity, or regulatory limitations of cloud infrastructure. By running AI directly on end-user devices, companies can bypass cross-border data concerns and accelerate secure global deployment—making AI truly accessible anytime, anywhere.2:30 PMData Movement & Management TrackGo!Foton
Website: https://www.gofoton.com/
Based in the USA with teams around the world, Go!Foton is at the forefront of advanced optical and photonics innovation. We engineer solutions to enhance user experience by offering customers unique approaches to solve real-world problems in connectivity, imaging, and beyond.
Go!Foton technology stands apart with feature-rich and performance-optimized solutions. We keep our customers satisfied and businesses performing, ensuring new and improved experiences for all.
Inspired by nature and physics, the sky is the limit when imagining what’s possible and creating what’s next.
Discover new dimensions at gofoton.com.
3:15 PMData Movement & Management TrackMatthew Burns
Global Director, Technical MarketingSamtecMatthew Burns develops go-to-market strategies for Samtec’s Silicon-to-Silicon solutions. Over the course of 25 years, he has been a leader in design, applications engineering, technical sales and marketing in the telecommunications, medical and electronic components industries. He currently serves as Secretary at PICMG. Mr. Burns holds a B.S. in Electrical Engineering from Penn State University.
3:30 PMData Movement & Management TrackInnodisk
Website: https://www.innodisk.com/index
Innodisk is a global leader in industrial-grade memory, storage, and AIoT solutions. Headquartered in Taiwan with a strong international footprint, Innodisk has held the largest market share in industrial-grade storage since its founding in 2005 and ranks among the world’s top providers of industrial memory modules.
As AI technology continues to evolve, Innodisk leverages its deep expertise, innovative engineering, and integrated hardware-software approach to deliver customized solutions that power the future of AIoT. Through close collaboration with industry partners, we are accelerating the adoption of intelligent applications across sectors—paving the way for a smarter, more connected world.
Explore our solutions and success stories at www.innodisk.com.
4:00 PMData Movement & Management TrackUltra Ethernet Consortium
Website: https://ultraethernet.org/
Please visit our website for more information.4:30 PMData Movement & Management TrackGenerative AI is fundamentally changing how datacenters are built, putting three types of silicon center-stage: GPUs, custom AI ASICs, and advanced networking processors. Driven by these technologies, the datacenter processor market soared to $147 billion in 2024 and is expected to double by 2030, largely thanks to explosive growth in GPUs and specialized AI ASICs.
While GPUs remain the reference for AI training and inference, hyperscale providers, eager to reduce their dependence on Nvidia, are increasingly co-designing specialized AI ASICs with chipmakers like Broadcom, Marvell, and Alchip. These ASICs sacrifice some versatility to achieve superior performance and energy efficiency, creating opportunities for a thriving startup scene featuring companies like Groq, Cerebras, and Tenstorrent, and spurring major waves of venture investment and mergers. Crucially, chiplet architectures, which combine multiple smaller chip components into a single, optimized package, are now key to driving GPU and ASIC performance upward, beyond what traditional single-chip designs can deliver.
As AI models become ever larger and require responses within milliseconds, networking silicon has become just as critical as processors themselves. DPUs, smart network cards, and advanced switches now coordinate massive arrays of accelerators, making both scale-up and scale-out networks a pivotal part of datacenter performance.
Adrien Sanchez
Senior Technology & Market Analyst, Computing & SoftwareYole IntelligenceAdrien Sanchez is Senior Technology & Market Analyst, Computing at Yole Group.
Adrien produces technology & market analyses covering computing hardware and software, AI, machine learning and neural networks.
Prior to Yole Group, he worked at AW Europe (Belgium), where he focused on image recognition & comprehension for ADAS. He also worked at ACOEM (France) on real-time sound classification using deep learning and edge computing.
Adrien graduated with a double degree at Grenoble Institute of Technology PHELMA (Grenoble INP Phelma, France) and Grenoble Ecole de Management (GEM, France), and he earned an MSc on AI at Heriot-Watt University (Edinburgh, UK).Hugo Antoine
Technology & Market Analyst, Computing & SoftwareYole GroupHugo Antoine is a Technology & Market Analyst, Computing and Software at Yole Group.
Hugo develops technology & market analyses covering computing hardware, software, and Artificial Intelligence (AI).
He holds a master's degree from Ecole des Mines de Saint-Etienne (France), with a focus on microelectronics and computing at the Centre of Microelectronics in Provence (France). In addition, he pursued an AI specialization at Ecole Polytechnique de Montreal (Canada). Furthermore, he completed a dual-degree program in innovation management at emlyon business school, highlighting his expertise at the intersection of technology and business.5:00 PMAI Hardware TrackAlex Fleetwood
Chief Solutions ArchitectVaireAlex Fleetwood is Chief Solutions Architect at Vaire Computing, a deep-tech startup pioneering near-zero energy AI chips based on reversible computing. A serial founder with multiple successful startups under their belt, Alex has built and scaled interdisciplinary teams at the cutting edge of AI, hardware, and systems design. At Vaire, he is responsible for strategic partnerships, go-to-market strategy, and leading engagements with hyperscalers, government agencies, and foundries.
5:30 PMWomen in AI SessionNalini Garg
California State AmbassadorWomen In AINalini is a recognized leader at Deloitte in AI & Data Practice, standing at the forefront of AI innovation and Strategy. With a distinct path of achievements, she delivered over 30 talks on generative AI, emphasizing its safe usage and business efficiencies for life sciences and tech companies. Nominated as top 100 women in AI for 2023, 2024, she currently serves as the California state ambassador for Women in AI. Beyond her professional prowess, Nalini is a wellness instructor, artist, and a podcast host.
Ipsita Mohanty
Vice ChairIEEE Women in EngineeringIpsita Mohanty is an accomplished AI Leader and Vice Chair of IEEE Women in Engineering, Santa Clara Valley, Region 6. With over 15 years of experience spanning Salesforce, Goldman Sachs, Walmart Labs, and Amdocs, she has led several impactful AI-driven initiatives, including architecting Salesforce-Tableau Agents and enhancing recommendation systems at Walmart. Ipsita holds a Master’s degree in Information Technology from Carnegie Mellon University and a Bachelor’s in Computer Science and Engineering from KIIT University. She is also the co-founder of QApp, an AI/ML automation platform empowering freelancers. Winner of several AI Awards and recognized among the Top 100 Women in AI Leaders globally, she actively contributes to AI research and community leadership while championing diversity in technology.
Shweta Behere
Senior Engineering ManagerWorkdayShweta Behere is a Senior Engineering Manager at Workday, where she leads cloud platform engineering teams dedicated to building resilient and scalable systems. She is at the forefront of delivering AI solutions to enhance cloud platform infrastructure and observability. She currently serves as the IEEE Chair for the Women in Engineering Affinity Group for the Santa Clara Valley Section.
With over 18 years of experience, Shweta has made significant contributions to advancing public cloud, storage, and virtualization technologies, resulting in multiple patents and publications highlighting her innovative impact. Her leadership has earned her industry recognition, including being named among the “Top 50 Women of Impact in 2025" by Women Impact Tech and one of the “30 Outstanding Women in Engineering Leadership” by the Women Tech Network for her substantial contributions to the field.
Shweta is passionate about promoting women in engineering fields. As a thought leader, she addresses the challenges of balancing career and parenthood in tech, sharing her insights through articles and panel discussions. Through her mentorship, advocacy, and leadership, Shweta Behere inspires the next generation of technologists, especially women, and actively cultivates a more inclusive and supportive environment.IEEE Women in Engineering
Website: https://wie.ieee.org/
Please visit our website for more information.Women in AI
Website: https://www.womeninai.co/
Women in AI (WAI) is a nonprofit do-tank working towards inclusive AI that benefits global society. We are a community-driven initiative bringing empowerment, knowledge and active collaboration via education, research, events, and blogging.
At Women in AI we empower women and minorities to become AI & Data experts, innovators and leaders. We encourage ethical applications and responsible use of artificial intelligence.
Mission: Women in AI is a community working towards shaping inclusive AI for our common future
-
Wednesday, 10 Sep, 20251:00 PMJudges
Rashmi Gopinath
General PartnerBAM Corner PointRashmi Gopinath is a General Partner at B Capital Group where she leads the fund’s enterprise software practice in cloud infrastructure, cybersecurity, devops, and AI/ML sectors. She brings over two decades of experience investing and operating in cutting-edge enterprise technologies. She led B Capital’s investments in over 24 companies such as DataRobot, FalconX, Clari, Phenom People, Synack, Innovaccer, Labelbox, Fabric, 6Sense, Highspot, Pendo, Starburst, OwnBackup, Figment, Perimeter81, Zesty, among others.
Rashmi was previously a Managing Director at M12, Microsoft’s venture fund, where she led investments globally in enterprise software and sat on several boards including Synack, Innovaccer, Contrast Security, Frame, UnravelData, Incorta, among others.
Prior to M12, Rashmi was an Investment Director with Intel Capital where she was involved in the firm’s investments in startups including MongoDB (Nasdaq: MDB), ForeScout (Nasdaq: FSCT), Maginatics (acq. by EMC), BlueData (acq. by HPE), among others. Rashmi held operating roles at high-growth startups such as BlueData (acq. by HPE) and Couchbase (Nasdaq: BASE) where she led global business development, product and marketing roles. She began her career in engineering and product roles at Oracle and GE Healthcare. She earned an M.B.A. from Northwestern University, and a B.S. in Electrical Engineering from University of Mumbai in India.
2:00 PMInferencing InnovationsMoshe Tanach
Co-Founder & CEONeuRealityMoshe Tanach is Founder and CEO at NeuReality.
Before founding NeuReality, he served as Director of Engineering at Marvell and Intel, leading complex wireless and networking products to mass production.
He also served as Appointed Vice President of R&D at DesignArt-Networks (later acquired by Qualcomm) developing 4G base station products.
He holds Bachelor of Science in Electrical Engineering (BSEE) from the Technion, Israel, Cum Laude.
NeuReality
Website: https://www.neureality.ai/
Founded in 2019, NeuReality Ltd. is an AI technology innovation company creating purpose-built AI-platforms for ultra-scalability of real-life AI applications. As artificial intelligence becomes part of the fabric of daily life, deploying AI at scale is one of the biggest technical challenges of the computing industry. NeuReality is the first complete, system-level solution specifically designed to address the challenges of optimizing, deploying, managing, and scaling AI workloads. NeuReality’s revolutionary AI-centric solution – built around the NR1 Network Attached Processing Unit (NAPU) and inference serving building blocks in software – is designed specifically for AI workloads, enabling the scale of real-life AI Applications.
With the NR1 NAPU, our solution reduces the dependencies on CPUs, NICs and PCI-switches, moving simple but critical data-path functions from software to hardware.
With the automatic software tools and SDK we provide toolchain, runtime, orchestration, MLOPs integration and simple UX for data scientists and DevOps to remove barriers in deploying AI
The result is an innovative AI-optimized system that reduces energy consumption, shrinks the data center footprint, and lowers TCO by cutting capital costs.
it’s built from ground up to enable ultra-scalability.
Download the latest case study here: https://www.neureality.ai/casestudies/ai-linear-scalability/
2:15 PMInferencing InnovationsAxelera
Website: https://www.axelera.ai/
Axelera AI is delivering the world’s most powerful and advanced solutions for AI at the Edge. Its industry-defining Metis™ AI platform – a complete hardware and software solution for AI inference at the edge – makes computer vision applications more accessible, powerful and user friendly than ever before. Based in the AI Innovation Center of the High Tech Campus in Eindhoven, The Netherlands, Axelera AI has R&D offices in Belgium, Switzerland, Italy and the UK, with over 170 employees in 18 countries. Its team of experts in AI software and hardware come from top AI firms and Fortune 500 companies.
For more information on Axelera AI, see: www.axelera.ai
3:00 PMIP & EDAArm
Website: https://www.arm.com/markets/artificial-intelligence
Arm is the industry’s highest-performing and most power-efficient compute platform with unmatched scale that touches 100 percent of the connected global population. To meet the insatiable demand for compute, Arm is delivering advanced solutions that allow the world’s leading technology companies to unleash the unprecedented experiences and capabilities of AI. Together with the world’s largest computing ecosystem and 20 million software developers, we are building the future of AI on Arm.
3:15 PMIP & EDASemiconductor development faces increasing complexity, faster timelines, and fierce competition, exposing the limitations of traditional EDA tools. In response, AI Agents, powered by LLMs and advanced algorithms, are emerging as next-gen solutions. This session explores how these agents surpass conventional automation by independently managing tasks like hardware modeling, constraint solving, debugging, testbench creation, and design optimization. We'll cover real-world use cases showing how AI Agents deliver improved productivity, design quality, and time-to-market, including their ability to autonomously detect bugs and optimize RTL designs.
Mehir Arora
Founding EngineerChipAgentsMehir Arora is a founding engineer at ChipAgents, a company at the forefront of integrating agentic AI into Electronic Design Automation (EDA) workflows. Graduated from UC Santa Barbara, Mehir has contributed to advancing state-of-the-arts in AI methodologies, including a paper presented at ICML 2024. At ChipAgents, he focuses on developing agentic AI tools that enhance chip design and verification processes, aiming to significantly improve efficiency and productivity in semiconductor engineering.
3:30 PMIP & EDASiemens
Website: https://eda.sw.siemens.com/en-US/
Siemens is rapidly expanding our number one position in digital industrial software to enable our customers to swiftly advance in their digital transformation. With the un-matched breadth and depth of the Siemens Xcelerator portfolio, Siemens is enabling customers to leverage AI to break down the barriers between their electrical, mechanical and software design silos, between their design, manufacturing, and ERP silos within all markets. It enables customers to bridge the virtual and physical design and manufacturing worlds, empowering customers to rapidly evolve from systems to market-leading ecosystems companies so they can leverage AI to engineer a smarter future faster – and deliver smart products, factories, infrastructure, and cities – for us all.
3:45 PMIP & EDAModeratorRichard Curtin
Managing PartnerSilicon CatalystSilicon Catalyst
Website: https://siliconcatalyst.com/
Please visit our website for more information.4:15 PMAI Data CenterFlexnode
Website: https://flexnode.io/
Flexnode designs, builds, and operates high density, rapidly-deployed, scalable modular data centers purpose-built for today and tomorrow’s most complex IT projects. Supporting 150kW+ racks, Flexnode’s data centers are AI-ready, specifically designed for today’s leading chipsets (i.e., NVIDIA GB200/B300, AMD MI355X) and future developments. Flexnode works closely with our clients to configure modular solutions to meet their exact needs. In addition, Flexnode works closely with all major hardware, cooling, and networking providers, giving its clients optionality while also ensuring solutions are precisely configured to optimize their IT initiatives. Flexnode’s modular design enables deployments from 1MW to 300MW+, reduces design-build time and construction material waste, while delivering industry-leading rack densities, PUE and resiliency. With densities increasing significantly, Flexnode’s modular design is a fast, reliable, and cost-effective way to deploy both today and tomorrow’s chipsets without sacrificing capex, power, space, or operability. In collaboration with best-in-class data center design, engineering, and construction partners, Flexnode delivers on projects ranging from 50kW to 1MW racks, to underpin vital AI initiatives globally.
4:50 PMAI Data CenterCadence
Website: https://www.cadence.com/en_US/home.html
Cadence’s goal is to empower engineers at semiconductor and systems companies to create innovative, intelligent, and highly differentiated electronic products that transform the way people live, work, and play. The company’s Intelligent System Design strategy helps customers develop differentiated products—from chips to boards to systems—in AI, IoT, mobile, 5G, consumer, cloud, data center, automotive, aerospace, and other market segments. Cadence offers specialized IP with industry-leading performance, power efficiency, and interconnects, as well as AI-specific verification and implementation solutions. The company also employs machine learning within its tools and solutions to enable the best power, performance, and area (PPA), quality of results (QoR), and time-to-market (TTM) benefits.
5:30 PMAgentic AI InnovationsLarge language models can now power capable software agents, yet real‑world success comes from disciplined engineering rather than flashy frameworks. Most reliable agents are built from simple, composable patterns instead of heavy abstractions.
The talk will introduce patterns to add complexity and autonomy only when it pays off. Attendees should leave with a practical decision framework for escalating from a single prompt to multi‑step agents, also keeping in mind guardrails for shipping trustworthy, cost‑effective agents at scale.Sushant Mehta
Research EngineerGoogle Deepmind
Jump to: Tuesday, 9 Sep | Wednesday, 10 Sep