Inference and AI Infrastructure (Special Event for #BosTechWeek)
This event is a part of #BosTechWeek—a week of events hosted by VCs and startups to bring together the tech ecosystem. Learn more at www.tech-week.com.
Abstract
The market focus has shifted from building LLMs and training to how to serve these models and efficient inference. In this meet up we are discussing the AI inference stack and how optimizing it is a multi dimensional problem, the role of hardware and software co-design and how applications are the only deliverable. We also deep dive into MLCommons Chakra, a framework for capturing AI application execution graphs.
Presenters
Venkat Pullela, CTO, AI Infrastructure, Keysight.
Tushar Krishna, Associate Professor, School of ECE, Georgia Tech.
Host: Minlan Yu, Harvard University.
RSVP
If you are interested in attending this talk, please RSVP here.