Tejas Chopra is a Sr. Engineer at Netflix working on Machine Learning Platform for Netflix Studios and a Founder at GoEB1 which is the world’s first and only thought leadership platform for immigrants.
Tejas is a recipient of the prestigious EB1A (Einstein) visa in US. Tejas is a Tech 40 under 40 Award winner, a 2xTEDx speaker, a BCS Fellow, a Senior IEEE Member, an ACM member, and has spoken at conferences and panels on Cloud Computing, Blockchain, Software Development and Engineering Leadership.
Tejas has been awarded the ‘International Achievers Award, 2023’ by the Indian Achievers’ Forum. He is an Adjunct Professor for Software Development at University of Advancing Technology, Arizona, an Angel investor and a Startup Advisor to startups like Nillion. He is also a member of the Advisory Board for Flash Memory Summit.
Tejas’ experience has been in companies like Box, Apple, Samsung, Cadence, and Datrium. Tejas holds a Masters Degree in ECE from Carnegie Mellon University, Pittsburgh.
Day 1: Jun 4, 2025
4:30 pm
4:30 pm
TRACK 1: PRACTICALWORKSHOP: MACHINE LEARNING
Enhance memory for Large Language Models (LLMs), and adopt advanced strategies to minimize memory usage effectively. Gain practical techniques to:
- Uncover the typical memory footprint of core Machine Learning data structures and algorithms, and understand memory allocation and deallocation during model training.
- Apply memory-saving techniques such as data quantization, model pruning, and efficient mini-batch selection to conserve memory without compromising model performance.
- Address unique challenges of LLM memory demands during inference, exploring model architecture, input sequence length, and vocabulary size impacts on memory.
- Implement strategies like model distillation, dynamic memory allocation, and efficient caching to optimize memory usage during LLM inferencing.
Empower your organization to deploy ML models effectively while navigating the complexities of memory consumption in large-scale ML deployments
Day 2: Jun 5, 2025