
AI Systems Performance Engineering
Optimizing Model Training and Inference Workloads with Gpus, Cuda, and Pytorch
$215.54
- Paperback
954 pages
- Release Date
23 December 2025
Summary
Elevate your AI system performance capabilities with this definitive guide to maximizing efficiency across every layer of your AI infrastructure. In today’s era of ever-growing generative models, AI Systems Performance Engineering provides engineers, researchers, and developers with a hands-on set of actionable optimization strategies. Learn to co-optimize hardware, software, and algorithms to build resilient, scalable, and cost-effective AI systems that excel in both training and in…
Book Details
| ISBN-13: | 9798341627789 |
|---|---|
| Author: | Chris Fregly |
| Publisher: | O'Reilly Media |
| Imprint: | O'Reilly Media |
| Format: | Paperback |
| Number of Pages: | 954 |
| Release Date: | 23 December 2025 |
| Weight: | 1.79kg |
| Dimensions: | 177mm x 233mm |
You Can Find This Book In
About The Author
Chris Fregly
Chris Fregly is a performance engineer and AI product leader who has driven innovations at Netflix, Databricks, Amazon Web Services (AWS), and multiple startups. He has led performance-focused engineering teams that built AI/ML products, scaled go-to-market initiatives, and reduced cost for large-scale generative-AI and analytics workloads. Chris is co-author of the O’Reilly books Data Science on AWS and Generative AI on AWS, and creator of the O’Reilly course “High-Performance AI in Production with NVIDIA GPUs. His work spans kernel-level tuning, compiler-driven acceleration, distributed training, and high-throughput inference.
Returns
This item is eligible for free returns within 30 days of delivery. See our returns policy for further details.




