Build a Text-to-Image Generator (from Scratch), 9781633435421
Hardcover
Build your own AI image generator, from code to creativity.

Build a Text-to-Image Generator (from Scratch)

With Transformers and Diffusions

$124.98

  • Hardcover

    360 pages

  • Release Date

    23 January 2026

Check Delivery Options

Summary

AI images flood feeds, yet the models behind them feel mysterious. Relying on black boxes risks bias, errors, and costly creative dead ends. You deserve hands-on skills to build, audit, and improve these generators yourself. This book starts from a blank notebook, guiding every line of Python code. Learn transformers for vision, then craft diffusion models that sharpen noise into art. Finish with a custom system generating high-resolution images from any text prompt.

  • Vis…

Book Details

ISBN-13:9781633435421
ISBN-10:1633435423
Author:Mark Liu
Publisher:Manning Publications
Imprint:Manning Publications
Format:Hardcover
Number of Pages:360
Release Date:23 January 2026
Weight:639g
Dimensions:235mm x 190mm x 20mm
What They're Saying

Critics Review

  • This book stands out for its hands-on, no-fluff approach to text-to-image generation—perfect for practitioners who want to build rather than just theorize. The clear PyTorch implementations, Colab-friendly examples, and practical exercises make even advanced concepts like Diffusion Models feel achievable.Simeon Leyzerzon, President, Excelsior Software Ltd.
  • This book is a great hands-on intro to how text-to-image models like Stable Diffusion actually work under the hood. It explains the roles of transformers, VAEs, and denoising U-Nets in a super approachable way, with lots of code you can run yourself. If you’re curious about generative AI and want to build or tweak your own models, this is a solid place to start.Ravikumar Sanapala, Product Manager, Reality Labs, Meta

About The Author

Mark Liu

Mark Liu is a professor and program director known for translating cutting-edge AI into practical curricula. With years mentoring graduate students and professionals, Mark brings clarity, rigor, and enthusiasm to every page. He distills deep generative-model expertise into step-by-step guidance that empowers readers to build powerful visual AI systems.

Returns

This item is eligible for free returns within 30 days of delivery. See our returns policy for further details.