Dwith Chenna, MTS Product Engineer for AI Inference at AMD, presents the “Quantization Techniques for Efficient Deployment of Large Language Models: A Comprehensive Review” tutorial at the May 2025 Embedded Vision Summit. The deployment of large language models (LLMs) in resource-constrained environments is challenging due to the significant computational and…
Register or sign in to access this content.
Registration is free and takes less than one minute. Click here to register and get full access to the Edge AI and Vision Alliance's valuable content.