“Evolving Inference Processor Software Stacks to Support LLMs,” a Presentation from Expedera

Ramteja Tadishetti, Principal Software Engineer at Expedera, presents the “Evolving Inference Processor Software Stacks to Support LLMs” tutorial at the May 2025 Embedded Vision Summit.

As large language models (LLMs) and vision-language models (VLMs) have quickly become important for edge applications from smartphones to automobiles, chipmakers and IP providers have struggled with how to adapt processor software stacks. In this talk, Tadishetti examines how edge processor software stacks have evolved from their focus on CNNs to today’s support of a rapidly expanding range of diverse networks, including LLMs and VLMs.

Tadishetti examines the difficulties that LLMs and VLMs present to a processor software stack and the challenges posed by the rapid introduction of new models with novel features, and he explains the methods Expedera has implemented to mitigate these challenges. He also discusses potential future software evolutions that will further streamline the implementation of new models.

See here for a PDF of the slides.

Here you’ll find a wealth of practical technical insights and expert advice to help you bring AI and visual intelligence into your products without flying blind.

Contact

Address

Berkeley Design Technology, Inc.
PO Box #4446
Walnut Creek, CA 94596

Phone
Phone: +1 (925) 954-1411
Scroll to Top