Expedera's Origin Evolution NPU IP Brings Generative AI to Edge Devices
Highlights
- Expedera launches its Origin EvolutionTM
NPU IP , bringing hardware acceleration to meet the computational demands of running LLMs on resource-constrained edge devices. - New purpose-built hardware and software architecture runs LLMs and traditional neural networks with ultra-efficient PPA, providing fully scalable
NPU IP solutions. - Origin Evolution
NPU IP solutions are suitable for applications ranging from smartphones to automotive to data centers and are available now.
Running LLM inference in edge hardware is crucial because it reduces latency and eliminates security concerns associated with cloud-based implementations. However, deploying LLMs in resource-constrained systems poses challenges due to their large model sizes and significant computational requirements. Consequently, edge designs require specialized hardware that can effectively address their unique resource constraints, including power, performance, area (PPA), latency, and memory requirements. Moreover, innovative software optimizations are essential, including model compression, hardware optimization, attention optimization, and the creation of dedicated frameworks to manage computational and energy constraints at the edge.
"Origin Evolution is a radical advancement providing an AI inference engine with out-of-the-box compatibility with popular LLM and CNN networks, that produces ideal results in applications as varied as smartphones, automobiles, and data centers," said
Scalable to 128 TFLOPS in a single core and to PetaFLOPS and beyond with multiple cores, Origin Evolution can be configured to produce optimal PPA results in a wide range of applications. Origin Evolution significantly reduces memory and system power needs while increasing processor utilization. Compared to alternative solutions, its packet-based processing reduces external memory moves by more than 75% for Llama 3.2 1B and Qwen2 1.5
Origin Evolution can support custom and 'black box' layers and networks, while offering out-of-the-box support for today's most popular networks, including Llama3, ChatGLM, DeepSeek, Qwen, MobileNet,
Origin Evolution allows users to implement existing trained models with no reduction in accuracy and no retraining requirements, with confidence in achieving ideal PPA. It uses Expedera's unique packet-based architecture to achieve unprecedented NPU efficiency. Packets, contiguous fragments of neural networks, overcome the hurdles of large memory movements and differing network layer sizes, which LLMs exacerbate. The architecture routes the packets through discrete processing blocks, including Feed Forward, Attention, and Vector, which accommodate the varying operations, data types, and precisions required when simultaneously or separately running LLM and CNN networks. Origin Evolution includes a high-speed external memory streaming interface compatible with the latest DRAM and HBM standards. Complementing the hardware stack is an advanced software stack, featuring support for network representations from HuggingFace, Llama.cpp, TVM, and others. It supports full integer and floating-point precisions (including mixed modes), layer fusion and fissions, and centralized control of multiple cores within a chip, chiplet, or system. For more information on Origin Evolution, visit https://www.expedera.com/blog/2025/05/20/expederas-origin-evolution-npu-ip-brings-generative-ai-to-edge-devices/.
About Expedera
Expedera provides customizable neural engine semiconductor IP that dramatically improves performance, power, and latency while reducing cost and complexity in edge and data center AI inference applications. Successfully deployed in 10s of millions of devices, Expedera's Neural Processing Unit (NPU) solutions are scalable and produce superior results in applications ranging from edge nodes and smartphones to automotive and data center inference. The platform includes an easy-to-use software stack that allows the importing of trained networks, provides various quantization options, automatic completion, compilation, estimator, and profiling tools, and supports multi-job APIs. Headquartered in
Media Contact:
Paul Karazuba
+1 650-887-0815
[email protected]
View original content to download multimedia:https://www.prnewswire.com/news-releases/expederas-origin-evolution-npu-ip-brings-generative-ai-to-edge-devices-302459052.html
SOURCE Expedera, Inc.
Serious News for Serious Traders! Try StreetInsider.com Premium Free!
You May Also Be Interested In
- Menu Order AI Appoints Krishna Kumar as Chief Operating Officer
- Sterling Capital Technologies Introduces Verified EUR/USD Hedging Strategy
- Eromnet Obtains Singapore MPI License, Expanding Global Fintech Business
Create E-mail Alert Related Categories
PRNewswire, Press ReleasesSign up for StreetInsider Free!
Receive full access to all new and archived articles, unlimited portfolio tracking, e-mail alerts, custom newswires and RSS feeds - and more!



Tweet
Share