Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT – Arm Newsroom

In today’s evolving IoT landscape, where software complexity continues to increase, edge devices require more performance, efficiency, and security than ever. The Arm Cortex-A portfolio meets this demand by bringing advanced computing capabilities to power-constrained devices, delivering enhanced AI processing, robust security and optimized efficiency across diverse markets. The Cortex-A3xx series specifically delivers ultra-efficient solutions and optimized performance across various market segments, including consumer devices and cloud services. More importantly, it provides a powerful and scalable solution for the rapidly growing and highly diverse IoT market, making it particularly ideal for edge AI applications.
Edge AI requires increasingly higher compute performance, stronger security, and greater software flexibility. As software complexity grows, the Armv9 architecture has been introduced to provide advanced machine learning (ML) and AI capabilities, along with enhanced security features. This leading-edge architecture is now deployed in the ultra-efficient Cortex-A3xx tier, providing a robust foundation for next-generation edge AI applications.
Today, Arm introduces the Cortex-A320 the first ultra-efficient Cortex-A processor implementing the Armv9 architecture. Cortex-A320 is an AArch64 CPU, based on the Armv9.2-A version of the architecture. Its microarchitecture has been derived from Cortex-A520 but has been significantly optimized to improve area and power.
Efficiency improvements of over 50% compared to the Cortex-A520 are achieved through multiple microarchitecture updates. These include a narrow fetch and decode datapath, densely banked L1 caches, a reduced-port integer register file, and other optimizations.
Significant microarchitecture innovations, such as efficient branch predictors and pre-fetchers, as well as memory system improvements, have also boosted Cortex-A320’s scalar performance, by more than 30% in SPECINT2K6, compared to its predecessor, Cortex-A35.
Most importantly, by integrating the Armv9 enhancements in the NEON and scalable vector extension (SVE2) vector processing technologies, Cortex-A320 delivers multiple folds (10x) of ML processing uplift compared to Cortex-A35, as measured in int8 General Matrix Multiplication (GEMM). With support for new data types, like BF16, and new dot product and matrix multiplication instructions, Cortex-A320 achieves up to 6x higher ML performance than Cortex-A53, the world’s most popular Armv8-A CPU.
The significant improvements in ML capabilities, combined with the high area and energy efficiency, qualify Cortex-A320 as the most efficient core in ML applications across all Arm Cortex-A CPUs.
Cortex-A320 also brings multifold ML performance increase against the Arm Cortex-M processors – for example, up to 8x higher GEMM performance compared to Cortex-M85, the highest performing Cortex-M CPU. This performance boost isn’t just due to the Armv9 enhancements in AI processing; it also stems from significantly improved memory access performance and increased frequencies in the Cortex-A320.
Additionally, thanks to its A-profile architecture, multi-core execution, and flexible memory management, both make Cortex-A320 a suitable candidate for extending performance to high-performance Cortex-M microcontrollers.
Cortex-A320 is a single-issue, in-order CPU with a 32-bit instruction fetch, implementing an optimized 8-stage pipeline with a compact forwarding network, to achieve higher frequency points than Cortex-A520.
Cortex-A320 offers scalability within a cluster by supporting single-core to quad-core configurations. It features DSU-120T, a streamlined DynamIQ Shared Unit (DSU), which enables Cortex-A320-only clusters. DSU-120T is a minimal DSU implementation, significantly reducing complexity, area, and power consumption, thereby maximizing efficiency for low-end Cortex-A based designs. 
Cortex-A320 supports up to 64KB L1 caches, and up to 512KB L2, and it has a 256-bit AMBA5 AXI interface to the external memory. The L2 cache and the L2 TLB can be shared between the Cortex-A320 CPUs, and the vector processing unit – which implements the NEON and SVE2 SIMD (Single Instruction, Multiple Data) technologies – can be either private in a single core complex or shared between 2 cores in a dual-core, or quad-core implementation.
Cortex-A320 ensures compatibility with edge and infrastructure devices, while delivering efficiency and scalability. It benefits from the extensive open-source Linux support, a robust security ecosystem, and – more importantly – key Armv9 architecture advancements.
Apart from the ML improvements through the updates in the NEON and SVE2 vector processing technologies, the Armv9 architecture brings significant enhancements to security, which is key to any IoT and embedded system. Cortex-A320 brings important security features to the ultra-efficiency Cortex-A tier, like the Memory Tagging Extension (MTE) which provides enhanced memory safety, as well as Pointer Authentication (PAC) and Branch Target Identification (BTI), which mitigate against jump and return oriented programming attacks.
One of the key Armv9 features adopted by the Cortex-A320 is Secure EL2 (Exception Level 2). For more details, visit the Secure Virtualization page. Secure EL2 enhances software isolation in TrustZone, facilitating the secure execution of software containers on edge devices.
The Cortex-A320 leverages all these benefits across a wide range of applications, from low-end general purpose MPUs, smart speakers, and software-defined smart cameras, to factory floor autonomous vehicles, automated edge AI assistants, AI-enabled human-machine interfaces, and utility robot controllers. Apart from edge AI applications, other key market segments are also benefitting from Cortex-A320, like smartwatches and smart wearables, as well as infrastructure devices, such as Baseboard Management Controllers (BMC) for servers.
Cortex-A320 can also be an ideal fit for applications where a high-performance Cortex-M is traditionally used, like battery-operated MCU use-cases, or applications running a real-time operating system (RTOS), that require though to scale up performance through symmetric multi-processing, which is supported out of the box in the A-profile architecture.
It can also be a suitable candidate for RTOS applications that require Cortex-A memory management or address translation features, for enhanced software flexibility. For example, Cortex-A320 can be appropriate for use-cases that require downloading apps on an MCU device, thus a memory management unit (MMU) is necessary for code relocation across the memory map.
At the same time, due to the wider addressing space, Cortex-A320 can be an efficient solution for heterogeneous multicore use cases which combine a big Cortex-A with a microcontroller class core. Cortex-A320 enables Arm’s partners to use a small architecturally compatible core alongside the bigger Cortex-A processor, so that the memory architecture is simplified.
On the other hand, thanks to its A-profile characteristics, Cortex-A320 can provide out of the box Linux support and enable software portability for Android or any existing rich operating system. Cortex-A320 brings unprecedented levels of flexibility, to target multiple market segments, applications and operating systems.

Our latest Ethos-U85 NPU is designed to tolerate the higher latency memories generally found in Cortex-A based systems and works well with the Cortex-A320.
The Ethos-U85 driver has now been updated so that Ethos-U85 can be driven directly by a Cortex-A320, without the need for a Cortex-M based ML island. This update improves latency and allows Arm partners to remove the cost and complexity of using a Cortex-M to drive the NPU.
Moreover, the memory access performance and the enhanced memory system of Cortex-A320, allow the execution of larger ML models, such as large language models (LLMs) of more than 1Bn parameters, which cannot run effectively on Cortex-M based systems due to the limited addressable memory space.
Ethos-U NPUs work with quantized datatypes to meet the cost and energy requirements of the most constrained edge AI use-cases. Any ML operators and datatypes that are not supported by the Ethos-U85 will fallback automatically to the Cortex-A320, exploiting the Neon/SVE2 engine for acceleration.
Due to the significant ML improvements in the Armv9 architecture, a quad-core Cortex-A320 can execute up to 256 GOPS, measured in 8-bit MACs/cycle when running at 2GHz. As a result, Cortex-A320 can run advanced ML and AI use-cases directly on the CPU, even without the need for an external accelerator. This can save system area, power and complexity, for devices targeting a wide range of ML and AI applications, for up to 0.25 TOPs.
Bringing Armv9 security and unprecedented AI performance levels into the ultra-efficient Cortex-A tier, Cortex-A320 offers new possibilities to software developers to develop and deploy ever more demanding use-cases, opening a new era for edge AI devices. By combining the A-profile architecture and the software ecosystem around it, with efficiency and flexibility, Cortex-A320 brings scalability and versatility to target multiple markets in IoT and beyond.
Discover how the ultra-efficient Arm Cortex-A320 CPU is revolutionizing IoT with unmatched performance, security, and energy efficiency.

Any re-use permitted for informational and non-commercial or personal use only.
A new generation of AI-native companies is emerging, bringing innovations that could completely disrupt industries as we know them.

Speaking with @theCUBE, Arm CMO Ami Badani shares how Arm’s compute platform is driving this + her advice on staying ahead:
Ami Badani, Arm | theCUBE + NYSE Wired: CMO Leaders Summit
Arm Badani at Arm talks with John Furrier at theCUBE + NYSE Wired: CMO Leaders Summit.
okt.to
We’ve worked with OpenCV on the first ever KleidiCV integration! 🎉

With 4x performance uplifts on OpenCV 4.11, in Android KleidiCV is enabling faster responses across key computer vision tasks on mobile with no extra developer effort.

Get the details:…
📢 Discover a new way to enhance your code with the Arm extension for @GitHub Copilot.

Available now on GitHub Marketplace, it makes migration to Arm simple with tailored architecture-specific answers and performance optimization tips: https://okt.to/CWdnEy
🤝 Together with @Synopsys we’re accelerating time-to-market for Arm CSS-based designs.

Earlier this week, Kevork Kechichian joined the team to celebrate the new ZeBu-200 and HAPS-200 hardware platforms that speed up time-to-market for AI innovations: https://okt.to/5WmCVL
We’re heading to #SXSW 2025! 🚀

Join Arm CEO Rene Haas on Wednesday, March 12, as he shares insights on the AI era and the unprecedented opportunities being unlocked on the Arm compute platform.

Add it to your agenda: https://okt.to/609gAn
🥳 Congratulations to our new group of Distinguished Arm Ambassadors who lead and inspire developers every day.

Want to be a part of the community?

🔗 Applications for the Arm Ambassador Program are now open, join us!
Here’s to our new Distinguished Arm Ambassadors. 👏

Their continued support and exceptional contributions to the Arm Developer Program help inspire the software community to learn and innovate on the Arm compute platform.

Thank you for everything you do!
Think chips are just in your gadgets? Think again. 🧠

@crmiller1 joins Rene Haas on Tech Unheard to reveal how semiconductors are powering our world – from tech innovation to globalization.

🎧 Search Tech Unheard or listen here: https://okt.to/o93Jtf
It’s been another record-breaking quarter in Q3 FYE25!

With revenue at an all-time high, we’re continuing to see strong momentum across the Arm ecosystem, with ongoing adoption of high-performance, power-efficient Armv9 and CSS platforms driven by AI:
Low power is more critical than ever in automotive.⚡

Catch Dipti Vachani’s interview with @TheSixFiveMedia’s @OABlanchard as they discuss what’s next for automotive innovation, and how low power is not just in Arm’s solutions but a core part of our culture and foundation.
Dipti Vachani, @Arm’s Automotive GM, joins analyst @OABlanchard to reveal how Arm is fueling the future of AI in cars. 🚗🌬️ Watch the full interview for her insights on the next generation of smart vehicles at CES 2025! https://youtu.be/8P9Cb8e8kOw
#CES2025 #OnArm #AI #automotive…
Together with @OpenAI, @SoftBank_Group Corp, & SoftBank Corp, we’re bringing AI agents to work.

Through Cristal intelligence, we’re developing Advanced Enterprise AI built on the Arm compute platform, setting new standards in productivity & efficiency: https://okt.to/oEc6CV
📱 #ICYMI: The new #GalaxyS25 Series of AI smartphones are coming from @SamsungMobile!

Powered by the Arm compute platform, the new devices bring next-gen AI capabilities to Android as part of a shared vision with Google.

Congrats, Samsung! #GalaxyAI

Galaxy S25 AI Phone Range Features Explained | Samsung UK
Learn about all the incredible features on the new Samsung Galaxy S25 AI Phone range at Samsung UK.
okt.to
With the first public spec of the CSA now available, Eddie Ramirez shares how it came about and what it means for the industry.

Speaking at the Chiplet Summit last week, Eddie shares why the CSA will help designers connect chiplets into composable SoCs:.
Arm Defines a New Standard for Chiplet Interoperability
In this video, learn why Arm is investing in chiplets as a way to build dynamic options for compute in the A…
okt.to
📸 Here’s a glimpse of an incredible week with Arm at #wef25 in Davos! ➡️

Explore our blog post for key insights from our panel discussion with world-leading AI academics: https://okt.to/rsdH5V
We’re driving the future of automotive. 🚙

Watch Dipti Vanchani’s insightful talk with @TheSixFiveMedia’s @OABlanchard as they discuss industry trends and Arm’s role in automotive AI innovation.
🚗 @OABlanchard and @Arm’s Dipti Vachani discuss 3 big trends in automotive AI: electrification, autonomy, & redefining the user experience. 🚘 Catch the full interview to see what’s driving the future of smart cars. https://sixfivemedia.com/events/ces-2025/how-arm-is-powering-the-next-generation-of-ai-enabled-vehicles/ #CES2025 #OnArm #AI #automotive…
Rene Haas spoke to @SquawkStreet about The Stargate Project and Arm’s involvement as a key initial technology partner.

Discussing its huge potential, Rene shares what Arm-based technology to expect and what he’s most looking forward to about the project:
Arm Holdings CEO Rene Haas on $500B Stargate project: It's a big, big deal
Arm Holdings CEO Rene Haas joins 'Squawk on the Street' to discuss the $500 billion Stargate AI project, investing…
www.cnbc.com
🏁 The race is on with Android gaming on the new Lenovo Duet, built on the Arm-based @MediaTek Kompanio 838.

With powerful octa-core processing, enhanced multimedia experiences and all-day battery life, the race is in your hands – all you need to do is win!
Chances are, you’re hearing a lot more about semiconductors now than you once did.

🎙️ For the latest Tech Unheard episode, Economic Historian @crmiller1 joins Rene Haas to explore how AI is driving this change, and what it means for the future.

Full ep: https://okt.to/yHqsjm
To unlock the full potential of AI, we need tight collaboration between industry & academia to equip the next-generation with the skills to thrive.

Arm CMO Ami Badani opened up this important conversation at #wef25 with academic experts in the field of AI https://okt.to/Wp5j4q
With new AI applications emerging daily, Arm CMO Ami Badani shares why Arm is the trusted platform to meet the insatiable demand for compute.

Catch the full interview at the link below 👇
We’re proud to have been recognized as a winner of Glassdoor’s Best Places to Work in 2025, ranking number 1 on the UK Large list! 🏆🥳

The award, based on employee feedback, underscores our commitment to our core beliefs and helping our people to do their best work.
We’re excited to be a key initial technology partner in Stargate, bringing the power and pervasiveness of Arm’s compute platform to this bold initiative which will unlock AI’s potential for all.

This is how we shape the future of AI.
Announcing The Stargate Project | SoftBank Group Corp.
The Stargate Project is a new company which intends to invest $500 billion over the next four years building new…
okt.to
🆕 The first public spec of the Chiplet System Architecture is now available!

This marks a crucial step toward a shared understanding of how to define & connect chiplets into composable SoCs that address varying AI workloads & different industry needs: https://okt.to/lqgceB
Lasting AI innovations must deliver more compute with the same level of energy.

We’re committed to addressing this challenge head-on.

For @TIME, Rene Haas reflects on Arm’s journey, from phones to the data center, that’s paved the way for this moment:
Rene Haas on What Makes Arm's Chips Indispensable
The CEO talks about Arm’s role in the Fourth Industrial Revolution, the speed of AI advancement and sustainability efforts
okt.to
We’re in Davos for an insightful week at #wef25! 🇨🇭

Throughout the week, we’ll be sharing more about our collaboration across industries to unlock the new experiences and capabilities that AI promises on the Arm compute platform: https://okt.to/UcIvSf

source

About The Author

admin

See author's posts

Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT – Arm Newsroom

About The Author

admin

The Shift in Peering Threatening the Internet’s Foundations

Remembering Alan Barrett: A Builder of the African Internet

From Email to Case Study: What We Learned About Connecting Refugee Communities in Just One Year

The Shift in Peering Threatening the Internet’s Foundations

Remembering Alan Barrett: A Builder of the African Internet

From Email to Case Study: What We Learned About Connecting Refugee Communities in Just One Year

Local Infrastructure, Lower Costs: How Peering Is Moving the Needle on Internet Affordability

About The Author

Leave a Reply Cancel reply

More Stories

You may have missed