Designing Energy Efficient Accelerators Through Neural Architecture Search

Ziyu Yan

doi:10.54097/89k4tv16

Authors

Ziyu Yan

DOI:

https://doi.org/10.54097/89k4tv16

Keywords:

Neural architecture search, Energy-efficient accelerators, Hardware-aware co-design, Processing element array, Dataflow optimization, Deep neural networks

Abstract

The exponential growth of deep learning workloads in embedded and edge computing environments has placed extraordinary demands on hardware efficiency, compelling researchers to seek principled methods for designing accelerators that simultaneously maximize computational throughput and minimize energy consumption. Neural architecture search (NAS) has emerged as a transformative paradigm for automating the discovery of model architectures, and its extension into hardware-aware co-design domains opens a compelling pathway toward accelerators that are not only functionally accurate but also energy-optimal. This paper presents a comprehensive framework for designing energy-efficient accelerators through hardware-aware NAS, integrating multi-objective optimization, dataflow-level energy estimation, and differentiable search strategies to navigate a joint hardware-software design space. We propose a co-exploration methodology that simultaneously optimizes the network topology, processing element (PE) array configuration, and memory hierarchy by incorporating energy and latency proxies directly into the search reward function. Experiments conducted on standard image classification benchmarks demonstrate that the proposed framework achieves accuracy competitive with manually designed architectures while reducing system-level energy consumption by up to 43% compared to baseline accelerator configurations. Our results further validate that dataflow-aware hardware parameterization yields substantially more energy-efficient accelerators than platform-agnostic search strategies, and that the use of analytical energy models as differentiable objectives enables scalable search without costly hardware prototyping.

Downloads

Download data is not yet available.

References

[1] Elsken, T., Metzen, J. H., & Hutter, F. (2019). Neural architecture search: A survey. Journal of Machine Learning Research, 20(55), 1-21.

[2] Strubell, E., Ganesh, A., & McCallum, A. (2019, July). Energy and policy considerations for deep learning in NLP. In Proceedings of the 57th annual meeting of the association for computational linguistics (pp. 3645-3650).

[3] Sze, V., Chen, Y. H., Yang, T. J., & Emer, J. S. (2020). Efficient processing of deep neural networks (Vol. 51). San Rafael: Morgan & Claypool Publishers.

[4] Li, P., Liu, J., & Qiu, L. (2026). Deep Learning Methods for Demand Forecasting and Inventory Optimization in Modern Supply Chains. Asian Business Research Journal, 11(3), 21-29.

[5] Qiu, L. (2025). Reinforcement Learning Approaches for Intelligent Control of Smart Building Energy Systems with Real-Time Adaptation to Occupant Behavior and Weather Conditions. Journal of Computing and Electronic Information Management, 18(2), 32-37.

[6] Zhang, H. (2025). Reinforcement Learning Approaches for Layout Optimization in Electronic Design Automation with Electromagnetic Compatibility Constraints. Frontiers in Robotics and Automation, 2(2), 77-93.

[7] Shen, Z., Zhao, W., Wang, B., Wang, Z., & Shang, W. (2026). CAGR: A Cross-Accelerator Graph Optimization Framework for Efficient Recommender System Inference. IEEE Access.

[8] Sun, T., Wang, M., & Han, X. (2025). Deep Learning in Insurance Fraud Detection: Techniques, Datasets, and Emerging Trends. Journal of Banking and Financial Dynamics, 9(8), 1-11.

[9] Liu, J., Li, P., & Wang, Y. (2026). Graph Neural Networks for Modeling Complex Dependencies in Global Supply Chain Networks. Journal of Computing and Electronic Information Management, 20(3), 9-20.

[10] Zhang, F., & Wu, B. (2025). Large Language Models as General Purpose Intelligence Systems for Reasoning, Planning and Decision Making. American Journal of Artificial Intelligence and Neural Networks, 6(4), 45-72.

[11] Li, P., Ren, S., Zhang, Q., Wang, X., & Liu, Y. (2024). Think4SCND: Reinforcement learning with thinking model for dynamic supply chain network design. IEEE Access, 12, 195974-195985.

[12] Zhang, F., & Yang, J. S. (2025). Learning Driven Decision Intelligence for Autonomous Driving Through Multimodal Understanding World Modeling and Policy Optimization. Frontiers in Artificial Intelligence Research, 2(3), 616-634.

[13] Wang, B., Wang, Z., Zhao, W., & Liu, Y. (2025). Network Fabric Simulation and Validation for Data Center Routing Convergence Under Large-Scale Failure Scenarios. Computer Science Bulletin, 8(01), 310-326.

[14] Chu, X., Zhou, T., Zhang, B., & Li, J. (2020, August). Fair darts: Eliminating unfair advantages in differentiable architecture search. In European conference on computer vision (pp. 465-480). Cham: Springer International Publishing.

[15] Cai, H., Wang, T., Wu, Z., Wang, K., Lin, J., & Han, S. (2019). On-device image classification with proxyless neural architecture search and quantization-aware fine-tuning. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (pp. 0-0).

[16] Parashar, A., Raina, P., Shao, Y. S., Chen, Y. H., Ying, V. A., Mukkara, A., ... & Emer, J. (2019, March). Timeloop: A systematic approach to dnn accelerator evaluation. In 2019 IEEE international symposium on performance analysis of systems and software (ISPASS) (pp. 304-315). IEEE.

[17] Lu, Z., Whalen, I., Boddeti, V., Dhebar, Y., Deb, K., Goodman, E., & Banzhaf, W. (2019, July). Nsga-net: neural architecture search using multi-objective genetic algorithm. In Proceedings of the genetic and evolutionary computation conference (pp. 419-427).

[18] Liu, J., Wang, J., Chen, H., Guinness, J., Martin, R., & Kulkarni, C. S. (2019). Optimal Level Crossing Predictions for Electronic Prognostics. In AIAA Scitech 2019 Forum (p. 1962).

[19] Chen, J., Cui, Y., Zhang, X., Yang, J., & Zhou, M. (2024). Temporal convolutional network for carbon tax projection: A data-driven approach. Applied Sciences, 14(20), 9213.

[20] Wei, Z., Sun, T., & Zhou, M. (2024). LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning. Symmetry, 16(11), 1537.

[21] Zhang, S., Qiu, L., & Zeng, Z. (2026). Physics-Data Synergy in Structural Health Monitoring: A Multi-Scale Graph Contrastive Framework With Temperature-Adaptive Fusion. IEEE Access.

[22] Zeng, Z., Lin, H., Zhang, S., & Wang, B. (2026). Adaptive Robust Watermarking for Large Language Models via Dynamic Token Embedding Perturbation. IEEE Access, 14, 9319-9339.

[23] Qiu, L. (2025). Multi-Agent Reinforcement Learning for Coordinated Smart Grid and Building Energy Management Across Urban Communities. Computer Life, 13(3), 8-15.

[24] Zhao, W., Chen, T., Yang, J. S., & Qiu, L. (2026). AutoML-Pipeline: A RAG-enhanced code generation framework with pre-validation for cloud-native machine learning workflows. IEEE Access.

[25] Yang, Y., & Yang, J. (2026). Synthetic Data Meets Finance: Generative Models for Privacy Preserving Analytics. Journal of Banking and Financial Dynamics, 10(4), 1-8.

[26] Wang, Z., Shen, Z., Wang, B., & Shang, W. (2025). Modernizing Enterprise Analytics through Low-Code Automation and Cloud-Native Data Architectures. Asian Business Research Journal, 10(12), 20-33.

[27] Zhao, X., Sun, T., Ren, S., Yang, J., & Liu, Y. (2025). RAG-Based AI Agents for Enterprise Software Development: Implementation Patterns and Production Deployment. Frontiers in Artificial Intelligence Research, 2(3), 501-520.

[28] Dong, X., & Yang, Y. (2020). Nas-bench-201: Extending the scope of reproducible neural architecture search. arXiv preprint arXiv:2001.00326.

[29] Dong, X., Liu, L., Musial, K., & Gabrys, B. (2021). Nats-bench: Benchmarking nas algorithms for architecture topology and size. IEEE transactions on pattern analysis and machine intelligence, 44(7), 3634-3646.

[30] Chitty-Venkata, K. T., & Somani, A. K. (2022). Neural architecture search survey: A hardware perspective. ACM Computing Surveys, 55(4), 1-36.

[31] Svoboda, K., & Adegbija, T. (2025). Spiking Neural Network Architecture Search: A Survey. arXiv preprint arXiv:2510.14235.

[32] Wen, W., Liu, H., Chen, Y., Li, H., Bender, G., & Kindermans, P. J. (2020, August). Neural predictor for neural architecture search. In European conference on computer vision (pp. 660-676). Cham: Springer International Publishing.

Designing Energy Efficient Accelerators Through Neural Architecture Search

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Cover

Indexing & Abstracting