Hardware software co-design of machine learning accelerators using univariate functions

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Published Date

Publisher

Abstract

As neural networks grow in complexity, efficient hardware-software co-design is crucial for balancing performance and resource constraints, especially for edge devices and FPGA accelerators. This thesis investigates optimization techniques for hardware-awaretraining and model compression using univariate functions. First, we optimize hardware using a simple constant coefficient multiplier on Hybrid Binary-Unary (HBUNN) architecture, which offers variable hardware costs for constant coefficients. By applying regularization-based training, we reduce the full model hard- ware area cost by 59.3% while improving accuracy by 0.42% over L2 regularization. Using this optimized model, the fully parallel-pipeline HBUNN-based ResNet-18 has a reduced area cost by 29.6% compared to conventional binary architectures. Next, we introduce Personal Self-Attention (PSA), a novel method for learning non- linear univariate functions. Using PSA with linear transformations, we demonstrate a 2×compression in hidden size of Multi-Layer Perceptrons (MLP), while matching accuracy. Applying this to an MLP-based vision model on CIFAR-10, we cut the number of operations by 45%–28%, boosting hardware efficiency. We validate PSA with two hardware accelerators while maintaining the same reference accuracy. First, an unrolled streaming design that reduces LUT + DSP usage by 25% while doubling throughput to 32kFPS. Second, a fixed-size SIMD accelerator that improves throughput by 62.1% while using only 3.5% additional LUTs.

Description

University of Minnesota Ph.D. dissertation. May 2025. Major: Electrical Engineering. Advisor: Kia Bazargan. 1 computer file (PDF); xiii, 109 pages.

Related to

Replaces

License

Collections

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Other identifiers

Suggested citation

Singh, Gaurav. (2025). Hardware software co-design of machine learning accelerators using univariate functions. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/275925.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.