Hardware software co-design of machine learning accelerators using univariate functions
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Authors
Published Date
Publisher
Abstract
As neural networks grow in complexity, efficient hardware-software co-design is crucial for balancing performance and resource constraints, especially for edge devices and FPGA accelerators. This thesis investigates optimization techniques for hardware-awaretraining and model compression using univariate functions. First, we optimize hardware using a simple constant coefficient multiplier on Hybrid Binary-Unary (HBUNN) architecture, which offers variable hardware costs for constant coefficients. By applying regularization-based training, we reduce the full model hard- ware area cost by 59.3% while improving accuracy by 0.42% over L2 regularization. Using this optimized model, the fully parallel-pipeline HBUNN-based ResNet-18 has a reduced area cost by 29.6% compared to conventional binary architectures. Next, we introduce Personal Self-Attention (PSA), a novel method for learning non- linear univariate functions. Using PSA with linear transformations, we demonstrate a 2×compression in hidden size of Multi-Layer Perceptrons (MLP), while matching accuracy. Applying this to an MLP-based vision model on CIFAR-10, we cut the number of operations by 45%–28%, boosting hardware efficiency. We validate PSA with two hardware accelerators while maintaining the same reference accuracy. First, an unrolled streaming design that reduces LUT + DSP usage by 25% while doubling throughput to 32kFPS. Second, a fixed-size SIMD accelerator that improves throughput by 62.1% while using only 3.5% additional LUTs.
Description
University of Minnesota Ph.D. dissertation. May 2025. Major: Electrical Engineering. Advisor: Kia Bazargan. 1 computer file (PDF); xiii, 109 pages.
Related to
Replaces
License
Collections
Series/Report Number
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Singh, Gaurav. (2025). Hardware software co-design of machine learning accelerators using univariate functions. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/275925.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.