TY - GEN
T1 - A hardware acceleration technique for gradient descent and conjugate gradient
AU - Kesler, David
AU - Deka, Biplab
AU - Kumar, Rakesh
PY - 2011
Y1 - 2011
N2 - Application Robustification, a promising approach for reducing processor power, converts applications into numerical optimization problems and solves them using gradient descent and conjugate gradient algorithms [1]. The improvement in robustness, however, comes at the expense of performance when compared to the baseline non-iterative versions of these applications. To mitigate the performance loss from robustification, we present the design of a hardware accelerator and corresponding software support that accelerate gradient descent and conjugate gradient based iterative implementation of applications. Unlike traditional accelerators, our design accelerates different types of linear algebra operations found in many algorithms and is capable of efficiently handling sparse matrices that arise in applications such as graph matching. We show that the proposed accelerator can provide significant speedups for iterative versions of several applications and that for some applications such as least squares, it can substantially improve the computation time as compared to the baseline non-iterative implementation.
AB - Application Robustification, a promising approach for reducing processor power, converts applications into numerical optimization problems and solves them using gradient descent and conjugate gradient algorithms [1]. The improvement in robustness, however, comes at the expense of performance when compared to the baseline non-iterative versions of these applications. To mitigate the performance loss from robustification, we present the design of a hardware accelerator and corresponding software support that accelerate gradient descent and conjugate gradient based iterative implementation of applications. Unlike traditional accelerators, our design accelerates different types of linear algebra operations found in many algorithms and is capable of efficiently handling sparse matrices that arise in applications such as graph matching. We show that the proposed accelerator can provide significant speedups for iterative versions of several applications and that for some applications such as least squares, it can substantially improve the computation time as compared to the baseline non-iterative implementation.
UR - http://www.scopus.com/inward/record.url?scp=79961187689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79961187689&partnerID=8YFLogxK
U2 - 10.1109/SASP.2011.5941086
DO - 10.1109/SASP.2011.5941086
M3 - Conference contribution
AN - SCOPUS:79961187689
SN - 9781457712111
T3 - Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011
SP - 94
EP - 101
BT - Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011
T2 - 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011
Y2 - 5 June 2011 through 6 June 2011
ER -