Analytical cost metrics: days of future past
dc.contributor.author | Prajapati, Nirmal, author | |
dc.contributor.author | Rajopadhye, Sanjay, advisor | |
dc.contributor.author | Böhm, Wim, committee member | |
dc.contributor.author | Chong, Edwin, committee member | |
dc.contributor.author | Pouchet, Louis-Noël, committee member | |
dc.date.accessioned | 2019-09-10T14:36:54Z | |
dc.date.available | 2019-09-10T14:36:54Z | |
dc.date.issued | 2019 | |
dc.description.abstract | Future exascale high-performance computing (HPC) systems are expected to be increasingly heterogeneous, consisting of several multi-core CPUs and a large number of accelerators, special-purpose hardware that will increase the computing power of the system in a very energy-efficient way. Specialized, energy-efficient accelerators are also an important component in many diverse systems beyond HPC: gaming machines, general purpose workstations, tablets, phones and other media devices. With Moore's law driving the evolution of hardware platforms towards exascale, the dominant performance metric (time efficiency) has now expanded to also incorporate power/energy efficiency. This work builds analytical cost models for cost metrics such as time, energy, memory access, and silicon area. These models are used to predict the performance of applications, for performance tuning, and chip design. The idea is to work with domain specific accelerators where analytical cost models can be accurately used for performance optimization. The performance optimization problems are formulated as mathematical optimization problems. This work explores the analytical cost modeling and mathematical optimization approach in a few ways. For stencil applications and GPU architectures, the analytical cost models are developed for execution time as well as energy. The models are used for performance tuning over existing architectures, and are coupled with silicon area models of GPU architectures to generate highly efficient architecture configurations. For matrix chain products, analytical closed form solutions for off-chip data movement are built and used to minimize the total data movement cost of a minimum op count tree. | |
dc.format.medium | born digital | |
dc.format.medium | doctoral dissertations | |
dc.identifier | Prajapati_colostate_0053A_15688.pdf | |
dc.identifier.uri | https://hdl.handle.net/10217/197455 | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Colorado State University. Libraries | |
dc.relation.ispartof | 2000-2019 | |
dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
dc.subject | execution time model | |
dc.subject | performance tuning | |
dc.subject | stencils | |
dc.subject | analytical cost models | |
dc.subject | software/hardware codesign | |
dc.subject | mathematical optimization | |
dc.subject | matrix chain products | |
dc.title | Analytical cost metrics: days of future past | |
dc.type | Text | |
dcterms.rights.dpla | This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | Colorado State University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy (Ph.D.) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Prajapati_colostate_0053A_15688.pdf
- Size:
- 5.42 MB
- Format:
- Adobe Portable Document Format