Local linearity of ReLU neural networks

Sattelberg, Ben, author; Draper, Bruce, advisor; Davies, Ewan, committee member; Kirby, Michael, committee member; Peterson, Chris, committee member

Local linearity of ReLU neural networks

dc.contributor.author	Sattelberg, Ben, author
dc.contributor.author	Draper, Bruce, advisor
dc.contributor.author	Davies, Ewan, committee member
dc.contributor.author	Kirby, Michael, committee member
dc.contributor.author	Peterson, Chris, committee member
dc.date.accessioned	2026-06-08T10:33:09Z
dc.date.issued	2026
dc.description.abstract	Despite their impressive practical results and broad usage, ReLU neural networks are still widely considered to be black boxes. Modern networks are complex, high-dimensional, nonlinear functions frequently applied to problems where other methods perform poorly. Building an understanding of their behavior is, therefore, both difficult and necessary. Significant progress has been made towards this, but results remain limited in comparison to empirical success. This research proposes two primary methodologies to add to current understanding: formal analysis of network behavior on simple problems and investigation of the piecewise linear behavior induced by the ReLU activation function. Network behavior on simple problems, such as approximation of Boolean functions or minimal n-dimensional classification, has reduced complexity that makes answering questions about optimal or minimal network size feasible. The behavior of networks on these restricted domains provides an interpretable method for determining the effects of network size on representational capacity and training success rate in a way that is still applicable to more complex problems of interest. The piecewise linear nature of the ReLU function also allows for simplified local evaluation of network behavior. ReLU neural networks form convex polytopes in the input space with the network behaving as a simple linear mapping from input to output within each of these polytopes. By exploiting this structure, it is possible to examine locally linear behavior in aggregate to construct metrics for network complexity and similarity. This analysis is able to avoid traditional difficulties stemming from symmetries and architectural differences by largely maintaining the interior of the network as a black box.
dc.format.medium	born digital
dc.format.medium	doctoral dissertations
dc.identifier	Sattelberg_colostate_0053A_19513.pdf
dc.identifier.uri	https://hdl.handle.net/10217/244886
dc.identifier.uri	https://doi.org/10.25675/3.027246
dc.language	English
dc.language.iso	eng
dc.publisher	Colorado State University. Libraries
dc.relation.ispartof	2020-
dc.rights	Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subject	geometry
dc.subject	neural networks
dc.subject	linearization
dc.subject	Boolean functions
dc.title	Local linearity of ReLU neural networks
dc.type	Text
dcterms.rights.dpla	This Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Colorado State University
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy (Ph.D.)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sattelberg_colostate_0053A_19513.pdf
Size:: 26.52 MB
Format:: Adobe Portable Document Format

Download

Collections

2020-
Theses and Dissertations