Repository logo

Local Linearity of ReLU Neural Networks

dc.contributor.authorSattelberg, Ben, author
dc.contributor.authorDraper, Bruce, advisor
dc.contributor.authorDavies, Ewan, committee member
dc.contributor.authorKirby, Michael, committee member
dc.contributor.authorPeterson, Chris, committee member
dc.date.accessioned2026-06-08T10:33:09Z
dc.date.issued2026
dc.description.abstractDespite their impressive practical results and broad usage, ReLU neural networks are still widely considered to be black boxes. Modern networks are complex, high-dimensional, nonlinear functions frequently applied to problems where other methods perform poorly. Building an understanding of their behavior is, therefore, both difficult and necessary. Significant progress has been made towards this, but results remain limited in comparison to empirical success. This research proposes two primary methodologies to add to current understanding: formal analysis of network behavior on simple problems and investigation of the piecewise linear behavior induced by the ReLU activation function. Network behavior on simple problems, such as approximation of Boolean functions or minimal n-dimensional classification, has reduced complexity that makes answering questions about optimal or minimal network size feasible. The behavior of networks on these restricted domains provides an interpretable method for determining the effects of network size on representational capacity and training success rate in a way that is still applicable to more complex problems of interest. The piecewise linear nature of the ReLU function also allows for simplified local evaluation of network behavior. ReLU neural networks form convex polytopes in the input space with the network behaving as a simple linear mapping from input to output within each of these polytopes. By exploiting this structure, it is possible to examine locally linear behavior in aggregate to construct metrics for network complexity and similarity. This analysis is able to avoid traditional difficulties stemming from symmetries and architectural differences by largely maintaining the interior of the network as a black box.
dc.format.mediumborn digital
dc.format.mediumdoctoral dissertations
dc.identifierSattelberg_colostate_0053A_19513.pdf
dc.identifier.urihttps://hdl.handle.net/10217/244886
dc.identifier.urihttps://doi.org/10.25675/3.027246
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartof2020-
dc.rightsCopyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright.
dc.subjectGeometry
dc.subjectNeural networks
dc.subjectLinearization
dc.subjectBoolean functions
dc.titleLocal Linearity of ReLU Neural Networks
dc.typeText
dcterms.rights.dplaThis Item is protected by copyright and/or related rights (https://rightsstatements.org/vocab/InC/1.0/). You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
thesis.degree.disciplineComputer Science
thesis.degree.grantorColorado State University
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy (Ph.D.)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sattelberg_colostate_0053A_19513.pdf
Size:
26.52 MB
Format:
Adobe Portable Document Format

Collections