Local linearity of ReLU neural networks

Sattelberg, Ben, author; Draper, Bruce, advisor; Davies, Ewan, committee member; Kirby, Michael, committee member; Peterson, Chris, committee member

Local linearity of ReLU neural networks

Files

Sattelberg_colostate_0053A_19513.pdf (26.52 MB)

Date

2026

Authors

Sattelberg, Ben, author

Draper, Bruce, advisor

Davies, Ewan, committee member

Kirby, Michael, committee member

Peterson, Chris, committee member

Abstract

Despite their impressive practical results and broad usage, ReLU neural networks are still widely considered to be black boxes. Modern networks are complex, high-dimensional, nonlinear functions frequently applied to problems where other methods perform poorly. Building an understanding of their behavior is, therefore, both difficult and necessary. Significant progress has been made towards this, but results remain limited in comparison to empirical success. This research proposes two primary methodologies to add to current understanding: formal analysis of network behavior on simple problems and investigation of the piecewise linear behavior induced by the ReLU activation function. Network behavior on simple problems, such as approximation of Boolean functions or minimal n-dimensional classification, has reduced complexity that makes answering questions about optimal or minimal network size feasible. The behavior of networks on these restricted domains provides an interpretable method for determining the effects of network size on representational capacity and training success rate in a way that is still applicable to more complex problems of interest. The piecewise linear nature of the ReLU function also allows for simplified local evaluation of network behavior. ReLU neural networks form convex polytopes in the input space with the network behaving as a simple linear mapping from input to output within each of these polytopes. By exploiting this structure, it is possible to examine locally linear behavior in aggregate to construct metrics for network complexity and similarity. This analysis is able to avoid traditional difficulties stemming from symmetries and architectural differences by largely maintaining the interior of the network as a black box.

Subject

geometry

neural networks

linearization

Boolean functions

URI

https://hdl.handle.net/10217/244886
https://doi.org/10.25675/3.027246

Collections

2020-
Theses and Dissertations

Full item page

Local linearity of ReLU neural networks

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By