Towards using neural networks for geoscientific discovery
Date
2020
Authors
Toms, Benjamin A., author
Barnes, Elizabeth A., advisor
Ebert-Uphoff, Imme, committee member
Hurrell, James W., committee member
Thompson, David W. J., committee member
Journal Title
Journal ISSN
Volume Title
Abstract
How can we use computational methods to extract physically meaningful patterns from geoscientific data? This question has been asked in some form for decades within the geoscientific community, with many landmark discoveries resulting from the novel application of computational methods to a geoscientific dataset. For example, the Madden-Julian Oscillation was discovered through Fourier transforms of tropical time-series, while the defining structures of the Northern Hemispheric annular modes were first captured using principal component analysis. These discoveries rooted in computational methods have since driven decades of geoscientific research and innovation, and are only two of among many similar examples. It is therefore clear that computational science and geoscience are inextricably intertwined, and so the continued advancement of both fields in tandem is beneficial to future geoscientific discovery. Many methods exist to discover patterns within geoscientific data, although each is limited by its own set of assumptions. The most common assumption is that of linearity, which oftentimes conflicts with our understanding that the earth system can be both dynamically and statistically nonlinear. However, a recently popularized subset of methods within the computer science community known as neural networks can identify nonlinear patterns and are therefore potentially powerful tools for geoscientific discovery. Neural networks learn how to map one dataset to another using a combination of nonlinear relationships, and are generalizable to a broad range of tasks including forecasting and identifying patterns within images. Regardless of the application, a common limitation of neural networks has been the difficulty to understand how and why they make their decisions. Therefore, while they have been used in geoscience for more than two decades, they have mostly been applied when accuracy is valued more than understanding, such as for making forecasts. Within this dissertation, we first propose a framework for how neural networks can be used for geoscientific discovery by applying recently invented methods from the computer science community. We focus on methods that explain which aspects of the input dataset are useful for the neural network when making connections to the output dataset. This framework enables physical interpretations of how and why neural networks make decisions, since the geoscientist that designs the neural network is likely familiar with the physical meaning of each input. In the first study of the dissertation, we outline the framework and apply it to two simple tasks to ensure the neural network interpretations abide by our current understanding of the earth system. The interpretable neural networks successfully identify the pattern of the El NiƱo Southern Oscillation and oceanic patterns that lend seasonal predictability, which lends confidence that the framework is reliable. In the second study, we then further test the methods by applying them to a more spatially and temporally complex oscillation called the Madden-Julian Oscillation (MJO). The interpretable neural networks correctly identify the known spatial structures and seasonality of the MJO, and also suggest that the MJO is nonlinear and expresses its nonlinearity through the uniqueness of each event. The final study assesses whether the proposed framework can be used to identify predictable patterns of earth-system variability within climate models through its application to decadal predictability. We find that the interpretable neural networks identify known modes of oceanic decadal variability that contribute to predictability of continental surface temperatures. The interpretations can also be used to identify distinct regimes of predictability, wherein spatially and temporally unique oceanic modes contribute predictability for the same location at different times. From a broader perspective, these studies suggest that neural networks are a viable tool for geoscientific discovery and are particularly useful given their ability to capture nonlinear, time-evolving patterns. It is likely that new neural network algorithms and methods for their interpretation will continue to be developed by the computer science community, and so this research provides a guideline for how such methods can be gainfully applied within the geosciences.