The Unreasonable Effectiveness of the Forget Gate with Jos van der Westhuizen
EPISODE 240
|
MARCH
18,
2019
Watch
Follow
Share
About this Episode
Today we're joined by Jos Van Der Westhuizen, PhD student in Engineering at Cambridge University.
Jos' research focuses on applying LSTMs, or Long Short-Term Memory neural networks, to biological data for various tasks. In our conversation, we discuss his paper "The unreasonable effectiveness of the forget gate," in which he explores the various "gates" that make up an LSTM module and the general impact of getting rid of gates on the computational intensity of training the networks. Jos eventually determines that leaving only the forget-gate results in an unreasonably effective network, and we discuss why. Jos also gives us some great LSTM related resources, including references to Jurgen Schmidhuber, whose research group invented the LSTM, and who I spoke to back in Talk #44.
About the Guest
Jos van der Westhuizen
Cambridge University
Connect with Jos
Resources
- Paper: The unreasonable effectiveness of the forget gate
- Blog: The unreasonable effectiveness of recurrent neural networks
- The unreasonable effectiveness of mathematics in the natural sciences
- SuperSeamless
- Paper:Long Short-Term Memory
- Paper: Learning to forget: continual prediction with LSTM
- Paper: LSTM: A Search Space Odyssey
- Paper: An Empirical Exploration of Recurrent Network Architectures
- Paper: Can recurrent neural networks warp time?
- Paper: WaveNet: A Generative Model for Raw Audio