

-> continue to remember or forget it -> a switch




more complex, but the crux is the same: gate looks at input and decide to remember or forget











hidden state is not exactly the memory itself, it's the function of the memory.






straightforward











GRU is even better than LSTM in some cases, and more follow the rationale of the original idea

网友评论