Minimal implementations of deep learning pape

Minimal implementations/tutorials of deep learning papers with side-by-side notes; including:

  • transformers: original, xl, switch, feedback
  • optimizers: adam, radam, adablief
  • GANs: DCGAN, CycleGAN
  • PPO, DQL
  • Capsule network
  • sketch-RNN