Advantages of depth in feed forward network?
Pattern recognition and machine learning - Christopher M. Bishop
Information theory, Inference and learning algorithms - David J. C. Mackay
Naked statistics - charles wheelan
Quantum Physics
Journal - (Nature , eLife)
Deep learning in alternate reality
The forgetting machine - Rodrigo Quian
The Emperor’s new mind - Robert Penrose
Episodic memory - LSD trip in KGP
Transformers Explained Visually (Part 3): Multi-head Attention, deep dive