Advantages of depth in feed forward network?

Pattern recognition and machine learning - Christopher M. Bishop

Information theory, Inference and learning algorithms - David J. C. Mackay

Naked statistics - charles wheelan

Quantum Physics

Journal - (Nature , eLife)

Deep learning in alternate reality

The forgetting machine - Rodrigo Quian

The Emperor’s new mind - Robert Penrose

Episodic memory - LSD trip in KGP

Transformers Explained Visually (Part 3): Multi-head Attention, deep dive