This is a record

Captured June 8, 2023 6:47 pm

View Mode: Standard
Title
The Scaling Hypothesis · Gwern.net
Description
On GPT-3: meta-learning, scaling, implications, and deep theory. The scaling hypothesis: neural nets absorb data & compute, generalizing and becoming more Bayesian as problems get harder, manifesting new abilities even at trivial-by-global-standards-scale. The deep learning revolution has begun as f