Interpretability Collection by hezo Oct 16, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - XAI - Attention - LayerNorm Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Phase Change - Delay - Classes and Labels Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - Training - ICL - Induction Circuit Evolution Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Head - Num Labels vs Classes - Loss Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Residual Head Hypothesis Collection by matlok May 6, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Phase Change Delay - Large Vocabulary Size Larger vocab is better compression, but may result in longer training ICL phase change delays due to the slower Induction Head Copy Subcircuit (C) Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Head - Copy vs QK Match See figure 6: Classes vs labels in columns B and C. Subcircuit B delays phase change on number classes vs C delays on number of labels (dramatically) Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Circuit - Data Dependent Learning Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - XAI - Induction Head - Phase Change - Components Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3 pyvene: A Library for Understanding and Improving PyTorch Models via Interventions Paper • 2403.07809 • Published Mar 12, 2024 • 1
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions Paper • 2403.07809 • Published Mar 12, 2024 • 1
Interpretability Collection by hezo Oct 16, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Residual Head Hypothesis Collection by matlok May 6, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - XAI - Attention - LayerNorm Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Phase Change Delay - Large Vocabulary Size Larger vocab is better compression, but may result in longer training ICL phase change delays due to the slower Induction Head Copy Subcircuit (C) Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Phase Change - Delay - Classes and Labels Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Head - Copy vs QK Match See figure 6: Classes vs labels in columns B and C. Subcircuit B delays phase change on number classes vs C delays on number of labels (dramatically) Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - Training - ICL - Induction Circuit Evolution Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Circuit - Data Dependent Learning Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - ICL - Induction Head - Num Labels vs Classes - Loss Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Papers - XAI - Induction Head - Phase Change - Components Collection by matlok May 5, 2024 - What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3 pyvene: A Library for Understanding and Improving PyTorch Models via Interventions Paper • 2403.07809 • Published Mar 12, 2024 • 1
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions Paper • 2403.07809 • Published Mar 12, 2024 • 1