Pretrained models from scratch used in "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining".
Rosie Zhao
rosieyzh
·
AI & ML interests
theory of machine learning, deep learning
Recent Activity
updated
a dataset
about 17 hours ago
rosieyzh/sonnet3.5_slimorca_500tok
published
a dataset
about 17 hours ago
rosieyzh/sonnet3.5_slimorca_500tok
updated
a dataset
4 days ago
rosieyzh/litbench_500tok