Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
updated
a model
about 10 hours ago
allenai/Olmo-3-7B-Think
updated
a model
about 10 hours ago
allenai/Olmo-3-7B-Think
updated
a model
about 10 hours ago
allenai/Olmo-3-7B-Think
Organizations
models
39
mnoukhov/test
Updated
mnoukhov/SmolLM2-135M-tldr-sft
Text Generation
•
0.1B
•
Updated
•
17
mnoukhov/SmolLM2-360M-tldr-sft
Text Generation
•
0.4B
•
Updated
•
8
mnoukhov/SmolLM2-135M-Instruct_tldr-sft
Text Generation
•
0.1B
•
Updated
•
23
mnoukhov/SmolLM2-135M-Instruct_tldr-rm
Text Classification
•
0.1B
•
Updated
•
22
mnoukhov/pythia2.8b-rm-tldr6.9b
Text Classification
•
3B
•
Updated
•
22
mnoukhov/pythia2.8b-sft-tldr
Text Generation
•
3B
•
Updated
•
16
mnoukhov/pythia160m-sft-tldr
Text Generation
•
0.2B
•
Updated
•
14
mnoukhov/pythia160m-rm-tldr6.9b
Text Classification
•
0.1B
•
Updated
•
17
mnoukhov/pythia1b-rm-tldr6.9b
Text Classification
•
0.9B
•
Updated
•
14
datasets
53
mnoukhov/MATH_3000_final_filter
Viewer
•
Updated
•
2.3k
•
21
mnoukhov/deepscaler_20k_medhard_nolatex_rlvr
Viewer
•
Updated
•
19.5k
•
18
mnoukhov/DAPO-Math-14k-Processed-RLVR
Viewer
•
Updated
•
14.1k
•
19
mnoukhov/rlvr_countdown
Viewer
•
Updated
•
490k
•
12
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b
Viewer
•
Updated
•
177k
•
34
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel2_llama8b
Viewer
•
Updated
•
92.1k
•
16
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_llama8b
Viewer
•
Updated
•
176k
•
20
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr_relabel_pythia1b
Viewer
•
Updated
•
107k
•
20
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr
Viewer
•
Updated
•
107k
•
36
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia1b
Viewer
•
Updated
•
177k
•
20