Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
Zeliang Zhang
zeliang0426
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
updated
a model
about 1 month ago
zeliang0426/DS_Qwen25-7-full-lora-3k
updated
a model
about 1 month ago
zeliang0426/DS_Qwen25-7-cache-lora-3k
View all activity
Organizations
None yet
zeliang0426
's models
68
Sort: Recently updated
zeliang0426/DS_Qwen25-7-full-lora-3k
Updated
Nov 22
zeliang0426/DS_Qwen25-7-cache-lora-3k
Updated
Nov 22
zeliang0426/DS-Qwen25-7-Think
Updated
Nov 20
zeliang0426/QKV_Qwen25-7-full-param-3k
Text Generation
•
8B
•
Updated
Nov 19
•
4
zeliang0426/QKV_Qwen25-3-full-param-3k
Text Generation
•
3B
•
Updated
Nov 18
•
66
zeliang0426/DS-LLama-vanilla-6K
Updated
Sep 12
zeliang0426/Distill-LLama-8B-4k
Text Generation
•
8B
•
Updated
Aug 28
•
5
zeliang0426/Distill-LLama-8B-5k-try_2
Text Generation
•
8B
•
Updated
Aug 28
•
5
zeliang0426/Distill-LLama-8B-5k-smallLR
Text Generation
•
8B
•
Updated
Aug 28
•
8
zeliang0426/Distill-LLama-8B-6k
Text Generation
•
8B
•
Updated
Aug 28
•
5
zeliang0426/Distill-LLama-8B-7k
Text Generation
•
8B
•
Updated
Aug 24
•
6
zeliang0426/Distill-LLama-8B-5k
Text Generation
•
8B
•
Updated
Aug 22
•
5
zeliang0426/Distill-LLama-8B-5k-largeLR
Updated
Aug 22
zeliang0426/Qwen25-3-Think-nglobal_16
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-nglobal_32
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-no_global
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-nglobal_48
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Cache-Sink
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Distill_Llama_Darpo-full-lora-3k
Updated
Aug 18
zeliang0426/SFT-Full
Updated
Aug 17
zeliang0426/SFT-Think
Text Generation
•
3B
•
Updated
Aug 16
zeliang0426/SFT-Cache
Updated
Aug 16
zeliang0426/RM-Think
Text Generation
•
3B
•
Updated
Aug 16
•
6
zeliang0426/RM-Cache
Updated
Aug 16
zeliang0426/Long_Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15
•
4
zeliang0426/Distill_Llama_SFT-full
Updated
Aug 15
zeliang0426/DS_Darpo-full-lora-3k
Updated
Aug 15
zeliang0426/DS_Llama_Darpo-cache-lora-3k
Updated
Aug 15
zeliang0426/Short_DS_Llama_Darpo-cache-adapter-3k
Text Generation
•
7B
•
Updated
Aug 15
•
4
zeliang0426/DS_Llama_Darpo-cache-adapter-3k
Text Generation
•
7B
•
Updated
Aug 15
•
4
Previous
1
2
3
Next