Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yamatazen 's Collections
Genshin Impact
Optimizers
Autoregressive image generation
GGUF tools
AGI
Model merging
Multilingual LLMs
Japanese LLMs
AI censorship
LLM leaderboards
Grokking

Optimizers

updated 16 days ago
Upvote
-

  • CAME: Confidence-guided Adaptive Memory Efficient Optimization

    Paper • 2307.02047 • Published Jul 5, 2023 • 2

  • Practical Efficiency of Muon for Pretraining

    Paper • 2505.02222 • Published May 4 • 40

  • AdaMuon: Adaptive Muon Optimizer

    Paper • 2507.11005 • Published Jul 15 • 2

  • Muon is Scalable for LLM Training

    Paper • 2502.16982 • Published Feb 24 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs