Wei-Ning Hsu's picture

1

Wei-Ning Hsu

wnhsu

·

wnhsu

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago

perception-encoder-audio-visual

authored a paper about 1 year ago

Movie Gen: A Cast of Media Foundation Models

authored a paper over 1 year ago

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

View all activity

Organizations

None yet

upvoted a collection 7 days ago

perception-encoder-audio-visual

9 items • Updated 9 days ago • 21

authored a paper about 1 year ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 99

authored a paper over 1 year ago

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Paper • 2404.09956 • Published Apr 15, 2024 • 12

authored 2 papers about 2 years ago

Attention or Convolution: Transformer Encoders in Audio Language Models for Inference Efficiency

Paper • 2311.02772 • Published Nov 5, 2023 • 8

Toward Joint Language Modeling for Speech Units and Text

Paper • 2310.08715 • Published Oct 12, 2023 • 9