Patrik Bartak's picture

Patrik Bartak

ogg-vorbis

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

liked a model 3 months ago

JetBrains/Mellum-4b-base

upvoted a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27, 2025 • 20

liked a model 3 months ago

JetBrains/Mellum-4b-base

Text Generation • 4B • Updated May 7, 2025 • 3.16k • 433

upvoted a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 37

upvoted a paper 6 months ago

KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11, 2025 • 40

liked 4 datasets 9 months ago

openai/graphwalks

Viewer • Updated Apr 14, 2025 • 1.15k • 526 • 90

JetBrains/KStack

Viewer • Updated Apr 22, 2025 • 5.52M • 418 • 8

nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 3.92k • 519

ByteDance-Seed/Multi-SWE-RL

Updated Jul 23, 2025 • 3.01k • 31

liked a model 11 months ago

DeepPavlov/bert-base-bg-cs-pl-ru-cased

Feature Extraction • Updated Nov 8, 2021 • 118 • • 4

updated a dataset over 1 year ago

ogg-vorbis/KExercises_compilable

Viewer • Updated Aug 28, 2024 • 11.7k • 10

liked 3 models about 2 years ago

EleutherAI/pythia-70m

95.6M • Updated Nov 21, 2023 • 90.1k • 77

EleutherAI/pythia-12b-deduped

Text Generation • Updated Jun 8, 2023 • 3.23k • 52

EleutherAI/pythia-2.8b

Text Generation • 3B • Updated Jun 9, 2023 • 23.3k • 32

liked 2 models over 2 years ago

meta-llama/Llama-2-7b-hf

Text Generation • 7B • Updated Apr 17, 2024 • 602k • 2.24k

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 357 • 4.44k