Roman Nekrasov

Rob1234567

romannekrasovaillm

AI & ML interests

Areas of interest: agentic mid-training, reinforcement learning with reward verification (RLVR), scaling agent environments, interleaved agent reasoning with tools

Recent Activity

liked a dataset 2 days ago

Fujitsu-FRE/MAPS

upvoted a paper 3 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

liked a model 22 days ago

yujiepan/kimi-k2.5-tiny-random

View all activity

Organizations

None yet

liked a dataset 2 days ago

Fujitsu-FRE/MAPS

Viewer • Updated 16 days ago • 8.86k • 40 • 9

upvoted a paper 3 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 15 days ago • 183

liked 2 models 22 days ago

yujiepan/kimi-k2.5-tiny-random

Feature Extraction • 3.24M • Updated 6 days ago • 283 • 1

moonshotai/Kimi-K2.5

Image-Text-to-Text • 171B • Updated 21 days ago • 1.48M • • 2.17k

liked 2 models about 1 month ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • Updated Dec 3, 2025 • 747k • • 953

google/medasr

Automatic Speech Recognition • Updated about 1 month ago • 36.4k • 287

upvoted a collection about 1 month ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated Jan 14 • 441

New activity in nvidia/Nemotron-Agentic-v1 about 2 months ago

Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

#4 opened about 2 months ago by

Rob1234567

liked a model 3 months ago

allenai/Olmo-3-32B-Think

Text Generation • 1.05M • Updated Jan 5 • 2.91k • • 169

upvoted a collection 3 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 9 items • Updated Dec 23, 2025 • 164

upvoted a paper 3 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 91

upvoted 2 articles 3 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

upvoted a collection 4 months ago

Gemma 3 Release

Collection

28 items • Updated Aug 11, 2025 • 614

upvoted a collection 5 months ago

Qwen3Guard

Collection

7 items • Updated Dec 31, 2025 • 64

liked a model 7 months ago

openai/gpt-oss-120b

Text Generation • Updated Aug 26, 2025 • 3.64M • • 4.53k

liked a model 9 months ago

ai-sage/GigaChat-20B-A3B-instruct

Text Generation • 21B • Updated Jun 25, 2025 • 1.18k • 49

upvoted a paper 9 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

liked a dataset 10 months ago

logicreasoning/logi_glue

Viewer • Updated Oct 31, 2023 • 356k • 483 • 4

liked a model 10 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated Dec 17, 2025 • 15.1k • 1.31k

Roman Nekrasov

AI & ML interests

Recent Activity

Organizations

Rob1234567's activity

Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2