1 11 66

Vadim Smolyakov

vsmolyakov

https://vsmolyakov.github.io/

AI & ML interests

Machine Learning Engineer @ Microsoft

Recent Activity

upvoted a paper 5 days ago

Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models

upvoted a paper 9 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 10 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

View all activity

Organizations

None yet

liked a dataset 11 days ago

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 3.66k • 192

liked 2 models 3 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated 2 days ago • 316k • • 1.65k

MiniMaxAI/MiniMax-M2

Text Generation • Updated Dec 23, 2025 • 156k • • 1.46k

liked 2 models 4 months ago

zeroentropy/zerank-1

Text Ranking • 4B • Updated Nov 19, 2025 • 634 • 73

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 26.9k • • 1.39k

liked a dataset 4 months ago

openai/gdpval

Viewer • Updated Sep 25, 2025 • 220 • 27.7k • 453

liked 2 models 6 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 2.8M • • 4.42k

gghfez/Mistral-Small-3.2-24B-Instruct-hf-AWQ

Text Generation • 24B • Updated Jun 25, 2025 • 306 • 4

liked a dataset 6 months ago

Salesforce/CRMArenaPro

Viewer • Updated Jul 9, 2025 • 8.61k • 559 • 15

liked a dataset 7 months ago

Salesforce/CRMArena

Viewer • Updated Jun 18, 2025 • 1.19k • 276 • 8

liked a model 9 months ago

microsoft/Phi-4-mini-instruct

Text Generation • 4B • Updated Dec 10, 2025 • 150k • 671

liked 2 models 10 months ago

nvidia/Llama-3.1-Nemotron-70B-Reward

Updated Apr 13, 2025 • 21 • 78

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • 7B • Updated May 23, 2024 • 1.04k • 4

liked a dataset 10 months ago

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 5.02k • 104

liked 2 models 10 months ago

weqweasdas/RM-Mistral-7B

Text Classification • 7B • Updated Mar 31, 2024 • 3.01k • 24

RLHFlow/ArmoRM-Llama3-8B-v0.1

Text Classification • 8B • Updated Sep 23, 2024 • 9.85k • 183

liked 2 models 11 months ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated Dec 22, 2025 • 92.1k • 1.34k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 47.9k • • 2.88k

liked a Space 12 months ago

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

mistralai/Mistral-Small-24B-Instruct-2501

24B • Updated Jul 28, 2025 • 755k • 950

Vadim Smolyakov

AI & ML interests

Recent Activity

Organizations

vsmolyakov's activity

The Ultra-Scale Playbook