2 27

Wang

VincentWang

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a dataset 14 days ago

peteromallet/dataclaw-peteromallet

liked a model about 1 month ago

meituan-longcat/LongCat-Flash-Thinking-2601

liked a dataset 2 months ago

kensho/DocFinQA

View all activity

Organizations

None yet

liked a dataset 14 days ago

peteromallet/dataclaw-peteromallet

Viewer • Updated 15 days ago • 549 • 9.74k • 284

liked a model about 1 month ago

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated Jan 23 • 97 • 103

liked a dataset 2 months ago

kensho/DocFinQA

Viewer • Updated Nov 19, 2024 • 7.44k • 2.47k • 14

liked a model 2 months ago

YOYO-AI/Qwen3-30B-A3B-YOYO-Thinking-Chimera

Text Generation • 31B • Updated Jan 5 • 7 • 5

liked a model 3 months ago

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 11.4k • • 244

liked 4 datasets 3 months ago

liked a model 6 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • Updated Aug 26, 2025 • 29k • 488

liked a dataset 7 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13, 2025 • 246 • 26

liked a model 8 months ago

infly/inf-retriever-v1

liked a dataset 8 months ago

FreedomIntelligence/Evol-Instruct-Chinese-GPT4

Viewer • Updated Dec 6, 2023 • 70k • 35 • 47

liked a model 9 months ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

71B • Updated Apr 13, 2025 • 1.42k • 92

liked 3 datasets 10 months ago

EricLu/SCP-116K

Viewer • Updated Mar 17, 2025 • 182k • 235 • 123

a-m-team/AM-DeepSeek-Distilled-40M

Viewer • Updated May 10, 2025 • 11.5M • 2.03k • 56

allenai/tulu-3-sft-personas-instruction-following

Viewer • Updated Nov 21, 2024 • 30k • 825 • 62

liked a model 10 months ago

TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15, 2025 • 6.36k • • 21

upvoted an article 10 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

110

liked a dataset 10 months ago

ucinlp/drop

Viewer • Updated Jan 17, 2024 • 86.9k • 3.4k • 66

Wang

AI & ML interests

Recent Activity

Organizations

VincentWang's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment