21 10

Анна Соколова (Anna Sokolova)

anna-96

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

liked a dataset 16 days ago

openai/gsm8k

liked a dataset 16 days ago

JudSacr/squad_v2_french_translated

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 26 days ago • 195

liked 2 datasets 16 days ago

openai/gsm8k

Benchmark • Updated Mar 23 • 17.6k • 931k • 1.37k

JudSacr/squad_v2_french_translated

Viewer • Updated 16 days ago • 1 • 35 • 1

upvoted 2 papers 19 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 25 days ago • 59

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 25 days ago • 219

upvoted a paper 22 days ago

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published 25 days ago • 24

upvoted a paper 23 days ago

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published 28 days ago • 52

liked a model 26 days ago

Cloth-splatters/folding-state-est-gps

Updated 26 days ago • 1

upvoted 2 papers about 1 month ago

Lightning Unified Video Editing via In-Context Sparse Attention

Paper • 2605.04569 • Published May 6 • 18

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 63

liked a model about 1 month ago

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 881k • 4.69k

upvoted 2 papers about 2 months ago

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation

Paper • 2511.16428 • Published Apr 8 • 2

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 506

liked a dataset about 2 months ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 506k • 1.13k

upvoted 2 papers about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 632

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published Apr 6 • 47

liked a dataset 2 months ago

ropedia-ai/xperience-10m

Updated Apr 21 • 108k • 202

liked 2 models 2 months ago

Aizures/qwen25-coder-1.5b-finetuned

Updated Apr 3 • 1

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27, 2025 • 1.06M • • 4.09k

upvoted a paper 3 months ago

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95

Анна Соколова (Anna Sokolova)

AI & ML interests

Recent Activity

Organizations

anna-96's activity