Prithiv Sakthi's picture

Building on HF

Prithiv Sakthi PRO

prithivMLmods

hugging-science

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow ML 🤗

Recent Activity

upvoted an article about 2 hours ago

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

upvoted a collection about 3 hours ago

MTP Qwen 3.5/3.6 {MoE} Stable

updated a collection about 3 hours ago

MTP Qwen 3.5/3.6 {MoE} Stable

View all activity

Organizations

upvoted an article about 2 hours ago

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

+3

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

3 days ago

• 44

upvoted a collection about 3 hours ago

MTP Qwen 3.5/3.6 {MoE} Stable

Collection of Qwen 3.5/3.6 MoE Featuring GGUF • 4 items • Updated 37 minutes ago • 1

upvoted 3 papers 3 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 5 days ago • 68

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Paper • 2605.28003 • Published 5 days ago • 46

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 5 days ago • 80

upvoted a collection 3 days ago

[Trimming] Qwen3 Embedding 0.6B

Collection of trimmed Qwen's Qwen3-Embedding-0.6B models. The models are sorted alphabetically. • 166 items • Updated 3 days ago • 1

upvoted an article 3 days ago

Article

Introduction to Trimming ✂

lbourdois

•

3 days ago

• 25

upvoted a changelog 3 days ago

Hugging Face Changelog

Filter Models page by Base Models only

3 days ago

• 64

upvoted a collection 3 days ago

MTP Qwen 3.5/3.6 Stable

Collection of Qwen 3.5/3.6 MTP Featuring GGUF • 4 items • Updated 1 day ago • 1

upvoted a collection 4 days ago

RFDetr

RF-DETR checkpoints converted to be used with 🤗 Transformers • 15 items • Updated 4 days ago • 14

upvoted 4 papers 4 days ago

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Paper • 2605.23902 • Published 10 days ago • 44

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 6 days ago • 127

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

Paper • 2605.26230 • Published 7 days ago • 39

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published 6 days ago • 68

upvoted a collection 5 days ago

QIE/FRIE1.1 — Test LoRAs [May2026]

Collection of Qwen Image Editing LoRAs • 4 items • Updated 1 day ago • 1

upvoted 2 papers 5 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 7 days ago • 132

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published 7 days ago • 99

upvoted 2 papers 8 days ago

SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers

Paper • 2605.22668 • Published 11 days ago • 40

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Paper • 2605.21572 • Published 12 days ago • 51

upvoted a collection 8 days ago

Stable Heretic / Multimodal Models

Revised Weights • 4 items • Updated 8 days ago • 1