蓋瑞王's picture

Open to Work

蓋瑞王

gary109

·

AI & ML interests

GAN,Music,LLM

Recent Activity

liked a model 2 days ago

Zichen1024/CoVe-4B

liked a dataset 2 days ago

TianHongZXY/CHIMERA

liked a model about 1 month ago

Qwen/Qwen3-TTS-12Hz-1.7B-Base

View all activity

Organizations

None yet

upvoted a collection 10 months ago

🧠 Traditional Chinese Reasoning Datasets

A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated Oct 13, 2025 • 9

upvoted 8 articles about 1 year ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

+1

Jan 23, 2025

•

192

Article

The AI tools for Art Newsletter - Issue 1

Jan 31, 2025

•

84

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

29

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

264

Article

From Files to Chunks: Improving HF Storage Efficiency

Nov 20, 2024

•

70

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k

Article

Hugging Face x LangChain : A new partner package

+1

May 14, 2024

•

160

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted a collection about 1 year ago

Breeze 2 Family

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26, 2025 • 19

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

888

upvoted a collection about 1 year ago

high-quality Chinese training datasets

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 24

upvoted a paper over 1 year ago

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published Nov 27, 2024 • 20

upvoted a collection over 1 year ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 4 days ago • 66

upvoted 6 papers over 1 year ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 14

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24, 2024 • 17

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23, 2024 • 42

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

Scaling Up Diffusion and Flow-based XGBoost Models

Paper • 2408.16046 • Published Aug 28, 2024 • 10