-
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Paper • 2410.05265 • Published • 33 -
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Paper • 2410.03450 • Published • 36 -
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Paper • 2410.08196 • Published • 48 -
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Paper • 2410.07303 • Published • 18
XXSg559
XXSg559
·
AI & ML interests
None yet
Organizations
essay
-
PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Paper • 2410.05265 • Published • 33 -
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Paper • 2410.03450 • Published • 36 -
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Paper • 2410.08196 • Published • 48 -
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Paper • 2410.07303 • Published • 18
models
8
XXSg559/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
4
XXSg559/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
XXSg559/Qwen2.5-1.5B-Instruct-thinking-function_calling-V0
Updated
XXSg559/q-Taxi-v3
Reinforcement Learning
•
Updated
XXSg559/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
XXSg559/ppo-Huggy
Reinforcement Learning
•
Updated
XXSg559/sft_output
Updated
XXSg559/SmolLM2-FT-MyDataset
Text Generation
•
Updated