Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
TenAI
PRO
honey90
2
10
100
Follow
seawolf2357's profile picture
kolaslab's profile picture
freefallen's profile picture
17 followers
Ā·
50 following
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 7 hours ago
VKAE Accelerated
upvoted
an
article
about 20 hours ago
Adding a GPU Without Building One
reacted
to
SeaWolf-AI
's
post
with ā¤ļø
about 20 hours ago
š Adding a GPU without building one AI is usually framed as "how smart is the model / how many GPUs did you buy." The real bottleneck is elsewhere ā how efficiently you use the GPUs you already have. Training happens once; inference runs the entire time users use your product. So a service's economics come down to cost per token. Inference acceleration uses software to pull several times more out of the same GPU ā the effect of plugging in one more "virtual GPU." VIDRAFT's VKAE, measured (B200, same-harness, no quality loss): Qwen3.5-35B-A3B (MoE): 25.7 ā 601 tok/s (23.4Ć) Darwin-36B-Opus (in-house MoE): 25.0 ā 280.8 (11.2Ć) 10,000+ tok/s peak aggregate under concurrency The key: it's reproducible ā model + serving shipped as one container. docker pull vidraft/qwen35-vkae:601 Don't take our word for it ā run it yourself. The mechanism will be released as a paper. š Leaderboard & demo š https://huggingface.co/spaces/VIDraft/vkae Articles š https://huggingface.co/blog/FINAL-Bench/vkae-leaderboard
View all activity
Organizations
None yet
spaces
27
Sort:Ā Recently updated
pinned
Sleeping
Agents
Remove Video Background
š
Easily remove your videos background!
pinned
Runtime error
Agents
DALLE 3 XL v2
š„
Sleeping
Smart Building HVAC Energy Optimization
š¢
Launch a Streamlit web app interface
Sleeping
Agents
WAN 2.1 Fast & security
š„
Running
tenspce
š³
Runtime error
Agents
FLUX LOGO Generator
š
View 27 Spaces
models
1
honey90/TenOS-Ko-28B
Text Generation
ā¢
27B
ā¢
Updated
26 days ago
ā¢
40
datasets
0
None public yet