AI & ML interests

None defined yet.

Recent Activity

HProg  updated a model about 2 hours ago
embedl/Cosmos-Reason2-2B-W4A16-Edge2
JonnaMat  updated a Space about 4 hours ago
embedl/Edge-Inference-Benchmarks
View all activity

Add accuracy data

#4 opened about 3 hours ago by
JonnaMat

Add subgroups to sidebar

#3 opened about 5 hours ago by
JonnaMat
JonnaMat 
posted an update about 20 hours ago
view post
Post
139
Qwen3.5 on-device benchmarks on the Nvidia Jetson lineup are now live 🚀

We've added the latest Qwen3.5 models (0
8B - 9B) to our on-device inference benchmarks (Nvidia Jetson Orin Nano Super, AGX Orin, AGX Thor).

👉 Explore TPS, TTFT, E2E latency, and TPOT. Measured on real hardware: embedl/Edge-Inference-Benchmarks

🌟 Stay tuned for additional benchmarks and Embedl-optimized models: Enabling models run faster and on less expensive hardware.

If you're working on edge LLM deployment, we'd love to discuss your use case.
  • 1 reply
·