Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
109.7
TFLOPS
4
15
7
NJX-njx
NJX-njx
Follow
John6666's profile picture
evalstate's profile picture
tegridydev's profile picture
8 followers
·
18 following
https://github.com/NJX-njx
NJX_njx_ai
NJX-njx
AI & ML interests
AI infra, large model architecture, intelligent agent,evaluation
Recent Activity
upvoted
an
article
2 days ago
Microgpt
published
an
article
2 days ago
Microgpt
replied
to
FreshmanD
's
post
5 days ago
LoongFlow Big News!!! @all We’ve put AI Agents into a production GPU cluster to handle GPU failure prediction. Not as a demo. Not as AutoML. But as an evolving system that designs and improves its own models. On two GPU types: – IT21HMDB01-B2: +30% prediction accuracy – H800: +25% prediction accuracy The resulting models already meet production standards and are being wired into the ops pipeline. How it works: • An ML agent designs the full ML pipeline from scratch • A Math agent performs targeted evolutionary optimization • The agents explore, discard, and iterate toward better modelsHumans don’t hand-tune parameters. This is not offline analysis. GPU failure prediction means: • heavy assets • real incidents • real operational risk The agents now trigger maintenance before failures happen. This feels like an early signal: AI agents are starting to take responsibility for infrastructure-level engineering decisions in production systems. For ML Agent, you can check: https://github.com/baidu-baige/LoongFlow
View all activity
Organizations
NJX-njx
's models
None public yet