Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Building on HF
71
4
35
userm
usermma
Follow
phuket2's profile picture
ThingsAI's profile picture
JACKYNIKING's profile picture
13 followers
·
6 following
AI & ML interests
while i don't have a pc, i contribute from android phone, i BELIEVE in Digital Intelligence, it would change the World FOREVER
Recent Activity
replied
to
AxionLab-official
's
post
about 1 hour ago
# An Open Letter from SupraLabs. Over the past few days, SupraLabs has been mentioned in a public discussion regarding small language models, scaling laws, and training methodology. We'd like to clarify our position. Before anything else, we want to make one thing absolutely clear: we have great respect for Lane and the work being done at Glint Research. At no point was our intention to disrespect Lane, Glint Research, or their research. What began as a technical discussion about model scaling and training methodology unfortunately became much more personal than we ever intended. From our perspective, it was simply an exchange of technical opinions, and we sincerely hope it remains that way. We'd also like to acknowledge that one of our own comments during the discussion was poorly worded. Referring to a benchmark as "fake" was imprecise. What we intended to criticize was the comparison methodology, not the integrity of the evaluation itself. Comparing a merged checkpoint against a single checkpoint is, in our view, not an apples-to-apples comparison. That said, this was never the core of the discussion. Our disagreement was not about SLERP, model merging, or whether training a small model on massive amounts of data is an interesting research direction. We support experimentation and unconventional ideas. The actual point of disagreement was much simpler. The statement that a 1M parameter model trained on 1 trillion tokens will become a "100M killer" is, today, a prediction, not an experimental result. Could it happen? Perhaps. Would it be exciting if it did? Absolutely. But until benchmark results, reproducible evaluations, and independent validation exist, we believe such statements should be presented as hypotheses rather than established conclusions. Research advances by testing ideas, not by assuming their outcomes. We sincerely wish Lane and everyone at Glint Research success in their experiments. Thank you to everyone who read it.
new
activity
about 14 hours ago
mlx-community/GLM-5.2-DQ4plus-q8:
How did you run this on 512GB mac studio?
new
activity
1 day ago
ThingAI/Quark-72M:
Good one...
View all activity
Organizations
None yet
usermma
's buckets
5
Sort: Recently updated
usermma/GLM-Buckets
3.01 TB
usermma/Quark-72M-Instruct-bucket
287 MB
usermma/llama.cpp-quark
61 MB
usermma/test
0 Bytes
usermma/linux-commands-bucket
360 kB