userm's picture
Building on HF

userm

usermma

AI & ML interests

while i don't have a pc, i contribute from android phone, i BELIEVE in Digital Intelligence, it would change the World FOREVER

Recent Activity

repliedto AxionLab-official's post about 3 hours ago
# An Open Letter from SupraLabs. Over the past few days, SupraLabs has been mentioned in a public discussion regarding small language models, scaling laws, and training methodology. We'd like to clarify our position. Before anything else, we want to make one thing absolutely clear: we have great respect for Lane and the work being done at Glint Research. At no point was our intention to disrespect Lane, Glint Research, or their research. What began as a technical discussion about model scaling and training methodology unfortunately became much more personal than we ever intended. From our perspective, it was simply an exchange of technical opinions, and we sincerely hope it remains that way. We'd also like to acknowledge that one of our own comments during the discussion was poorly worded. Referring to a benchmark as "fake" was imprecise. What we intended to criticize was the comparison methodology, not the integrity of the evaluation itself. Comparing a merged checkpoint against a single checkpoint is, in our view, not an apples-to-apples comparison. That said, this was never the core of the discussion. Our disagreement was not about SLERP, model merging, or whether training a small model on massive amounts of data is an interesting research direction. We support experimentation and unconventional ideas. The actual point of disagreement was much simpler. The statement that a 1M parameter model trained on 1 trillion tokens will become a "100M killer" is, today, a prediction, not an experimental result. Could it happen? Perhaps. Would it be exciting if it did? Absolutely. But until benchmark results, reproducible evaluations, and independent validation exist, we believe such statements should be presented as hypotheses rather than established conclusions. Research advances by testing ideas, not by assuming their outcomes. We sincerely wish Lane and everyone at Glint Research success in their experiments. Thank you to everyone who read it.
new activity 1 day ago
ThingAI/Quark-72M:Good one...
View all activity

Organizations

None yet