Building on HF
18.1 TFLOPS 3 followers ·
12 following
AI & ML interests guy with low end hardware learning about llm's and learning about finetuning
my hardware:
intel xeon E5v4
32gb ddr4 ram
amd rx 5700 8gb vram (Hf allows to set from 6600 lol)
512gb ssd
contact:
dc: simonko_11015
github: Simonko912
Recent Activity reacted
to
ajibawa-2023 's
post with 🔥 about 17 hours ago Cpp-Code-Large
Dataset: https://huggingface.co/datasets/ajibawa-2023/Cpp-Code-Large
Cpp-Code-Large is a large-scale corpus of C++ source code comprising more than 5 million lines of C++ code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the C++ ecosystem.
By providing a high-volume, language-specific corpus, Cpp-Code-Large enables systematic experimentation in C++-focused model training, domain adaptation, and downstream code understanding tasks.
Cpp-Code-Large addresses the need for a dedicated C++-only dataset at substantial scale, enabling focused research across systems programming, performance-critical applications, embedded systems, game engines, and large-scale native software projects. View all activity
Organizations None yet
simonko912 's activity All Models Datasets Spaces Papers Collections Community Posts Upvotes Likes Articles