Cofos Logo v2 β€” Personalized Python Code Assistant

Cofos Logo v2 is a 522M-parameter personalized code assistant, fine-tuned from AMFORGE/cofos_v2 on a curated corpus of personal coding conversations and project artifacts. It is the first personalization-tier model in the Cofos family by AMEFORGE, built on the proprietary SparseMind architecture.

The "Logo" suffix denotes that this variant has been fine-tuned on a specific developer's personal style and patterns. The base model remains general-purpose; this variant is a directional bias toward a particular coding voice.


Model Summary

Field Value
Developer AMEFORGE
Architecture SparseMind v15 (proprietary)
Parameters 522M
Context length 2048 tokens
Vocabulary 16,384 (NexusBPE, FR-aware)
Languages French, English
Base model AMFORGE/cofos_v2
Specialization Personal coding style + Python
License Apache 2.0
Status Experimental β€” first personalization tier

Intended Use

Primary use cases

  • Personalized Python code generation that reflects a specific developer's style and preferred patterns
  • Bilingual coding assistance (French/English) where consistency with the developer's authored prompts matters
  • Experimental baseline for studying personalization of small code models through targeted fine-tuning

Out-of-scope

This model is not designed for:

  • General-purpose code assistance for other developers (the personal bias may produce surprising stylistic outputs)
  • Tasks requiring broad knowledge or long-context reasoning
  • Languages other than French and English
  • Languages other than Python (other syntaxes appear only as artifacts of the base)
  • Production deployment without human review

Cofos Logo v2 is a research artifact, not a general assistant. It explores whether targeted fine-tuning of a small specialized base on personal data produces a measurably "voiced" model.


Performance

Evaluated on the same held-out set of 100 real Python instruction prompts used during base training.

Metric Cofos v2 base Cofos Logo v2 (this)
Real-syntax-valid (n=100) 67.0% 66.0%
Validation loss 3.07 2.98

The fine-tune produces a small drop in raw Python syntactic validity (-1pp) and a meaningful improvement in validation loss on personal+code mixed data (-0.09). This is the expected trade-off of personalization: the model has shifted some capacity toward modeling personal style at a marginal cost to generic Python correctness.

A separate external benchmark (50 prompts, semantic execution tests) is available in the Cofos benchmark suite. Refer to that for comparison against open-source small code models.


Training

Fine-tune configuration

  • Base: AMFORGE/cofos_v2 (522M, step 11500, 67% real_syntax_valid)
  • Method: Continued training (not LoRA β€” full parameter fine-tune)
  • Steps: 1500
  • Learning rate: 1e-5 (conservative β€” base is heavily trained)
  • Schedule: 100-step warmup, cosine decay to 0
  • Best checkpoint: step 600 (saved as cofos_model.pt)

Data mix

  • 65% personal data β€” curated conversations and code artifacts from the model author
  • 25% distilled instruction-response data β€” kept as anti-catastrophic-forgetting signal so generic skills are preserved
  • 10% real Python instructions β€” production patterns from public datasets

All personal data was redacted for secrets (API keys, tokens, PEM keys) before training. Redaction tokens use stable placeholders so the model learns to emit the placeholder rather than the secret.

Tokenizer

AMFORGE/cofos_tok_v2 β€” NexusBPE-based custom tokenizer (vocab 16384, FR-aware, with atomic structural tags <instruction> <thought> <python> <code> <test>). Inherited from the base model unchanged.


Lineage

cofos_tok_v2 (tokenizer, NexusBPE)
       ↓
cofos_v2 (base code model, from scratch, 67% syntax)
       ↓
cofos_logo_v2 (this model β€” personal fine-tune, 66% syntax)

A parallel branch (cofos_logo_general) is planned, fine-tuning the upcoming general-purpose base instead of cofos_v2. The two variants will be compared empirically.


Usage

Loading

from huggingface_hub import hf_hub_download

checkpoint_path = hf_hub_download(repo_id="AMFORGE/cofos_logo_v2",
                                  filename="cofos_model.pt")
tokenizer_path  = hf_hub_download(repo_id="AMFORGE/cofos_logo_v2",
                                  filename="cofos_tokenizer.model")

Loading and inference require the AMEFORGE SparseMind runtime. The model architecture is proprietary; contact AMEFORGE for access to the runtime.

Prompt format

<instruction>Write a Python function that ...</instruction>

For chain-of-thought generation, prime with <thought>:

<instruction>Write a Python function that ...</instruction>
<thought>

For code generation primed with the wrapper:

<instruction>Write a Python function that ...</instruction>
<python><code>

Limitations & Risks

Personalization-specific

  • Stylistic bias: The model has been fine-tuned on one developer's style. Output will reflect that style and may feel unnatural to other developers.
  • Voice persistence: The model has learned recurring patterns (variable naming, function structure, comment style) from the personal corpus. These will appear even when not requested.
  • Topical narrowness: The personal corpus has its own topical distribution (heavy on ML, light on web frontend, etc.). Generations on topics outside that distribution may degrade in quality.

Inherited from base

  • Capacity: 522M parameters is small. Complex multi-step reasoning is limited.
  • Hallucination: As with all autoregressive language models, generated code can look plausible but be subtly wrong (invented variable names, swapped function names, etc.). Always test before use.
  • No safety alignment: No RLHF, no refusal training. Should not be deployed in user-facing products without safety layers.

Privacy

The training data was redacted for technical secrets (API keys, tokens, PEM keys) before training. Personal information beyond such secrets (style, opinions, project context) is intentionally retained because the purpose of personalization is to learn that style. Users should be aware that interacting with this model exposes them to the trained style of the original author.


Environmental Considerations

This fine-tune adds approximately 5 GPU-hours (A100) on top of the base model's training cost. The personalization approach is designed to be lightweight, making personalized models accessible without retraining from scratch.


License

Released under the Apache 2.0 license. You are free to use, modify, and redistribute it, subject to the terms of the license.


Citation

@misc{cofos_logo_v2_2026,
  title  = {Cofos Logo v2: A Personalized Python Code Assistant},
  author = {{AMEFORGE}},
  year   = {2026},
  url    = {https://huggingface.co/AMFORGE/cofos_logo_v2}
}

Contact

For questions, collaborations, or access to the AMEFORGE SparseMind runtime:

  • Organization: AMEFORGE
  • HuggingFace: @AMFORGE

Cofos Logo v2 is part of the broader Cofos model family by AMEFORGE. See the AMFORGE organization page for the base model, sibling variants, and ongoing work on the general-purpose foundation.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for AMFORGE/cofos_logo_v2

Base model

AMFORGE/cofos_v2
Finetuned
(1)
this model

Evaluation results