papers
updated
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published
• 160
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Paper
• 2403.13372
• Published
• 179
LMEnt: A Suite for Analyzing Knowledge in Language Models from
Pretraining Data to Representations
Paper
• 2509.03405
• Published
• 24
KL3M Tokenizers: A Family of Domain-Specific and Character-Level
Tokenizers for Legal, Financial, and Preprocessing Applications
Paper
• 2503.17247
• Published
• 1
swiss-ai/Apertus-70B-2509
Text Generation
• 71B • Updated
• 1.15k
• 142
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware
Embeddings
Paper
• 2509.04011
• Published
• 29
Why Language Models Hallucinate
Paper
• 2509.04664
• Published
• 196
hmBERT: Historical Multilingual Language Models for Named Entity
Recognition
Paper
• 2205.15575
• Published