Fine-tuning and adapting small language models under compute constraints. Efficient training methods — LoRA, full fine-tuning, sequence packing. Deep Reinforcement Learning for autonomous decision-making. Evaluation methodology for domain-specific LLMs. Building and curating large-scale datasets for AI model training.