Perf: skip regex on every token unless think tag present b76b9b2 verified bilalnaveed commited on Feb 19
Perf: short system prompt 530->50 tokens, n_threads=2, ctx=512, max_tokens=80 2a442d4 verified bilalnaveed commited on Feb 19
Fix Gradio 6: remove type param from Chatbot (removed in v6, always messages format) cabb89e verified bilalnaveed commited on Feb 19
Fix Gradio 6: move theme/css to launch(), remove invalid Chatbot kwargs c7de55e verified bilalnaveed commited on Feb 19
Enable true real-time streaming by default and disable chat DB logging latency a1ec972 verified bilalnaveed commited on Feb 18
Ultra-fast preset: shorter context/output, reduced history, lower thinking overhead bb2212d verified bilalnaveed commited on Feb 18
Speed tuning: smooth chunked output mode + faster CPU defaults 5bdf790 verified bilalnaveed commited on Feb 18
Improve streaming smoothness and reduce latency on low hardware 97c7b53 verified bilalnaveed commited on Feb 18
Fix Gradio chat history format compatibility (tuple vs messages) 2577504 verified bilalnaveed commited on Feb 18
Fix chat streaming duplication, robust message normalization, and cleaner response style f09fa63 verified bilalnaveed commited on Feb 18
Add local models: faster-whisper (STT) + BLIP (vision) - all features now 100% local except image generation 6a81697 verified bilalnaveed commited on Feb 18
Fix all features: Update API URLs to router.huggingface.co, fix chat_fn signature bae7342 verified bilalnaveed commited on Feb 18
Fix: Use tuple format for Gradio 4.19.2 compatibility 7686e51 verified bilalnaveed commited on Feb 18
Fix: Downgrade to Gradio 4.19.2 to avoid API schema bug 147a1ad verified bilalnaveed commited on Feb 18
Fix version compatibility: pin huggingface_hub==0.20.3 gradio==4.19.2 aa76d7e verified bilalnaveed commited on Feb 18