Commit History

Perf: skip regex on every token unless think tag present
b76b9b2
verified

bilalnaveed commited on

Perf: short system prompt 530->50 tokens, n_threads=2, ctx=512, max_tokens=80
2a442d4
verified

bilalnaveed commited on

Fix Gradio 6: remove type param from Chatbot (removed in v6, always messages format)
cabb89e
verified

bilalnaveed commited on

Fix Gradio 6: remove show_copy_button from Chatbot
cc7ec53
verified

bilalnaveed commited on

Fix Gradio 6: move theme/css to launch(), remove invalid Chatbot kwargs
c7de55e
verified

bilalnaveed commited on

Upload README.md with huggingface_hub
6163e0e
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
087c95a
verified

bilalnaveed commited on

Upload Dockerfile with huggingface_hub
420a1ed
verified

bilalnaveed commited on

Upload requirements-hf.txt with huggingface_hub
5240ebb
verified

bilalnaveed commited on

Upload config.py with huggingface_hub
97421ca
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
97fe68d
verified

bilalnaveed commited on

Enable true real-time streaming by default and disable chat DB logging latency
a1ec972
verified

bilalnaveed commited on

Ultra-fast preset: shorter context/output, reduced history, lower thinking overhead
bb2212d
verified

bilalnaveed commited on

Upload config.py with huggingface_hub
b81cc00
verified

bilalnaveed commited on

Speed tuning: smooth chunked output mode + faster CPU defaults
5bdf790
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
e5b444c
verified

bilalnaveed commited on

Improve streaming smoothness and reduce latency on low hardware
97c7b53
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
9bd8260
verified

bilalnaveed commited on

Fix Gradio chat history format compatibility (tuple vs messages)
2577504
verified

bilalnaveed commited on

Fix chat streaming duplication, robust message normalization, and cleaner response style
f09fa63
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
3b3c93e
verified

bilalnaveed commited on

Complete backend upload
2fab3ca
verified

bilalnaveed commited on

Fix: Add error handling for vision model loading
3e182f1
verified

bilalnaveed commited on

Add local models: faster-whisper (STT) + BLIP (vision) - all features now 100% local except image generation
6a81697
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
fdeca0d
verified

bilalnaveed commited on

Fix all features: Update API URLs to router.huggingface.co, fix chat_fn signature
bae7342
verified

bilalnaveed commited on

Fix: Use tuple format for Gradio 4.19.2 compatibility
7686e51
verified

bilalnaveed commited on

Fix: Downgrade to Gradio 4.19.2 to avoid API schema bug
147a1ad
verified

bilalnaveed commited on

Fix Gradio API schema bug - manual chat interface
e8e3d37
verified

bilalnaveed commited on

Compile from source for Debian (glibc compatibility)
4bb2db7
verified

bilalnaveed commited on

Add build-essential + pin llama-cpp v0.2.90
6920f48
verified

bilalnaveed commited on

Fix llama-cpp install order
f1eba7b
verified

bilalnaveed commited on

Switch to Docker SDK
a7e9890
verified

bilalnaveed commited on

Remove gradio (auto-added by HF Spaces)
2922ee9
verified

bilalnaveed commited on

Fix version compatibility: pin huggingface_hub==0.20.3 gradio==4.19.2
aa76d7e
verified

bilalnaveed commited on

Remove llama-cpp from requirements
bdb30fb
verified

bilalnaveed commited on

Add Dockerfile for prebuilt wheel
cc70c4b
verified

bilalnaveed commited on

Use Python 3.11 + prebuilt wheel
6643b8b
verified

bilalnaveed commited on

Trigger rebuild
b00fb70
verified

bilalnaveed commited on

Upload requirements.txt with huggingface_hub
4e34abd
verified

bilalnaveed commited on

Upload database.py with huggingface_hub
09258b6
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
caffb70
verified

bilalnaveed commited on

Upload README.md with huggingface_hub
97abed9
verified

bilalnaveed commited on

Upload config.py with huggingface_hub
7585f42
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
5e3bbd7
verified

bilalnaveed commited on

Upload inference.py with huggingface_hub
3be6421
verified

bilalnaveed commited on

Upload conversation.py with huggingface_hub
e5aa07d
verified

bilalnaveed commited on

Upload config.py with huggingface_hub
45c5ee6
verified

bilalnaveed commited on

Upload app.py with huggingface_hub
5291dfc
verified

bilalnaveed commited on

Upload requirements.txt with huggingface_hub
b78d77c
verified

bilalnaveed commited on