Mahmud ElHuseyni 🇵🇸
MElHuseyni
AI & ML interests
Computer Vision
NLP
Machine Learning
Recent Activity
liked
a dataset
about 8 hours ago
turkerberkdonmez/TUSGPT-TR-Medical-Dataset-v1
upvoted
an
article
about 14 hours ago
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
liked
a model
1 day ago
Dexmal/DM0-base
Organizations
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 79.2k • • 39 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 629k • • 179 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 741 • • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 3.4k • • 26
Object Detection Models 🍉
VLM Leaderboards 📈
-
Running45
OCRBenchv2 Leaderboard
🏆45Display OCRBench leaderboard for text recognition models
-
Running196
Vidore Leaderboard
🥇196Compare and rank visual document retrieval models across different benchmarks
-
Running on CPU Upgrade993
Open VLM Leaderboard
🌎993VLMEvalKit Evaluation Results Collection
-
RunningFeatured560
Vision Arena (Testing VLMs side-by-side)
🖼560Analyze images with multiple vision models for labels and boxes
SmolVLM 🚐
OCR Models 👀️📃
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 178k • 477 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 26.5k • 96 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 5.55k • 101 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • Updated • 532 • 74
Speech Models 🎧
Arabic Models (LLM, VLM, Multimodel)
SmolVLM 🚐
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 79.2k • • 39 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 629k • • 179 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 741 • • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 3.4k • • 26
OCR Models 👀️📃
Object Detection Models 🍉
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 178k • 477 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 26.5k • 96 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 5.55k • 101 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • Updated • 532 • 74
VLM Leaderboards 📈
-
Running45
OCRBenchv2 Leaderboard
🏆45Display OCRBench leaderboard for text recognition models
-
Running196
Vidore Leaderboard
🥇196Compare and rank visual document retrieval models across different benchmarks
-
Running on CPU Upgrade993
Open VLM Leaderboard
🌎993VLMEvalKit Evaluation Results Collection
-
RunningFeatured560
Vision Arena (Testing VLMs side-by-side)
🖼560Analyze images with multiple vision models for labels and boxes
Speech Models 🎧