view post Post 1659 ###### CVPR2025 Workshop Challenge Alert ######š« Between deadlines, rebuttals, and existential crises??? "We got you!!!!"š¢ Our new CVPR25 multi-modal challenge is online !!!š½ļø Dishcovery: VLM MetaFood Challenge!!!! š½ļøšš§« Can your groundbreaking VLM understand the difference between sushi styles, pasta types, or cooking methods from just image + caption pairs?š Our Task: Match fine-grained images to food descriptionsChallenge Highlights:š¦ 400K food image-caption pairs, a little taste to get you started !!!š¬ Got a SoTA VLM? Come test it on our challenging test sets !!!šÆ Challenge for everyone! Easy to use SigLIP baseline is provided !!!š Real, synthetic, noisy data ā just like real life - Will your VLM redefine how people track their diets??? ( š£ļø We believe so!!! )š Join the challenge: https://www.kaggle.com/competitions/dishcovery-vlm-mtf-cvpr-2025šļø Deadline: Phase I: 4th of May, 2025 - Phase II: 10th of May, 2025š Workshop website: https://sites.google.com/view/cvpr-metafood-2025#CVPR25 #ComputerVision #CV #Deeplearning #DL #VisionLanguage #VLM #multimodal #FoundationModels See translation š„ 4 4 + Reply
Running 9 Nerfies: Deformable Neural Radiance Fields š§ 9 Turn casual videos into 3D portraits from any angle
Running Nerfies: Deformable Neural Radiance Fields š§ Turn casual videos into 3D portraits from any angle