4 18

Shravan Nayak

BAJUKA

https://bajuka.github.io/

BAJUKA

AI & ML interests

NLP

Recent Activity

upvoted a paper 3 days ago

How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning

upvoted a paper 6 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

upvoted a paper 8 days ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

View all activity

Organizations

upvoted a paper 3 days ago

How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning

Paper • 2605.27310 • Published 5 days ago • 18

upvoted a paper 6 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 11 days ago • 105

upvoted a paper 8 days ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

Paper • 2605.18607 • Published 13 days ago • 14

upvoted a paper 9 days ago

RiT: Vanilla Diffusion Transformers Suffice in Representation Space

Paper • 2605.21981 • Published 10 days ago • 10

updated a dataset 14 days ago

BAJUKA/data

Preview • Updated 14 days ago • 389

published a dataset 14 days ago

BAJUKA/data

Preview • Updated 14 days ago • 389

upvoted a paper 15 days ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published 19 days ago • 61

upvoted a paper about 1 month ago

Sema Code: Decoupling AI Coding Agents into Programmable, Embeddable Infrastructure

Paper • 2604.11045 • Published Apr 13 • 26

upvoted a paper about 2 months ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 95

updated a model about 2 months ago

BAJUKA/llavanext-qwen25-3b-siglip-train1p5m-ovvideo

3B • Updated Apr 10 • 1

published a model about 2 months ago

BAJUKA/llavanext-qwen25-3b-siglip-train1p5m-ovvideo

3B • Updated Apr 10 • 1

upvoted 2 papers about 2 months ago

Communicating about Space: Language-Mediated Spatial Integration Across Partial Views

Paper • 2603.27183 • Published Mar 28 • 20

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

New activity in ServiceNow/VideoCUA 2 months ago

Add video-text-to-text task category and usage instructions

#3 opened 2 months ago by

nielsr

authored a paper 2 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

commented a paper 2 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98 •

upvoted a paper 2 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

New activity in ServiceNow/VideoCUA 2 months ago

Update README.md

#2 opened 2 months ago by

HideOnBush

Upload cua-suite-teaser.png

#1 opened 2 months ago by

HideOnBush

updated a dataset 2 months ago

ServiceNow/VideoCUA

Updated Mar 30 • 960 • 32

Shravan Nayak

AI & ML interests

Recent Activity

Organizations

BAJUKA's activity

Add video-text-to-text task category and usage instructions

Update README.md

Upload cua-suite-teaser.png