metaphi-ai commited on
Commit
db02725
Β·
verified Β·
1 Parent(s): b2d9272

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -15,9 +15,9 @@ pinned: false
15
 
16
  | Agent | Occupation | Complexity | Scale | What It Tests | Verifiers |
17
  |-------|--------|-------|-------|------------|--------|
18
- | **[Fin Agent](link)** | Credit analyst | 32+ expert hours | 2,610 tasks, 26K+ PDFs | Multiple document reasoning β†’ taxonomy aware transaction categorization β†’ Business P&L construction | Binary pass/fail |
19
- | **[Enterprise Knowledge Agent](link)** | Senior business analyst | 16+ expert hours| 1,220 pitch-deck tasks, 45 video tasks, 279 preference pairs | Source faitfhulness β†’ narrative arc based story-telling --> design coherenece| Precision, Recall on citation. Subjective on video preference |
20
- | **[Front-end Agent](link)** | Senior Frontend engineer | 60-100 expert hours | 37 tasks, 147 expert preferences | Figma environment navigation β†’ design system creation β†’ build verification | Subjective on output preference |
21
 
22
  ## Leaderboard
23
 
 
15
 
16
  | Agent | Occupation | Complexity | Scale | What It Tests | Verifiers |
17
  |-------|--------|-------|-------|------------|--------|
18
+ | **[Fin Agent](link)** | Credit analyst | 32+ expert hours | 2,610 tasks, 26K+ PDFs | Multiple document reasoning β†’ taxonomy aware transaction categorization β†’ Business P&L construction | Programmatic: Binary pass/fail |
19
+ | **[Enterprise Knowledge Agent](link)** | Senior business analyst | 16+ expert hours| 1,220 pitch-deck tasks, 45 video tasks, 279 preference pairs | Source faitfhulness β†’ narrative arc based story-telling --> design coherenece| Skill-based rubrics and Preference-pairs |
20
+ | **[Front-end Agent](link)** | Senior Frontend engineer | 60-100 expert hours | 37 tasks, 147 expert preferences | Figma environment navigation β†’ design system creation β†’ build verification | Skill-based rubrics and Preference-pairs |
21
 
22
  ## Leaderboard
23