2 5

bogeumkim

nsbg

AI & ML interests

NLP

Recent Activity

updated a collection about 2 months ago

eval-papers-collection

updated a collection about 2 months ago

eval-papers-collection

updated a collection about 2 months ago

eval-papers-collection

View all activity

Organizations

Collections 1

spaces 1

First Agent Template

⚡

models 3

datasets 3

bogeumkim/cipher-sft-dataset-ver0.2

Viewer • Updated May 14, 2024 • 5.1k • 22

bogeumkim/emotion_cls

Viewer • Updated Aug 24, 2023 • 19.7k • 14

bogeumkim/sw_hackathon_dataset

Viewer • Updated Aug 20, 2023 • 103k • 33

bogeumkim

AI & ML interests

Recent Activity

Organizations

Collections 1

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

spaces 1

First Agent Template

models 3

bogeumkim/pentest-llama-3-ko-sft

bogeumkim/mistral-7b-ko-example

bogeumkim/polyglot-1.3b-qlora-emotion-classification

datasets 3

bogeumkim/cipher-sft-dataset-ver0.2

bogeumkim/emotion_cls

bogeumkim/sw_hackathon_dataset

bogeumkim

AI & ML interests

Recent Activity

Organizations

Collections 1

spaces 1

First Agent Template

models 3 Sort: Recently updated

datasets 3 Sort: Recently updated

models 3

datasets 3