LLM-jp

Team

university

https://llm-jp.nii.ac.jp/en/

llm_jp

llm-jp

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

e-mon updated a dataset 4 days ago

llm-jp/leaderboard-requests-v2

Silviase updated a dataset 5 days ago

llm-jp/jawildtext

speed new activity 5 days ago

llm-jp/Jagle-VL-2.2B-Jagle-FineVision:failed to load model

View all activity

Papers

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models

JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation

View all Papers

llm-jp 's collections 17

LLM-jp-4 Models

llm-jp/llm-jp-4-8b-base

Text Generation • 9B • Updated Apr 24 • 4.03k • 6
llm-jp/llm-jp-4-8b-instruct

Text Generation • 9B • Updated Apr 24 • 15.1k • 7
llm-jp/llm-jp-4-8b-thinking

Text Generation • 9B • Updated Apr 24 • 27k • 38
llm-jp/llm-jp-4-32b-a3b-base

Text Generation • 32B • Updated Apr 24 • 538 • 5

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

Paper • 2510.22276 • Published Oct 25, 2025 • 3
llm-jp/WAON-Bench

Viewer • Updated Apr 13 • 1.87k • 740 • 2
llm-jp/waon-siglip2-base-patch16-256

Zero-Shot Image Classification • 0.4B • Updated Nov 2, 2025 • 720 • 1
llm-jp/WAON

Updated Nov 6, 2025 • 104 • 8

Optimal Sparsity Code

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M

Text Generation • 0.3B • Updated Feb 19 • 7
llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M

Text Generation • 0.5B • Updated Feb 19 • 5
llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M

Text Generation • 0.9B • Updated Feb 19 • 9
llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M

Text Generation • 2B • Updated Feb 19 • 6

LLM-jp-3.1 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series

llm-jp/llm-jp-3.1-8x13b-instruct4

Text Generation • 73B • Updated May 30, 2025 • 284 • 4
llm-jp/llm-jp-3.1-8x13b-32K-instruct4

Text Generation • 73B • Updated Feb 25 • 521 • 2
llm-jp/llm-jp-3.1-13b-instruct4

Text Generation • 14B • Updated May 30, 2025 • 1.59k • 19
llm-jp/llm-jp-3.1-1.8b-instruct4

Text Generation • 2B • Updated May 30, 2025 • 3.18k • 20

Open Japanese LLM leaderboard

Runtime error

Agents

108

Open Japanese LLM Leaderboard

🌸

108

Explore and compare LLM models with interactive filters and visualizations
llm-jp/leaderboard-requests

Viewer • Updated Oct 23, 2025 • 3 • 430 • 2
llm-jp/leaderboard-contents

Viewer • Updated Oct 23, 2025 • 862 • 103 • 1
llm-jp/leaderboard-results

Updated Oct 23, 2025 • 17.1k • 1

Drop-Upcycling

llm-jp/FS-8x1.5B

9B • Updated Feb 27, 2025 • 4
llm-jp/BTX-8x1.5B

9B • Updated Feb 27, 2025 • 5
llm-jp/FS-8x3.7B

19B • Updated Feb 27, 2025 • 4
llm-jp/NU-8x1.5B

9B • Updated Feb 27, 2025 • 6

LLM-jp-3.1 Pre-trained Models

Pre-trained models in the LLM-jp-3.1 model series

llm-jp/llm-jp-3.1-8x13b

Text Generation • 73B • Updated May 30, 2025 • 10
llm-jp/llm-jp-3.1-8x13b-32K

Text Generation • 73B • Updated Feb 25 • 32 • 1
llm-jp/llm-jp-3.1-13b

Text Generation • 14B • Updated May 30, 2025 • 189 • 2
llm-jp/llm-jp-3.1-1.8b

Text Generation • 2B • Updated May 30, 2025 • 774 • 13

LLM-jp ver2.0 Models

Models in the LLM-jp ver2.0 model series

llm-jp/llm-jp-13b-v2.0

Text Generation • Updated Apr 30, 2024 • 809 • 15
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 5
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 8 • 1
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 348 • 3

LLM-jp ver1.0 Models

Models in the LLM-jp ver1.0 model series

llm-jp/llm-jp-13b-v1.0

Text Generation • Updated Oct 20, 2023 • 1.3k • 41
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0

Text Generation • Updated Oct 20, 2023 • 944 • 15
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0

Text Generation • Updated Oct 20, 2023 • 953 • 8
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0

Text Generation • Updated Oct 20, 2023 • 946 • 4

Jagle

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models

llm-jp/Jagle

Updated 18 days ago • 1.13k • 15
llm-jp/Jagle-VL-2.2B-Jagle-FineVision

Feature Extraction • 2B • Updated 5 days ago • 48 • 2
llm-jp/Jagle-VL-2.2B-FineVision

Feature Extraction • 2B • Updated 20 days ago • 11 • 1
llm-jp/Jagle-VL-2.2B-Jagle

Feature Extraction • 2B • Updated Apr 13 • 44 • 4

Llama-Mimi

Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens

llm-jp/Llama-Mimi-1.3B

Audio-to-Audio • 1B • Updated Oct 2, 2025 • 1.58k • 11
llm-jp/Llama-Mimi-8B

Audio-to-Audio • 8B • Updated Sep 19, 2025 • 9.86k • 12
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens

Paper • 2509.14882 • Published Sep 18, 2025 • 2

Optimal Sparsity Math

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M

Text Generation • 0.3B • Updated Feb 19 • 7
llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M

Text Generation • 0.5B • Updated Feb 19 • 11
llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M

Text Generation • 0.9B • Updated Feb 19 • 9
llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M

Text Generation • 2B • Updated Feb 19 • 5

LLM-jp-3 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series

llm-jp/llm-jp-3-8x13b-instruct3

Text Generation • 73B • Updated Apr 1, 2025 • 309 • 8
llm-jp/llm-jp-3-172b-instruct3

Text Generation • 172B • Updated Jan 20, 2025 • 282 • 11
llm-jp/llm-jp-3-13b-instruct3

Text Generation • 14B • Updated Feb 4, 2025 • 366 • 8
llm-jp/llm-jp-3-8x1.8b-instruct3

Text Generation • 9B • Updated Apr 1, 2025 • 218 • 4

Multi Modal Models

llm-jp/llm-jp-4-vl-9b-beta

Feature Extraction • 9B • Updated 18 days ago • 2.32k • 12
llm-jp/JAMMEval

Viewer • Updated Apr 8 • 1.59k • 823 • 5
llm-jp/llm-jp-3-vila-14b

Image-Text-to-Text • Updated Nov 18, 2024 • 16 • 11
llm-jp/llm-jp-clip-vit-base-patch16

Zero-Shot Image Classification • Updated Apr 30, 2025 • 84 • 1

Sparse Autoencoders

llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c988240

0.1B • Updated Mar 12, 2025 • 10
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c100000

Updated Mar 18, 2025
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c10000

Updated Mar 18, 2025
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c1000

Updated Mar 18, 2025

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series

llm-jp/llm-jp-3-8x13b

Text Generation • 73B • Updated Mar 27, 2025 • 49
llm-jp/llm-jp-3-172b

Text Generation • 172B • Updated Dec 23, 2024 • 3 • 4
llm-jp/llm-jp-3-8x1.8b

Text Generation • 9B • Updated Mar 27, 2025 • 493
llm-jp/llm-jp-3-13b

Text Generation • 14B • Updated Sep 26, 2024 • 158 • 13

LLM-jp ver1.1 Models

Models in the LLM-jp ver1.1 model series

llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1

Text Generation • Updated Mar 12, 2024 • 1
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1

Text Generation • 13B • Updated Feb 7, 2024 • 181 • 2
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1

Text Generation • Updated Mar 12, 2024 • 1

LLM-jp-4 Models

llm-jp/llm-jp-4-8b-base

Text Generation • 9B • Updated Apr 24 • 4.03k • 6
llm-jp/llm-jp-4-8b-instruct

Text Generation • 9B • Updated Apr 24 • 15.1k • 7
llm-jp/llm-jp-4-8b-thinking

Text Generation • 9B • Updated Apr 24 • 27k • 38
llm-jp/llm-jp-4-32b-a3b-base

Text Generation • 32B • Updated Apr 24 • 538 • 5

Jagle

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models

llm-jp/Jagle

Updated 18 days ago • 1.13k • 15
llm-jp/Jagle-VL-2.2B-Jagle-FineVision

Feature Extraction • 2B • Updated 5 days ago • 48 • 2
llm-jp/Jagle-VL-2.2B-FineVision

Feature Extraction • 2B • Updated 20 days ago • 11 • 1
llm-jp/Jagle-VL-2.2B-Jagle

Feature Extraction • 2B • Updated Apr 13 • 44 • 4

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

Paper • 2510.22276 • Published Oct 25, 2025 • 3
llm-jp/WAON-Bench

Viewer • Updated Apr 13 • 1.87k • 740 • 2
llm-jp/waon-siglip2-base-patch16-256

Zero-Shot Image Classification • 0.4B • Updated Nov 2, 2025 • 720 • 1
llm-jp/WAON

Updated Nov 6, 2025 • 104 • 8

Llama-Mimi

Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens

llm-jp/Llama-Mimi-1.3B

Audio-to-Audio • 1B • Updated Oct 2, 2025 • 1.58k • 11
llm-jp/Llama-Mimi-8B

Audio-to-Audio • 8B • Updated Sep 19, 2025 • 9.86k • 12
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens

Paper • 2509.14882 • Published Sep 18, 2025 • 2

Optimal Sparsity Code

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M

Text Generation • 0.3B • Updated Feb 19 • 7
llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M

Text Generation • 0.5B • Updated Feb 19 • 5
llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M

Text Generation • 0.9B • Updated Feb 19 • 9
llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M

Text Generation • 2B • Updated Feb 19 • 6

Optimal Sparsity Math

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M

Text Generation • 0.3B • Updated Feb 19 • 7
llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M

Text Generation • 0.5B • Updated Feb 19 • 11
llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M

Text Generation • 0.9B • Updated Feb 19 • 9
llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M

Text Generation • 2B • Updated Feb 19 • 5

LLM-jp-3.1 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series

llm-jp/llm-jp-3.1-8x13b-instruct4

Text Generation • 73B • Updated May 30, 2025 • 284 • 4
llm-jp/llm-jp-3.1-8x13b-32K-instruct4

Text Generation • 73B • Updated Feb 25 • 521 • 2
llm-jp/llm-jp-3.1-13b-instruct4

Text Generation • 14B • Updated May 30, 2025 • 1.59k • 19
llm-jp/llm-jp-3.1-1.8b-instruct4

Text Generation • 2B • Updated May 30, 2025 • 3.18k • 20

LLM-jp-3 Fine-tuned Models

Fine-tuned models in the LLM-jp-3 model series

llm-jp/llm-jp-3-8x13b-instruct3

Text Generation • 73B • Updated Apr 1, 2025 • 309 • 8
llm-jp/llm-jp-3-172b-instruct3

Text Generation • 172B • Updated Jan 20, 2025 • 282 • 11
llm-jp/llm-jp-3-13b-instruct3

Text Generation • 14B • Updated Feb 4, 2025 • 366 • 8
llm-jp/llm-jp-3-8x1.8b-instruct3

Text Generation • 9B • Updated Apr 1, 2025 • 218 • 4

Open Japanese LLM leaderboard

Runtime error

Agents

108

Open Japanese LLM Leaderboard

🌸

108

Explore and compare LLM models with interactive filters and visualizations
llm-jp/leaderboard-requests

Viewer • Updated Oct 23, 2025 • 3 • 430 • 2
llm-jp/leaderboard-contents

Viewer • Updated Oct 23, 2025 • 862 • 103 • 1
llm-jp/leaderboard-results

Updated Oct 23, 2025 • 17.1k • 1

Multi Modal Models

llm-jp/llm-jp-4-vl-9b-beta

Feature Extraction • 9B • Updated 18 days ago • 2.32k • 12
llm-jp/JAMMEval

Viewer • Updated Apr 8 • 1.59k • 823 • 5
llm-jp/llm-jp-3-vila-14b

Image-Text-to-Text • Updated Nov 18, 2024 • 16 • 11
llm-jp/llm-jp-clip-vit-base-patch16

Zero-Shot Image Classification • Updated Apr 30, 2025 • 84 • 1

Drop-Upcycling

llm-jp/FS-8x1.5B

9B • Updated Feb 27, 2025 • 4
llm-jp/BTX-8x1.5B

9B • Updated Feb 27, 2025 • 5
llm-jp/FS-8x3.7B

19B • Updated Feb 27, 2025 • 4
llm-jp/NU-8x1.5B

9B • Updated Feb 27, 2025 • 6

Sparse Autoencoders

llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c988240

0.1B • Updated Mar 12, 2025 • 10
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c100000

Updated Mar 18, 2025
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c10000

Updated Mar 18, 2025
llm-jp/llm-jp-3-1.8b-sae-l12-k32-16x-c1000

Updated Mar 18, 2025

LLM-jp-3.1 Pre-trained Models

Pre-trained models in the LLM-jp-3.1 model series

llm-jp/llm-jp-3.1-8x13b

Text Generation • 73B • Updated May 30, 2025 • 10
llm-jp/llm-jp-3.1-8x13b-32K

Text Generation • 73B • Updated Feb 25 • 32 • 1
llm-jp/llm-jp-3.1-13b

Text Generation • 14B • Updated May 30, 2025 • 189 • 2
llm-jp/llm-jp-3.1-1.8b

Text Generation • 2B • Updated May 30, 2025 • 774 • 13

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series

llm-jp/llm-jp-3-8x13b

Text Generation • 73B • Updated Mar 27, 2025 • 49
llm-jp/llm-jp-3-172b

Text Generation • 172B • Updated Dec 23, 2024 • 3 • 4
llm-jp/llm-jp-3-8x1.8b

Text Generation • 9B • Updated Mar 27, 2025 • 493
llm-jp/llm-jp-3-13b

Text Generation • 14B • Updated Sep 26, 2024 • 158 • 13

LLM-jp ver2.0 Models

Models in the LLM-jp ver2.0 model series

llm-jp/llm-jp-13b-v2.0

Text Generation • Updated Apr 30, 2024 • 809 • 15
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 5
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 8 • 1
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0

Text Generation • 14B • Updated Apr 30, 2024 • 348 • 3

LLM-jp ver1.1 Models

Models in the LLM-jp ver1.1 model series

llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1

Text Generation • Updated Mar 12, 2024 • 1
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1

Text Generation • 13B • Updated Feb 7, 2024 • 181 • 2
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1

Text Generation • Updated Mar 12, 2024 • 1

LLM-jp ver1.0 Models

Models in the LLM-jp ver1.0 model series

llm-jp/llm-jp-13b-v1.0

Text Generation • Updated Oct 20, 2023 • 1.3k • 41
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0

Text Generation • Updated Oct 20, 2023 • 944 • 15
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0

Text Generation • Updated Oct 20, 2023 • 953 • 8
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0

Text Generation • Updated Oct 20, 2023 • 946 • 4

AI & ML interests

Recent Activity

Papers

Team members 34

llm-jp 's collections 17

Open Japanese LLM Leaderboard

Open Japanese LLM Leaderboard