Datasets and models for EMNLP paper "Scalable Data Ablation Approximations for Language Models through Modular Training and Merging"
Clara Na
claran
AI & ML interests
None yet
Organizations
None yet
models 30
claran/s2orc-biology1994-1999-ind-130m
Updated
• 8
claran/s2orc-biology2007-2008-ind-130m
Updated
• 4
claran/s2orc-biology2013-2013-ind-130m
Updated
• 3
claran/s2orc-biology2021-2021-ind-130m
Updated
• 5
claran/s2orc-biology2019-2019-ind-130m
Updated
• 5
claran/s2orc-biology2000-2003-ind-130m
Updated
• 6
claran/s2orc-biology2015-2015-ind-130m
Updated
• 7
claran/s2orc-biology2014-2014-ind-130m
Updated
• 9
claran/s2orc-biology2004-2006-ind-130m
Updated
• 5
claran/s2orc-biology2016-2016-ind-130m
Updated
• 7
datasets 20
claran/chat_trace_2023
Viewer
• Updated
• 19.4k • 7
claran/code_trace_2023
Viewer
• Updated
• 8.82k • 18
claran/pg19-sample
Viewer
• Updated
• 1.02k • 16
claran/wikitext-2-noheader-sample-v2
Viewer
• Updated
• 1.02k • 25
claran/wikitext-2-nonulls-sample-v2
Viewer
• Updated
• 1.02k • 9
claran/wmt14-fr-en-sample
Viewer
• Updated
• 1.02k • 10
claran/imdb_sample
Viewer
• Updated
• 1.02k • 14
claran/wikitext-2-noheader-sample
Viewer
• Updated
• 10k • 8
claran/wikitext-2-nonulls-sample
Viewer
• Updated
• 10k • 8
claran/samsum_sample
Viewer
• Updated
• 1k • 3