-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 56 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 66 • 5 -
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 39 • 6
AI & ML interests
Babelscape is a deep tech company founded in 2016 focused on multilingual Natural Language Processing.
Recent Activity
View all activity
-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 56 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 66 • 5 -
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 39 • 6
Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.
models 17
Babelscape/Qwen2.5-Math-7B-PRM800k-PDDL-r
8B • Updated
Babelscape/Llama-3.1-8B-PRM800k-r
8B • Updated
Babelscape/Llama-3.1-8B-PRM800k-PDDL-r
8B • Updated • 39
Babelscape/Qwen2.5-Math-7B-PRM800k-r
8B • Updated
Babelscape/t5-base-summarization-claim-extractor
0.2B • Updated • 2.03k • 14
Babelscape/wsl-reader-deberta-v3-base
0.2B • Updated • 120 • 4
Babelscape/wsl-retriever-e5-base-v2
Updated • 94 • 3
Babelscape/wsl-retriever-e5-base-v2-wordnet-index
Updated • 49 • 5
Babelscape/wsl-base
Updated • 55 • 3
Babelscape/mdeberta-v3-base-triplet-critic-xnli
Text Classification • 0.3B • Updated • 31 • 10
datasets 16
Babelscape/PDDL2PRM
Updated • 17
Babelscape/wsl
Viewer • Updated • 1.31k • 13 • 7
Babelscape/LLM-Oasis_claim_falsification
Viewer • Updated • 52.4k • 37 • 6
Babelscape/LLM-Oasis_unfactual_text_generation
Viewer • Updated • 81.2k • 39 • 6
Babelscape/LLM-Oasis_paraphrase_generation
Viewer • Updated • 81.3k • 32 • 6
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 66 • 5
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 56 • 6
Babelscape/story-summeval
Viewer • Updated • 319 • 35 • 8
Babelscape/ALERT_DPO
Viewer • Updated • 45.7k • 90 • 14
Babelscape/ALERT
Viewer • Updated • 45.7k • 1.32k • 16