Content moderation models and datasets - 2025 Collection Models and datasets that support automatic content moderation • 21 items • Updated Dec 1, 2025 • 4
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 260M • • 4.79k
awesome safety resources Collection a directory of helpful datasets, tools, papers, and resources related to open source online safety • 27 items • Updated Jan 12 • 3
view article Article Open Collaboration in Action: Inside the Open Safeguard Hackathon roosttools • Dec 18, 2025 • 8
view article Article Open Collaboration in Action: Inside the Open Safeguard Hackathon roosttools • Dec 18, 2025 • 8
awesome safety resources Collection a directory of helpful datasets, tools, papers, and resources related to open source online safety • 27 items • Updated Jan 12 • 3
Running Agents 3 Llm Moderation Testing 🦀 3 A model to test different models assessing content policies
awesome safety resources Collection a directory of helpful datasets, tools, papers, and resources related to open source online safety • 27 items • Updated Jan 12 • 3