Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
28
15
76
Nikita Kezins
entfane
Follow
nbeerbower's profile picture
frascuchon's profile picture
John6666's profile picture
10 followers
·
28 following
entfane
nikita-kezins
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated
a dataset
3 days ago
entfane/jailbreaks-only
published
a dataset
3 days ago
entfane/jailbreaks-only
updated
a model
3 days ago
entfane/llama-guard-binary
View all activity
Organizations
entfane
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
3 days ago
entfane/jailbreaks-only
Viewer
•
Updated
3 days ago
•
666
•
51
published
a dataset
3 days ago
entfane/jailbreaks-only
Viewer
•
Updated
3 days ago
•
666
•
51
updated
a model
3 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
3 days ago
•
63
published
a model
3 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
3 days ago
•
63
updated
a dataset
19 days ago
entfane/construction_points
Viewer
•
Updated
19 days ago
•
10k
•
170
published
a dataset
19 days ago
entfane/construction_points
Viewer
•
Updated
19 days ago
•
10k
•
170
updated
a model
22 days ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
22 days ago
•
124
published
a model
22 days ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
22 days ago
•
124
updated
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
31
published
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
31
updated
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
5
published
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
5
updated
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
published
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
updated
a dataset
about 1 month ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
180
published
a dataset
about 1 month ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
180
updated
a dataset
about 1 month ago
entfane/toxic_classification
Viewer
•
Updated
Apr 3
•
38.9k
•
7
published
a dataset
about 1 month ago
entfane/toxic_classification
Viewer
•
Updated
Apr 3
•
38.9k
•
7
updated
a model
about 1 month ago
entfane/bert_cyberharm
Text Classification
•
0.1B
•
Updated
Apr 1
•
26
published
a model
about 1 month ago
entfane/bert_cyberharm
Text Classification
•
0.1B
•
Updated
Apr 1
•
26
Load more