alt-gnome/telegram-spam
Viewer • Updated • 21.4k • 194 • 4
How to use corall88/russian_spam_detector with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="corall88/russian_spam_detector") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("corall88/russian_spam_detector")
model = AutoModelForSequenceClassification.from_pretrained("corall88/russian_spam_detector")Модель russian_spam_detector предназначена для бинарной классификации текстов на 2 категории:
from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline
model_name = "corall88/russian_spam_detector"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)
detector = pipeline("text-classification", model=model, tokenizer=tokenizer)
message = "Поздравляем! Вы выиграли 1000000 рублей, пройдите по ссылке - ..."
predict = detector(message)
print(predict)
В качетсвете данных для файнтюнинга модели был выбран датасет cо спам сообщениями.
Модель основана на RuModernBERT-base и дообучена на задаче бинарной классификации.
| Metric | Value |
|---|---|
| Accuracy | 0.99 |
| F1-score | 0.99 |
| Precision | 0.99 |
| Recall | 0.99 |
@misc{russian_spam_detector,
title={russian_spam_detector: modern model for spam detection},
author={corall88},
url={https://huggingface.co/corall88/russian_spam_detector},
publisher={Hugging Face}
year={2025},
}
Base model
deepvk/RuModernBERT-base