Update README.md
Browse files
README.md
CHANGED
|
@@ -1,8 +1,20 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
|
| 2 |
# NetuArk Posts Classifier (Ensemble Architecture)
|
| 3 |
|
| 4 |
This model is a novel ensemble classifier designed to categorize technology-related social media posts into their respective news sources.
|
| 5 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
## Model Details
|
| 7 |
- **Architecture:** Voting Classifier (Multinomial Naive Bayes + Logistic Regression)
|
| 8 |
- **Vectorization:** TF-IDF (N-grams 1-3)
|
|
@@ -17,4 +29,4 @@ Trained on the [Xerv-AI/netuark-posts-6000](https://huggingface.co/datasets/Xerv
|
|
| 17 |
import joblib
|
| 18 |
model = joblib.load('netuark_ensemble_classifier.joblib')
|
| 19 |
prediction = model.predict(["New AI breakthrough on HackerNews"])
|
| 20 |
-
```
|
|
|
|
| 1 |
+
---
|
| 2 |
+
datasets:
|
| 3 |
+
- Xerv-AI/GRAD
|
| 4 |
+
---
|
| 5 |
|
| 6 |
# NetuArk Posts Classifier (Ensemble Architecture)
|
| 7 |
|
| 8 |
This model is a novel ensemble classifier designed to categorize technology-related social media posts into their respective news sources.
|
| 9 |
+
The model is trained to classify the following sources:
|
| 10 |
+
- ArsTechnica
|
| 11 |
+
- FT
|
| 12 |
+
- GuardianTech
|
| 13 |
+
- HackerNews
|
| 14 |
+
- Slashdot
|
| 15 |
+
- TechCrunch
|
| 16 |
+
- TheVerge
|
| 17 |
+
-
|
| 18 |
## Model Details
|
| 19 |
- **Architecture:** Voting Classifier (Multinomial Naive Bayes + Logistic Regression)
|
| 20 |
- **Vectorization:** TF-IDF (N-grams 1-3)
|
|
|
|
| 29 |
import joblib
|
| 30 |
model = joblib.load('netuark_ensemble_classifier.joblib')
|
| 31 |
prediction = model.predict(["New AI breakthrough on HackerNews"])
|
| 32 |
+
```
|