New multilingual severity classifiers for vulnerability analysis
from cedric@lemmy.ml to security@lemmy.ml on 06 Apr 15:55
https://lemmy.ml/post/45549798
from cedric@lemmy.ml to security@lemmy.ml on 06 Apr 15:55
https://lemmy.ml/post/45549798
🚀 We’ve just published a new article introducing a Russian-language severity classifier, along with improved English and Chinese models for vulnerability descriptions.
👉 vulnerability-lookup.org/…/russian-severity-class…
These models are trained with VulnTrain and served via ML-Gateway, and are fully integrated into Vulnerability-Lookup.
🔍 What’s new
- 🇷🇺 Russian severity classifier based on FSTEC (BDU) data, using a ruRoBERTa-large model
- 🇬🇧 Improved English model trained on diverse sources (CVE, GitHub, PySec, CSAF…)
- 🇨🇳 Improved Chinese model leveraging CNVD data
- 📊 Better training insights with per-class precision / recall / F1 metrics
- 🧩 Multi-source datasets with traceable origins and dynamic dataset cards
All datasets and models are openly available on Hugging Face.



⚙️ Under the hood
- VulnTrain 3.1.0 adds FSTEC support, dataset traceability, and improved model selection
- ML-Gateway 0.5.0 now supports multilingual severity classification out of the box
This work is part of the AIPITCH project, supporting practical AI integration in cybersecurity workflows.
threaded - newest