CATastrophe - Code Vulnerability Detector
This model is an autoencoder-based vulnerability detector for Python code. It uses TF-IDF vectorization and an autoencoder architecture to detect anomalies in code that may indicate vulnerabilities.
Model Details
- Architecture: Autoencoder (Input β 512 β 128 β 512 β Input)
- Input Features: 2000 (TF-IDF)
- Training Loss: 0.0005
- Framework: PyTorch
Usage
import torch
import pickle
from model import Autoencoder
# Load model
model = Autoencoder(input_dim=2000)
model.load_state_dict(torch.load('catastrophe_model.pth'))
model.eval()
# Load vectorizer
with open('vectorizer.pkl', 'rb') as f:
vectorizer = pickle.load(f)
# Analyze code
code_text = "your code here"
features = vectorizer.transform([code_text]).toarray()
features_tensor = torch.tensor(features, dtype=torch.float32)
with torch.no_grad():
reconstructed = model(features_tensor)
anomaly_score = torch.mean((features_tensor - reconstructed) ** 2, dim=1)
Training Configuration
- Batch Size: 256
- Epochs: 50
- Learning Rate: 0.001
- Optimizer: Adam
Limitations
This model is trained on vulnerable commits only and uses reconstruction error as an anomaly score. High scores indicate potential vulnerabilities, but manual review is recommended.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support