Phando
/

chemberta-v2-finetuned-uspto-50k-classification

Text Classification

Model card Files Files and versions Community

This ChemBERTa-v2 checkpoint was fine-tuned on the USPTO-50k dataset for sequence classification.

Specifically, the objective is to predict the reaction class label, and the input is either (canonicalized) all reactant SMILES or all product SMILES (separated by ".").

Train/Test split: 0.99/0.01
Evaluation results:
- Accuracy: 87.11%
- Loss: 0.4272
Fine-tuning hyperparameters:
- seed = 233
- batch-size = 128
- num_epochs = 5 (but early stopped at epoch 4)
- learning_rate = 5e-4
- warmup_steps = 64
- weight_decay = 0.01
- lr_scheduler_type = "cosine"

Downloads last month: 127

Safetensors

Model size

83.5M params

Tensor type

F32

·

Inference Providers NEW

Text Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Phando/chemberta-v2-finetuned-uspto-50k-classification