InstaDeepAI
/

mace-organics

Model card Files Files and versions Community

heloise-chomet commited on May 9

Commit

d0a34bd

verified ·

1 Parent(s): 74f6758

Update README.md

Browse files

Updated description of MACE foundation model.

Files changed (1) hide show

README.md +56 -0

README.md CHANGED Viewed

@@ -1,3 +1,59 @@
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.

+# MACE
+## Reference
+Ilyes Batatia, Dávid Péter Kovács, Gregor N. C. Simm, Christoph Ortner, and Gábor Csányi.
+Mace: Higher order equivariant message passing neural networks for fast and accurate force fields,
+2023. URL: https://arxiv.org/abs/2206.07697.
+## How to Use
+For complete usage instructions, please refer to our [documentation](https://instadeep.github.io/mlip)
+## Model architecture
+| Parameter                       | Value                | Description                                         |
+|---------------------------------|----------------------|-----------------------------------------------------|
+| `num_layers`                    | `2`                  | Number of MACE layers.                              |
+| `num_channels`                  | `128`                | Number of channels.                                 |
+| `l_max`                         | `3`                  | Maximal degree of spherical harmonics.              |
+| `node_symmetry`                 | `1`                  | Highest degree of node features.                    |
+| `correlation`                   | `3`                  | Maximum correlation order.                          |
+| `readout_irreps`                | `["16x0e", "0e"]`    | Irreps for the readout block.                       |
+| `num_readout_heads`             | `1`                  | Number of readout heads.                            |
+| `include_pseudotensors`         | `false`              | Whether all parities are kept.                      |
+| `num_bessel`                    | `8`                  | Number of Bessel basis functions.                   |
+| `activation`                    | `silu`               | Activation function in the non-linear readout block.|
+| `radial_envelope`               | `polynomial_envelope`| Radial envelope function.                           |
+| `symmetric_tensor_product_basis`| `false`              | Whether to use a symmetric tensor product basis.    |
+| `atomic_energies`               | `average`            | Treatement of atomic energies.                      |
+| `avg_num_neighbors`             | `null`               | Mean number of neighbors.                           |
+For more information about MACE hyperparameters,
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/models/mace.html#mlip.models.mace.config.MaceConfig)
+## Training
+Training is performed over 220 epochs, with an exponential moving average (EMA) decay rate of 0.99.
+The model employs a Huber loss function with scheduled weights for the energy and force components.
+Initially, the energy term is weighted at 40 and the force term at 1000.
+At epoch 115, these weights are flipped.
+We use our default MLIP optimizer in v1.0.0 with the following settings:
+| Parameter                        | Value   | Description                                         |
+|----------------------------------|---------|-----------------------------------------------------|
+| `init_learning_rate`             | `0.01`  | Initial learning rate.                              |
+| `peak_learning_rate`             | `0.01`  | Peak learning rate.                                 |
+| `final_learning_rate`            | `0.01`  | Final learning rate.                                |
+| `weight_decay`                   | `0`     | Weight decay.                                       |
+| `warmup_steps`                   | `4000`  | Number of optimizer warm-up steps.                  |
+| `transition_steps`               | `360000`| Number of optimizer transition steps.               |
+| `grad_norm`                      | `500`   | Gradient norm used for gradient clipping.           |
+| `num_gradient_accumulation_steps`| `1`     | Steps to accumulate before taking an optimizer step.|
+For more information about the optimizer,
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/training/optimizer.html#mlip.training.optimizer_config.OptimizerConfig)
+## Dataset
+| Parameter                   | Value | Description                                |
+|-----------------------------|-------|--------------------------------------------|
+| `graph_cutoff_angstrom`     | `5`   | Graph cutoff distance (in Å).              |
+| `max_n_node`                | `32`  | Maximum number of nodes allowed in a batch.|
+| `max_n_edge`                | `288` | Maximum number of edges allowed in a batch.|
+| `batch_size`                | `64`  | Number of graphs in a batch.               |
+This model was trained on the [SPICE2_curated dataset](https://huggingface.co/datasets/InstaDeepAI/SPICE2-curated).
+For more information about dataset configuration
+please refer to our [documentation](https://instadeep.github.io/mlip/api_reference/data/dataset_configs.html#mlip.data.configs.GraphDatasetBuilderConfig)
 ## License summary
 1. The Licensed Models are **only** available under this License for Non-Commercial Purposes.