Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Aman Gupta's picture

1 4

Aman Gupta PRO

amang1802

aastha6's profile picture

RodriMauri001's profile picture

·

AI & ML interests

None yet

Organizations

amang1802 's collections 7

ThinkTransformer experiments

Experiments with new architecture that enables latent space reasoning

amang1802/think_fineweb-edu_chkpts_exp2

Updated Feb 20
amang1802/think_fineweb-edu_chkpts_exp11

Updated Feb 22

Small model pretraining experiments

amang1802/llama_162M_fineweb100BT

Text Generation • 0.2B • Updated Aug 27 • 1
amang1802/llama_162M_fineweb10BT

Text Generation • 0.2B • Updated Dec 22, 2024

Synthetic Data rewrite (model checkpoints)

Models trained with synthetic data generated using various synthetic rewrite methods

amang1802/llama-3.1-8B-cpttest_mode1_fulltext

8B • Updated Jan 6
amang1802/llama-3.1-70B-cpttest_mode1_fulltext

71B • Updated Jan 6 • 1
amang1802/llama-3.1-8B-cpttest_mode2_qna_fulltext

8B • Updated Jan 7
amang1802/llama-3.1-70B-cpttest_mode2_qna_fulltext

71B • Updated Jan 7

WildeWeb Research

Soft Skills Data curation

amang1802/llama-3.1-70B-wildeweb-sample

71B • Updated Jan 18
amang1802/wildeweb_sample

Viewer • Updated Jan 17 • 38.3k • 6
amang1802/wildeweb_cls_1M

Viewer • Updated Jan 17 • 1M • 16
amang1802/wildeweb-safety-vibe-check

Viewer • Updated Jan 20 • 5 • 12

Smol-Math pretraining and post-training projects

amang1802/smol-math-400M

Text Generation • 0.4B • Updated Feb 9
amang1802/math-vibe-gsm-similar

Viewer • Updated Feb 9 • 5 • 14
amang1802/math-vibe-new

Viewer • Updated Feb 9 • 5 • 12

PPO experiments

Using PPO with simpler reward functions

amang1802/summary_train

Viewer • Updated Nov 21, 2024 • 1.28k • 19
amang1802/summary_train_med

Viewer • Updated Jun 6 • 18.4k • 16
amang1802/Llama3.2-1B-summary-length-1024-1ep

Text Generation • 1B • Updated Nov 21, 2024 • 1 •
amang1802/Llama3.2-1B-summary-length-exp2

Text Generation • 1B • Updated Nov 21, 2024 •

Synthetic Data rewrite research (training and eval datasets)

Researching methods for synthetic rewrites for CPT datasets and evaluating them in their ability to improve knowledge memorization

amang1802/synthetic_data_unconditioned_L3.1_70B

Viewer • Updated Dec 28, 2024 • 10.2k • 6
amang1802/synthetic_data_unconditioned_L3.1_70B_deduped

Viewer • Updated Dec 28, 2024 • 10.2k • 9
amang1802/synthetic_data_unconditioned_L3.1_405B_Instruct

Viewer • Updated Dec 29, 2024 • 10.2k • 8
amang1802/synthetic_data_unconditioned_L3.1_405B_Instruct_deduped

Viewer • Updated Dec 29, 2024 • 10.2k • 13

ThinkTransformer experiments

Experiments with new architecture that enables latent space reasoning

amang1802/think_fineweb-edu_chkpts_exp2

Updated Feb 20
amang1802/think_fineweb-edu_chkpts_exp11

Updated Feb 22

Smol-Math pretraining and post-training projects

amang1802/smol-math-400M

Text Generation • 0.4B • Updated Feb 9
amang1802/math-vibe-gsm-similar

Viewer • Updated Feb 9 • 5 • 14
amang1802/math-vibe-new

Viewer • Updated Feb 9 • 5 • 12

Small model pretraining experiments

amang1802/llama_162M_fineweb100BT

Text Generation • 0.2B • Updated Aug 27 • 1
amang1802/llama_162M_fineweb10BT

Text Generation • 0.2B • Updated Dec 22, 2024

PPO experiments

Using PPO with simpler reward functions

amang1802/summary_train

Viewer • Updated Nov 21, 2024 • 1.28k • 19
amang1802/summary_train_med

Viewer • Updated Jun 6 • 18.4k • 16
amang1802/Llama3.2-1B-summary-length-1024-1ep

Text Generation • 1B • Updated Nov 21, 2024 • 1 •
amang1802/Llama3.2-1B-summary-length-exp2

Text Generation • 1B • Updated Nov 21, 2024 •

Synthetic Data rewrite (model checkpoints)

Models trained with synthetic data generated using various synthetic rewrite methods

amang1802/llama-3.1-8B-cpttest_mode1_fulltext

8B • Updated Jan 6
amang1802/llama-3.1-70B-cpttest_mode1_fulltext

71B • Updated Jan 6 • 1
amang1802/llama-3.1-8B-cpttest_mode2_qna_fulltext

8B • Updated Jan 7
amang1802/llama-3.1-70B-cpttest_mode2_qna_fulltext

71B • Updated Jan 7

Synthetic Data rewrite research (training and eval datasets)

Researching methods for synthetic rewrites for CPT datasets and evaluating them in their ability to improve knowledge memorization

amang1802/synthetic_data_unconditioned_L3.1_70B

Viewer • Updated Dec 28, 2024 • 10.2k • 6
amang1802/synthetic_data_unconditioned_L3.1_70B_deduped

Viewer • Updated Dec 28, 2024 • 10.2k • 9
amang1802/synthetic_data_unconditioned_L3.1_405B_Instruct

Viewer • Updated Dec 29, 2024 • 10.2k • 8
amang1802/synthetic_data_unconditioned_L3.1_405B_Instruct_deduped

Viewer • Updated Dec 29, 2024 • 10.2k • 13

WildeWeb Research

Soft Skills Data curation

amang1802/llama-3.1-70B-wildeweb-sample

71B • Updated Jan 18
amang1802/wildeweb_sample

Viewer • Updated Jan 17 • 38.3k • 6
amang1802/wildeweb_cls_1M

Viewer • Updated Jan 17 • 1M • 16
amang1802/wildeweb-safety-vibe-check

Viewer • Updated Jan 20 • 5 • 12

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs