view post Post 2077 Only a single RTX 4090 running model pre-training is really slow, even for small language models!!! (https://huggingface.co/collections/JingzeShi/doge-slm-677fd879f8c4fd0f43e05458) See translation 2 replies ยท ๐ 8 8 ๐คฏ 6 6 ๐ 4 4 + Reply