Post
155
I did some testing on the scalability of FWKV. It hits a speed bottleneck at 1B due to the T4โs bandwidth limitations. Theoretically, it should match RWKVโs inference speed if the GPU had more bandwidth. So the 1B size is not accurate.