Android Models
Collection
LiteRT models that can run on Android
•
20 items
•
Updated
•
33
This model provides a few variants of the embedding model published in the Gecko paper that are ready for deployment on Android or iOS using LiteRT stack or google ai edge RAG SDK.
Note that all benchmark stats are from a Samsung S23 Ultra.
Backend | Max sequence length | Init time (ms) | Inference time (ms) | Memory (RSS in MB) | Model size (MB) | |
---|---|---|---|---|---|---|
dynamic_int8 |
GPU |
256 |
1306.06 |
76.2 |
604.5 |
114 |
dynamic_int8 |
GPU |
512 |
1363.38 |
173.2 |
604.6 |
120 |
dynamic_int8 |
GPU |
1024 |
1419.87 |
397 |
871.1 |
145 |
dynamic_int8 |
CPU |
256 |
11.03 |
147.6 |
126.3 |
114 |
dynamic_int8 |
CPU |
512 |
30.04 |
353.1 |
225.6 |
120 |
dynamic_int8 |
CPU |
1024 |
79.17 |
954 |
619.5 |
145 |