External large foundation model: How to efficiently serve trillions of parameters for online ads recommendation

Publication
Companion Proceedings of the ACM on Web Conference 2025