Skip to content
gqlxj1987's Blog
Go back

Bert Production intro

Edit page

原文链接

we needed to handle over 25,000 inferences per second (and over 1 billion inferences per day), at a latency of under 20ms

benchmark的重要性


Edit page
Share this post on:

Previous Post
Tars Intro
Next Post
Fedb Cluster