Skip to content

BentoML and the GIL #946

Answered by parano
jondoering asked this question in General
Jul 31, 2020 · 1 comments · 2 replies
Discussion options

You must be logged in to vote

Hi @jondoering, there is no scalability issue with GIL in BentoML. BentoML uses gunicorn to manage multiple processes of model backend workers. In BentoML, the frontend micro-batching layer(the MarshalServer in the code) is built with asyncio and aiohttp. It sends batched requests to the model backend, which runs a Flask app in a Gunicorn server.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@jondoering
Comment options

@bojiang
Comment options

Answer selected by parano
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants