chapter: mlops-engines #28

htrivedi99 · 2023-08-25T21:21:39Z

Review checklist

Don't worry about satisfying all items, it's fine to open a (draft) PR.

fixes #4

casperdcl

intro too long
intro mostly off-topic (what does it have to do with MLOps Engines? a lot of this looks more relevant to other chapters)
missing clear definition of "MLOps Engine"
missing list/table of MLOps Engines & feature comparison (see e.g. this)
missing justifications (every claim needs to be backed up, you can't just state personal opinions)
use clean book-not-blog language, e.g.
- ## Some Thoughts About The Future -> ## Future
- With large language models, the story is no different -> LLMs are similar
- stick to third-person, and definitely don't keep changing between first, second, and third
- avoid repetition
did you see the original section outline? e.g. Python Bindings, Apache TVM, links to read before writing anything, etc.

casperdcl · 2023-08-31T14:50:43Z

mlops-engines.md

+
+## The MLOps Lifecycle
+
+![](https://static.premai.io/book/mlops-engines-LLMOps-diagram.jpg)


I'm not sure I understand this diagram - what is it trying to show?

This diagram shows the LLMOps lifecycle. Its just suppose to serve as a banner image

I'm particularly struggling to understand the meaning of the arrows and the colours.

mlops-engines.md

casperdcl · 2023-08-31T14:58:09Z

mlops-engines.md

+
+## Challenges With Open-Source MLOps
+
+MLOps has always been available in two flavors. One is the managed version, where all the components are provided out of the box for a steep price. The other is a DIY setup where you stitch together various open-source components. 


citation needed

citation added

mlops-engines.md

casperdcl · 2023-08-31T15:53:21Z

mlops-engines.md

+Due to the challenge of running LLMs, enterprises will opt to use an inference server instead of containerizing the model in-house. Most companies don't have the expertise to optimize these models, but they still want the performance benefits. Inference servers, whether they are open-source or not, will be the path forward.
+
+Another pattern that's emerging is that models will move to the data instead of the data moving to the model. Right now if you call the ChatGPT API, you would be sending your data to the model. Enterprises have worked very hard over the past decade to set up robust data infrastructure in the cloud. It makes a lot more sense to bring the model into the same cloud environment where the data is. This is where open-source models being cloud agnostic have a huge advantage.


citations needed. If it's your personal opinion, sate why. Otherwise assume the reader disagrees with every claim you make.

added citations

casperdcl · 2023-08-31T15:54:14Z

mlops-engines.md

+* Takes a while for new LLMs to be supported
+
+
+Many other open-source projects like [BentoML](https://www.bentoml.com/), [FastAPI](https://fastapi.tiangolo.com/), and [Flask](https://flask.palletsprojects.com/en/2.3.x/) have been used for serving models in the past. The reason I have not included them on this list is that these open-source tools do not provide the optimizations you need to run LLMs in production.


that's not a reason. Also, what are "the optimizations you need to run LLMs in production"?

…tations as well

casperdcl suggested changes Aug 31, 2023

View reviewed changes

casperdcl reviewed Aug 31, 2023

View reviewed changes

casperdcl assigned htrivedi99 Aug 31, 2023

casperdcl added the content text & code label Aug 31, 2023

casperdcl changed the title ~~adding rough draft for mlops engines~~ chapter: mlops-engines Sep 1, 2023

casperdcl force-pushed the mlops-engines branch 2 times, most recently from 671da89 to 3c8bae8 Compare September 18, 2023 10:58

Het Trivedi and others added 15 commits September 18, 2023 20:49

adding rough draft for mlops engines

a10d841

Added links and a section

e0389ef

Added diagrams to mlops-engines chapter

c1a1079

added sources for images

ece0e84

Removed many of the biases and kept it more factual. Added initial ci…

c32c48e

…tations as well

fixing broken URL

c96d120

Updating mlops chapter with formatting for figures and references

bc0582c

Fixing dollar sign formatting issue

0c39137

mlops-engines: some copyedits

b35a6bf

mlops-engines: more copyedits

c6e89e4

wip

3bd4b49

slight intro copyedits

83a977d

add {{ wip }} admonition

7624a95

mlops-engines: wip note

ffd4359

slight copyedits

53b7dca

casperdcl force-pushed the mlops-engines branch from 11eb2f9 to 53b7dca Compare September 18, 2023 19:53

casperdcl merged commit 3a742c6 into main Sep 18, 2023
1 check failed

casperdcl deleted the mlops-engines branch September 18, 2023 19:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chapter: mlops-engines #28

chapter: mlops-engines #28

htrivedi99 commented Aug 25, 2023 •

edited by casperdcl

Loading

casperdcl left a comment •

edited

Loading

casperdcl Aug 31, 2023

htrivedi99 Sep 1, 2023

casperdcl Sep 14, 2023

casperdcl Aug 31, 2023

htrivedi99 Sep 1, 2023

casperdcl Aug 31, 2023

htrivedi99 Sep 1, 2023

casperdcl Aug 31, 2023


		## The MLOps Lifecycle

		![](https://static.premai.io/book/mlops-engines-LLMOps-diagram.jpg)


		## Challenges With Open-Source MLOps

		MLOps has always been available in two flavors. One is the managed version, where all the components are provided out of the box for a steep price. The other is a DIY setup where you stitch together various open-source components.

		Due to the challenge of running LLMs, enterprises will opt to use an inference server instead of containerizing the model in-house. Most companies don't have the expertise to optimize these models, but they still want the performance benefits. Inference servers, whether they are open-source or not, will be the path forward.

		Another pattern that's emerging is that models will move to the data instead of the data moving to the model. Right now if you call the ChatGPT API, you would be sending your data to the model. Enterprises have worked very hard over the past decade to set up robust data infrastructure in the cloud. It makes a lot more sense to bring the model into the same cloud environment where the data is. This is where open-source models being cloud agnostic have a huge advantage.

		* Takes a while for new LLMs to be supported


		Many other open-source projects like [BentoML](https://www.bentoml.com/), [FastAPI](https://fastapi.tiangolo.com/), and [Flask](https://flask.palletsprojects.com/en/2.3.x/) have been used for serving models in the past. The reason I have not included them on this list is that these open-source tools do not provide the optimizations you need to run LLMs in production.

chapter: mlops-engines #28

chapter: mlops-engines #28

Conversation

htrivedi99 commented Aug 25, 2023 • edited by casperdcl Loading

Review checklist

casperdcl left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

htrivedi99 commented Aug 25, 2023 •

edited by casperdcl

Loading

casperdcl left a comment •

edited

Loading