Skip to content

EzheZhezhe/ML-Real-time-serving-with-Embedded-Model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Real-time serving with embedded model

Real-time serving with embedded model is about distributed event-at-a-time processing with millisecond latency and high throughput.

What to optimize: latency and throughput

End user: usually no direct interactions with a model

Validation: offline and online via A/B testing


Where to start

Learn MLOps general concepts:

Next learn more about real-time serving with embedded ML models:


Next step: Advanced workshop: Real-time serving with embedded model

This workshop is WIP

It will cover a real-life use case of embedding a machine learning model into streaming app and its troubleshooting

About

[WIP] Advanced workshop covering ML Real-time serving with Embedded model

Topics

Resources

License

Stars

Watchers

Forks