Published On Mar 6, 2024
Check out the next episode of Beers with Engineers at AI Infra Club. Together with Michael Feil from Gradient, we will talk about Embeddings inference using Infinity; https://lnkd.in/dw73eqK8 π«
π Here is a short overview of the agenda:
- βWhy open-sourcing an embedding engine is important
β- Which models to run and why choose Python over Rust and C++ (I am especially looking forward to this one π )
β- TL;DR about some tricks to improve throughput, reranking, and classification
β- What Infinity does on a high level, their roadmap and demo
#beerswithengineers #aiinfrastructure #AIOps #mlops #AIDevOps #GPUComputing #CloudAI#AIScaling#MachineLearningInfrastructure #runai #aistack #ml