End-to-end ML lifecycle on enterprise AWS.
- Managing the end-to-end machine learning lifecycle on an enterprise AWS platform, empowering 50+ data scientists to seamlessly transition from project onboarding and Kubeflow environment provisioning to full scale model training.
- Developing a Gen AI chatbot leveraging Retrieval-Augmented Generation (RAG) against comprehensive ATO knowledge bases, leveraging LangChain, AWS etc, automating responses to 10,000+ monthly queries and reducing manual query resolution time by 40%.
- Performing critical BAU operations and ML infrastructure maintenance, executing monthly Amazon EKS cluster patching and optimizing model inference pipelines via KServe and ModelServe to guarantee 99.9% platform uptime.
- Administrating centralized ML libraries and dependencies via Artifactory, ensuring secure, standardized package distribution for 100+ concurrent projects, which accelerated environment configuration times by 50%.















