brand logo
View All Jobs

Gen AI + Python (Fast API) | MLE/Senior MLE | ML Platform

Bangalore
Job Description

JOB DESCRIPTION
Senior MLE (Platform)_ Gen AI + FastAPI
Locations: Bangalore (Hybrid)
Who we are :
Tiger Analytics is a global leader in AI and analytics, helping Fortune 1000 companies solve their toughest challenges. We offer fullstack AI and analytics services & solutions to empower businesses to achieve real outcomes and value at scale. We are on a mission to push the boundaries of what AI and analytics can do to help enterprises navigate uncertainty and move forward decisively. Our purpose is to provide certainty to shape a better tomorrow.
Our team of 4000+ technologists and consultants are based in the US, Canada, the UK, India, Singapore and Australia, working closely with clients across CPG, Retail, Insurance, BFS, Manufacturing, Life Sciences, and Healthcare. Many of our team leaders rank in Top 10 and 40 Under 40 lists, exemplifying our dedication to innovation and excellence.
We are a Great Place to Work-Certified™ (2022-24), recognized by analyst firms such as Forrester, Gartner, HFS, Everest, ISG and others. We have been ranked among the ʻBestʼ and ʻFastest Growingʼ analytics firms lists by Inc., Financial Times, Economic Times and Analytics India Magazine.
We are seeking an experienced and dynamic Python - FastAPI Lead to join our team. The ideal talent will have a strong technical background, extensive hands-on experience in Python and FastAPI, and the ability to lead a team of developers to build scalable and high-performing web applications and APIs.
About The Role
  • Lead Position involved in setting the direction and goals for the ML Engineering team, in terms of impact, ML system design, and ML Innovation
  • Drive the development and adoption of ML platform accelerators solutions in areas such as  MLOPS Automation, Model Monitoring and Observability, LLM Ops and LLM Observability.
  • Lead cross-functional technical teams spanning deep learning, machine learning, distributed systems, program management, app engineering, and product teams. Oversee all aspects of the design, development, and delivery of machine learning-enabled end-to-end pipelines and solutions
  • Design and architect services and pipelines to support the full machine learning lifecycle from Development to Deployment. Architect solutions for High Availability, Low Latency, and High Throughput.
  • Improve system reliability-Troubleshoot and investigate any identified issues, Own complex online/ production performance issues, leveraging in-depth knowledge of how the ML  system’s work
  • Identify possible issues and performance leakages in the system and perform optimization
  • Build and perfect test cases to introduce highly reliable and scalable application platforms
  • Leading a team from a technical perspective to develop ML best practices and influence engineering culture Writing high-performance, reliable, and maintainable code.
  • Support the delivery of the platform solution by providing technical oversight and guidance to the respective Project Teams and ensure High customer satisfaction.
  • Lead, mentor, and manage a team of ML engineers, fostering a collaborative and high-performance environment.
  • Stay informed on emerging technologies and trends, explore cutting-edge deep learning models, prototype innovative ideas, and conduct both offline and online experiments.
What do we expect?
  • Bachelor’s or master’s degree in Computer Science, Engineering, or related fields.
  • 5+ years of experience in Machine Learning, Generative AI, and algorithm design, with expertise in modern ML stacks (PyTorch, TensorFlow, Hugging Face, LangChain, etc.).
  • 3+ years of hands-on experience in building and deploying real-time ML systems, including model serving, streaming, and low-latency inference.
  • Expertise in API development using FastAPIFlask, or similar frameworks, with a strong focus on microservices architecture.
  • Proven experience with real-time communication protocols such as WebSockets and Server-Sent Events (SSE) for streaming and event-driven applications.
  • Strong proficiency in Python programming, with deep knowledge of asynciomulti-threading, and multi-processing for high-performance systems.
  • Deep understanding of MLOps and LLMOps, including model deployment, monitoring, and scaling using Docker and Kubernetes.
  • Experience with cloud platforms (AWS, GCP, Azure) for deploying and managing real-time ML systems and APIs.
  • Hands-on experience with Generative AI technologies, including LLM fine-tuningprompt engineering, and RAG (Retrieval-Augmented Generation) frameworks.
  • Proficient with SQL and NoSQL databases, and experience with vector databases (e.g., Pinecone, Weaviate) for Gen AI applications.
  • Strong understanding of DevOps practices, including CI/CD pipelines, containerization, orchestration, and observability tools.
  • Experience in building scalable, event-driven architectures and deploying ML models in production environments.
  • Comfortable with fast-paced development and zero-to-one build skills, particularly in the context of Generative AI and real-time systems.
  • Proven experience in designing and managing live products/platforms, collaborating with product managers and stakeholders to achieve business goals.
  • Strong intuition for product development and a deep understanding of customer requirements, particularly in the context of Generative AI applications.
  • Expertise in version control (Git), containerization (Docker), and orchestration (Kubernetes) for scalable deployments.
  • Enthusiastic about building high-performing teams and fostering effective organizational structures.
  • High integrity, curiosity, and a strong desire to learn and grow as a software engineer, with a focus on cutting-edge Gen AI technologies.
Preferred Skills (Nice-to-Have):
  • Experience with AI/ML security best practices and ethical AI considerations.
  • Familiarity with AI observability tools (e.g. MLflow) for monitoring and debugging ML models in production.
  • Knowledge of low-latency architectures and real-time inference systems for Gen AI models.
  • Experience with streaming technologies such as Apache Kafka or similar tools.


Preferred Qualifications:
  • Prior experience leading development teams, with strong mentoring and leadership abilities.
  • Proven track record of delivering high-quality, scalable solutions on time.
Soft Skills:
  • Strong verbal and written communication skills.
  • Ability to work collaboratively in a team environment.
  • Proactive approach to identifying challenges and proposing solutions.
You are important to us, letʼs stay connected!
Every individual comes with a different set of skills and qualities so even if you donʼt tick all the boxes for the role today, we urge you to apply as there might be a suitable/unique role for you tomorrow. We are an equal opportunity employer. Our diverse and inclusive culture and values guide us to listen, trust, respect, and encourage people to grow the way they desire.
Note: The designation will be commensurate with expertise and experience. Compensation packages are among the best in the industry.
Additional Benefits: Health insurance (self & family), virtual wellness platform, and knowledge communities.