We are seeking a highly skilled Senior AI/ML Core Engineer to lead the development and scaling of advanced AI and machine learning solutions across critical business platforms. The role focuses on building production-grade AI systems, optimizing ML infrastructure, and driving innovation in large language models (LLMs), intelligent automation, and real-time AI applications. The ideal candidate will combine strong research capability with hands-on engineering expertise to deliver scalable, reliable, and high-performing AI products.
Qualification and Experience
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, Software Engineering, or a related field
- 5+ years of experience in AI/ML Engineering, Machine Learning Infrastructure, or Applied AI development
- Proven experience in building and deploying production-scale AI/ML systems
- Hands-on experience working with LLMs, deep learning frameworks, and distributed computing environments
- Experience leading technical initiatives and mentoring engineering teams is preferred
Job Description
- Design, develop, and optimize core AI products such as Trust Engine, KYC AI, eVA, and intelligent automation platforms
- Build and deploy scalable AI/ML systems integrating state-of-the-art LLMs and classical machine learning models into production environments
- Develop and maintain robust ML infrastructure, MLOps pipelines, and model-serving platforms with a focus on scalability, governance, and reliability
- Implement fine-tuning, retrieval-augmented generation (RAG), embeddings, and vector database solutions for AI applications
- Optimize model performance, inference latency, GPU utilization, and cloud infrastructure costs
- Collaborate with research, engineering, product, and infrastructure teams to define technical roadmaps and AI architecture standards
- Ensure compliance, auditability, monitoring, and observability of production AI systems
- Drive engineering excellence through best coding practices, architecture reviews, and technical mentorship
- Support real-time AI processing, distributed data systems, and large-scale model deployment pipelines
- Continuously evaluate emerging AI technologies, frameworks, and industry trends to improve product capabilities and engineering efficiency
Required Skills
- Expert-level proficiency in Python with strong understanding of data structures, algorithms, and system design
- Hands-on experience with PyTorch and/or TensorFlow
- Strong familiarity with Hugging Face Transformers, scikit-learn, and XGBoost
- Experience building applications using Large Language Models (LLMs), including fine-tuning, RAG pipelines, embeddings, and vector databases (Pinecone, Weaviate, FAISS, pgvector, etc.)
- Working knowledge of MLOps platforms and tools such as MLflow, Kubeflow, SageMaker, or Vertex AI
- Experience with Docker, Kubernetes, and containerized deployment environments
- Production experience with cloud platforms including AWS, GCP, or Azure, especially GPU/accelerator workloads
- Strong SQL skills and experience with distributed data processing tools such as Spark, Ray, Dask, Airflow, or dbt
- Solid understanding of statistics, linear algebra, optimization techniques, and core machine learning theory
- Knowledge of monitoring, observability, and performance optimization in production AI systems
Benefits of Working at eSewa
- Stellar opportunity to work with the rising company
- The amazing and passionate young team, beautiful office space
- Trust of biggest FinTech company.
- One-of-a-kind company culture and growth opportunities to accelerate your career progression.
How to apply?
We are always keen to meet energetic and talented professionals who would like to join our team. Click on the button below and submit your application to apply for the post.