We are looking for a Lead/Principal Machine Learning Engineer with strong expertise in Java and AI/ML development to design and deliver scalable, high-performing B2B SaaS solutions. In this role, you will develop and deploy intelligent agents within multi-agent frameworks, build agentic workflows, and contribute to innovative AI-driven applications. You will also design and implement microservices using Java, Spring Boot, Kafka, and MySQL, ensuring secure, reliable, and efficient systems throughout the development lifecycle. This is a hands-on leadership role where you will influence architecture decisions, drive adoption of GenAI and LLM-based solutions, and deliver impactful, production-ready applications.
Essential functions
- Develop agents in multi-agent frameworks and contribute to agentic workflow management and tool building.
- Design, develop, test, and document microservices and enterprise applications using Java, Spring Boot, Kafka, and MySQL.
- Apply appropriate standards, processes, and tools throughout the software development lifecycle to ensure secure, reliable solutions.
- Participate in architecture design and reviews, and perform code reviews to verify quality and compliance with requirements.
- Support code deployment across development, staging, and production environments, including authoring deployment procedures.
- Troubleshoot and resolve production and security issues, ensuring system stability and performance.
- Develop and optimize database queries, fix bugs, and improve the performance of existing applications.
- Build and maintain large-scale, multi-tier applications, delivering highly scalable and high-performing solutions aligned with business goals.
Qualifications
- Strong knowledge and working experience with GenAI, LLM, implementing and deploying AI Agents
- 6+ experience with Python in AI development
- 7+ experience with Core Java and Java web application development for RESTful web services
- 7+ years of strong knowledge with Spring Framework, Sprint boot, and Build Tools, such as Maven
- 5+ working knowledge in Reactive and Event-driven programming using Kafka
- Strong OOP Design Skills and experience with design patterns
- Ability to understand and author APIs
- Practice in Test-driven Development
- Good knowledge of multi-threading and concurrency
Would be a plus
- Experience with CI/CD using Jenkins, Docker, Kubernetes
- Experience in langchain, Langgraph, MVP, and other machine learning frameworks is a plus
We offer
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package — medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership inenterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.