Andersen is hiring an AI Platform Engineer to enhance a secure cloud-based platform by building scalable AI infrastructure, orchestration, and observability for reliable, intelligent performance.
The customer is an international company delivering professional and technology-enabled solutions that support effective collaboration, structured communication, and operational efficiency for organizations. It operates in a fast-growing environment, focusing on scalability, security, and continuous improvement while developing digital platforms used by diverse clients worldwide.
The project is focused on enhancing a secure, cloud-based board management platform with intuitive meeting tools, real-time collaboration, and advanced analytics. It also includes building and maintaining scalable AI infrastructure, orchestration patterns, and observability to ensure reliable and intelligent platform performance.
Responsibilities:
- Implementing multi-agent orchestration and tool-calling patterns.
- Building observability infrastructure (tracing, logging, cost tracking).
- Establishing API gateway patterns and runtime consistency.
- Designing failure handling, retry logic, and context routing.
- Enforcing tool schema standards across teams.
- Documenting and evangelizing platform patterns.
Must-have:
- Experience as an AI Platform Engineer, LLM Infrastructure Engineer or in a similar role for 4+ years.
- Solid experience working with LLM system architecture and infrastructure, including multi‑agent orchestration and tool‑calling.
- Practical experience with LLM frameworks (e.g., LangChain, LlamaIndex, Semantic Kernel, or equivalents).
- Knowledge of major cloud platforms (Azure/AWS/GCP) and distributed systems services.
- Experience with logging, metrics, and tracing tools: OpenTelemetry, Prometheus, Grafana, Elastic, Datadog.
- Strong proficiency in Python or another commonly used ML/infrastructure language.
- Good understanding of API design principles (REST, gRPC), JSON Schema, and OpenAPI specifications.
- Hands‑on experience implementing agent orchestration patterns and tool invocation in high‑load AI environments.
- Level of English – from Upper-Intermediate and above.
Nice to have:
- Proven ability to build observability infrastructure, including distributed tracing, logging, metrics, and cost tracking.
- Experience designing and implementing API gateway patterns: request routing, version control, unified responses, and runtime consistency.
- Strong understanding of resilience mechanisms such as error handling, retry logic, fallback strategies, and context routing.
- Experience establishing and enforcing tool schema standards across cross‑functional teams.
- Ability to document technical solutions and promote architectural patterns across the organization.
Reasons why this job would be interesting to you:
- Experience in teamwork with leaders in FinTech, Healthcare, Retail, Telecom, and others. Andersen cooperates with such businesses as Samsung, Siemens, Johnson & Johnson, BNP Paribas, Ryanair, Mercedes, TUI, Verivox, Allianz, T-Systems, etc..
- The opportunity to change the project and/or develop expertise in an interesting business domain.
- Job conditions – you can work both fully remotely and from the office or can choose a hybrid variant.
- Guarantee of professional, financial, and career growth! The company has introduced systems of mentoring and adaptation for each new employee.
- The opportunity to earn an additional up to 1,000 USD per month by participating in the company's activities.
- Access to the corporate training portal, where the entire knowledge base of the company is collected and which is constantly updated.
- Bright corporate life (parties / pizza days / PlayStation / fruits / coffee / snacks / movies).
- Certification compensation (AWS, PMP, etc).
- Referral program.
- English courses.
- Private health insurance and compensation for sports activities.
Join us!
Your personal data is protected in accordance with GDPR regulations. Learn more: https://andersenlab.com/privacy-policy/pl