Job Description
AI Platform Engineer (Gen AI) designs, deploys, and optimizes scalable generative AI platforms, LLM workflows, cloud infrastructure, APIs, automation, and enterprise AI solutions securely efficiently.
Responsibilities
- Build scalable Gen AI platforms for enterprise applications.
- Deploy LLM models using cloud-native infrastructure pipelines securely.
- Integrate APIs, vector databases, and AI orchestration tools.
- Monitor model performance, latency, reliability, and production costs.
- Implement prompt engineering, RAG workflows, and guardrails effectively.
- Automate CI/CD pipelines for AI model deployments safely.
- Collaborate with data, DevOps, and security teams daily.
- Troubleshoot platform issues and optimize Gen AI workloads.
Required Skills
- Strong Python programming and backend API development skills.
- Experience with LLMs, RAG, embeddings, and prompts engineering.
- Knowledge of AWS, Azure, GCP, or Kubernetes platforms.
- Understanding of vector databases and AI model deployment.
- Familiarity with Docker, CI/CD, monitoring, and security.
- Ability to optimize Gen AI performance and scalability.
Note: Salary depends on experience, skills, location, and is paid in local currency.