Senior Platform Engineer
About The Position
We are looking for an experienced Senior Platform Engineer who is passionate about software design, development, and deployment. The job involves writing production-grade modern DevOps/Platform solutions and will be part of building Dream Computing Services layer which will be deployed anywhere.
Responsibilities
- Develop, build and maintain the best services and solutions for our Dream Computing Services platform.
- Apply an Everything as Code (EaC) approach using technologies such as Python, Ansible, Terraform, and Kubernetes.
- Build and develop services that will be part of Dream Computing Services and will run on remote infrastructure.
- Develop and maintain tools for automation, deployment, monitoring, and operations specifically tailored for remote environments.
- Troubleshoot issues in our development, production, and test environments.
- Collaborate effectively with team members and communicate complex technical issues clearly.
Skills
- At least 4-5 years of experience with DevOps or Platform technologies.
- Experience with the design, build, development, and maintenance of DevOps services and solutions.
- Proven experience working with remote environments and solutions like on-premise or customer private cloud.
- Experience with deployment technologies and CI/CD technologies in AWS for building and packaging.
- Strong knowledge of networking and security principles, including developing and deploying encryption and signing services.
- Proficient coding capabilities in Python and Bash.
- Hands-on experience with developing infrastructure services that run remotely, including customized Kubernetes.
- Proven track record of delivering packages to remote customer sites.
- Experience with air-gapped on-premise solutions is a significant advantage.
- Experience with deploying and maintaining robust and automated services/pipelines.
Must-Have Skills:
- Kubernetes, Python, Bash, Ansible, Linux, Networking, Docker.
Advantages:
- Experience with AI components (training, inference, serving).
Tech Stack:
- Kubernetes, Jenkins, Ansible, Terraform, Docker + Compose, MLFlow, Kserve, Minio, GitHub, Python, Bash, Linux, MongoDB, RabbitMQ, Redis, Neo4J.