ETH Zürich
Systems Engineer – Cloud and Platform Services
📍 Lugano
Rolle und Verantwortlichkeiten
The primary goal of this position is to design, implement, and operate scalable inference services within a multi-tenant HPC infrastructure. As a systems engineer, you will play a key role in the architecture deployment, operation, and documentation of production-grade inference platforms that support a wide range of AI-driven workloads. Your work will directly influence the evolution of CSCS’s AI and HPC capabilities, which will improve the performance, reliability, and efficiency of our inference services and enable researchers and engineers to deploy and run advanced models at scale.
Team / Beschreibung
The Swiss National Supercomputing Centre (CSCS) develops and operates a high-performance computing and data research infrastructure that supports world-class science in Switzerland. Its user laboratory is available to domestic and international researchers in academia, industry, and the business sector. The centre is operated by ETH Zurich and has offices at its data centre in Lugano and in Zurich.
Qualifikationen und Fähigkeiten
Bachelor’s degree or higher in computer engineering, computer science, a relevant technical field, or equivalent practical experience.
Sound knowledge of Linux administration skills.
Deploying services on Kubernetes.
Infrastructure-as-code tools (e.g. Terraform/openTofu).
Automation tools and framework, including CI/CD processes and ecosystem (e.g., Gitlab CI, HashiCorp Vault).
Experience with versioning systems and CI/CD workflows such as ArgoCD is preferred.
Experience with automated installation systems for physical servers, such as MAAS or Foreman is preferred.
Experience with developing Ansible configurations is preferred.
Experience with developing services on top of Kubernetes is preferred.
Self-motivated and proactive team player.
Strong communication skills or interest in developing them.
Strong problem-solving mindset with tolerance for uncertainty and change.
Understands user needs and works collaboratively to address them.
Adaptive and willing to learn new technologies with and from others.
Comfortable tackling complex or ambiguous problems.
Comfortable admitting when you don’t know, reaching out and leveraging the right expertise when needed.
Experience with working in self-organized teams is a plus.
Familiarity with Agile methodology is a plus.
Experience with test-driven development is a plus.