Job DescriptionWe are seeking talented Site Reliability Engineers well-versed in infrastructure of large complex systems/clusters built by automated solutions to provide huge compute capabilities.
You will be designing, building, and operating the infra components for a global private cloud, which is housing all core applications.
Design and deploy different clusters with large numbers of nodes, with consideration of running cost, redundancy, capacity, etc.
Design and deploy a fully-automated infrastructure cloud
Verify and introduce new open source solutions to improve functionality and performance
Monitor, troubleshoot and maintain critical production environments
Collaborate with other developers
Strong Linux/Unix system administrator skills
Intermediate programming/scripting skills, ideally in Java, Python or Ruby, but will consider experience in other object-oriented and functional languages
Understanding of virtualization and/or cloud computing
Experience with automation tools (ex. Chef, Puppet, Ansible)
Experience with CI/CD tools (ex. Jenkins, CircleCI)
Bachelor's degree in Computer Science or related field
Experience with cloud platform (ex. OpenStack)
Experience with container solution and tools (ex. LXC/Docker/Kubernetes/Mesos)
Experience with and working knowledge of IP networking systems and protocols
English Requirement:Business Level
Japanese Requirement:Not Required