Site Reliability Engineer (Enterprise Solution)

Taipei / KKStream - Engineering / Permanent

KKStream serves our enterprise level customers coming from top Japanese companies in the OTT industry and supports 24x7 operations. 
At KKStream, we work closely with RD and front-line service teams to build and sustain scalable, reliable and operational services. We are highly responsible for our customers so we have lots of cross team collaboration. We also work very closely with our cloud service partner and learn a lot of cloud service technologies. 
We have lots of passion for knowledge sharing, issue solving and new technologies. If you have fresh ideas, love cloud technologies to show unique and incredible viewpoints, and enjoy collaborating with cross-functional teams to develop real-world solutions and fantastic user experiences. Welcome to join us!


  • Develop and maintain service monitoring software stack.
  • Develop and maintain infrastructure orchestration on clouds.
  • Improve the reliability and scalability for on-line services
  • Engage in and improve the whole lifecycle of services
  • Participate on-call rotation.
  • Requirements:

  • Bachelor's degree in Computer Science or a related technical field involving software or systems engineering, or equivalent practical experience.
  • Nice to Have

  • Good skills and experience in communication.
  • Experience in programming languages such as Python or Bash script.
  • Experience in infrastructure deployment automation tools such as Terraform or AWS Cloudformation.
  • Experience in knowledge of Git.
  • Experience in CI/CD technical stacks such as gitlab.
  • Experience in operating and deploying services on AWS.
  • Experience in cloud networking, storaging, and computing services.
  • Experience in monitoring dashboards and collecting metrics/logging/tracing such as AWS CloudWatch or Container Insights.
  • Apply Now