Site Reliability Engineer

Taipei / KKCompany - Engineering / Permanent

KKCompany is Asia's leading music entertainment company. Started by a group of music loving Internet software developers, we built and launched one of the world's first music streaming services in 2005. Based in Taipei, the heart of Chinese pop music, we gradually grew our business from Taiwan out to Hong Kong, Singapore, Malaysia and Japan. Ever curious towards reinvention and discovering new business models of the future, we have expanded our business scope from music streaming to live events, technology services, content, investments and continue to explore reinvention through innovation in the digital entertainment space.


  • Engage in and improve services lifecycle from design to deployment, operation, and refinement.
  • Support services before the production stage through system design consulting, platform development, capacity planning, and launch reviews.
  • Maintain services in the production stage by monitoring availability, performance, resources, and other related metrics.
  • Construct and scale systems or platforms through automation and infrastructure as code.
  • Practice incident response and blameless post-mortems.
  • Requirements:

  • 2+ years of relevant cloud computing development experience, especially AWS.
  • Experience with high-level scripts, like Python, PHP, or modern languages, like Go, elixir.
  • Knowledge of cloud computing techniques, especially Docker, serverless techniques, SQL or NoSQL.
  • BS/MS/PhD in Computer Science (or equivalent).
  • Excellent communication skills, both written and verbal.
  • Nice To Have:

  • Have experience in automating infrastructure configuration (Ansible, Chef, Puppet, Terraform, etc) and monitoring (Cacti, Nagios, Prometheus, etc).
  • Have experience in operating containerized environments (Docker Swarm, Kubernetes, Nomad, etc).
  • Have experience in managing distributed systems in cloud environments such as AWS or GCP.
  • Have experience in analyzing and troubleshooting large scale distributed systems.
  • Have experience in technical writing or documentation.
  • Apply Now