• Location
    • Santa Clara, CA
  • Date Posted
  • Sep. 23, 2021
  • Function
  • IT
  • Sector
  • Data

We are seeking a senior technical engineer for a cross-functional team that is tasked with maintaining the quality, availability, and reliability of the Clumio Data Platform.  The ideal candidate will be self-motivated, an excellent communicator, have experience with Linux, Amazon Web Services, Kubernetes, Docker, Terraform, VMWare vSphere, IP Networking, ITIL Service Management, and strict Change Control procedures.  This candidate should be very comfortable working in a fast-paced SDLC that includes the infrastructure-as-code model.   The role will work closely with software engineering, quality assurance, customer success and product management teams to ensure the timely delivery and high availability of platform features and services.  This role will personally drive continuous improvement throughout the organization.

Roles and Responsibilities

  • Work alongside Engineering and Quality Engineering teams to ensure a stable and scalable infrastructure
  • Work with developers to drive organizational requirements for resources, capacity, security, configuration management, deployment, and monitoring
  • Work with senior management to develop project plans and drive tasks accordingly
  • Develop automation strategies and implement automation code, tools, and frameworks to scale operations
  • Maintain escalation policies, incident communication, and follow-the-sun support between multiple geographically disperse virtual NOCs
  • Handle 24x7x365 Incident Management and drive Problem Management continuous improvements to enforce and maintain SLAs
  • Routinely survey and evaluate available technology options to improve processes, tooling, and monitoring
  • Collaborate with QE to help provide comprehensive coverage for software releases by building and maintaining suitable test environments
  • Collaborate with Sales to build and maintain demonstration/Proof of Concept environments
  • Interface with Customer Success to provide scope and detail for incident reports and maintenance activities
  • Acquire and maintain a thorough working knowledge of the products and services that are live and under development

Requirements

  • B.S. in Computer Science or equivalent experience.
  • Familiarity with ITIL practices
  • Extensive experience with cloud systems architecture, monitoring frameworks, network architecture
  • Experience using source control tools such as Git
  • Significant scripting (Python, Bash) experience
  • Experience with configuration management and orchestration tools such as Terraform or Ansible
  • Familiarity with AWS or GCP services, APIs, and paradigms

Desirable

  • Experience documenting policies and detailed procedures for an ISMS (ISO/IEC 27001)
  • Familiarity with unit testing in Python
  • Familiarity with ISO27001 Annex A controls as they apply to cloud hosted SaaS/PaaS/IaaS operations
  • Demonstrated experience managing projects and delivering results in an ITIL framework
  • Experience with continuous integration platforms such as Jenkins
  • Occasional travel may be required

Clumio provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.