The Global Technology Operations Center is a 24/7/365 first-line operations and incident management team that supports client.
Typical Day in the Role for a candidate:
The Systems Administrator responsibilities include the facilitation of detection, triage, correction, escalation and documentation of major and critical events.
The Systems Administrator will also ensure that Service Level Targets are met when managing incidents and service requests, trending recurring events and incidents to initiate problem investigations.
Job Responsibilities:
Monitor all production systems for quality and performance ensuring that any failures or critical issues are addressed immediately.
Ensure that Knowledge (i.e. processes, procedures, troubleshooting guides, etc.) are developed and maintained for the team.
Collaborate with other groups to set standards and document production configurations.
Recommend policies and procedures to be used to support the Digital Services environment.
Daily monitoring of the ticket queue, performing analysis of problems across multiple systems and groups.
Improve performance and scalability within operations support environments by researching, evaluating and recommending new technology products and services
Continuously updates job knowledge by participating in educational opportunities, open-source user groups, reading professional publications and white papers.
Stays abreast of current and cutting-edge Internet operations best practices, establishes personal professional network
Support scheduled installations/upgrades and maintenance in accordance with established IT policies and procedures and business requirements.
Works closely with business owners on change and maintenance windows/requests
Must be able to support on-call duties as needed.
Must Have Skills / Requirements:
1) 2+ years of experience within a media or internet technology operations environment with production support experience in a Linux/Unix.
2) Excellent Internet application support experience within open-source platform environments.
3) Deep technical expertise in server administration, client service, user authentication, configuration management tools, internal & external monitoring tools, networking troubleshooting and concepts and shell scripting.
4) Advanced level experience and support of cloud technologies and container-based environments based on Kubernetes, Docker and other container orchestration solutions.
5) Configuration experience with commercial, open source and SaaS based monitoring tools {Zenoss, Nagios, Zabbix, Monit/mmonit, Kibana, Catchpoint, Conviva, Circonus, DataDog, BigPanda, Rigor, New Relic, Cloudwatch and etc} .
6) Should be knowledgeable in configuration management, automation, orchestration systems (e.g Chef, Puppet, CFEngine, etc)
7) Must be able to support on-call duties as needed.
"Nice to Have Skills / Preferred" Requirements
1) Demonstrable experience with one or more web/systems programming languages such as Python, PHP, Ruby, Javascript, Perl, HTML/CSS.
2) Experienced with ServiceNow, PagerDuty, JIRA, and Confluence.
3) Knowledgeable in video encoding and streaming concepts and technologies
Education / Certifications
Four-year college diploma or university degree in computer science or software engineering or equivalent practical work experience.
Must be able to willing and able work a rotational 24x7 schedule including weekends, and on-call rotations and duties as needed