Senior Operations Engineer – Santa Monica, CA, USA
Our Client is seeking an experienced, best-in-class Senior Operations Engineer to join their Live-Site team. Ideal candidates will have several years of experience working at industry-leading companies, creating cutting edge online experiences closely tied to games and deploying them at very large scale. They will also have an excellent aptitude for critical thinking and analysis, superior people skills and a strong desire to shape the future of gaming.
Responsibilities:
- Develop detailed project plans for large-scale server deployments.
- Manage, prioritize and successfully execute multiple concurrent mission-critical server deployment projects.
- Balance competing demands for resources and adapt to changing priorities.
- Perform quality assurance function in support of ensuring quality in vendor deployments.
- Manage infrastructure.
- Track and schedule the following:OS/OS vendor updates, network equipment software/firmware updates, external tools/service updates (e.g., Hadoop) and adding/removing hardware to/from service clusters.
- Ensure monitoring and metrics systems are operational and ensure configuration for monitoring and metric systems are up to date.
- Respond to problems and outages.
- Participate in on-call rotations.
Requirements:
- Administration experience of Red Hat and Debian based Linux servers.
- Very comfortable working with Linux based systems.
- Good understanding of database technologies.
- Experience with Load Balancing technologies.
- Experience with Cloud Computing technologies.
- Able to automate every task he or she does.
- Strong scripting skills; Unix shell programming and at ideally at least one of Perl, Python or Ruby.
- Should know how to script tasks involving SQL, XML and network operations in a Linux environment, including web service monitoring plugin development.
- Strong analytical / troubleshooting skills.
- Leadership skills, ability to mentor less experienced members of the team and provide leadership when necessary.
- Excellent written and verbal communications skills
- Ability to communicate and collaborate with team members.
- Ability to work well under high pressure.
- Strong prioritization skills.
Bonus Skills:
- Experience with Windows Servers
- In-depth knowledge of IP based networking, SSL Termination and load balancing (HAProxy, nginx, stunnel, stud, etc.)
- Cloud Computing (Amazon EC2, S3, RDS, etc.)
- Administration of services on Linux servers (Apache, MySQL, etc.)
- SQL databases – MySQL preferred
- NoSQL databases – MongoDB, RIAK, Amazon SimpleDB, etc.
- Web services development (e.g. XML-RPC, REST)
Experience with Nagios, Hyperic, Zabbix, Cacti and/or other monitoring systems
