Incident Management - Incident Manager, Central Technical Operations Services (CTOS) Job at Amazon.com Services LLC, Austin, TX

S2tPemhZaUhmekRUemZXcEVBTUEwZVpKTEE9PQ==
  • Amazon.com Services LLC
  • Austin, TX

Job Description

DESCRIPTION

Amazon Central Technical Operations Services (CTOS) is the first line of defense for maintaining high availability in the Amazon Retail Website. We make customer impacting events shorter, less frequent, and less severe by providing large scale event and incident management. The Amazon Retail Website has hundreds of millions of customers globally who can be impacted by these types of incidents. The work we do to mitigate these incidents helps to ensure our customers have a seamless experience. Our Incident Managers are front-and-center in driving down event duration by utilizing their operational experience, knowledge of best practices, and effective usage of incident management tools.

We’re looking for Incident Managers who have owned or participated in operational and/or incident management for at least one large-scale enterprise. The Amazon Retail Website is complex and constantly changing; it operates across dozens of countries, consists of thousands of cloud-based services, is built and maintained by tens of thousands of engineers, and serves hundreds of millions customers. When it experiences major issues, part of your job will be to respond to it within minutes and ensure the best course of action is taken. This experience will expose you to all things Amazon.

Our Incident Managers are encouraged to build solutions to problems while sharing the benefit of those solutions with other service teams. This is an excellent opportunity to join one of Amazon’s world-class teams, and work with some of the best and brightest while also developing your skills and career within one of the most dynamic, innovative and progressive technology companies anywhere. In addition to a stimulating and fun working environment, Amazon offers mentoring programs with experienced engineers, regular tech talks with technology Principals, and well-defined career paths for motivated engineers who want to contribute to our culture of operational excellence and customer-focused technical innovation.

This position will be part of a globally distributed team of 20+ Incident Managers and Engineers across Austin, Dublin, and Sydney to allow for 24x7 coverage. Each group will work 10 hour shifts for 4 days a week. If you're looking for a team with great growth potential and an opportunity to make a huge impact, this is the team to join.

Responsibilities
- Be a technology evangelist and use your deep knowledge to solve business problems
- Reduce mean time to resolution for all incident types
- Design and/or build world class listening systems
- Adapt and improve operations management systems and processes to accommodate rapidly increasing growth
- Participate in Agile sprints to evolve business processes and technologies
- Create and review documentation, design new standard operating procedures
- Identify and troubleshoot recurring platform issues and engage service owners to drive resolution
- Automate tasks through creation and maintenance of scripts and tools
- Respond to and complete customer requests within SLA via a trouble ticketing system
- Take part in a “follow the sun” rotation split between Austin, Dublin and Sydney sites, including weekends and holidays
- Mentor peers in your areas of technical and operational strength
- Participate in the interviewing process

BASIC QUALIFICATIONS

- Bachelor’s degree in Computer Science or at least 7 years of relevant experience in a large-scale online technical operations environment
- Experience maintaining a high standard of operations in a large scale environment
- Development/scripting skills in at least one interpreted language (e.g. Java, Python) as well as shell
- Working experience in Linux and networking fundamentals
- Excellent troubleshooting skills and a commitment to document findings
- Experience driving collaborative projects from conception to delivery
- Experience in Agile/Scrum or related collaborative workflow

Job Tags

Holiday work, Full time, Work experience placement, Shift work,

Similar Jobs

RPG LLC

Brand Ambassador Job at RPG LLC

 ...Job Title: Brand Ambassador Location: Atlanta, GA Position Type: Full-time, Non-exempt Salary: $48,000 annual base salary, with...  ...Ambassador, you will represent our brand in various capacities, working to engage with potential customers, support our sales efforts... 

AreaProbe

Real Estate Market Analyst Job at AreaProbe

Real Estate Market Analyst Location Washington, DC (Foggy Bottom area) : Job Summary: We are seeking a talented Real Estate Market Analyst to join our team. The ideal candidate will possess a keen analytical mind and a deep understanding of the real estate industry.... 

PepsiCo Beverages North America

Warehouse Loader - 2nd Shift Job at PepsiCo Beverages North America

 ...electronic pallet jack. Position is responsible for accuracy in loading and complying with shipper/receiver documentation. Follows warehouse instructions in building loads. Employees must maintain a clean and safe work environment. Position may be required to work across... 

Kaleidoscope Education Solutions

Special Education Teacher Job at Kaleidoscope Education Solutions

 ...Location: Detroit, MI Date Posted: 12/28/2024 Category: Education K12 Education: Bachelors Degree Certified Special Education Teacher - 24/25 School Year - Wayne County, Detroit Area, MI Love your work. Extraordinary People. Choose Kaleidoscope. Our... 

AECOM

Geologist II Job at AECOM

 ...commitment to workplace health and safety are imperative. Qualifications : Minimum Requirements ~ Bachelor of Science degree in geology or relevant geological science discipline (e.g. geophysics hydrogeology or geochemistry. ~2 years of experience or demonstrated...