Manager, Software Development & Engineering
Your Opportunity
Schwab remains committed to providing increased visibility to career growth opportunities and job requirements. This posting announcement is part of increased transparency and while all qualified applicants will be reviewed and considered, this organization has a preferred candidate identified for this role.
At Schwab, you’re empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us “challenge the status quo” and transform the finance industry together.
We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location.
We are looking for a skilled engineer with disciplines that incorporate aspects of software systems engineering and
operations. We are combining these skills to come up with better ways of managing and operating applications
• Champion and evangelize the SRE mindset by driving reliability, scalability, and efficiency through automation,
systematization, and AIOps-driven insights.
• Identify opportunities to design and deliver innovative tools that solve complex operational challenges across
large-scale, mission-critical enterprise applications.
• Build and maintain automation scripts and intelligent auto-remediation workflows, integrating them into core
infrastructure and operations.
• Leverage AIOps techniques (anomaly detection, event correlation, noise reduction, and predictive alerting) to
proactively identify and prevent incidents.
• Triage alerts, perform rapid root-cause analysis, and resolve high-severity production issues using data-driven
diagnostics and ML-assisted observability platforms.
• Develop tools, frameworks, and instrumentation to validate deployments, improve release confidence, and
increase rollout success for applications.
• Evaluate, forecast, and coordinate capacity planning using historical trends, predictive analytics, and AIOps-
based capacity insights.
• Design and enhance CI/CD orchestration systems, embedding reliability, policy enforcement, and automated
validation to reduce friction in production delivery.
• Perform real-time troubleshooting of mission-critical application workflows and feed operational and AIOps
insights back into product and engineering teams.
• Participate in on-call rotations and continuously improve incident response through post-incident analysis,
automation, and learning systems
What you have
To ensure that we have fulfilled our promise of "challenging the status quo," this role has specific qualifications that successful candidates should have.
Required Qualifications
• 5–7 years of experience supporting and administering enterprise-scale, production systems.
• 5–7 years of experience developing automation scripts, building observability dashboards, and
configuring alerts for proactive issue detection.
• Hands-on experience applying AIOps concepts, including alert correlation, anomaly detection, noise
reduction, and predictive analytics in production environments.
• 5–7 years of experience working within the Software Development Lifecycle (SDLC), including process
optimization and continuous improvement.
• 3–5 years of experience with public cloud platforms such as GCP, AWS, or Azure (GCP preferred).
• Strong hands-on experience with enterprise systems administration, monitoring, deployment, and
reliability engineering practices.
• Solid understanding of IP networking fundamentals including DNS, DHCP, firewalls, routing, and load
balancing.
• Experience with large-scale distributed systems, high-availability architectures, and fault-tolerant design.
• Proficiency in Linux and Windows system administration, troubleshooting, and performance tuning.
• Development experience in one or more languages such as Python, PowerShell, or Java, with a focus on
automation and operational tooling.
• Working knowledge of relational and NoSQL databases such as PostgreSQL, SQL Server, Oracle, or
MongoDB.
• Experience supporting or integrating with Actimize platforms.
• Familiarity with messaging and streaming technologies such as Kafka, RabbitMQ, IBM MQ, and Solace.
• Experience with observability and monitoring platforms such as Splunk, Grafana, Datadog, Synthetic Monitoring, and/or AIOps-enabled monitoring tools.
• Bachelor’s degree in Computer Science, Engineering, or a related discipline
Preferred Qualifications
• Experience implementing or operating AIOps platforms for incident prediction, automated root cause
analysis, or self-healing systems.
• Financial services industry experience.
• Familiarity with Agile, DevOps, and SRE best practices
In addition to the salary range, this role is also eligible for bonus or incentive opportunities
What’s in it for you
At Schwab, you’re empowered to shape your future. We champion your growth through meaningful work, continuous learning, and a culture of trust and collaboration—so you can build the skills to make a lasting impact. Our Hybrid Work and Flexibility approach balances our ongoing commitment to workplace flexibility, serving our clients, and our strong belief in the value of being together in person on a regular basis.
We offer a competitive benefits package that takes care of the whole you – both today and in the future:
- 401(k) with company match and Employee stock purchase plan
- Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
- Paid parental leave and family building benefits
- Tuition reimbursement
- Health, dental, and vision insurance