BNY Mellon Careers

Senior Specialist Infrastructure Operations Analyst

Jersey City, New Jersey
Information Technology

Job Description

The BNY Mellon Technology organization places great emphasis on reducing risk and increasing resiliency, which puts a strong focus to our engineering practices, including reliability engineering and development standards. The Resilient Systems Engineering (RSE) group is charged with addressing the need to continually enhance the stability, resilience, and recoverability of the firm’s critical assets and underlying infrastructure.


Uniquely, the RSE group bridges both infrastructure and application development teams, requiring deep expertise in the entire technology stack to achieve the highly resilient, scalable, and performant business services required by our clients.


The RSE is seeking Cloud Developers and Systems Engineers to join our Application Runtime Platform team. We work on BNY Mellon Application Engine, a platform as a service that schedules and runs containerized and non-containerized applications on Linux and Windows across our data centers.  Our systems power nearly a quarter of the global economy and we continue to invest in uplifting the technologies that underpin our private cloud and public clouds.


We're building our placement engine and container services platform to enable developer and operational efficiencies in our datacenters and public cloud.   Our team uses many technologies to enable innovation for our business.  For example - Docker, Mesos, Nomad, Consul, Puppet, Salt, VMWare.   Our team's skillset includes Developers, Devops, Systems Engineers and SRE.


As a diverse platform team, we know how software is built, configured and deployed.  We write services, plugins and agents.   We configure, automate and run many infrastructure and platform services - like Mesos and Docker clusters, centralized logging platform based on ELK, Docker registries and Prometheus for monitoring.  We understand middleware and infrastructure and provide the tools and services that allow developers to run their applications.  Additionally, our on-boarding and engagement team helps developers understand and use the platform. 


Because we are diverse, we are looking for a range of skillsets - from developer to Linux engineer - so please reach apply if this space interests you and you are passionate about what you do.

Key responsibilities include:

  • Work closely with Docker, Linux and Application orchestrators.
  • Research, design, and implement software components powering our cloud platform.
  • Develop features in an agile environment where we quickly prototype and iterate on functionality.
  • Develop robust functionality in a complex, distributed systems code-base.
  • Work extensively with open source software.  You may even modify or extend code maintained as part of an open source project.
  • Deploy and scale critical services and features that are used by thousands of developers and potentially impact millions of end users.
  • Employ both Object Orientated development skills and Systems Engineering skills.
  • Code services and User Interfaces in Golang, Java, Groovy, Javascript.
  • Use innovative tools and frameworks such as Vert.x, SpringBoot, Java, Angular, Docker, Mesos, Nomad, Vault, Consul, Marathon, Puppet, Salt, etc. on Linux and Windows.


Sr. Specialist Infrastructure Operations Analyst->> Designs, implements, integrates, and provides full support for complex software in a multi-tiered, multi-platform environment. Researches and recommends the appropriate system software to meet corporate standards and objectives for system performance. Identifies and solves complex and critical systems related issues to meet the objectives for the corporation. Provides input into platform selection, version implementation, software product recommendation, and usage of enhanced functionality. Assists with complex issues in the installation, support, troubleshooting, and repair of data center equipment. Implements, integrates and provides full support for middleware software in a multi-tiered, multi-platform environment. Collaborates and consults with application development teams to determine middleware platform selection, efficient transaction design, use and recovery procedures in line with business requirements. Communicates with internal and external system users to address concerns and make sure that technical issues are dealt with appropriately. Manages ticket queues and handles complex escalated issues. Analyzes repeat incident patterns to identify opportunities for cost reduction and productivity enhancements. Works with vendors to ensure efficient incident resolution. Contributes to the achievement of area objectives.


  • Bachelor's degree in computer science or a related discipline, or equivalent work experience is required, advanced degree is preferred.

  • Eight to ten (8-10) years of related infrastructure experience is required – preferably software development design and implementation of large scale distributed systems and web services building complex software that is testable and designed for extensibility.

  • Experience in the securities or financial services industry is a plus.

  • Experience building and operating critical production systems.

  • Development Experience with languages like Java, Golang, Javascript.

  • Demonstrable hands-on experience with Linux, Docker and DevOps.

  • Experience with automation / Configuration Management tools like Salt, Chef, Ansible or Puppet.

  • Deep expertise with the architecture and implementation of PaaS software

  • Deep understanding of Linux and OS Tuning.

  • Deep understanding of how to build fault tolerance and scalability into cloud based systems.

  • Good understanding of building, deploying, and maintaining critical applications in a cloud-based environment.

  • Grasp of software engineering skills in modular design, data structures, algorithms, and Unix systems development.

  • Experience of SDLC and Agile Development tools (Jenkins / Teamcity, Maven, JIRA, Confluence).

BNY Mellon is an Equal Employment Opportunity/Affirmative Action Employer.
Minorities/Females/Individuals With Disabilities/Protected Veterans.

Primary Location: United States-New Jersey-Jersey City
Internal Jobcode: 45170
Job: Information Technology
Organization: Resilient Systems Engineering-HR17387
Requisition Number: 1901145