AVP, SRE Specialist, Enterprise Architecture - SRE, Technology and Operations (WD14679)
DBS BANK LTD.
SG, SG
2d ago
source : GrabJobs

Business Function

Business Function Group Technology and Operations (T&O) enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation.

In Group T&O, we manage the majority of the Bank's operational processes and inspire to delight our business partners through our multiple banking delivery channels.

Responsibilities

Site Reliability Engineering (SRE) at DBS combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems.

Site Reliability Engineers focus on ensuring our customer and colleagues experience best of DBS systems.

As a Site Reliability Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault tolerant and designed to scale.

You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime.

Site Reliability Engineers utilize automation, continuous monitoring, tools and solid engineering principles around infrastructure and applications to optimize existing systems, build infrastructure and eliminate operational work.

DBS Technology and Operations is looking for passionate, creative and detail-oriented engineers who excel on solving operational problems and improving efficiency.

  • Drive the Site Reliability Engineering agenda forward at an Enterprise Level to improve availability, reliability, and performance of services.
  • Drive the Site Reliability Engineering agenda forward at an Enterprise Level to improve availability, reliability, and performance of services.
  • Drive cross-team efforts in resiliency assessment exercises and reporting
  • Draft and / or contribute to internal SRE training materials
  • Support services before they go live through activities such as Chaos testing (failure injection), system design inputs, developing software platforms and frameworks, capacity planning and launch reviews.
  • Engage with product engineering teams to test against relevant Chaos Engineering tool kit.
  • Sounds understanding of CI / CD pipelines and SDLC (application delivery)
  • Assist application teams in setting up SLI, SLO and Error budget for the system / s
  • Participate in Blameless Incident Retrospectives and follow up on action items
  • Work with application teams for Observability, automating monitoring and auto-remediation of known issues..
  • Programming and scripting to automate failure scenarios, integration with pipelines and developing self-service portals.
  • Work with teams located across locations in Asia Pacific
  • Requirements

  • Experience in SRE transformation and adoption for large scale environments
  • Experience in one or more of the following : Java Script, Java and Python.
  • Very good analytical and problem-solving skills with good understanding of technical risks emerging out of architecture decisions.
  • Experience with developing applications and setting up automations in a Linux environment, with sound knowledge of algorithms, data structures, complexity analysis and software design.
  • Understands complex architectures and well versed with design patterns.
  • Development skills with experience in real time, distributed and highly secured environments.
  • Experience with developing test cases and ensuring appropriate test coverage through unit and automated testing.
  • Experience with one orf more of ELK, Grafana, Prometheus, Dynatrace and AppDynamics.
  • Experience with Proxies and Load Balancers like HAProxy and Nginix.
  • Experience with CI / CD pipelines and release strategies
  • Understands key SRE concepts such as Error Budgets, MTTD, MTTR and Launch Control
  • Systematic problem-solving approach coupled with effective communication skills and a sense of ownership and drive.
  • Ability to debug and optimize code and to automate routine tasks.
  • Bachelor’s or Master’s degree in Computer Science, a related technical field that involves programming, or equivalent practical experience.
  • Minimum of 10 years technology experience (preferably in the financial industry).
  • Highly motivated, pro-active and capable of working under pressure without compromising development processes and productivity.
  • Strong, committed and reliable team player, able to take direction but also willing to contribute to discussions on design and strategy.
  • Possess strong interpersonal and communication skills to be able to deal with and form good relationships with the business and other technology groups through day to day support and project work
  • Interest in financial technologies, new technology tools and the ability to learn.
  • Apply Now

    We offer a competitive salary and benefits package and the professional advantages of a dynamic environment that supports your development and recognises your achievements.

    Report this job
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Apply
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Continue
    Application form