Intellect Minds is a Singapore-based company since 2008, specializing in talent acquisition, application development, and training.
We serve BIG MNCs and well-known clients in talent acquisition, application development, and training needs for Singapore, Malaysia, Brunei, Vietnam, and Thailand.
Our client is an established company, a leader within their industry is now looking for a Site Reliability Engineer to join their esteemed organization.
Roles & Responsibilities
Deep dive into development lines, learning and understanding the mechanism of every application component, and promoting product scalability, stability and performance.
Manage and maintain client product applications and services. Support and influence improvement in client product application to enhance the stability and availability of the systems Solve key problems that potentially may take with the production systems and create solutions to prevent incidents occurring again Analyze patterns of production incidents and set-up appropriate alerting / monitoring mechanisms in the system to catch the issues before hand.
To create a release template to ensure that architecture and testing efforts performed during development of service are sufficient to support availability and performance SLAs.
To test the playbook and scripts in the lower environment to ensure the accuracy and completeness. To provide and test the runbooks and healing / corrective automation scripts for restoration of runtime tools based services.
To work with development team to do full integration of services with the application monitoring system Improve application stability & operational efficiency by developing scripts to automate tasks.
Skills : Requirement
Experienced working with either C++, Go, Java, Python for scaling systems and services Good working record with either GCP or AWS environment Previously had responsibilities of SRE from previous projects e.
g designing, delivering and managing large scale platforms Has good knowledge and working experience in application monitoring systems (which include AppDynamics) Hands-on knowledge with Linux operating system (Ubuntu, CentOS, etc.
Knowledge of Computer Network (TCP / IP, DNS, etc.) Experienced in Incident Management process and ability to resolve Level 1 issue within agreed organization SLO.
All successful candidates can expect a very competitive remuneration package and a comprehensive range of benefits.
Interested Candidates, please submit your detailed resume online.
The Recruitment Team
Intellect Minds Pte Ltd (Singapore)