Site Reliability Engineer

About Anchanto:

Anchanto helps all businesses to exploit the full potential of e-commerce. Our suite of SaaS Products enables companies globally to springboard omnichannel sales, scale fulfilment operations, and use intelligent data to grow their e-commerce, logistics & warehousing activities. Leading, brands, distributors, retailers, and logistic enterprises such as L'Oréal, Decathlon, or DHL Supply Chain rely on our technology to scale their local and global e-commerce operations.

Headquartered in Singapore and with more than 10 local offices across Asia-Pacific, the Middle East and Europe, we are growing rapidly and looking for ambitious people to join our teams to build the future successes of Anchanto.

The Role (Describe the role): A senior DevOps/SRE engineer with at least 7+ years hands-on experience working in large and multi-account cloud  environment. Be able to contribute hands-on to availability, performance improvement at Infra and apps level along with an eye for cost optimization through templating and automation.

Key Responsibilities:

Responsibilities:

  • Build, operate maintain CI/CD pipeline/s using Github actions, Jenkins, SonarQube
  • Automate server monitoring and start stop using scripts (bash, lambda)
  • Deploy and manage EoL and EoS for all Infra systems on AWS
  • Manage version updates , upgrades and fixes for all Infra systems on AWS
  • Manage site 24×7 for Infra monitoring – create dashboards, manage users, tune alerts, suppress false positives, ensure all Infra is monitored
  • Design procedures for system troubleshooting and maintenance and maintain the existing SOP
  • Manage performance and cost optimization for all Infra components
  • Identify process and automations gaps and PoC tools to address these
  • Monitor Infra efficiency, Availability, in accordance with SLAs
  • Manage and ensure monthly maintenance windows are effectively utilised in co-ordination with ENG managers and Architects
  • Assist Cyber Security team in identifying and deploying cybersecurity measures by continuously performing vulnerability assessment and risk management
  • Mentoring and guiding the team members in engineering for Infra best practices – create and maintain DOs and DON’Ts and conduct trainings as may be necessary
  • Manage system hardening in line with the standards

Essential Requirements:

What You Need/ You have a track record of:

  • 7+ years of hands-on experience with DevOps in all the above areas or similar role
  • Hands-on experience as Linux admin, AWS – Compute, Storage and networking administration, operations and support
  • Hands-on experience with operations and support for Github, Jenkins, SonarQube, ELK, Monitoring tools including AWS cloud native tools
  • Knowledge of Distributed systems and micro services architectures and ability to identify gaps in monolithic architecture and recommend improvements for better efficiency and reduce costs
  • Experience working with config and operations for AWS API gateways, WAF, SSL certificates, R53 along with
  • Knowledge about CI/CD practices, deployment patterns, and relevant tool along with understanding and keen sense of Quality 
  • An understanding of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform, Chef etc.
  • Used to observability practices and tool for Monitoring, Metrics, Logging, Alerts & Tracing
  • Knowledge about service discovery, networking security, multi-tenancy, database access, concurrency control, or cache consistency.
  • Built distributed solutions for configuration, monitoring, and auto-mitigation of services.
  • Working knowledge of RDS (MYSQL and Postgres) with knowledge of SQL, Redshift, SQS, SES, SNS, Kafka
  • Technical skill to review, verify, and validate the scripts written earlier and fine-tune to be become more effective and efficient
  • Strong communication and collaboration skills and be able to quantify objectively his/her observations/recommendations/findings
  • Ability to cope with pressure and be able to manage expectations and priorities based on sound logic
  • The ability and skill to articulate and communicate with senior leadership 
  • Self-motivated, results driven individual, passionate about technology,  – independent and requires least management overhead

Personal Attributes:

  • Good communicator and technically updated and sound and able to reason with Engineering team members as well as project and business team members to make a point effectively
  • Good problem-solving abilities to tackle complex automation challenges effectively.
  • Effective communication skills, adept at conveying ideas and collaborating with cross-functional teams.
  • Proactive approach to driving innovation and process improvements in automation.
  • Meticulous attention to detail and commitment to maintaining ambitious standards of quality and accuracy.
About cookies on this site

We use cookies to collect and analyse information on site performance and usage, to provide social media features and to enhance and customise content and advertisements. Learn more

Necessary cookies

Some cookies are required to provide core functionality. The website won't function properly without these cookies and they are enabled by default and cannot be disabled.

Analytical cookies

Analytical cookies help us improve our website by collecting and reporting information on its usage.

Marketing cookies

Marketing cookies are used to track visitors across websites to allow publishers to display relevant and engaging advertisements.