Lead Specialist, SRE

At Nadi Tech, part of TNG Digital Group, we build and run secure, high-performance digital infrastructure that powers essential services across Malaysia.

Our platforms support large-scale B2B and public sector initiatives used by millions of people every day. That means reliability, security, and performance are not nice-to-haves. They are built into everything we do from day one.

We are looking for people who enjoy solving complex, real-world problems. People who think in systems, understand how things connect, and take pride in building infrastructure that is stable, scalable, and built to last.

Here, you will take ownership of systems that matter and see your work used in real, everyday scenarios.

What You'll Do:

1.Service Reliability and Availability
  • Ensure uptime/availability of 99.99% are consistently met
  • Reduce Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) during incidents
  • Drive capacity planning and prevent reliability risks
2.Drive Automation and Operational Excellence
  • Deliver consistent and repeatable deployments with zero critical failures by maintaining and updating deployment scripts/templates
  • Reduce manual toil across the team by measurable percentages
  • Standardize and harden container images, CI/CD pipelines, and cloud infrastructure
3.Release and Disaster Recover
  • Reduce deployment incidents through adherence to best practices in release management
  • Reduce deployment duration via automation
  • Plan and execute disaster recovery and ensure RTO and RPO are met for cloud/multi-cloud environments
4.Incident Response and Troubleshooting
  • Reduce the frequency of recurring issues via problem management & root cause analysis.
  • Establish and enforce incident response processes
5.Security and Compliance
  • Ensure 100% compliance with regulatory and audit requirements for infrastructure security
  • Achieve zero critical security incidents by optimizing infrastructure and adhering to industry standards
  • Ensure infrastructure, container, and code security standards are enforced
  • Successfully implement secure architectures for all new deployments in collaboration with development teams
6.Team Leadership & Strategic Alignment
  • Lead, mentor, and grow the SRE team’s technical and operational capabilities
  • Establish on-call rotations, knowledge sharing sessions, and training programs
  • Foster a good culture of blameless accountability, learning, and continuous improvement
  • Partner with product and engineering teams to embed reliability into the SDLC
  • Influence architectural decisions with an SRE mindset
Role Requirements:

Qualification:

  • Bachelor's degree in computer science, Engineering, Network or related field
  • Professional cloud certification
Experiences:
  • Proven 8 years’ experience in a DevOps or SRE role
Skills:
  • Strong knowledge of scripting language and programming language (e.g. Bash, Python, Go) and experience with configuration management tools (e.g. Ansible, Chef)
  • Good mindset and implementation on CI/CD tools and release engineering
  • Experience with cloud platforms (e.g. AWS, Azure) and infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation).
  • Advanced cloud certification and project management is a plus.
  • Strong understanding in site reliability engineering, infrastructure engineering, cloud architecture service and mindset.
  • Experience with containerization technologies like Docker and container orchestration platforms such as Kubernetes.
  • Knowledge of networking principles and protocols with solid examples.
  • Strong knowledge on cloud architecture and services
  • Strong problem-solving skills and the ability to handle high-pressure situations calmly and effectively.
  • Strong attention to detail and a commitment to delivering high-quality results.
Personality:
  • Passionate, agile, flexible, and positive attitude.
  • Assertive, driven individual with a strong sense of urgency
  • Self-starter with continuous improvement mindset
What you get

Work your way

  • Flexible working hours
Your wellbeing matters
  • Medical coverage, with option to include dependants
  • Extra leave for family and caregiving needs
Rewards that grow with you
  • Monthly lifestyle allowance via TNG eWallet
  • <
Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...