Ikhsan Noor Rosyidin
Platform Engineer
Tangerang Selatan, Indonesia
ikhsan.rosyidin@gmail.com | linkedin.com/in/essanpupil | github.com/essanpupil
Professional Summary
Over 13 years of cross-functional experience spanning infrastructure architecture, software engineering, and cybersecurity.
Proven track record of greenfield engineering, designing fault-tolerant, multi-cloud (AWS, GCP), and on-premise ecosystems from scratch.
Leveraging a strong software engineering background to deliver code heavy, scalable Infrastructure as Code (IaC) solutions and bespoke third party integrations.
Adept at driving regional high availability, enforcing rigorous security automation, and implementing self-service Business Continuity Planning (BCP) across high-traffic, multi-tenant global platforms.
Technical Skills
- Cloud & On-Premise Infrastructure: AWS (EKS, ECS, EC2, Lambda, DynamoDB, S3, VPC, IAM), GCP (Compute Engine, GKE, Dataflow), Bare-metal/On-Premise (Ubuntu/Linux Servers).
- Content Delivery & Caching: AWS CloudFront, Akamai CDN, Varnish Cache.
- Networking & Edge Security: AWS WAF, GCP Cloud Armor, ngrok.
- Containerization & Orchestration: Kubernetes (AWS EKS, K3s), Helm, Docker.
- Infrastructure as Code & Configuration: Terraform, CloudFormation, Terragrunt, Ansible.
- Continuous Integration & Delivery (CI/CD): ArgoCD (GitOps), GitHub Actions, GitLab CI, AWS CodePipeline, AWS CodeBuild, Jenkins.
- Workflow Automation & RPA: Apache Airflow, UiPath, Automated Testing (Selenium/WebDriver, pytest, unittest).
- Observability & Monitoring: Datadog, CloudWatch, Prometheus, Grafana, Loki, Tempo, ELK Stack, Nagios.
- Reliability Engineering: Incident Management, SLOs, SLIs, Error Budgets, Chaos Engineering (AWS FIS), Disaster Recovery (DR), High Availability Architecture, SLA Management.
- Security & Governance: Cloud IAM, Bitnami Sealed Secrets, AWS WAF, GCP Cloud Armor, AlienVault OSSIM.
- Software Engineering & Backend: Django, Flask, Ruby on Rails.
- Scripting: Python, Bash, SQL.
- Databases & Storage: PostgreSQL, MySQL, Pgcat, Longhorn, AWS S3.
- Storage & Data Formats: Longhorn (Block Storage), NFS, Parquet.
Professional Experience
Ailytics (ailytics.ai) | Senior DevOps & Cloud Engineer | August 2025 – March 2026 | Singapore, Remote
Ailytics is a growing B2B AI video analytics startup. As a Senior DevOps Engineer, I architected the critical infrastructure required to run their Kubernetes-based platform.
- Reliability: Engineered highly available deployment strategies across cloud (AWS EKS) and on-premise (K3s on Ubuntu Servers) environments. Authored Terraform modules and Ansible playbooks to orchestrate K3s deployments across 5 bare-metal client servers. Architected distributed block storage using Longhorn.
- Operational Excellence: Architected a comprehensive cloud-native observability ecosystem (Prometheus, Grafana, Loki, Tempo) to accelerate MTTR. Refactored legacy Helm charts and implemented ArgoCD to establish GitOps-based continuous delivery, accelerating deployment times by ~40%.
- Performance Efficiency: Accelerated global content delivery and reduced application latency by implementing AWS CloudFront. Optimized PostgreSQL database performance utilizing Pgcat connection poolers for high-concurrency traffic.
- Security: Resolved strict client firewall constraints by deploying ngrok secure tunnels and configuring inter-node NAT via Ansible. Implemented encrypted secret management via Bitnami Sealed Secrets within Helm deployments.
Traveloka is Southeast Asia’s leading lifestyle superapp and a tech unicorn operating at massive enterprise scale, processing highly volatile booking and financial traffic for millions of users.
- Cost Optimization: Orchestrated enterprise AWS FinOps initiatives, reducing monthly compute expenditure by ~30%, by architecting scalable infrastructure utilizing AWS EC2 Spot Instances and Auto Scaling Groups (ASG). Debugged complex cost anomalies and implemented strict billing alerts.
- Operational Excellence: Upgraded the IaC ecosystem by implementing Terragrunt over 50 existing Terraform modules. Engineered automated CI/CD pipelines using AWS CodeBuild, CodePipeline, and GitHub Actions. Deployed UiPath (RPA) infrastructure on Windows Server to automate legacy partner workflows.
- Reliability: Participated in a high-stakes 24/7 on-call rotation for business-critical travel services, triaging production incidents and maintaining strict SLAs. Designed Disaster Recovery (DR) plans and implemented Datadog for real-time monitoring across ECS and RDS.
- Security: Implemented security perimeters by deploying AWS WAF across all public-facing endpoints. Designed and enforced strict AWS IAM roles and policies to guarantee least-privilege access controls across more than 100 multi-tenant accounts.
Vidio is one of Southeast Asia’s premier OTT video streaming platforms, operating at massive scale and regularly handling traffic spikes exceeding 200,000 Requests Per Second during live broadcast events.
- Cost Optimization: Executed a large-scale infrastructure migration from AWS to GCP. Reduced cloud billing by ~30% by rightsizing Linux server workloads and engineering fault-tolerant architectures utilizing Google Compute Engine preemptible instances.
- Performance Efficiency: Managed Akamai CDN and Varnish Cache configurations to ensure high-availability video streaming handling over 200,000 RPS. Maintained Thumbor image processing servers for dynamic content delivery.
- Operational Excellence: Automated infrastructure provisioning and configuration management using Jenkins and Ansible. Upgraded core data pipelines by migrating Apache Airflow to Python 3 to streamline workflows.
- Reliability & Observability: Integrated Datadog with a centralized ELK stack to provide full-stack visibility into distributed environments, accelerating incident response and drastically reducing MTTR.
- Developer Experience (DevEx): Championed internal developer productivity by standardizing and optimizing local Vagrant environments, successfully reducing local provisioning times by ~30 minutes and removing workflow friction for the software engineering teams.
- Security: Implemented Google Cloud Armor to secure public web services against external threats and enforced strict geo-blocking policies.
Aplikasi Pesan Indonesia - BBM | DevOps Engineer | November 2018 – July 2019 | Jakarta, On-site
Aplikasi Pesan Indonesia (BBM) operated one of the most widely used messaging and digital ecosystem platforms in the country, serving tens of millions of active users.
- Performance Efficiency: Engineered highly scalable data restoration pipelines utilizing Jenkins, Python, and GCP Dataflow to process ~9 GB of Parquet files from GCS and continuous live message streams from Google Pub/Sub directly into Elasticsearch clusters.
- Infrastructure: Automated server provisioning and configuration management using Ansible across Google Cloud Platform (GCP) Virtual Machines, ensuring the underlying stability of a high-throughput messaging ecosystem.
- Observability: Managed a centralized ELK Stack and implemented Datadog monitoring to proactively track the health, logging, and performance of massive Elasticsearch clusters.
- Reliability: Authored critical incident runbooks and participated in high-stakes 24/7 on-call rotations to rapidly triage production outages, ensuring mission-critical messaging uptime and eliminating single points of failure.
Jublia is a Singapore-based event technology company that provides matchmaking and networking platforms for global B2B conferences.
- Software Architecture & Backend: Engineered a custom Python application dispatcher capable of dynamically mapping URL prefixes to dedicated Flask micro-applications, significantly improving routing modularity. Modernized legacy systems by porting monolithic PHP web applications to a Flask stack and executing a critical core upgrade from Python 2 to Python 3.
- Data Engineering & Integration: Architected automated, two-way data synchronization pipelines utilizing Python to ensure seamless real-time data integrity between Google Spreadsheets and backend MySQL databases to provide alternative familiar interface for customers.
- Operational Excellence & Testing: Authored comprehensive
unittest suites and automated browser tests for distributed Flask applications. Engineered robust GitLab CI pipelines to enforce automated testing logic, accelerating safe deployment cycles to production Ubuntu servers.
- Mentorship & Leadership: Mentored intern software engineers, providing technical guidance on Python backend development and CI/CD best practices to accelerate their onboarding and impact.
Lewatmana was an early pioneer in smart-city traffic monitoring, processing real-time routing data and camera feeds across a distributed network spanning 5 international cities.
- Operational Excellence: Authored
unittest and automated browser tests for Django applications utilizing pytest and Selenium WebDriver. Engineered GitLab CI pipelines to execute these testing suites within isolated container environments. Automated background processes utilizing Bash scripting and deployed Vagrant to standardize local development workflows.
- Reliability: Deployed Nagios and Cacti to monitor the performance of distributed Python/Django applications and the servers. Managed PostgreSQL database replication systems using SkyTools.
- Performance Efficiency: Executed a major framework upgrade of a Django application from version 1.2-prealpha to 1.9, and optimized Django ORM queries to improve data retrieval speeds.
- Database Scalability: Resolved critical PostgreSQL maximum row limitations by changing to another field type and then synchronizing the ORM table declaration, ensuring continuous high-volume traffic data ingestion without downtime.
- Data Processing: Engineered robust Python backend features to accurately parse, synchronize, and calculate complex timestamp and time-series data across multiple timezones, ensuring strict data integrity for the traffic logging ecosystem.
- Data Ingestion & ETL: Architected Python data ingestion pipelines to automate the bulk import of legacy
.xls files directly into PostgreSQL. Developed complex parsing logic to dynamically resolve inconsistent file encodings and regional localization formats, preventing database ingestion failures.
Noosc Global (noosc.co.id) | IT Security Analyst | November 2012 – October 2014 | Jakarta, On-site
Noosc Global is an Indonesian Managed Security Service Provider (MSSP) delivering enterprise-class cybersecurity and network monitoring solutions.
- Security: Monitored and maintained AlienVault Open Source SIEM. Integrated critical log sources and network appliances including Snort, Palo Alto networks, and F5 Firewalls into the SIEM ecosystem to accelerate threat detection.
- Operational Excellence: Configured and aligned enterprise security tooling to enforce and comply with strict client security policies and regulatory frameworks.
Education
Universitas Darma Persada, Jakarta
Bachelor of Informatics Engineering
2009 - 2015
Courses and Certifications
-
| AWS Cloud Financial Management for Builders |
Amazon Web Services (2022) |
-
| Security Engineering on AWS |
Amazon Web Services (2022) |
-
| Red Hat Certified System Administrator (RHCSA) |
Red Hat (2014) |