Sharan Chenna
Site Reliability Engineersharan@sre:~$ curl -s -O https://www.sharanch.dev/resume.pdf
sharan@sre:~$ cat summary.txt
Results-driven Site Reliability Engineer with 4 years of experience in managing large-scale distributed systems. Specializing in improving system reliability, implementing robust monitoring and alerting solutions, and automating operational tasks. Skilled in incident response, performance optimization, and fostering a culture of blameless postmortems.
sharan@sre:~$ ls -l work-experience
Cloud Operations Engineer
Oracle Cloud Infrastructure (Feb 2024 - Sep 2025)
- > Managed and scaled a provisioning API core to compute services, Improving reliability by over 15%
- > Managed a fleet of 8000+ KVM Based Hypervisors end to end, Improved operational uptime by over 20%
- > Automated routine tasks to reduce man hours by over 3 hours a week
- > Designed and implemented a comprehensive observability stack using openTelemetry and Grafana.
- > Worked closely with development teams and improve incident response processes, fostering a culture of reliability and continuous learning.
- > Conducted RCA and blameless postmortems to avoid recurring issues
- > Coordinate and document critical incidents to ensure rapid resolution and effective communication with stakeholders
- > Optimized container images by using staged builds
- > Used Terraform and Chef to automate patching across regions worldwide
- > Devloped command line tools to ease Oncall Operations
- > Manged CI/CD which are built on TeamCity
- > Maintained software security by working closely with Dev team to ensure compliance
Associate Engineer - LinuxOps
CtrlS Datacenters (Apr 2021 - Dec 2023)
- > Maintained and monitored private cloud infrastructure onpremises, ensuring 99.9% uptime.
- > Developed scripts to automate routine tasks, saving over 10 hours of manual work per week.
- > Contributed to on-call rotation and performed root cause analysis for production incidents.
- > Setup Ansible automation pipelines for various enterprise clients
- > Developed a monitoring dashboards using Zabbix agents
- > Worked on VMware for provisioning of VMs
- > Worked on SOC to Implement IAM
- > Worked with LVM, NFS, Samba and CIFS
sharan@sre:~$ cat skills.txt
Infrastructure Automation
- > Terraform, Ansible, Chef
- > Docker, Kubernetes, OCI Build
- > AWS CLI, OCI, GitLab CI/CD
- > TeamCity, Jenkins (CI/CD pipelines)
Observability & Monitoring
- > Prometheus, Grafana, Alertmanager
- > OpenTelemetry, Blackbox Exporter
- > Zabbix, Node Exporter
- > Log analysis, RCA tooling
Scripting & Ops Engineering
- > Python, Bash, SaltStack
- > Disk usage analysis, batch ops
- > Custom CLI tooling for on-call
- > Incident coordination & postmortems
sharan@sre:~$ ls -l projects/
Monitoring Stack Deployment
Proof Of Concept project for Terraform for Infrastructure Provisioning, AWS Provider, Ansible Automation, Docker for portability, Grafana Dashboards, Prometheus for PromQL and Node Exporter for metrics
view →Jenkins CI/CD
Proof Of Concept project that uses Jenkins for CICD, This deploys a web app using flask and uses GitHub webhook payloads for triggering the Pipeline.
view →sharan@sre:~$ ls social
I'm currently open to new opportunities. Whether you have a question or just want to say hi, my inbox is always open.
Say Hello