Krishna Gopal Devops - 1
Krishna Gopal Devops - 1
PROFESSIONAL SUMMARY:
Over 9 years of experience comprising of Kubernetes Administration, Cloud Services Management,
Enterprise Applications and Servers Administration.
Strong experience in managing Kubernetes environments for scalability, availability and zero downtime.
Experienced in deployment strategies such as Rolling update, Canary and Blue-Green in Kubernetes for
providing updates to the applications and creating custom ConfigMaps and secrets in an encoded format
for security.
Experienced in containerizing applications to deploy on managed Kubernetes service EKS and AKS.
Creating Alerts and Dashboards for monitoring SLO/SLI using Data Dog and CloudWatch
Good experience with Helm charts with Kustomize composing deployment manifest files to deploy K8s
objects/microservices.
Experience with Chaos testing for Network exhaustion, Pod failure, Node failure, High CPU load, Memory
exhaustion, new deployment failures, Horizontal Pod Autoscaling (HPA), Container startup failures and
Dependency failures
Experienced in infrastructure and application monitoring (observability) tools Prometheus-Grafana,
Splunk, ELK/OpenSearch, Open Telemetry, Datadog, New Relic, AWS CloudWatch, and AppDynamics
Good Understanding of Java, microservices architecture and distributed data streaming system like
Kafka.
Efficient in writing Infrastructure as a code (IaC) in Terraform, AWS Cloud formation and experience
using Ansible, Chef for configuration management.
Strong CICD experience with Git, Jenkins, Azure DevOps, for build automation and deployments and
expertise in using tools like Maven, Ant, MS Build for building deployable artifacts from source code
repositories.
Experience in working on AWS and its services like VPC, EC2, IAM, ECS, EBS, RDS, S3, Lambda, ELB, Auto
Scaling, Route 53, Cloud Front, Cloud Watch, Cloud Trail, SQS, and SNS.
Experienced in Databases like MySQL, Oracle, MariaDB, MongoDB, DynamoDB
Good understanding of OSI Model, TCP/IP protocol suite (IP, ARP, TCP, UDP, SMTP, FTP, and TFTP)
Good understanding of Observability and MELT implementation patterns for large-scale services. Solid
understanding of Site Reliability Engineering principles, with a substantiated history of successfully
applying SLAs, SLIs, and SLOs to enhance and quantify system dependability and efficiency.
TECHNICAL SKILLS:
Container Kubernetes, K3s, Docker, Rancher, Clair, Chaos Monkey, Gremlin, EKS, AKS,
Orchestration: Containerd, Istio (Service Mesh),
Monitoring Prometheus, Grafana, Nagios, CloudWatch, Splunk, Data Dog, New Relic
Logging Elasticsearch, Kibana, Promtail-Loki, Kafka, Fluentd, Logstash
Cloud Technologies Amazon Web Services (AWS), Azure DevOps Provisioning
Tools Terraform, OpenTofu, CloudFormation
Config& Management Ansible, Chef
Languages Python, Java, Perl, JSON, YAML, PowerShell, Bash/Shell Scripting
CI-CD Tools Jenkins, Azure
DevOps Build Tools: Maven, Ant, Gradle
Code Quality SonarQube
Version Tools Git, Bitbucket, Subversion, GitHub
Database MySQL, Oracle, Amazon , MariaDB, MongoDB, Redis
Networking/ Protocols DNS, LDAP, TCP/IP, FTP, HTTP, HTTPS, SSH, SFTP, SCP, SSL
Operating Systems LINUX (RHEL 4/5/6/7), UNIX, Ubuntu, Centos, Windows
App/Web servers Tomcat, Ngnix, Apache Web Server, Web logic, IBM Web sphere
CERTIFICATIONS: -
AWS Certified Solutions Architect Associate:
https://www.credly.com/badges/e39fdf6a-24c4-4ff3-8dcb-7fffe1320ebe/public_url
Certified Kubernetes Administrator (CKA):
https://www.credly.com/badges/4629f9f1-fddb-4413-97d8-7dab05730158/public_url
WORK EXPERIENCE:
Client: AdvizeX, Cleveland, OH Oct 2023-
Present
Role: DevOps/Kubernetes Engineer
Responsibilities:
Designed and implemented Continuous Integration and Continuous Delivery (CI/CD) pipelines using Git,
Jenkins, Bamboo, and GitHub Actions to automate the build, test, and deployment processes across
development, QA, and production environments.
Expert in automating deployments with AWS, using IAMs to integrate Jenkins with AWS Code Pipeline,
and creating EC2 instances for virtual servers.
Designed and deployed multiple applications using various AWS services (e.g., EC2, S3, RDS, VPC, IAM,
ELB, EMR, CloudWatch, Route 53, Lambda, and CloudFormation) with a focus on high availability and
fault tolerance.
Implemented event-driven architectures, integrated different AWS services and SaaS applications,
automated workflows based on events, and created real-time data processing applications.
Managed High-performance applications requiring robust database features, fault-tolerant and scalable
database solutions, and migration of on-premises databases to the cloud by using AWS Aurora.
Implemented AWS Lambda functions to run scripts in response to events in the Amazon S3 bucket using
the Amazon API gateway.
Provided operational and maintenance support for AWS cloud resources, including launching,
maintaining, and troubleshooting EC2 instances, S3 buckets, VPCs, ELBs, and RDS.
Expertise in building Terraform to create AWS Infrastructure by pulling Terraform code from GitHub
repositories and working closely with teams to ensure high-quality and timely delivery of builds and
releases.
Build and Manage Kubernetes clusters using Terraform to automate infrastructure deployment,
configuration, and scaling, resulting in increased efficiency and reduced deployment time. Manage, and
Scale containerized applications using EKS.
Implemented Kubernetes network and service discovery concepts such as Services and Ingress for
application pods and end users to communicate.
Manage and configure Kubernetes application Deployment YAML files using Kubectl and Helm
Implemented interactive and customizable visualizations for tracking experiments by using Weights &
Biases (W&B), making it easier to analyze performance and worked on Robust Tracking and
Management like Comprehensive tools for tracking experiments, managing datasets, and organizing
models to ensure reproducibility and streamline the ML workflow.
Created alerts and dashboards for monitoring SLO/SLI using Datadog and CloudWatch, ensuring
proactive incident management.
Designed and implemented a robust CI/CD pipeline using Jenkins integrated with OpenShift to automate
the build, test, and deployment processes.
Worked with various Docker components, including Docker Engine, Hub, Machine, Compose, Swarm,
and Docker registry, and created custom Docker container images, tagged them, and pushed them to
Docker Hub.
Containerized legacy applications and microservices using Docker and deployed them on OpenShift
clusters, enabling scalable and efficient management of applications.
Experience changing the AWS infrastructure from Elastic Beanstalk to Docker with Kubernetes.
Implemented role-based access control (RBAC) and integrated OpenShift with LDAP for secure user
management and compliance with regulatory requirements Configured network policies and security
groups to isolate sensitive data and ensure secure communication between services, adhering to
industry standards.
Configured Kubernetes for high availability, including pod autoscaling, node affinity, and anti-affinity
rules.
Performed SRE (Site Reliability Engineer) responsibilities with Observability and Monitoring Expertise.
Responsible for application support experiencing downtime or degraded performance, aimed at
ensuring the reliability, availability, and performance of applications in production.
Involved in strategies and processes to restore service availability and data access following a
catastrophic event by using Disaster Recovery concepts
Automated the collection and analysis of metrics, logs, and traces. Set up alerts to notify teams of
anomalies or threshold breaches.
Implemented Observability and APM by using Prometheus, Splunk, Grafana, Dynatrace, ELK Stack, and
Jaeger.
Expertise in Scheduling regular backups and test recovery processes to ensure data integrity and system
availability.
Automated Deployments and Integrated Rancher with their CI/CD pipeline to automatically deploy
changes to their development and staging environments.
Experience using AppDynamics, Dynatrace, Kibana, Grafana dashboard and Prometheus Alert Manager
for monitoring the health of Kubernetes and OpenShift nodes.
Set up Redis with multiple replicas and use automatic failover to ensure high availability and reliability
and monitoring solutions to track Redis performance and health
Managed databases with Oracle, MySQL, DynamoDB, RDBMS, and server DB tasks.
Used Packer and Terraform to automate system operations for deployment automation.
Experienced in Agile and most recently in CI/CD practices.
Written automation scripts using Bash, JSON, Groovy, Python, and Maven for build automation.
Documented processes and configurations in Confluence, creating structured spaces for effective
collaboration among different scrum teams.
Environment: Git, SVN, Jenkins, Maven, AWS, Azure, Terraform, Kubernetes (Kops, Kubeadm, AKS,
OpenShift, EKS), ELK Stack (Elasticsearch, Logstash, Kibana), Fluent Bit, Splunk, Prometheus, Grafana,
Datadog, Dynatrace, Ansible, Argo CD, Docker (Engine, Hub, Machine, Compose, Swarm, Registry), AWS
Lambda, Amazon API Gateway, AWS RDS, VPC, ELB, Route 53, Ingress, Bash, JSON, Groovy, Python,
CloudWatch, IAM, AWS CloudFormation, Kubectl, Helm.
EDUCATION:
Masters in Information Sciences 2022 - Trine university
Bachelor’s in computer applications From JNTUH -2015