Provide technical inquiry for internal and external clients.
Accomplish the tasks and missions which managers assign.
Identify, troubleshoot, resolve or escalate incidents quickly and effectively.
Be responsible for monitoring and maintaining the system service overall operational status.
Help and assist in the investigation and resolution of platform’s end user problems.
Develop tools, enhance operational processes, and provide automated solutions that enable better system service administration and support.
Performs root cause analysis. identify and resolve underlying problem patterns, while driving to develop automated and self-healing solutions.
Participate, engage, and help in the resolution of outages in/through conference calls.
Write clear and consumable documentation of the environment, operational procedures and project specific tasks.
Being extremely proficient with the Unix/Linux command line, shell scripting, and configuring systems monitoring tools.
Being security minded, and knowing the security implications of every decision made
Managing ambiguity and is able to seek out the information needed to make informed decisions.
Taking automation seriously and having the experience to implement automated processes such as Jenkins.
Having experience with configuration management, monitoring, and automation tools.
Having a thorough understanding of AWS/Aliyun is a plus, the ability to gain this understanding is a must.
Having knowledge of all things networking: TCP / IP, ICMP, SSH, DNS, SSL / TLS.
Having fought fires at scale, wrestled with lost instances on AWS, and can troubleshoot processes and servers gone awry with eyes closed.
Maintaining provisioning and continuous integration (CI) frameworks.
Participating in defining the DevOps technology roadmap, toolsets and strategies based on industry best-practices.
Maintaining configuration management and continuous delivery (CD) frameworks.
Requirements:
Strong analytical and planning skills.
Good communication and presentation skills.
Excellent problem-solving skills.
5+ years of eCommerce domain experience.
At least 5 years of experience in enterprise level IT projects Cloud based technologies (PaaS, IaaS and SaaS).
3+ years hands on experience using cloud-based platforms such as ALIYUN, AWS.
Has experience with VCS such a Git, Subversion and branching/merging strategies.
Has experience with automation of tools to support monitoring and alerts such as New Relic, AppDynamics, Splunk, Sumo Logic.
Has expert knowledge with Linux operating systems and command line tools, internet and network protocols.
Has experience with CI systems such as Jenkins, Bamboo, GitLab.
Has a background in networking, VIPs, firewalls and load balancers.
Familiar with containerization platforms, such as Docker, Kubernetes.
Has a strong persuasion, facilitation and influencing skills.
Must possess excellent critical thinking skills, reading and interpretation, writing, and
communication skills.
Must be extremely facts and data oriented.
Deadline and closure oriented as well as self-driven.
Experience in using CI/CD Solutions such as Jenkins, TeamCity, Bamboo, Spinnaker, Artifactory.
Experience handling large numbers of diverse systems with configuration management systems.
Experience in using, implementing and administering logging, telemetry and monitoring tools like
Splunk and Prometheus an advantage.
Experience with Git – branching strategies and best practices.
Experience working with Linux Internals and Operating System.
Fluency in shell scripting, such as Bash or Python and Regex.
Software Development experience ( Java, Perl, Python, C ) is an advantage.
Understanding of standard network protocols and components such as: HTTP, DNS, ECMP, TCP/IP,
ICMP, the OSI model, subnetting and load balancing strategies.
Experience with cloud based serverless, storage and developer tools technologies
Experience in Hybrid and Multi Cloud implementations is an advantage.
Experience handling AWS/ALIYUN service offerings such as EC2/ECS, VPC, RDS, S3/OSS, IAM,
CloudFormation, Cloud Watch, Cloud Trail, Load Balance.
Experience handling ALIYUN/Huawei Cloud service offerings is also an advantage.
Strong analytical and problem solving skills.
Ability to work through complex engineering challenges/obstacles.
Experience in dealing with customer issues with excellent verbal and written communication skills.
Passion for eliminating repetitive manual processes using automation.
Strong sense of ownership, customer service, and integrity demonstrated through clear
communication.
Experience with system and application implementation with CDN.
Experience with system and application implementation with mitigation DDoS attack and WAF.