The Role :
The DevOps Manager will be responsible and accountable for the Production / Staging/development Infra, Security. The DevOps Manager will implement, support, and harmonize DevOps processes across the DevOps Support teams (L1, L2, L3, and development for small changes), for Mobikwik. The role will be responsible to ensure the best reactive approach of the support teams to Production Incidents.
Key accountability and decision ownership :
- 10+ years of DevOps or related experience with gradually increasing responsibilities and a demonstrated understanding of DevOps and technical quality control processes, artifacts, and tools
- Ensures regular monitoring of regular processes and day to day job execution for the smooth running of all Production / Staging / Development environment
- Active participation in operations strategy &implementation driving quality and efficiency including best practices & metrics for system operations
- Assures proper tracking and reporting of all Enterprise IT systems related to measurements including system health reports, outage reports, L1, L2, predictive analysis, and management/operating summaries
- Responsible for all 24x7x365 Level 1/ 2 Operations in a proactive manner (Deployments, Monitoring, Troubleshooting, SLA/OLA Management, Service Capacity Management, Service Incident Management, Service Problem Management, Licensing, etc.) for the corresponding services.
- Implementing DevOps tools and life cycle on responsible services.
- Implementing Operational automation processes.
- Enable successful DevOps (Agile Operations) by transition the code from Dev/ Test to Staging until Production.
- Function as the escalation point for every P1Incident across the production Operations
- Ensure the DevOps teams are adequately managing of high and critical priority incidents in production
- Ensure high quality of solutions, security, performance, and operational requirements are met.
- Mentor - provide guidance, training, and problem-solving support to team members
- Identify opportunities for automation and architecture simplification
- Deploy automate, and maintain Hybrid cloud-based solutions
- Strong analytical and problem-solving skills.
Skills & Experience:
- Deep experience with Linux and associated OS concepts, services, and esp security.
- Experience working with Tomcat/Apache/JBoss is a must
- Experience with Mysql / Postgress( installation, Recovery, Configuring Master-Slave ) is must
- Experience with NoSQL (mongo, CouchDB, Redis).
- Ability to Automate processes by building new tools. Scripting experience (shell/Php/python) is mandatory.
- In-Depth Knowledge of Databases(esp MySQL) and tricks on how to configure and optimize them for scale.
- Understanding of Network protocols (TCP/IP, BGP, HSRP, NAT, IPSec) and adept at configuring Hardware/Software firewalls.
- Experience setting up application servers and configuring them for monitoring and performance
- Experience in Virtualization with Xen / KVM.
- Datacentre management / Cloud Management.
- Knowledge of Python / Java / Perl
Additional Requirements :
- Experience working with Data Analysis, Hadoop, Cloud infra.
- Managing server infrastructure with knowledge of how the web (HTTP, APIs, REST) works