We believe work is not a place, but rather a thing you do. Our technology revolves around this core philosophy. We are relentlessly committed to helping people work and play from anywhere, on any device. Innovation, creativity and a passion for ever-improving performance drive our company and our people forward. We empower the original mobile device: YOU!
What we’re looking for:
You are a Senior Site Reliability Engineer having experience of at least 8 years in the software industry with a passion for DevOps and experience developing, deploying, monitoring and managing cloud services. You have expertise working with public cloud infrastructure providers like Microsoft Azure/Amazon AWS/Google Cloud. You have a solid understanding of Windows development, debugging, administration, automation frameworks and software development experience in C#./NET/Java/Python.
Position Overview
As a Site Reliability Engineer you will design and implement web applications and REST API services using a microservice-based infrastructure to replace our current monolith implementation. The major technology stack includes the Azure/AWS PaaS services, Docker/Kubernetes, RDBMS, NoSQL and advanced monitoring tools/technologies. Your focus will be on maximizing system uptime.You will help us build an autonomous cloud capable of reliably serving 1 billion users. We build autonomy into our products, tools, and environments to do fully automated operations for us. Engineering a solution that can withstand failure and scaling challenges is paramount to our success. SRE teams are responsible for the Security, Availability, Performance, Compliance, Cost/COGS, Change management, Monitoring, Emergency response and capacity planning of Citrix cloud services. You will build innovative automated solutions and tools to help debug and resolve problems in production and prevent them from recurring. Further, you will proactively seek out system weaknesses and find ways to fix them before they cause production issues using monitoring data, watching trends, and using Chaos Engineering.
Role Responsibilities
You will develop and architect our next generation Citrix Cloud platform
Will manage and architect authentication and authorization across a large organization including handling employee lifecycles and role-based access control
You will provide common solutions to multiple teams by identifying and addressing patterns in infrastructure and applications
Will work with other development teams to design scalable, robust systems using cloud native architecture principles
Will develop software and provide hands-on technical expertise to design, deploy, and optimize Cloud services
You will drive technical architecture and standards/process to deliver the best customer experience
Will participate in an on-call rotation for the Citrix cloud Services platform
You will build framework policy engine that will monitor all security in the cloud, provide guidance on remediation; offer a fully autonomous service
Keeping your assigned site or service up and running or getting it back up and running quickly when failure occurs
Working closely with internal partners and teams to ensure that we ship software that meets security, SLA, and performance requirements
Writing, updating, and using documentation, including runbooks/playbooks
Automating work including infrastructure needs, testing, failover solutions, failure mitigation, and much more
Debugging complex problems across an entire stack and creating solid solutions
Developing CI/CD processes to improve cadence
Using Chaos Engineering to test what you build under real-world conditions
Experience with monitoring and observability such as with Splunk, New Relic, and Nagios is a plus
Experience automating infrastructure, testing, and deployments using tools like Ansible, Chef, or Terraform and can explain the Infrastructure as Code paradigm
Experience with configuration management, such as with Terraform, Ansible, Chef, Puppet etc.
You will be the technical mentor and coach for a team of SRE’s and guide them on day-to-day engineering tasks while working on individual deliverables in collaboration with engineering leadership.
Basic Qualifications
You have an Engineering Degree (BE/BTech/BS) or higher in CS as your educational Qualification
You have at least 8 years of software development experience with one or more Object Oriented languages, Preferably in .Net and C# (Preferred), or Java, Python
Have a good understanding of automation and frameworks like Chef, Puppet, Ansible, Docker, Kubernetes, Serverless architectures, ARM, PowerShell
You have hands on experience building Cloud services and distributed systems – deployment, monitoring, scaling, debugging.
You are proficient in one or more large public cloud providers Azure (preferred), AWS or GCP, or in large-scale PaaS/SaaS on any platform
You have a good understanding of security concepts at scale
You are a software engineer who knows how to apply engineering principles to operations. You have demonstrable experience managing or developing a multi-tenanted SaaS. You are well versed in a large number of technologies and welcome new tools and techniques.
You are comfortable in working in conjunction with fellow developers and operations members to come to the best possible solution.
You have the skill and ability to notice/observe patterns and ways to increase efficiency, eliminate downtime, optimize costs, and maintain performance at scale.
You have worked to provide hands-on technical expertise to design, deploy, secure and optimize Cloud services.
Have experience with algorithms, data structures, complexity analysis, software design and debugging on Windows platforms.
You come with a strong foundation in secure operational practices – key handling, access control, threat remediation
You are Interested in designing, analysing and troubleshooting large-scale distributed systems.
Have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Have experience with logging platforms and application performance using NewRelic, Splunk, SumoLogic, or App Insights Analytics
Have exposure to systems or applications at scale
Have experience on various Cloud Platforms: Azure (preferred), AWS or GCP
You have experience on CI/CD Tools such as TeamCity, Jenkins, Azure DevOps
Have experience on Container Tools , such as Docker, Kubernetes & SCM Tools: GIT, SVN, VSTS
You have experience on CM Tools such as Ansible, Terraform, Chef
What you’re looking for:
Our technology is built on the idea that everyone should be able to work from anywhere, at any time, and on any device. It’s a simple philosophy that guides everything we do — including how we work. If you’re an engineer, we’ll give you plenty of ways to test your skills on cutting edge technology. We want employees to do what they do best, every day.
Be bold. Take risks. Imagine a better way to work. If this sounds like you then we’d love to talk.
br{display:none;}.css-58vpdc ul > li{margin-left:0;}.css-58vpdc li{padding:0;}]]> Job Purpose: IDFC First Bank is looking for Salesforce Production Support Analyst who loves complex...
Apply For This JobFull Job Description Urgently looking for Female candidate for the role of Receptionist cum back office assistance.Must be computer proficientMust...
Apply For This JobAbout this role: Digital Quality Capability Leader We have an urgent opening for Digital Quality Capability Leader for our leading...
Apply For This JobJob Description Take classes to BBA Airline and Airport Management students and Diploma in Airline and Airport Management students on...
Apply For This Jobbr{display:none;}.css-58vpdc ul > li{margin-left:0;}.css-58vpdc li{padding:0;}]]> With over 80 years as an industrial technology leader, Kennametal Inc. delivers productivity to customers...
Apply For This JobThe Elevator Pitch: Why will you enjoy this new opportunity? VMware is the leader in virtualization and cloud infrastructure solutions...
Apply For This Job