Site Reliability Engineer
Location: Pune, India
Model of Work: Hybrid
About Quorum Software
Quorum Software connects people and information across the energy value chain. Twenty years ago, we built the first software for gas plant accountants. Pipeline operators came next, followed by land administrators, pumpers, and planners. Since 1998, Quorum has helped thousands of energy workers with business workflows that optimize profitability and growth. Our vision for the future connects the global energy ecosystem through cloud-first software, data standards, and integration. The trusted source of decision-ready data for 1,800+ companies, Quorum Software makes the essential connections that let us work better together in the connected energy workplace. For more information, visit quorumsoftware.com
Be a part of our legacy
Quorum Software is the world's largest provider of digital technology focused solely on business workflows that empower the next evolution of energy. From emerging companies to supermajors, throughout every region of the globe, customers rely on Quorum's proven innovation and unmatched global expertise to streamline business operations and make data-driven decisions that optimize profitability and growth. Our industry-leading solutions are transforming energy companies across the entire value chain, helping visionary leaders evolve their organizations into modern energy companies.
Who we are looking for:
Are you excited by challenges? Do you enjoy working in a fast-paced, international and dynamic environment? Then now is the time to join Quorum Software, a rapidly growing company and industry leader in oil & gas transformation.
What You Will Do:
- Observability: Design and implement scalable observability solutions, including monitoring, logging, tracing, and alerting with a strong focus on minimizing downtime. Experience with monitoring/ alerting solutions.
- Instrumentation and Integration: Work closely with development teams to ensure proper instrumentation of applications, services, and infrastructure components for comprehensive observability.
- Cloud Platforms: Strong understanding of public cloud services and architecture. Familiarity with VMware private cloud solutions. Hands-on experience with hybrid cloud environments.
- Incident Response and Resolution: Play a key role in incident response by utilizing observability tools to quickly identify and resolve issues, minimizing downtime and impact on end-users. Familiarity with ITIL practices and incident management frameworks.
- Performance Optimization: Collaborate with development teams to analyze performance metrics and implement optimizations to enhance system reliability and efficiency.
- Automation and Scripting: Develop automation scripts and tools to streamline observability processes, ensuring timely data collection and analysis.
- Capacity Planning: Contribute to capacity planning efforts by analyzing observability data to forecast future resource requirements, proactively address potential bottlenecks and identify under or unutilized resources.
- Collaboration, Knowledge Sharing, Documentation: Foster a culture of collaboration by sharing insights and best practices with cross-functional teams. Provide training and guidance to promote observability best practices. Assist with creation of solution documentation. Partner with internal and vendor resources to evangelize solutions.
- Continuous Improvement: Stay abreast of industry trends and emerging technologies in observability, and contribute to the continuous improvement of our observability strategy and tools.
- And other duties as assigned.
What to Bring:
- Overall 5-8 years of relevant experience
- Experience with public and private cloud platforms.
- Background in it operations including managing datacenter/ public cloud infrastructure, IT Services
- A track record of actively seeking and implementing improvements in cloud operations processes and technologies
- Experience with ensuring adherence to SLO for response and resolution of alerts and incidents
- Excellent written and verbal communication skills and customer empathy
- Knowledge of major cloud platforms such as Amazon Web Services (AWS), Microsoft Azure, and Private Datacenters
- Proficiency in operationalizing and managing alerts to proactively identify and address issues.
- Familiarity with ITIL (Information Technology Infrastructure Library) framework, particularly incident management, problem management, and change management.
- Understanding of cloud security best practices, and compliance standards.
- Strong problem-solving skills to analyse complex issues, identify root causes, and implement effective solutions.
- Strong documentation skills to maintain records of operational procedures, incident reports, and other relevant documentation.
- Ability to review trends and identify areas of opportunities within reported alerts.
- Understanding of ITSM processes and their integration with cloud operations
- Technologies:
- Windows Server, Active Directory, Linux
- Application and infrastructure stack
- Public/ Private cloud: Azure , AWS, VMWare,
- Security: encryption, HTTPS, firewall, best practices
- High availability, clustering, backup, disaster recovery, business continuity
- Identity: SAML, OAuth, Azure AD, Okta
- Database: MSSQL, Oracle, DB PaaS/ DBaaS
- Azure – AppInsights, Log Analytiics, Monitoring
- Monitoring - PRTG, Datadog, Zabbix, AWS Cloudwatch, PagerDuty.
- Automation and scripting - Powershell, Ansible, Terraform
Nice to Have:
- Certifications related to AWS,Azure and ITIL frameworks a plus.
Additional Details
- Visa Sponsorship: Employment eligibility to work with Quorum Software in India is required as the company will not pursue visa sponsorship for this position.
Diversity Statement: At Quorum, we are committed to fostering, cultivating and preserving a culture of diversity, equity and inclusion. We want to be the place where a diverse pool of talented people join us, stay with us and do their best work. With a diverse team of employees, we grow and learn better together. The collective sum of the individual differences, life experiences, knowledge, innovation, self-expression, and talent that our employees invest in their work represents not only part of our culture, but our reputation and our achievements. We are fully focused on equality and believe deeply in diversity of race, gender, sexual orientation, religion, ethnicity, national origin and all the other characteristics that make us unique. We have a DEI committee focused on Culture, Advocacy and Talent, have company-wide Unconscious Bias training and more.
Quorum Business Solutions and Quorum Software are Equal Opportunity Employers. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, ancestry, veteran status, disability, genetic information, or any other basis protected by law.
Those applicants requiring reasonable accommodation to the application and/or interview process should notify a member of the Human Resources Department