Senior Staff Software Engineer: C++, Linux, Kubernetes
Cohesity
Cohesity is the leader in AI-powered data security. Over 13,600 enterprise customers, including over 85 of the Fortune 100 and nearly 70% of the Global 500, rely on Cohesity to strengthen their resilience while providing Gen AI insights into their vast amounts of data. Formed from the combination of Cohesity with Veritas’ enterprise data protection business, the company’s solutions secure and protect data on-premises, in the cloud, and at the edge. Backed by NVIDIA, IBM, HPE, Cisco, AWS, Google Cloud, and others, Cohesity is headquartered in Santa Clara, CA, with offices around the globe.
We’ve been named a Leader by multiple analyst firms and have been globally recognized for Innovation, Product Strength, and Simplicity in Design , and our culture.
Want to join the leader in AI-powered data security?
We are seeking a Senior Staff Technical Leader to drive the architecture, design, and delivery of highly reliable, scalable distributed systems at Cohesity. This role requires deep technical expertise, strong architectural judgment, and the ability to influence engineering direction across multiple teams while remaining hands-on with complex technical problems.
You will own critical subsystems and platform capabilities that directly impact product scalability, operability, and customer experience across on-prem and cloud deployments. Beyond feature development, you will lead cross-functional efforts to improve system reliability, performance, and diagnosability, and act as a senior technical authority during high-severity customer and production incidents.
As part of the Product and Sustenance Engineering organization, you will partner closely with Product Management, Architecture, Customer Support, Field Engineering, and Customers, driving technical strategy while mentoring senior engineers and raising the overall engineering bar across the organization.
HOW YOU’LL SPEND YOUR TIME HERE
Technical Leadership & Strategic Ownership
Act as a technical authority for one or more major product areas or platform components
Drive long-term technical vision and architecture, aligning system design with business and product strategy
Influence technical direction across teams through design leadership, technical reviews, and mentorship
Serve as the final escalation point for complex production and customer issues
Architecture & System Design
Own and evolve distributed system architectures with a strong focus on scalability, availability, performance, and security
Lead design of systems deployed across Kubernetes-based platforms, on-prem environments, and public clouds
Make data-driven architectural trade-offs across compute, storage, networking, and orchestration layers
Review and approve designs for critical initiatives, ensuring alignment with platform standards and best practices
Execution & Delivery Excellence
Own end-to-end delivery of large, cross-team initiatives including scope definition, sequencing, and risk management
Own end-to-end delivery of large, cross-team initiatives including scope definition, sequencing, and risk management
Partner with Product Management to shape roadmaps, refine backlogs, and balance short-term delivery with long-term platform health
Proactively identify technical debt, reliability gaps, and scalability risks, and drive execution of mitigation plans
Hands-on Engineering & Deep Problem Solving
Design and implement complex features and infrastructure-level improvements across on-prem and cloud deployments
Perform deep root-cause analysis of complex functional, performance, and networking-related issues
Lead debugging efforts involving distributed systems, containerized workloads, networking stacks, and storage subsystems
Drive development and validation of patches, hot-fixes, and emergency releases in collaboration with QA and Field teams
Mentorship, Influence & Collaboration
Mentor senior and staff engineers on system design, debugging techniques, and engineering best practices
Raise the bar for code quality, testing, observability, and operational excellence
Collaborate effectively with globally distributed teams, influencing without authority through technical credibility
WE’D LOVE TO TALK TO YOU IF YOU HAVE MANY OF THE FOLLOWING
BS/MS in Computer Science, Engineering, or equivalent practical experience
15+ years of experience designing, building, and operating large-scale distributed systems
Strong hands-on coding experience in one or more of: C, C++, Go, Python
Deep expertise in Linux-based systems and production debugging in complex environments
Advanced debugging and troubleshooting skills using tools such as GDB, Python debuggers, profilers, logs, and metrics
Strong understanding of microservices and SaaS architectures
Kubernetes & Cloud-Native Systems
Strong experience designing and operating systems on Kubernetes
Deep understanding of container orchestration, service discovery, configuration management, and workload scheduling
Experience debugging issues related to pod lifecycle, resource contention, networking, and storage in Kubernetes environments
Networking & Distributed Systems
Solid understanding of networking fundamentals: TCP/IP, DNS, load balancing, proxies, TLS, and service networking
Experience diagnosing and resolving distributed networking issues across containers, VMs, and physical hosts
Familiarity with cloud networking concepts (VPCs, subnets, security groups, ingress/egress, service meshes is a plus)
Additional Skills
Strong grasp of testing methodologies, reliability engineering, and production readiness practices
Proven ability to articulate architectural decisions, evaluate trade-offs, and influence stakeholders using data
Excellent communication skills across technical and non-technical audiences
Experience working in Agile/Scrum development environments
Preferred
Experience in data management, storage, backup/recovery, or infrastructure platforms
Track record of leading cross-team, multi-quarter initiatives
Experience building systems with stringent availability, performance, and security requirements
Demonstrated experience leveraging AI tools to streamline workflows, enhance productivity, and support high-quality decision-making
Data Privacy Notice for Job Candidates:
For information on personal data processing, please see our Privacy Policy.
Equal Employment Opportunity Employer (EEOE)
Cohesity is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status or any other category protected by law.
If you are an individual with a disability and require a reasonable accommodation to complete any part of the application process, or are limited in the ability or unable to access or use this online application process and need an alternative method for applying, you may contact us at 1-855-9COHESITY or talent@cohesity.com for assistance.
In-Office Expectations
Cohesity employees who are within a reasonable commute (e.g. within a forty-five (45) minute average travel time) work out of our core offices 2-3 days a week of their choosing.
Interested candidates based outside of the designated areas are welcome to apply, provided they have the right to work in the job location.