hero

We bind our fortunes to those who dare to burn
away the obsolete and forge the unimagined future.

Reliability and Failure Analysis Technician

Cerebras

Cerebras

Sunnyvale, CA, USA
Posted on Nov 12, 2025

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

The Role:

The Reliability and Failure Analysis Technician will be responsible for performing detailed root cause analysis on Cerebras products and delivering comprehensive reports on findings. This role requires extensive hands-on lab work, including troubleshooting, measurement, precision inspection, repair, and assembly of company products and sub-assemblies. The Technician will execute product audits from the manufacturing line, perform GD&T-based dimensional inspections using CMM tools, and validate product functionality and reliability through structured testing and measurement experiments.

This position also supports reliability testing and AI hardware validation to ensure long-term performance, durability, and mechanical integrity under demanding AI workloads. Collaboration with Engineering, Manufacturing Engineering, Quality Assurance, and Technical Support teams is essential to drive continuous improvement and product insights.

Responsibilities:

  • Conducts root cause failure analysis on Cerebras products and publishes detailed reports with supporting data, images, and conclusions.
    • Performs dimensional inspections and GD&T validation on critical mechanical and electronic assemblies using CMM, optical measurement tools, and precision gauges.
    • Executes cross-sectioning and microscopy analysis to identify solder joint defects, delamination, voiding, and interconnect quality across components and substrates.
    • Troubleshoots, repairs, and reworks complex assemblies at the component and board level, including electronic, thermal, and mechanical interfaces.
    • Designs and conducts measurement and correlation experiments to evaluate mechanical tolerance stack-ups, solder integrity, and part-to-part variability.
    • Performs reliability and AI hardware validation testing using temperature chambers, leak testers, pressure testers, and other specialized test tools to simulate operational conditions.
    • Conducts product audits from the manufacturing line, verifying both functional and dimensional conformity to design intent and reliability standards.
    • Manages and maintains Failure Analysis and Reliability Lab infrastructure, ensuring tools, fixtures, and instruments (CMM, X-ray, CSAM, microscopes) are calibrated and operational.
    • Supports documentation, training, and process improvement efforts across Reliability, Quality, and Manufacturing teams.

Skills & Qualifications:

  • 10+ years of experience in a manufacturing, reliability, or lab environment, with 5+ years in testing, troubleshooting, measurement, and repair of high-complexity electronic or electromechanical systems.
    • Proven expertise in failure analysis methods (electrical, mechanical, and material), including fault isolation, signal tracing, and root cause determination.
    • Strong understanding of GD&T principles, dimensional tolerance analysis, and part inspection methods.
    • Hands-on experience with CMM, digital microscopes, laser profilometers, and precision measurement tools.
    • Skilled in cross-sectioning, polishing, and sample preparation for microstructural or solder joint analysis.
    • Proficiency in X-ray, CSAM, and optical microscopy for detecting voids, cracks, and interfacial defects.
    • Experience with AI hardware validation and reliability stress testing under thermal and mechanical load conditions.
    • Advanced soldering and desoldering skills (SMT, BGA, and lead-free).
    • Ability to read and interpret engineering drawings, schematics, and GD&T feature control frames.
    • Excellent analytical, documentation, and problem-solving skills.
    • Self-directed, detail-oriented, and able to collaborate effectively across multiple engineering disciplines.

The base salary range for this position is $180,000 to $250,000 annually. Actual compensation may include bonus and equity, and will be determined based on factors such as experience, skills, and qualifications.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  1. Build a breakthrough AI platform beyond the constraints of the GPU.
  2. Publish and open source their cutting-edge AI research.
  3. Work on one of the fastest AI supercomputers in the world.
  4. Enjoy job stability with startup vitality.
  5. Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2025.

Apply today and become part of the forefront of groundbreaking advancements in AI!


Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.


This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.