Sr. System Engineer

Company: Support Revolution
Location: San Jose
Posted on: February 17, 2025

Job Description:

Select how often (in days) to receive an alert: Create AlertLocation: San Jose, California, United StatesAbout Supermicro:Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.
Job Summary:As a Sr. System Engineer, you'll be the go-to person to roll out and maintain business critical applications and services for Supermicro. You are also responsible for resolving escalated service issues, coaching other engineers to resolutions, engineering and implementing complex projects. You will be a person who is independent with leadership to drive the technical development and with excellent communication skills.Essential Duties and Responsibilities:Includes the following essential duties and responsibilities (other duties may also be assigned):
--- Execute comprehensive system-level rack tests on latest NVidia and AMD GPUs, ARM-based, Intel Xeon, and AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools.
--- Establish expertise in HPC/AI applications and benchmarks, delivering impactful training sessions to customers and partners, while addressing complex customer support issues, demonstrating innovative problem-solving skills and building robust processes and procedures for HPC/AI solutions.
--- Conduct proof of concept design and testing, providing optimized benchmarks for HPC/AI applications in a timely manner. Fine-tune BIOS settings, optimize OS/network configurations, and develop diverse simulation configurations to enhance efficiency across various workloads.
--- Deliver on-site deployment services, ensuring customer acceptance verification and providing post-level 1&2 support. Create and maintain technical documentation, including technical notes, blogs, and diagrams, to facilitate knowledge dissemination.
--- Identify and document hardware and software quality issues and collaborate with Product Management and other Engineering teams to integrate customer feedback into future product enhancements.
--- Proactively engage in HPC roadmap development, planning software and hardware upgrades to sustain exceptional HPC infrastructure performance.
--- Document and analyze test plans, reports, logs, and actively contribute to the development of test utilities and automation scripts to streamline testing processes.Qualifications:--- BS/MS in Electrical Engineering, Computer Engineering or Computer Science
--- 8+ years of work-related experience in Deep Learning and Machine Learning
--- 8+ years of Linux/networking debugging/testing or relevant experience preferred
--- Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc.
--- Experience with DevOps or in cloud environments, including but not limited to Docker/Containers and Kubernetes
--- Hands-on experience with workload/scheduler Managers (Slurm) for rack/cluster
--- Familiar with MLPerf Training/Inference benchmark, LLM, HPL-AI or RCCL/NCCL
--- Programming experience with windows and Linux shell scripting
--- Strong sense of teamwork and good team player, strong communication skills
--- Familiar with Intel/AMD/NVIDIA development tool kits such as CUDA, oneAPI, ROCm is a plus
--- Experience with server/network hardware debugging and troubleshooting is a plus
--- CCNA, OpenStack, OpenShift, Azure or AWS is a plusSalary Range$140,000 - $158,000The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.EEO StatementSupermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.
#J-18808-Ljbffr

Keywords: Support Revolution, San Jose , Sr. System Engineer, Other , San Jose, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Jose recruiters find you. Post your resume for free!

Get San Jose Other jobs via email.

View more San Jose Other jobs

Other Other Jobs

staff - Registered Nurse (RN) - Director Hospice - $72K-90K per year
Description: Gentiva is seeking a Registered Nurse RN Director Hospice for a nursing job in San Jose, California.Job Description Requirements ul li Specialty: Hospice li Discipline: RN li Duration: Ongoing (more...)
Company: Gentiva
Location: San Jose
Posted on: 02/13/2025

Local Contract Occupational Therapist - $54-63 per hour
Description: Host Healthcare is seeking a local contract Occupational Therapist for a local contract job in San Jose, California.Job Description Requirements ul li Specialty: Occupational Therapist li Discipline: (more...)
Company: Host Healthcare
Location: San Jose
Posted on: 02/13/2025

Travel Physical Therapist - $2,505 per week
Description: Centra Healthcare Allied is seeking a travel Physical Therapist for a travel job in San Jose, California.Job Description Requirements ul li Specialty: Physical Therapist li Discipline: Therapy (more...)
Company: Centra Healthcare Allied
Location: San Jose
Posted on: 02/13/2025

Salary in San Jose, California Area | More details for San Jose, California Jobs |Salary

Travel Medical Technologist - $2,255 per week
Description: Host Healthcare is seeking a travel Medical Technologist for a travel job in San Jose, California.Job Description Requirements ul li Specialty: Medical Technologist li Discipline: Allied Health (more...)
Company: Host Healthcare
Location: San Jose
Posted on: 02/13/2025

Travel Surgical Technologist - $2,176 per week
Description: Supplemental Health Care is seeking a travel Surgical Technologist for a travel job in San Jose, California.Job Description Requirements ul li Specialty: Surgical Technologist li Discipline: Allied (more...)
Company: Supplemental Health Care
Location: San Jose
Posted on: 02/13/2025

Regional Sales Executive - Western Region
Description: SUMMARY OF POSITIONThe Regional Sales Executives are individual contributor field salespeople who are responsible for delivering new business bookings primarily from new customers in their territories. (more...)
Company: ePac LLC
Location: San Jose
Posted on: 02/13/2025

Travel Occupational Therapist - $2,092 per week
Description: Advantis Medical is seeking a travel Occupational Therapist for a travel job in San Jose, California.Job Description Requirements ul li Specialty: Occupational Therapist li Discipline: Therapy (more...)
Company: Advantis Medical
Location: San Jose
Posted on: 02/13/2025

Travel Skilled Nursing Facility Speech Language Pathologist - $2,613 per week
Description: Preferred Healthcare Staffing is seeking a travel Skilled Nursing Facility Speech Language Pathologist for a travel job in San Jose, California.Job Description Requirements ul li Specialty: Speech (more...)
Company: Preferred Healthcare Staffing
Location: San Jose
Posted on: 02/13/2025

Travel Nurse RN - Wound Care - $2,597 per week
Description: Medical Solutions is seeking a travel nurse RN Wound Care for a travel nursing job in San Jose, California.Job Description Requirements ul li Specialty: Wound Care li Discipline: RN li Duration: (more...)
Company: Medical Solutions
Location: San Jose
Posted on: 02/13/2025

Research Research Scientist Co-Op; AD for Accelerated Discovery San Jose, US
Description: We are seeking candidates with a proven interest in implementing innovative software solutions and applications using AI and machine learning. You will be engaged in solving problems in biochemistry, (more...)
Company: Avature
Location: San Jose
Posted on: 02/13/2025

Loading more jobs...

Sr. System Engineer

Didn't find what you're looking for? Search again!

Other Other Jobs

Log In or Create An Account