Lead Machine Learning Engineer, Performance and Scalability, Generative AI
Company: Adobe
Location: San Jose
Posted on: February 14, 2025
Job Description:
Lead Machine Learning Engineer, Performance and Scalability,
Generative AILead Machine Learning Engineer, Performance and
Scalability, Generative AIApply locations San Jose Seattle New York
time type Full time posted on Posted 2 Days Ago job requisition id
R153541Our CompanyChanging the world through digital experiences is
what Adobe's all about. We give everyone-from emerging artists to
global brands-everything they need to design and deliver
exceptional digital experiences! We're passionate about empowering
people to create beautiful and powerful images, videos, and apps,
and transform how companies interact with customers across every
screen.We're on a mission to hire the very best and are committed
to creating exceptional employee experiences where everyone is
respected and has access to equal opportunity. We realize that new
ideas can come from everywhere in the organization, and we know the
next big idea could be yours!About the Role
- Adobe Firefly is seeking a Lead Engineer to focus on
Performance and Scalability for our Generative AI systems, powering
flagship products like Photoshop, Illustrator, Express, and
firefly.adobe.com. In this senior role, you will be responsible for
optimizing high-performance, scalable AI pipelines, supporting
millions of users worldwide.Responsibilities
- Architect and optimize ML pipelines to support scalable
inference and model deployment on cloud-based GPU infrastructure
(e.g., AWS P5 instances).
- Develop and maintain high-throughput serving pipelines for
generative AI models, ensuring low-latency, high-performance
execution.
- Enable model serving optimizations by designing systems that
support tensor parallelism, quantization, distillation, and
caching, in collaboration with ML research teams.
- Develop automated monitoring and profiling tools to track
system efficiency, detect performance regressions, and optimize
resource utilization.
- Optimize GPU resource allocation and orchestration across
cloud-based ML workloads.
- Integrate scalable load testing frameworks to validate model
inference performance under high-traffic conditions.
- Collaborate with infrastructure and applied ML teams to
transition models from experimentation to production-ready,
cloud-optimized deployments.
- Establish standard methodologies for scaling and cloud-native
ML architectures, ensuring efficient deployment across multi-region
cloud environments.Qualifications
- 8+ years of proven track record in building high-performance ML
infrastructure and scalable AI systems.
- MS, or PhD in computer science or related field.
- Strong programming skills in Python and C++, with expertise in
building ML pipelines and model deployment infrastructure.
- Experience deploying large-scale ML models in cloud
environments, including AWS GPU instances, Kubernetes, Ray, or
similar.
- Experience with model conversion and optimization frameworks
like ONNX and TensorRT, as well as AOT compilation techniques.
- Experience with cloud-native architectures, autoscaling
strategies, and fault-tolerant machine learning systems.
- Proficiency in GPU orchestration, CUDA, and accelerated
inference techniques.
- Hands-on experience with profiling tools (e.g., Nsight, PyTorch
Profiler, perf) for system performance analysis.
- Ability to work in a fast-paced, startup-like environment with
multi-functional teams.Why Join Us?Firefly is Adobe's
groundbreaking family of AI models, crafted to transform content
creation in our products. Join us to shape the future of creativity
and enhance pipelines for millions of users in Photoshop,
Illustrator, and Premiere Pro. This is a highly strategic and
visible role, where you'll have the chance to create a significant
impact on the future of generative AI at Adobe.#FireflyGenAIOur
compensation reflects the cost of labor across several U.S.
geographic markets, and we pay differently based on those defined
markets. The U.S. pay range for this position is $162,000 --
$301,200 annually. Pay within this range varies by work location
and may also depend on job-related knowledge, skills, and
experience. Your recruiter can share more about the specific salary
range for the job location during the hiring process.At Adobe, for
sales roles starting salaries are expressed as total target
compensation (TTC = base + commission), and short-term incentives
are in the form of sales commission plans. Non-sales roles starting
salaries are expressed as base salary and short-term incentives are
in the form of the Annual Incentive Plan (AIP).In addition, certain
roles may be eligible for long-term incentives in the form of a new
hire equity award.Adobe will consider qualified applicants with
arrest or conviction records for employment in accordance with
state and local laws and "fair chance" ordinances.Adobe is proud to
be an Equal Employment Opportunity and affirmative action employer.
We do not discriminate based on gender, race or color, ethnicity or
national origin, age, disability, religion, sexual orientation,
gender identity or expression, veteran status, or any other
applicable characteristics protected by law. Learn more.Adobe aims
to make Adobe.com accessible to any and all users. If you have a
disability or special need that requires accommodation to navigate
our website or complete the application process, email
accommodations@adobe.com or call (408) 536-3015.
#J-18808-Ljbffr
Keywords: Adobe, San Jose , Lead Machine Learning Engineer, Performance and Scalability, Generative AI, Engineering , San Jose, California
Didn't find what you're looking for? Search again!
Loading more jobs...