California Jobs
Senior AI Software Engineer, GenAI Framework
This job is now closed
Job Description
- Req#: 32553503652
- Design and develop the GenAI open-source NeMo Framework and Megatron Core.
- Address large-scale AI training and inference challenges across the full model lifecycle, from data curation and preprocessing to training, tuning, and deployment.
- Work at the intersection of deep learning applications, libraries, frameworks, and the entire software stack.
- Perform performance tuning and optimization of deep learning frameworks and software components.
- Research, prototype, and develop scalable AI tools and pipelines.
- MS, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field with 5+ years of industry experience.
- Experience with AI frameworks (e.g., PyTorch, JAX) and inference/deployment environments (e.g., TRT, ONNX, Triton).
- Proficiency in Python programming, software design, debugging, performance analysis, testing, and documentation.
- A proven track record of effective collaboration across multiple engineering initiatives and contributions to AI libraries with innovative improvements.
- Strong understanding of deep learning fundamentals and their practical applications.
- Expertise in large-scale AI training, with a deep understanding of core compute system concepts such as latency/throughput bottlenecks, pipelining, and multiprocessing, along with demonstrated excellence in performance analysis and tuning.
- Experience with Generative AI techniques applied to LLM and MM learning (Text, Image, Video, Speech).
- Knowledge of GPU/CPU architecture and related numerical software.
- Experience with cloud computing (e.g., AI training and inference pipelines on AWS, Azure, GCP, OCI).
- Contributions to open source deep learning frameworks.
Senior AI Software Engineer, GenAI Framework (Finance)
We are seeking AI Software Engineers for NeMo! NVIDIA NeMo is an open-source, scalable, and cloud-native framework designed for researchers and developers working on Large Language Models (LLMs), Multimodal (MM), and Speech AI. NeMo offers end-to-end model training, including data curation, alignment, customization, evaluation, deployment, and tooling to enhance performance and user experience.
In this critical role, you will enhance NeMo Framework's capabilities by designing and implementing new features and optimizations, defining robust APIs, analyzing and tuning performance, and expanding our toolkits and libraries. You will collaborate with internal teams, users, and the open-source community to develop highly optimized solutions.
What you'll be doing:
What we need to see:
Ways to stand out from the crowd:
NVIDIA is recognized as one of the most desirable employers in the technology industry, with a team of forward-thinking and dedicated professionals. If you are creative and autonomous, we want to hear from you!
The base salary range is $148,000 - $287,500, determined by location, experience, and comparable roles.
You will also be eligible for equity and benefits. NVIDIA accepts applications continuously.
NVIDIA is committed to diversity and equal opportunity, valuing inclusivity in hiring and promotion practices without discrimination based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability, or any other legally protected characteristic.
#J-18808-LjbffrAbout the company
Notice
Talentify is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.
Talentify provides reasonable accommodations to qualified applicants with disabilities, including disabled veterans. Request assistance at accessibility@talentify.io or 407-000-0000.
Federal law requires every new hire to complete Form I-9 and present proof of identity and U.S. work eligibility.
An Automated Employment Decision Tool (AEDT) will score your job-related skills and responses. Bias-audit & data-use details: www.talentify.io/bias-audit-report. NYC applicants may request an alternative process or accommodation at aedt@talentify.io or 407-000-0000.