Jump to content

Senior Cloud Backend Engineer, NeMo LLM Service - US, CA, Santa Clara

11 days ago


 Share

Job Opportunity Details

Type

Full Time

Salary

Not Telling

Work from home

No

Weekly Working Hours

Not Telling

Positions

Not Telling

Working Location

US, CA, Santa Clara, United States, United States   [ View map ]

We are looking for a Senior Cloud Backend Engineer to work on a groundbreaking machine learning project to make customized, world-class Large Language Models available and easy to use. Our team is building next-generation services and interfaces for training and deploying AI at scale. We are dedicated to developing NLP and multi-modal technologies that tackle real problems. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment.

What you'll be doing

  • Creating customer-hosted microservices for training generative AI models

  • Development of distributed cloud applications, microservices and SaaS platform able to scale up to huge models

  • Implementing core infrastructure for cloud-native AI training and inference

  • Creating flexible systems that can integrate across the ML ecosystem

  • Relentlessly pursue speed of light performance under high load

What we need to see

  • BS, Masters, or equivalent experience in computer science, computer architecture, or related field

  • 5+ years of experience

  • Experience with the full software development lifecycle, particularly deploying and monitoring services in Cloud environments

  • Understanding of performance, security and reliability in complex distributed infrastructure

  • Excellent Golang, Rust, or C/C++ programming and software design skills, including debugging, performance and service health analysis, and test design.

  • Ability to work independently, define project goals and scope, interact directly with open source community, and manage your own development effort

Ways to stand out from the crowd

  • Experience deploying machine learning or statistical models into production environments, especially experience with frameworks such as PyTorch, Tensorflow, ONNX Runtime, and TensorRT

  • Knowledge of or experience with developing  production NLP or generative AI systems

  • Experience working with high availability environments

  • Kubernetes cluster administration experience

  • Experience providing software solutions for multiple customer environments with minimal engineering overhead

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

More Information

Application Details

  • Organization Details
    Nvidia
 Share


User Feedback

Recommended Comments

There are no comments to display.

Join the conversation

You are posting as a guest. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Add a comment...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...