Senior Software Engineer, Metropolis AI NIM
NVIDIA
We are seeking a senior software engineer for Metropolis AI NIM to develop and deliver the state-of-the-art AI models to the world in the form of NVIDIA Inference Microservices (NIM). You will collaborate across the organization to bring the latest flagship models (both CV and Vision-Language Models) from our community and partners—such as VILA and Florence-2—to life as optimized NVIDIA Inference Microservices (NIM). This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. Join our team of world-class software engineers and partners to deliver the most advanced models with lightning-fast inference.
In this role, you will develop hardware-accelerated solutions that enable rapid creation and deployment using the latest deep learning, artificial intelligence, and computer vision technologies. This position offers you the opportunity to collaborate within a worldwide matrixed software team focusing on core technologies for Multi-Modal and Streaming AI applications including CV and Vision-Language Model (VLM) inference pipelines and Omniverse-based simulation technologies among various exciting Multi-Modal AI technologies and have broad impact within our highly dynamic and technology-focused company.
What you’ll be doing:
Collaborate closely with our partners and the open-source community to deliver their flagship models as highly optimized NVIDIA Inference Microservices (NIM).
Research and develop innovative deep learning methodologies to accurately evaluate new model families across diverse domains.
Analyze, influence, and enhance AI/DL libraries, frameworks, and APIs, ensuring consistency with the best engineering practices.
You will design and develop accelerated streaming AI pipelines using CV and VLM models and lead technical design discussions
Profile and optimize the AI pipelines to ensure scalability, reliability, and efficiency.
Take on complex system-level optimization and resource utilization challenges.
Participate in a product development lifecycle that values high standards for clear requirements, software quality and performance.
Write code in Python and C++
What we need to see:
BS, MS, or PhD in Computer Science, AI, Applied Math, or a related field, or equivalent experience, with 5+ years of industry experience.
3+ years of hands-on experience in AI for computer vision (CV) and large language models (LLMs).
Complex system design and development using Python, C++ 14/17/20, and object-oriented programming.
Strong problem-solving, debugging, performance analysis, test design, and documentation skills.
Solid mathematical foundations and expertise in AI/DL algorithms.
Excellent written and verbal communication skills, with the ability to work both independently and collaboratively in a fast-paced environment.
Passion for expanding your technical knowledge into new areas.
Ability to excel in a multinational, multi-time-zone environment: excellent communication skill (verbal & written), collaborates well, represents our core values.
Ways to stand out from the crowd:
Demonstrated implementation of computer vision / machine learning applications, microservices, container and cloud-native application development.
Experience with cloud native architecture involving dockers, k8s, microservices.
Hands-on experience with inference and deployment environments like TensorRT, ONNX, Triton, or vLLM.
Experience working with High Availability environments
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and passionate people in the world working for us. Are you a creative problem solver with a passion for solving real-world problems with AI? If so, we want to hear from you.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.