H-1B Job Board

Finding companies that sponsor visas is a lot of work. We've made your life easier by compiling top companies and startups that hire foreign nationals.

Search

jobs

31,781

My job alerts

Senior Software Engineer- AI Hardware

Bloomberg

This job is no longer accepting applications

See open jobs at Bloomberg.See open jobs similar to "Senior Software Engineer- AI Hardware" Ellis H-1B.

Software Engineering, Other Engineering, Data Science

New York, NY, USA

Posted on Jan 12, 2025

The Role:

We are seeking an engineer to join our hardware management team. This team is responsible for the provisioning, monitoring, and support for thousands of servers supporting dozens of teams within Bloomberg, including the entire AI stack!

The ideal candidate will have experience in designing, implementing, and maintaining system software that enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems. This role will also be responsible for overseeing the ongoing monitoring, support, and maintenance of our HPC/AI clusters, ensuring peak performance and reliability.

We'll trust you to:

Design, build, and maintain highly reliable, scalable, and efficient infrastructure platforms that support our engineering teams and business needs.
Participate in system design discussions and contribute to architectural decisions
Ensure code quality through standard methodologies, code reviews, and alignment to clean code principles
Be able to produce clear and consumable documentation for a wide audience
Communicate effectively across diverse teams
Be willing to participate in on-call rotations as arranged
Be a self starter, manage priorities, and work independently
Stay up-to-date with the latest infrastructure technologies, and industry standard processes, and evaluate their potential impact on existing and future solutions

Who you are?

Hold yourself to high standards
Exude our ambitious, collaborative, and empathetic values
A self-starter mentality with an eagerness to solve previously unsolved problems
Excellent collaboration skills and are open to giving and receiving critical feedback across teams
Scalability and reliability are hardwired into your DNA
You have publicly available writing samples, blog posts, demos, or recordings of presentations on technical topics

What's in it for you?

A unique opportunity to be part of a rapidly growing team in one of the most exciting engineering teams in Bloomberg.
An inclusive and supportive work culture that fosters learning and growth.
Continuous professional development, product training, and career pathing
Intra-departmental mentor and buddy program for in-house networking
An inclusive company culture, ability to join our Community Guilds

You'll need to have:

4+ years of proficiency in Kubernetes environments (deployments, storage, services, jobs, ingress, egress, etc)
BA, BS, MS, PHD, in Computer Science, Electrical Engineering or related field
Hands-on management of GPU-based systems, including kernel and driver management, and developing software tooling to automate provisioning and maintenance of these systems.
Design, implemented, and maintained system software that enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems
Oversee the ongoing monitoring, support, and maintenance of our HPC/AI clusters, ensuring peak performance and reliability
Drive system upgrades, customization, and seamless integration with software developers, network operations, and data center teams
Manage and maintain a diverse range of computer systems and application software, ensuring they meet the highest standards of functionality and efficiency
Develop and maintain expertise in low-latency/high-bandwidth, interconnected infrastructure (including InfiniBand, Ethernet, RDMA/RoCE, and others)
Monitor and evaluate the efficiency and effectiveness of infrastructure service delivery methods and procedures
Partner with internal teams to develop prioritization, metrics, and processes around capacity planning and infrastructure availability. Periodically present capacity planning and performance reports to senior leaders during presentations and meetings
Benchmark, analyze, and make recommendations for improvement of IT infrastructure

We'd love to see:

Expertise with Kubernetes design patterns (operators, helm charts, kustomize, etc)
Experience with data center planning, including rack elevations, cabling plan, and cables/transceivers
Experience with data center operations and management

This job is no longer accepting applications

See open jobs at Bloomberg.See open jobs similar to "Senior Software Engineer- AI Hardware" Ellis H-1B.

See more open positions at Bloomberg

Privacy policy Cookie policy

See what Ellis can do for you

Get the peace of mind that comes from partnering with our experienced immigration lawyers

About Us

Pricing

Get Started

LinkedIn Facebook Twitter

Ellis Technologies, Inc. is not a law firm, but is affiliated with Ellis Legal, P.C. a law firm, authorized by the California Supreme Court. Nothing on this website, including guides and resources, is to be considered legal advice. For legal advice specific to your case, please consult with a licensed attorney. This website is for informational purposes only and does not constitute legal advice. The information provided on this website should not be used as a substitute for consultation with a licensed legal professional in your jurisdiction. The use of this website does not create an attorney-client relationship between the user and Ellis Legal, P.C. Ellis Legal, P.C. is not responsible for the content of external websites linked from this website. Past results do not guarantee similar outcomes in future cases.