GPU Network Software Engineer - 112183

Location: Austin, Texas, US

Company: Advanced Micro Devices

Apply now

Apply for Job


What you do at AMD changes everything 
 

At AMD, we push the boundaries of what is possible.  We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center. 
 

Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results.  It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world.   If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
 

GPU Network Software Engineer

The Role:

As a GPU network software engineer you will design, implement, and test features in communication libraries, middleware, and frameworks to provide outstanding support for GPU applications running high performance computing and machine learning workloads at scale. You will work with technical specialists within AMD, our partners, and the open-source community to implement these features as part of AMD’s Radeon Open Ecosystem (ROCm).

The Person:

You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and teamwork are paramount. You possess excellent written and verbal communication skills, and strong attention to detail. You are results-oriented and accustomed to tight deadlines and changing priorities. Most importantly, you are constantly thinking of ways to improve performance of multi-node GPU applications.

Key Responsibilities:

  • Design, implement, and test features to improve GPU support in communication libraries, middleware, and frameworks
  • Benchmark, profile and optimize code to improve performance of multi-node GPU applications
  • Deliver high-quality code and documentation following standard methodologies for open-source software development
  • Work with key technical specialists across AMD and with our partners and customers to improve ROCm applications, libraries, and tools

Preferred Experience:

  • Strong background developing system software in C/C++
  • Experience with at least one of the following:
  • Implementing communication middleware like MPI/SHMEM
  • Implementing lower-level communication frameworks like UCX and libfabric, or development using RDMA APIs
  • Development and optimization of communication collective algorithms (e.g., All-reduce)
  • Familiarity with GPU programming in HIP or CUDA
  • In-depth knowledge of standard methodologies in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
  • Proven track record contributing to open-source projectsD

Academic Credentials:

  • B.Sc. or B.Eng. degree in Computer Science, Electrical Engineering, or equivalent
  • Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred

Location:

Austin, TX

Santa Clara, CA

 

#LI-JG1

 



Requisition Number: 112183 
Country: United States State: Texas City: Austin 
Job Function: Design
  

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.

Apply now

Apply for Job

Share this Job