Deep Learning Library GPU Software Development Engineer

Location: Taipei, Taiwan, TW

Company: Advanced Micro Devices

Apply now

Apply for Job

What you do at AMD changes everything 

At AMD, we push the boundaries of what is possible.  We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center. 

Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results.  It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world.   If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.

Deep Learning Library GPU Software Development Engineer

The Role:

AMD is looking for an individual to join a hardworking team developing Deep Learning and High-Performance Computing GPU kernels on the AMD Radeon Open Compute (ROCm) platform ( and MIOpen, AMD's Deep Learning primitives library which provides highly-optimized implementations of different operators.


The Person:

The successful person will be an experienced GPU-compute programmer with experience in LLVM compiler and an eye toward hardware-aware performance optimizations.


Key Responsibilities:

The ideal candidate will be responsible for writing high-performance GPU kernels for AMD’s Machine Learning and Deep Learning Library: MIOpen (

  • They will be porting and optimizing algorithms for new GPU hardware.
  • Perform code reviews, build unit tests, author detailed documentation related to their work, and work with on-site and off-shore teams to deliver the software solutions on schedule.
  • They will play a key role in all phases of the software development including system requirements analysis, coordinating feature design and development across functional and organizational boundaries.


Preferred Experience:

  • Strong programming skills in C/C++; experience with CUDA programming and CUTLASS preferred.
  • Experience with LLVM Compiler, and compiler optimization techniques for GPU computing is preferred.
  • Experience or knowledge about BLAS operators and GEMM optimization.
  • Knowledge of Computer Architect and GPU architect.
  • Experience using version control software such as Git.
  • Strong understanding of Linux internals, Servers, and Debugging.
  • Basic knowledge of software development lifecycle, SW practices including debugging, test, revision control, documentation, and bug tracking.
  • Good teamwork and interpersonal skills required.
  • Ability to work independently and within complementary teams.
  • Demonstrate flexibility, strong motivation, and a proven track record of meeting results-oriented deadlines.
  • Knowledge with deep neural network machine learning technologies and modern machine learning programming frameworks.
  • Experience working with and developing virtualization containers and package managers for code deployment.


Academic Credentials:

Bachelor’s or Master’s in Computer Science, Computer Engineering, or related subjects, or equivalent experience



Requisition Number: 168928 
Job Function: Design  


AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.

Apply now

Apply for Job