Machine Learning Stack Integration Engineer - 162062

Location: Markham, Ontario, CA

Company: Advanced Micro Devices

Apply now

Apply for Job

What you do at AMD changes everything 

At AMD, we push the boundaries of what is possible.  We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center. 

Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results.  It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world.   If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.



Machine Learning Stack Integration Engineer



 You will be working with Solution Validation & Debug within the Machine Learning Software Engineering group. As a team member you will be working closely on the debug and triage of Machine learning and High-Performance Computing related issues and add value to the Solution Validation of ROCm Stack.



Ideal candidate will bring in broad experience on dealing with complex software level issues related to Machine Learning and High-Performance Computing.



  • Debug Machine Learning/ High Performance Computing related issues on Radeon Open Compute Stack (ROCm)
  • Develop test contents for complex Machine learning algorithms on distributed nodes
  • Port High Performance computing application on ROCm
  • Reproduce field defects and develop appropriate tests to prevent future issues.
  • Design, develop and deploy testing tools and automation libraries necessary to perform testing.
  • Lead the adoption of tooling and industry best practices by means of advocacy and outreach to help our development communities level up.
  • Other duties as assigned



  • Languages: Python, C, C++, Linux Shell scripting.
  • Frameworks/Libraries: TensorFlow, PyTorch, ONNXRT
  • Tools: Prior experience with Linux, Docker, LLVM compilers
  • Desired Skills: Understanding of High-Performance Computing application, Machine learning and GPU Programming, MPI Parallel Programming



  • Bachelor's Degree or higher in Computer Science or related quantitative field.



Markham, Ontario, Canada



Requisition Number: 162062 
Country: Canada Province: Ontario City: Markham 
Job Function:Design


AMD is an inclusive employer dedicated to building a diverse workforce. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective provincial human rights codes throughout all stages of the recruitment and selection process. Any applicant who requires accommodation should contact

AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services.


Apply now

Apply for Job

Share this Job