Workload Automation Engineer-HPC/Machine Learning (Santa Clara or Austin)- 132082

Location: Santa Clara, California, US

Company: Advanced Micro Devices

Apply now

Apply for Job


What you do at AMD changes everything 
 

At AMD, we push the boundaries of what is possible.  We believe in changing the world for the better by driving innovation in high-performance computing, graphics, and visualization technologies – building blocks for gaming, immersive platforms, and the data center. 
 

Developing great technology takes more than talent: it takes amazing people who understand collaboration, respect, and who will go the “extra mile” to achieve unthinkable results.  It takes people who have the passion and desire to disrupt the status quo, push boundaries, deliver innovation, and change the world.   If you have this type of passion, we invite you to take a look at the opportunities available to come join our team.
 

Automated Machine Learning and HPC Framework / Web App Developer

 

Our team ensures AMD-based systems and GPUs are operating at their best before they are deployed to solve the world’s most challenging problems. We are seeking software developers to design, build and maintain a world-class workload automation system for running large scale, GPU-enabled data center applications. In this role, you will create a system to provision, run, monitor and analyze workloads used in supercomputing, academia and the largest data centers on the planet. You will also design and develop of web applications that enable users to sift through mountains of ML / HPC performance data and create insightful, graphically rich reports.

 

Key Responsibilities

  • Build and maintain a workload automation system based on Ansible using an infrastructure-as-code model
  • Work with ML/HPC experts to help them automate their workloads so they can self-serve
  • Create database schemas and interfaces that enable the automation system to store workload performance results
  • Develop a custom web application that allows users to search, retrieve, display and report performance results
  • Build a reporting system that automates the process of creating informative tables and graphs for engineers, business units and their management

 

Preferred Experience

  • Experience with workload automation and management systems 
  • Extensive Python and shell script experience
  • Experience with web application development frameworks such as Django
  • Experience with data visualization tools such as Tableau
  • Experience with SQL and NoSQL databases
  • Previous use of GitHub

 

Ways to Stand Out

  • Experience with Ansible
  • Experience with Django

 

Location:  The team is based in Santa Clara, CA, but we are open to hiring the following AMD sites (Austin, TX, Bellevue, WA, Orlando, FL, San Diego, CA, and Boxborough, MA).

 

#LI-RL1


Requisition Number: 132082 
Country: United States State: California City: Santa Clara 
Job Function: Design
  

 

AMD does not accept unsolicited resumes from headhunters, recruitment agencies or fee based recruitment services. AMD and its subsidiaries are equal opportunity employers. We consider candidates regardless of age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status. Please click here for more information.

Apply now

Apply for Job

Share this Job