Permanent
Posted on 07 November 25 by Junior Trujillo
Powered by Tracker
We are seeking a highly skilled Software/ML Engineer to join the Wearable System Architect team and work on power and performance optimization of on device ML.
NO C2C
The area of focus and main problems to address are:
RESPONSIBILITIES
Collect power and performance measurement results and traces of ML benchmarks (e.g., MLPerf-Tiny).
Execute ML benchmarks under different on-device configurations, for example:
Execute ML benchmark on different on device ML accelerators.
Execute ML benchmark on slow/external memory and fast/internal memory
For this, the candidate needs to able to compile an existing ML model against different ML accelerators using corresponding ML compilers. The candidate also needs to be familiar with RTOS and Android development and run-time environments.
This will eventually lead to the workload partition definition, i.e., which type of ML workload will be more suitable on which ML accelerators.
For example:
Increase # of MACs significantly while keeping memory throughput relatively steady or vice-versa.
For this, the candidate needs to able to modify an existing ML model by changing model parameters (e.g., increasing dimension of CNN layer) or model architecture (e.g., add a Fully Connected layer).
This will eventually lead to PnP guideline that can help project PnP metrics based on values of key ML model parameters.
QUALIFICATIONS
MINIMUM QUALIFICATIONS