Machine Learning using PCIe networks

Compare, benchmark and optimise machine learning frameworks.

Dolphin PXH830 Adapter card

Machine learning is a popular topic in data science, and there exists many frameworks that can be used for training and inference of these neural networks. However, many of these frameworks are still only optimised for one machine and only the CPU architecture. In this thesis we want to do a survey of available machine learning frameworks, and find out which frameworks support which features. After the survey is complete, we want to try to minify one or more of the frameworks to run on Dolphins PCIe networks, either with socket support or by using Dolphins low-level SISCI API.

Goal
Analyse several popular machine learning frameworks (such as TensorFlow, Caffe, etc.) and find out what features such as GPU acceleration, multi-machine support the different framework support. Select one of the frameworks that are open source, and try to modify the communication system to enable multi machine support in Dolphins PCIe networks.

Learning outcome
In-depth knowledge and understanding of optimising a machine learning framework.
 
Qualifications
Good low-level computer systems understanding. The student should have completed, INF3151 or equivalent. IN5050 is recommended in the degree.

Publisert 16. sep. 2019 15:14 - Sist endret 16. sep. 2019 15:15

Veileder(e)

Omfang (studiepoeng)

60