Sunday 15:00–15:45 in Auditorium (LYST DEEP LEARNING)

Modelling a text corpus using Deep Boltzmann Machines in python

Ricardo Pio Monti

Audience level:
Experienced

Description

Deep Boltzmann machines (DBMs) are exciting for a variety of reasons, principal among which is the fact that they are able to learn probabilistic representations of data in an entirely unsupervised manner. In this talk I will discuss the process of fitting and interpreting DBMs using a topic modelling example as motivation.

Abstract

Deep Boltzmann machines (DBMs) are exciting for a variety of reasons, principal among which is the fact that they are able to learn probabilistic representations of data in an entirely unsupervised manner. This allows DBMs to leverage large quantities of unlabelled data which are often available. The resulting representations can then be fine-tuned using limited labelled data or studied to obtain a more comprehensive understanding of the data at hand.

This talk will begin by providing a high level description of DBMs and the training algorithms involved in learning such models. A topic modelling example will be used as a motivating example to discuss practical aspects of fitting DBMs and potential pitfalls. The entire code for this project is written in python using only standard libraries (e.g., numpy).