Disputation: Andrey Kutuzov

Doctoral candidate Andrey Kutuzov at the Department of Informatics, Faculty of Mathematics and Natural Sciences, is defending the thesis Distributional word embeddings in modeling diachronic semantic change for the degree of Philosophiae Doctor.

Image may contain: Tie, White-collar worker, Chin, Forehead, Tie.

The PhD defence and trial lecture are fully digital and streamed using Zoom. The host of the session will moderate the technicalities while the chair of the defence will moderate the disputation.

Ex auditorio questions: the chair of the defence will invite the audience to ask ex auditorio questions. This can be requested by clicking 'Participants -> Raise hand'. 

Trial lecture

"Word Sense Induction"

Main research findings

This thesis studies how distributional vector representations capture changes in lexical meaning. In natural human languages, words change what they mean over time. These diachronic semantic shifts can be detected automatically. We do this by analyzing changes in the behavior of large-scale neural language models trained on texts created in different time periods.

Distributional semantic models based on dense vector representations (word embeddings) efficiently capture many aspects of word meaning. As such, they are extremely important for natural language processing systems which are aimed at understanding and generating human language. However, they are mostly applied to language data without taking temporal drift into account: in a synchronic way.

We move on to the diachronic realm and employ word embeddings to achieve unsupervised data-driven detection of temporal semantic change. We train diachronic models in different ways, and devise methods to solve the task of detecting how words change their meaning and usage over time. The findings in this thesis are important both for general linguistics and for practical applications like web search and digital humanities.

 

 

Contact information to Department: Pernille Adine Nordby 

Publisert 30. okt. 2020 13:20 - Sist endret 29. juni 2021 10:30