Lensing Machines–Representing Perspective in Machine Learning
Committee:
Dr. Rosalind Picard (Chair)
Professor or Media Arts and Sciences
MIT Media Lab
Dr. Henry Lieberman
Research Scientist
Computer Science and Artificial Intelligence Laboratory
MIT
Dr. Eric Horvitz
Technical Fellow and Managing Director
Microsoft Research
Dr. Robert Selman
Roy Edward Larsen Professor of Human Development
Professor of Psychology in Psychiatry
Harvard University
Dr. David Blei
Professor of Statistics & Computer Science
Columbia University
Dr. Matthew Nock
Professor of Psychology
Harvard University
Generative models are venerated as full probabilistic models that randomly generate observable data given a set of latent variables that cannot be directly observed. They can be used to simulate values for variables in the model, allowing analysis by synthesis or model criticism, towards an iterative cycle of model specification, estimation, and critique. However, many datasets represent a combination of several viewpoints–different ways of looking at the same data that leads to various generalizations. For example, a corpus that has data generated by multiple people may be mixtures of several perspectives and can be viewed with different opinions by others. It isn't always possible to represent the viewpoints by clean separation, in advance, of examples representing each perspective and train a separate model for each point of view. In this thesis, we introduce lensing, a mixed-initiative technique to (1) extract lenses or mappings between machine-learned representations and perspectives of human experts, and (2) generate lensed models that afford multiple perspectives of the same dataset. We explore lensing of latent variable model in their configuration, parameter and evidential spaces. We apply lensing to three health applications, namely imbuing the perspectives of experts into latent variable models that analyze adolescent distress, crisis counseling, and self-harm.