PhD Candidate in Computer Science
I am a PhD candidate at the
Center for Automated Reasoning at Stanford University, under the guidance of Prof.
Clark Barrett. My research focuses on improving the prosocial tendencies of language models (LMs) through a series of unique developmental approaches. This includes the introduction of communication channels during autoregressive training (akin to a kindergarten setting), allowing a parent LM to guide a child LM by curating its training data, and enhancing human feedback on LMs via a combined embedding of EEG data and speech.