Speaker: Tom Goldstein, Volpi-Cupal Endowed Professor of Computer Science, Director of the Maryland Center for Machine Learning, University of Maryland
Registration for all CUID holders is preferred. If you do not have an active CUID, registration is required and is due at 12:00 PM the day prior to the seminar. Unfortunately, we cannot guarantee entrance to Columbia’s Morningside campus if you register following 12:00 PM the day prior to the seminar. Thank you for understanding!
REGISTER
Title: Alternative Test-Time Compute Scaling Strategies for Generative Models
Abstract: Recent trends in LLM development have focused on “Reasoning” models that expend large amounts of compute to improve their performance at inference time by producing many tokens. In this talk, we consider alternatives to the many-token paradigm. We will focus on models that perform efficient latent reasoning without verbalizing their outputs as tokens. We will also consider new generation strategies that bypass arduous and expensive token generating processes altogether.