Ohio State is in the process of revising websites and program materials to accurately reflect compliance with the law. While this work occurs, language referencing protected class status or other activities prohibited by Ohio Senate Bill 1 may still appear in some places. However, all programs and activities are being administered in compliance with federal and state law.

Colloquium: Ryan Cotterell (ETH Zürich) "The Underlying Logic of Language Models"

Fri, September 26, 2025
3:55 pm - 5:30 pm
Oxley Hall 103

Title: The Underlying Logic of Language Models

Abstract: The formal basis of the theory of computation lies in the study of languages, subsets of Σ*, the set of all strings over an alphabet Σ. Models of computation can be taxonomized into the languages they can decide, i.e., which languages a model can determine membership in. For instance, finite-state automata can decide membership in the regular languages. Language models are probabilistic generalizations of formal languages, where the notion of a set is relaxed into one of a probability distribution over Σ*. Recently, language models parameterized using recurrent neural networks, transformers, and state-space models have achieved enormous success in natural language processing. Similar to how theorists have taxonomized models of deterministic computation, researchers have sought to taxonomize the expressivity of language models based on various architectures in terms of the distributions over strings they can represent. This tutorial presents a self-contained overview of the formal methods used to taxonomize the expressivity of language models, which encompass formal languages and automata theory, various forms of formal logic, circuit complexity, and programming languages such as RASP. For example, we illustrate how transformers, under varying assumptions, can be characterized by various fragments of formal logic.