Publications

Foundational research advancing the state of the art in AI.

Capital One’s Applied AI Research

publications

Refusal tokens: a simple way to calibrate refusals

Refusal tokens enable controlling a single model's refusal rates without the need of any further fine-tuning. (COLM)

publications

Imagine, verify, execute: memory-guided agentic exploration

An agentic exploration framework inspired by human curiosity. (CoRL)

publications

Crowdsource, crawl, or generate?

An open-source initiative dedicated to developing high-quality, culturally relevant data for SEA languages. (ACL)

publications

Do language models understand honorific systems in Javanese?

The ability of LMs to process Javanese honorifics through classification and machine translation tasks. (ACL)

publications

What causes knowledge loss in multilingual language models?

Exploring knowledge loss in multilingual LMs, focusing on linguistic differences affecting representational learning. (ACL)

publications

Training dynamics underlying language model scaling laws

Loss deceleration and ZSL provide new insights into the training dynamics underlying language model scaling laws. (ACL)

publications

Position: supervised classifiers answer the wrong questions

A critical re-examination of popular out-of-distribution (OOD) detection procedures. (ICML)

publications

Dynamic guardian models: realtime content moderation

Specialized classifiers that evaluate text based on predefined trustworthiness objectives. (ICML)

publications

Zero-shot tabular prediction via adversarial transformer

Introducing APT, an adversarially pre-trained transformer achieving SOTA on small tabular tasks. (ICML)