Publications

Foundational research advancing the state of the art in AI.

Capital One’s Applied AI research

publications

Routing with generated data

A setting in which routers are trained on generated queries and answers produced from high-level task descriptions. (ACL)

publications

CommonLID: Re-evaluating language identification performance

A community-driven, human-annotated LID benchmark for the web domain, covering 109 languages. (ACL)

publications

Macaron: Controlled, human-written benchmark

A template-first benchmark that factorizes reasoning type and cultural aspect across question languages. (ACL)

publications

M4-RAG: A multimodal RAG

A massive-scale benchmark for evaluating retrieval-augmented VQA across languages and modalities. (CVPR)

publications

VLMs are confused tourists

A novel cultural adversarial robustness suite designed to assess VLMs’ stability against perturbed geographical cues. (CVPR)

publications

DynaGuard: A dynamic guardian model

A suite of dynamic guardian models offering novel flexibility by evaluating text based on user-defined policies. (ICLR)

publications

Alignment-weighted DPO

A DPO that targets the most problematic parts of an output by assigning different preference weights. (ICLR)

publications

Zero-shot multivariate time series forecasting

A framework for multivariate time series forecasting using tabular foundation models. (ICLR)

publications

EPSVec: Efficient and Private Synthetic Data Generation

A private text generation method that steers LLM generation using dataset vectors. (ICLR)