Publications

publications

FB-RAG: Improving RAG with forward and backward lookup

A new training-free framework based on a simple yet powerful forward-looking strategy. (AACL)

article |

publications

Integrating sequential and relational modeling

A collection of public datasets and prediction tasks that incorporate personal and relational events. (LoG)

article |

publications

Tuning-free LLM can build a strong recommender

A novel framework that constructs an intent-centric knowledge graph where both users and items are explicitly linked. (LoG)

article |

publications

Leveraging parameter space symmetries

Utilizing an alignment-first strategy to transfer advanced reasoning skills to a non-reasoning model (NeurIPS).

article |

publications

Play by the type rules: inferring constraints for small LMs

An efficient solution to enforce the well-typedness of LLM functions. (EurIPS)

article |

publications

Continual pre-training of MoEs: how robust is your router?

A systematic study of Mixture of Experts (MoE) continual pre-training. (NeurIPS)

article |

publications

T1: a tool-oriented conversational dataset

A conversational dataset specifically designed to capture and manage inter-tool dependencies across diverse domains. (NeurIPS)

article |

publications

R3: robust rubric-agnostic reward models

A novel reward modeling framework that is rubric-agnostic, generalizable and provides reasoned score assignments. (NeurIPS)

article |

publications

SoTA with less: MCTS-guided sample selection

Visual reasoning models that achieve SoTA performance using an order of magnitude fewer training samples. (NeurIPS)

article |

Publications

Foundational research advancing the state of the art in AI.

Capital One’s Applied AI research

FB-RAG: Improving RAG with forward and backward lookup

Integrating sequential and relational modeling

Tuning-free LLM can build a strong recommender

Leveraging parameter space symmetries

Play by the type rules: inferring constraints for small LMs

Continual pre-training of MoEs: how robust is your router?

T1: a tool-oriented conversational dataset

R3: robust rubric-agnostic reward models

SoTA with less: MCTS-guided sample selection

Footnotes