AI publications

AI publications 

AI publications 

publications

Preference tuning with human feedback: a survey

A survey of recent advancements in aligning deep generative models with human preferences across language, speech and vision.

publications

RainbowPO: unified preference optimization

This new framework enhances preference optimization for better AI alignment with human values.

publications

Enhancing LLM security with chain-of-thought fine-tuning

Fine-tuning and aligning chain-of-thought responses in LLMs for safer conversational AI.

publications

Zero-shot tabular prediction via adversarial transformer

Introducing APT, an adversarially pre-trained transformer achieving SOTA on small tabular tasks.

publications

Red teaming LLMs: an end-to-end safety overview

A survey covering attack methods, evaluation, metrics and tools for identifying and mitigating GenAI application vulnerabilities.

publications

Refusal tokens: a simple way to calibrate refusals in LLMs

A simple technique using refusal tokens to control and calibrate refusal behavior in large language models.

publications

Efficient linear layers for neural networks

Searching for efficient linear operators with optimal scaling laws leading to the development of the BTT-MoE architecture.

publications

Scaling-laws for large time-series models

Discovering power-law scaling relationships in large time-series transformer models, analogous to those found in language models.

publications

Re-evaluating evaluation for multilingual summarization

Standard metrics fail in non-English summarization, prompting a need for more nuanced evaluation frameworks.