A routing approach that assigns each problem to the smallest model likely to solve it, reducing compute. (NeurIPS)
article |
You are now leaving the Capital One website
You're leaving the Capital one website and heading to an external site. It may have different privacy and security policies, so take a moment to check them out.