4Deploy model whose predictions most resemble the ensemble mean (opens in new tab)(github.com)1neehao5d ago1
7Dynamic E2E Agentic Simulation and Evaluation with Cypress (opens in new tab)(github.com)2neehao12d ago0
11The User Is Stochastic: Testing Agentic Systems with Simulation and Evaluation (opens in new tab)(gojiberries.io)1neehao19d ago1
12Slosizer: Right-size reserved LLM capacity Based on SLO (opens in new tab)(pypi.org)1neehao22d ago0
13Pass-Through of Tariffs: Evidence from European Wine Imports (opens in new tab)(nber.org)76neehao23d ago84