1Merge and Conquer: Evolutionarily Optimizing AI for 2048 (opens in new tab)(arxiv.org)1xianshou5mo ago0
2Stuck in the Matrix: Probing Spatial Reasoning in Large Language Models (opens in new tab)(arxiv.org)1xianshou5mo ago0
3Reflection AI Raises $2B to Build "American DeepSeek" (opens in new tab)(nytimes.com)9xianshou5mo ago2
4Nvidia-backed Reflection AI raising at $5.5B valuation (opens in new tab)(reuters.com)2xianshou5mo ago1
6DeepSeek V3 0324 is now the best nonthinking model (Reddit) (opens in new tab)(old.reddit.com)1xianshou0y ago0
7DeepSeek V3 0324 outpaces GPT 4.5 and Claude 3.7 in coding, other benchmarks (opens in new tab)(huggingface.co)7xianshou0y ago0
9InvestorBench: A Benchmark for Financial Decision-Making Tasks with Agents (opens in new tab)(arxiv.org)1xianshou1y ago0