BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 5 days ago • 47
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 7 days ago • 36
facebook/dinov3-vits16-pretrain-lvd1689m Image Feature Extraction • 0.0B • Updated about 20 hours ago • 1.1k • 17
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated about 20 hours ago • 1.16k • 87
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 5 days ago • 180
view article Article TextQuests: How Good are LLMs at Text-Based Video Games? By justinphan3110 and 1 other • 8 days ago • 24