Interesting SSL papers EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper โข 2311.02077 โข Published Nov 3, 2023 โข 16 System 2 Attention (is something you might need too) Paper โข 2311.11829 โข Published Nov 20, 2023 โข 43 Large Language Models for Mathematicians Paper โข 2312.04556 โข Published Dec 7, 2023 โข 13 VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper โข 2403.00522 โข Published Mar 1, 2024 โข 47
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Paper โข 2311.02077 โข Published Nov 3, 2023 โข 16
System 2 Attention (is something you might need too) Paper โข 2311.11829 โข Published Nov 20, 2023 โข 43
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks Paper โข 2403.00522 โข Published Mar 1, 2024 โข 47
LLM databricks/dbrx-instruct Text Generation โข Updated Apr 19, 2024 โข 9.25k โข 1.11k Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper โข 2412.05271 โข Published Dec 6, 2024 โข 155 Running 2.48k 2.48k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper โข 2412.05271 โข Published Dec 6, 2024 โข 155
Running 2.48k 2.48k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters