CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge Paper • 2404.06664 • Published Apr 10, 2024
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs Paper • 2410.02677 • Published Oct 3, 2024
From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models Paper • 2407.00263 • Published Jun 28, 2024
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 22 days ago • 82
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 22 days ago • 82
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval Paper • 2503.08644 • Published Mar 11 • 16